×
1 warning:
  • Access restricted to public data.
    Please sign in at Rnomics Platform to access your data.
Navigation
RefSeq Detection Screen -- Protein maps 
Protein Maps : .
This page contains ASEs represented as a graph of the mRNA exonic structure mapped to the protein primary sequence. Exons are shown in white with black outline. The reference form is represented above the alternative form. Coding regions are shown with blue horizontal bars. These can be interrupted by red blocks for absent regions. Below are the domains in the protein obtained from NCBI's Conserved Domain Database. Domains that are affected by the ASE are outlined in red.

Further information can be obtained in the help page

FAS

  • 113302 TNFRSF6-6 162 409
  • NCBIGene 36.3 355
  • Multiple exon skipping, size difference: 247
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000043

  • Changed! cd TNFR 95aa 5e-16 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • Changed! smart DEATH 88aa 2e-16 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • Changed! pfam TNFR_c6 37aa 4e-05 in ref transcript
    • TNFR/NGFR cysteine-rich region.

BCL2L11

  • 113317 BCL2L11-4 130 220
  • NCBIGene 36.2 10018
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006538

  • Changed! pfam Bclx_interact 36aa 3e-12 in ref transcript
    • Bcl-x interacting. This domain is a long alpha helix, required for interaction with Bcl-x. It is found in BAM, Bim and Bcl2-like protein 11.
  • pfam Bim_N 37aa 9e-05 in ref transcript
    • Bim protein N-terminus. This family represents the N-terminal region of several mammal specific Bim proteins. The Bim protein is one of the BH3-only proteins, members of the Bcl-2 family that have only one of the Bcl-2 homology regions, BH3. BH3-only proteins are essential initiators of apoptotic cell death.
  • Changed! pfam Bclx_interact 35aa 9e-11 in modified transcript

ACACA

  • ACACA.F4 rs.ACACA.R1 208 319
  • NCBIGene 36.3 31
  • Single exon skipping, size difference: 111
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_198837

  • Changed! cd biotinyl_domain 64aa 5e-11 in ref transcript
    • The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine.
  • Changed! pfam ACC_central 751aa 0.0 in ref transcript
    • Acetyl-CoA carboxylase, central region. The region featured in this family is found in various eukaryotic acetyl-CoA carboxylases, N-terminal to the catalytic domain (pfam01039). This enzyme (EC:6.4.1.2) is involved in the synthesis of long-chain fatty acids, as it catalyses the rate-limiting step in this process.
  • Changed! pfam Carboxyl_trans 560aa 1e-163 in ref transcript
    • Carboxyl transferase domain. All of the members in this family are biotin dependent carboxylases. The carboxyl transferase domain carries out the following reaction; transcarboxylation from biotin to an acceptor molecule. There are two recognised types of carboxyl transferase. One of them uses acyl-CoA and the other uses 2-oxoacid as the acceptor molecule of carbon dioxide. All of the members in this family utilise acyl-CoA as the acceptor molecule.
  • Changed! TIGR accC 502aa 2e-89 in ref transcript
    • This model represents the biotin carboxylase subunit found usually as a component of acetyl-CoA carboxylase. Acetyl-CoA carboxylase is designated EC 6.4.1.2 and this component, biotin carboxylase, has its own designation, EC 6.3.4.14. Homologous domains are found in eukaryotic forms of acetyl-CoA carboxylase and in a number of other carboxylases (e.g. pyruvate carboxylase), but seed members and trusted cutoff are selected so as to exclude these. In some systems, the biotin carboxyl carrier protein and this protein (biotin carboxylase) may be shared by different carboxyltransferases. However, this model is not intended to identify the biotin carboxylase domain of propionyl-coA carboxylase. The model should hit the full length of proteins, except for chloroplast transit peptides in plants. If it hits a domain only of a longer protein, there may be a problem with the identification.
  • Changed! pfam Biotin_lipoyl 66aa 3e-13 in ref transcript
    • Biotin-requiring enzyme. This family covers two Prosite entries, the conserved lysine residue binds biotin in one group and lipoic acid in the other. May not recognise the Glycine cleavage system H proteins.
  • Changed! COG AccC 503aa 1e-112 in ref transcript
    • Biotin carboxylase [Lipid metabolism].
  • Changed! COG COG4799 561aa 2e-88 in ref transcript
    • Acetyl-CoA carboxylase, carboxyltransferase component (subunits alpha and beta) [Lipid metabolism].
  • Changed! COG AccB 75aa 1e-07 in ref transcript
    • Biotin carboxyl carrier protein [Lipid metabolism].

AFF3

  • AFF3.F2 AFF3.R31 380 455
  • NCBIGene 36.3 3899
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025108

  • pfam AF-4 1204aa 0.0 in ref transcript
    • AF-4 proto-oncoprotein. This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.

AKAP13

  • AKAP13.u.f.21 AKAP13.u.r.24 286 352
  • AceView 36.Apr07 AKAP13
  • Single exon skipping, size difference: 66
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: AKAP13.bApr07

  • cd RhoGEF 195aa 5e-37 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • cd PH 91aa 2e-05 in ref transcript
    • Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • smart RhoGEF 192aa 5e-40 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
  • smart PH 102aa 5e-07 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • COG ROM1 305aa 5e-06 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

APAF1

  • APAF1.F8 APAF1.R8 138 267
  • NCBIGene 36.3 317
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181861

  • Changed! cd WD40 304aa 4e-60 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 274aa 5e-47 in ref transcript
  • pfam NB-ARC 286aa 5e-73 in ref transcript
    • NB-ARC domain.
  • pfam CARD 85aa 4e-14 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • smart WD40 40aa 8e-08 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 40aa 2e-06 in ref transcript
  • smart WD40 40aa 8e-05 in ref transcript
  • smart WD40 42aa 1e-04 in ref transcript
  • smart WD40 37aa 1e-04 in ref transcript
  • Changed! pfam eIF2A 114aa 0.003 in ref transcript
    • Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
  • smart WD40 28aa 0.006 in ref transcript
  • smart WD40 34aa 0.008 in ref transcript
  • Changed! COG COG2319 398aa 4e-33 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! COG COG2319 408aa 5e-25 in ref transcript
  • Changed! cd WD40 264aa 1e-55 in modified transcript
  • Changed! COG COG2319 390aa 1e-33 in modified transcript

APCandSRP19andZRSR1

  • APC.F16 APC.R14 232 344
  • AceView 36.Apr07 APCandSRP19andZRSR1
  • Single exon skipping, size difference: 112
  • Inclusion in the protein causing a frameshift
  • Reference transcript: APCandSRP19andZRSR1.eApr07

  • Changed! cd RRM 61aa 5e-08 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! smart RRM_1 72aa 7e-16 in ref transcript
    • RNA recognition motif.
  • Changed! pfam zf-CCCH 25aa 7e-04 in ref transcript
    • Zinc finger C-x8-C-x5-C-x3-H type (and similar).

ATG5

  • APG5L.u.f.9 APG5L.u.r.5 148 276
  • AceView 36.Apr07 ATG5
  • Single exon skipping, size difference: 128
  • Exclusion in the protein causing a frameshift
  • Reference transcript: ATG5.bApr07

  • Changed! pfam APG5 194aa 9e-71 in ref transcript
    • Autophagy protein Apg5. Apg5 is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway.

APP

  • APP.F9 APP.R1 118 286
  • NCBIGene 36.3 351
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201413

  • Changed! cd KU 52aa 1e-17 in ref transcript
    • BPTI/Kunitz family of serine protease inhibitors; Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure.
  • pfam A4_EXTRA 165aa 1e-88 in ref transcript
    • Amyloid A4 extracellular domain.
  • Changed! pfam Kunitz_BPTI 52aa 8e-19 in ref transcript
    • Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways. TAP molecules are highly dipolar, and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix.
  • pfam APP_amyloid 43aa 4e-18 in ref transcript
    • beta-amyloid precursor protein C-terminus. This is the amyloid, C-terminal, protein of the beta-Amyloid precursor protein (APP) which is a conserved and ubiquitous transmembrane glycoprotein strongly implicated in the pathogenesis of Alzheimer's disease but whose normal biological function is unknown. The C-terminal 100 residues are released and aggregate into amyloid deposits which are strongly implicated in the pathology of Alzheimer's disease plaque-formation. The domain is associated with family A4_EXTRA, pfam02177, further towards the N-terminus.
  • pfam Beta-APP 34aa 1e-06 in ref transcript
    • Beta-amyloid peptide (beta-APP).
  • pfam OmpH 81aa 0.004 in ref transcript
    • Outer membrane protein (OmpH-like). This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterised as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.
  • Changed! PRK xseA 173aa 0.001 in ref transcript
    • exodeoxyribonuclease VII large subunit; Reviewed.

AXIN1

  • AXIN1.u.f.11 AXIN1.R5 312 420
  • NCBIGene 36.3 8312
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003502

  • pfam RGS 123aa 2e-31 in ref transcript
    • Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
  • smart DAX 83aa 4e-27 in ref transcript
    • Domain present in Dishevelled and axin. Domain of unknown function.
  • pfam Axin_b-cat_bind 33aa 2e-08 in ref transcript
    • Axin beta-catenin binding domain. This domain is found on the scaffolding protein Axin which is a component of the beta-catenin destruction complex. It competes with the tumour suppressor adenomatous polyposis coli protein (APC) for binding to beta-catenin.

AXL

  • AXL_x151_x133_f AXL_x151_x133_r 178 205
  • NCBIGene 36.3 558
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021913

  • cd PTKc_Axl 272aa 1e-167 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Axl. Protein Tyrosine Kinase (PTK) family; Axl; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Axl is a member of the Axl subfamily, which is composed of receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with two immunoglobulin-like domains followed by two fibronectin type III repeats, a transmembrane segment, and an intracellular catalytic domain. Binding to their ligands, Gas6 and protein S, leads to receptor dimerization, autophosphorylation, activation, and intracellular signaling. Axl is widely expressed in a variety of organs and cells including epithelial, mesenchymal, hematopoietic, as well as non-transformed cells. Axl signaling is important in many cellular functions such as survival, anti-apoptosis, proliferation, migration, and adhesion. Axl was originally isolated from patients with chronic myelogenous leukemia and a chronic myeloproliferative disorder. Axl is overexpressed in many human cancers including colon, squamous cell, thyroid, breast, and lung carcinomas.
  • cd FN3 104aa 2e-06 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IG 77aa 2e-05 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 84aa 0.006 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 88aa 0.009 in ref transcript
  • smart TyrKc 266aa 1e-105 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • pfam fn3 85aa 2e-06 in ref transcript
    • Fibronectin type III domain.
  • smart IG_like 83aa 2e-05 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 97aa 1e-04 in ref transcript
  • pfam I-set 85aa 0.008 in ref transcript
    • Immunoglobulin I-set domain.
  • COG SPS1 280aa 7e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

BCAS1

  • BCAS1.F9 BCAS1.R4 226 292
  • AceView 36.Apr07 BCAS1
  • Single exon skipping, size difference: 66
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: BCAS1.aApr07

BCL2L1

  • BCL2L1.u.f.5 BCL2L1.b.r.24 169 187
  • AceView 36.Apr07 BCL2L1
  • Alternative 3-prime, size difference: 18
  • Inclusion in 5'UTR
  • Reference transcript: BCL2L1.cApr07

  • cd Bcl-2_like 114aa 2e-40 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • TIGR bcl-2 233aa 1e-89 in ref transcript
    • in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.

BCL2L12

  • BCL2L12-3 BCL2L12-4 187 330
  • AceView 36.Apr07 BCL2L12
  • Single exon skipping, size difference: 143
  • Exclusion in the protein causing a frameshift
  • Reference transcript: BCL2L12.aApr07

AGR3

  • BCMP11.F2 BCMP11.R2 119 255
  • AceView 36.Apr07 AGR3
  • Single exon skipping, size difference: 136
  • Exclusion of the protein initiation site
  • Reference transcript: AGR3.aApr07

  • Changed! cd AGR 130aa 2e-67 in ref transcript
    • Anterior Gradient (AGR) family; members of this family are similar to secreted proteins encoded by the cement gland-specific genes XAG-1 and XAG-2, expressed in the anterior region of dorsal ectoderm of Xenopus. They are implicated in the formation of the cement gland and the induction of forebrain fate. The human homologs, hAG-2 and hAG-3, are secreted proteins associated with estrogen-positive breast tumors. Yeast two-hybrid studies identified the metastasis-associated C4.4a protein and dystroglycan as binding partners, indicating possible roles in the development and progression of breast cancer. hAG-2 has also been implicated in prostate cancer. Its gene was cloned as an androgen-inducible gene and it was shown to be overexpressed in prostate cancer cells at the mRNA and protein levels. AGR proteins contain one conserved cysteine corresponding to the first cysteine in the CXXC motif of TRX. They show high sequence similarity to ERp19.
  • Changed! cd AGR 104aa 1e-51 in modified transcript

BMP4

  • BMP4.u.f.4 BMP4.u.r.5 128 337
  • NCBIGene 36.3 652
  • Alternative 5-prime, size difference: 209
  • Exclusion in 5'UTR
  • Reference transcript: NM_001202

  • pfam TGFb_propeptide 236aa 9e-62 in ref transcript
    • TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein.
  • smart TGFB 101aa 9e-48 in ref transcript
    • Transforming growth factor-beta (TGF-beta) family. Family members are active as disulphide-linked homo- or heterodimers. TGFB is a multifunctional peptide that controls proliferation, differentiation, and other functions in many cell types.

BTC

  • BTC.F1 BTC.R1 137 284
  • AceView 36.Apr07 BTC
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: BTC.aApr07

C11orf17

  • C11orf17-F1 C11orf17-R1 252 333
  • AceView 36.Apr07 C11orf17
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: C11orf17.bApr07

SMG7

  • C1orf16.F8 C1orf16.R8 99 249
  • AceView 36.Apr07 SMG7
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: SMG7.aApr07

  • cd TPR 79aa 0.003 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • pfam EST1 119aa 1e-24 in ref transcript
    • Telomerase activating protein Est1. Est1 is a protein which recruits or activates telomerase at the site of polymerisation.

CAPN3

  • CAPN3.u.f.38 CAPN3.u.r.40 153 297
  • NCBIGene 36.3 825
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000070

  • Changed! cd CysPc 353aa 1e-106 in ref transcript
    • Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
  • cd Calpain_III 158aa 1e-54 in ref transcript
    • Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
  • cd EFh 55aa 2e-06 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 68aa 0.004 in ref transcript
  • cd EFh 56aa 0.008 in ref transcript
  • Changed! pfam Peptidase_C2 343aa 1e-158 in ref transcript
    • Calpain family cysteine protease.
  • pfam Calpain_III 155aa 8e-65 in ref transcript
    • Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
  • COG FRQ1 99aa 6e-06 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! cd CysPc 305aa 1e-111 in modified transcript
  • Changed! pfam Peptidase_C2 295aa 1e-165 in modified transcript

CASC4

  • CASC4.F1 NM_177974-R1 127 295
  • NCBIGene 36.3 113201
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138423

  • TIGR SMC_prok_B 159aa 3e-06 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 157aa 9e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG FlaD 98aa 0.003 in ref transcript
    • Putative archaeal flagellar protein D/E [Cell motility and secretion].

CCL4

  • CCL4.F1 CCL4.R1 259 372
  • AceView 36.Apr07 CCL4
  • Single exon skipping, size difference: 115
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: CCL4.dApr07

  • Changed! cd Chemokine_CC 26aa 1e-04 in ref transcript
    • Chemokine_CC: 1 of 4 subgroup designations based on the arrangement of the two N-terminal cysteine residues; includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity; some members (e.g. 2HCC) contain an additional disulfide bond which is thought to compensate for the highly conserved Trp missing in these; chemotatic for monocytes, macrophages, eosinophils, basophils, and T cells, but not neutrophils; exist as monomers and dimers, but are believed to be functional as monomers; found only in vertebrates and a few viruses; a subgroup of CC, identified by an N-terminal DCCL motif (Exodus-1, Exodus-2, and Exodus-3), has been shown to inhibit specific types of human cancer cell growth in a mouse model. See CDs: Chemokine (cd00169) for the general alignment of chemokines, or Chemokine_CXC (cd00273), Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for the additional chemokine subgroups, and Chemokine_CC_DCCL for the DCCL subgroup of this CD.
  • Changed! pfam IL8 22aa 6e-05 in ref transcript
    • Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds.

CCNE1

  • CCNE1.F14 CCNE1.R3 260 395
  • AceView 36.Apr07 CCNE1
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: CCNE1.cApr07

  • cd CYCLIN 91aa 1e-14 in ref transcript
    • Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
  • Changed! pfam Cyclin_N 128aa 7e-42 in ref transcript
    • Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-terminal domain.
  • Changed! pfam Cyclin_C 126aa 1e-08 in ref transcript
    • Cyclin, C-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). Human UDG2 is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the C-terminal domain.
  • Changed! COG COG5024 213aa 5e-23 in ref transcript
    • Cyclin [Cell division and chromosome partitioning].
  • Changed! pfam Cyclin_N 121aa 7e-39 in modified transcript
  • Changed! pfam Cyclin_C 78aa 3e-07 in modified transcript
  • Changed! COG COG5024 234aa 1e-21 in modified transcript

CD40

  • CD40.F10 CD40.R9 133 229
  • AceView 36.Apr07 CD40
  • Alternative 3-prime, size difference: 96
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: CD40.aApr07

  • Changed! cd TNFR 98aa 8e-16 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • Changed! cd TNFR 63aa 3e-04 in ref transcript
  • Changed! smart TNFR 39aa 0.004 in ref transcript
    • Tumor necrosis factor receptor / nerve growth factor receptor repeats. Repeats in growth factor receptors that are involved in growth factor binding. TNF/TNFR.

CDC2L1

  • CDC2L1.u.f.9 refseq_CDC2L1.R1 197 334
  • NCBIGene 36.2 984
  • Single exon skipping, size difference: 137
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001787

  • Changed! cd S_TKc 287aa 1e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 276aa 6e-69 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 295aa 2e-55 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

CDC2L2

  • CDC2L2.u.f.7 refseq_CDC2L1.R6 140 179
  • NCBIGene 36.2 985
  • Alternative 5-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033531

  • cd S_TKc 287aa 1e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 276aa 5e-69 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00024 295aa 4e-55 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

CHEK2

  • CHEK2.u.f.31 CHEK2.u.r.29 194 256
  • AceView 36.Apr07 CHEK2
  • Single exon skipping, size difference: 62
  • Exclusion in the protein causing a frameshift
  • Reference transcript: CHEK2.aApr07

  • Changed! cd S_TKc 268aa 7e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd FHA 89aa 5e-07 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • Changed! smart S_TKc 257aa 4e-75 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam FHA 79aa 8e-08 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • Changed! PTZ PTZ00263 226aa 5e-43 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! cd S_TKc 65aa 7e-09 in modified transcript
  • Changed! smart STYKc 64aa 7e-10 in modified transcript
    • Protein kinase; unclassified specificity. Phosphotransferases. The specificity of this class of kinases can not be predicted. Possible dual-specificity Ser/Thr/Tyr kinase.

NLRP1

  • DEFCAP.au.11.f NALP1.u.r.25 152 284
  • NCBIGene 36.3 22861
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033004

  • cd LRR_RI 179aa 8e-30 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • cd AAA 95aa 0.009 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • pfam NACHT 170aa 5e-45 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam CARD 82aa 2e-16 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • pfam PAAD_DAPIN 58aa 2e-08 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • smart LRR_RI 28aa 0.002 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.
  • COG RNA1 142aa 0.004 in ref transcript
    • Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].

DNMT3B

  • DNMT3B.u.f.26 DNMT3B.u.r.28 137 326
  • NCBIGene 36.3 1789
  • Multiple exon skipping, size difference: 189
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006892

  • cd Dnmt3b_related 87aa 2e-35 in ref transcript
    • The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
  • Changed! cd Cyt_C5_DNA_methylase 271aa 4e-13 in ref transcript
    • Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
  • pfam PWWP 73aa 8e-24 in ref transcript
    • PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
  • pfam DNA_methylase 123aa 8e-08 in ref transcript
    • C-5 cytosine-specific DNA methylase.
  • COG Dcm 160aa 5e-13 in ref transcript
    • Site-specific DNA methylase [DNA replication, recombination, and repair].
  • Changed! cd Cyt_C5_DNA_methylase 134aa 9e-09 in modified transcript

DBF4B

  • DRF1.F1 DRF1.R1 93 158
  • AceView 36.Apr07 DBF4B
  • Alternative 3-prime, size difference: 65
  • Exclusion in the protein causing a frameshift
  • Reference transcript: DBF4B.aApr07

  • Changed! pfam zf-DBF 49aa 1e-16 in ref transcript
    • DBF zinc finger. This domain is predicted to bind metal ions and is often found associated with pfam00533 and pfam02178.
  • Changed! COG DBF4 58aa 3e-04 in ref transcript
    • Protein kinase essential for the initiation of DNA replication [DNA replication, recombination, and repair / Cell division and chromosome partitioning].

DSC3

  • DSC3.F1 DSC3.R1 229 272
  • NCBIGene 36.3 1825
  • Single exon skipping, size difference: 43
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001941

  • cd CA 218aa 9e-36 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 209aa 8e-31 in ref transcript
  • cd CA 210aa 4e-24 in ref transcript
  • Changed! cd CA 193aa 1e-19 in ref transcript
  • pfam Cadherin_pro 87aa 5e-18 in ref transcript
    • Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions.
  • pfam Cadherin 99aa 9e-17 in ref transcript
    • Cadherin domain.
  • pfam Cadherin 103aa 2e-11 in ref transcript
  • smart CA 77aa 4e-11 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • pfam Cadherin 95aa 1e-10 in ref transcript
  • Changed! pfam Cadherin_C 63aa 1e-04 in ref transcript
    • Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn.
  • Changed! pfam Cadherin 74aa 0.009 in ref transcript
  • Changed! cd CA 192aa 2e-19 in modified transcript

ECT2

  • ECT2.F12 ECT2.R11 100 193
  • AceView 36.Apr07 ECT2
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: ECT2.aApr07

  • cd PH_etc2 129aa 2e-61 in ref transcript
    • Epithelial cell transforming 2 (ECT2) pleckstrin homology (PH) domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinases, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
  • cd RhoGEF 187aa 7e-37 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • cd BRCT 71aa 4e-08 in ref transcript
    • Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
  • smart RhoGEF 185aa 2e-43 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
  • smart BRCT 73aa 2e-07 in ref transcript
    • breast cancer carboxy-terminal domain.
  • pfam BRCT 72aa 7e-04 in ref transcript
    • BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
  • COG ROM1 201aa 1e-08 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

F3

  • F3.F2 F3.R2 270 430
  • AceView 36.Apr07 F3
  • Single exon skipping, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: F3.aApr07

  • pfam Tissue_fac 125aa 3e-52 in ref transcript
    • Tissue factor.

FANCA

  • FANCA.F13 FANCA.R13 102 231
  • AceView 36.Apr07 FANCA
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: FANCA.aApr07

  • pfam Fanconi_A 31aa 5e-10 in ref transcript
    • Fanconi anaemia group A protein.

FANCL

  • FANCL.F9 FANCL.R5 153 431
  • AceView 36.Apr07 FANCL
  • Multiple exon skipping, size difference: 278
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: FANCL.bApr07

  • Changed! pfam WD-3 270aa 1e-118 in ref transcript
    • WD-repeat region. This entry is of a region of approximately 100 residues containing three WD repeats and six cysteine residues possibly as three cystine-bridges. These regions are contained within the Fancl protein in humans which is the putative E3 ubiquitin ligase subunit of the FA complex (Fanconi anaemia). Eight subunits of the Fanconi anaemia gene products form a multisubunit nuclear complex which is required for mono-ubiquitination of a downstream FA protein, FANCD2. The WD repeats are required for interaction with other subunits of the FA complex.
  • Changed! COG COG5219 62aa 0.003 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

FASTK

  • FAST_HUMAN-5 FAST_HUMAN-6 410 730
  • AceView 36.Apr07 FASTK
  • Multiple exon skipping, size difference: 320
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: FASTK.aApr07

  • Changed! pfam FAST_1 67aa 1e-13 in ref transcript
    • FAST kinase-like protein, subdomain 1. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This region is often found immediately N-terminal to the FAST kinase-like protein, subdomain 2.
  • Changed! pfam FAST_2 92aa 8e-13 in ref transcript
    • FAST kinase-like protein, subdomain 2. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This subdomain is often found associated with the FAST kinase-like protein, subdomain 2.
  • Changed! pfam RAP 58aa 5e-12 in ref transcript
    • RAP domain. This domain is found in various eukaryotic species, particularly in apicomplexans such as Plasmodium falciparum, where it is found in proteins that are important in various parasite-host cell interactions. It is thought to be an RNA-binding domain.

FGFR1OP

  • FGFR1OP.u.f.10 FGFR1OP.u.r.12 108 168
  • NCBIGene 36.3 11116
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007045

  • pfam FOP_dimer 81aa 8e-35 in ref transcript
    • FOP N terminal dimerisation domain. Fibroblast growth factor receptor 1 (FGFR1) oncogene partner (FOP) is a centrosomal protein that is involved in anchoring microtubules to subcellular structures. This domain includes a Lis-homology motif. It forms an alpha helical bundle and is involved in dimerisation.

FGFR2

  • FGFR2.F1 FGFR2.R1 205 472
  • AceView 36.Apr07 FGFR2
  • Single exon skipping, size difference: 267
  • Exclusion in the protein (no frameshift)
  • Reference transcript: FGFR2.aApr07

  • cd PTKc_FGFR2 304aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 2. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 2 (FGFR2); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR2 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. There are many splice variants of FGFR2 which show differential expression and binding to FGF ligands. Disruption of either FGFR2 or FGFR2b is lethal in mice, due to defects in the placenta or severe impairment of tissue development including lung, limb, and thyroid, respectively. Disruption of FGFR2c in mice results in defective bone and skull development. Genetic alterations of FGFR2 are associated with many human skeletal disorders including Apert syndrome, Crouzon syndrome, Jackson-Weiss syndrome, and Pfeiffer syndrome.
  • cd IGcam 93aa 1e-16 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 101aa 3e-05 in ref transcript
  • Changed! cd IGcam 64aa 8e-05 in ref transcript
  • pfam Pkinase_Tyr 277aa 1e-115 in ref transcript
    • Protein tyrosine kinase.
  • pfam I-set 75aa 2e-12 in ref transcript
    • Immunoglobulin I-set domain.
  • Changed! smart IGc2 57aa 9e-11 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 96aa 2e-10 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • COG SPS1 252aa 3e-21 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

FGFR3

  • FGFR3.F4 rs.FGFR3.R1 99 435
  • NCBIGene 36.3 2261
  • Multiple exon skipping, size difference: 336
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000142

  • cd PTKc_FGFR3 334aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 3. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 3 (FGFR3); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR3 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Many FGFR3 splice variants have been reported with the IIIb and IIIc isoforms being the predominant forms. FGFR3 IIIc is the isoform expressed in chondrocytes, the cells affected in dwarfism, while IIIb is expressed in epithelial cells. FGFR3 ligands include FGF1, FGF2, FGF4, FGF8, FGF9, and FGF23. It is a negative regulator of long bone growth. In the cochlear duct and in the lens, FGFR3 is involved in differentiation while it appears to have a role in cell proliferation in epithelial cells. Germline mutations in FGFR3 are associated with skeletal disorders including several forms of dwarfism. Some missense mutations are associated with multiple myeloma and carcinomas of the bladder and cervix. Overexpression of FGFR3 is found in thyroid carcinoma.
  • cd IGcam 93aa 4e-16 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! cd IGcam 94aa 8e-08 in ref transcript
  • pfam Pkinase_Tyr 277aa 1e-117 in ref transcript
    • Protein tyrosine kinase.
  • pfam I-set 75aa 2e-13 in ref transcript
    • Immunoglobulin I-set domain.
  • Changed! smart IG_like 94aa 2e-10 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 58aa 6e-06 in ref transcript
    • Immunoglobulin C-2 Type.
  • COG SPS1 275aa 6e-22 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! smart IG_like 43aa 0.009 in modified transcript

FGFR4

  • FGFR4.F23 FGFR4.R11 275 469
  • AceView 36.Apr07 FGFR4
  • Alternative 3-prime, size difference: 194
  • Inclusion in the protein causing a frameshift
  • Reference transcript: FGFR4.bApr07

  • Changed! cd PTKc_FGFR4 311aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 4. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 4 (FGFR4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR4 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Unlike other FGFRs, there is only one splice form of FGFR4. It binds FGF1, FGF2, FGF6, FGF19, and FGF23. FGF19 is a selective ligand for FGFR4. Although disruption of FGFR4 in mice causes no obvious phenotype, in vivo inhibition of FGFR4 in cultured skeletal muscle cells resulted in an arrest of muscle progenitor differentiation. FGF6 and FGFR4 are uniquely expressed in myofibers and satellite cells. FGF6/FGFR4 signaling appears to play a key role in the regulation of muscle regeneration. A polymorphism in FGFR4 is found in head and neck squamous cell carcinoma.
  • cd IGcam 93aa 6e-17 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 101aa 2e-07 in ref transcript
  • cd IGcam 67aa 0.003 in ref transcript
  • Changed! pfam Pkinase_Tyr 277aa 1e-114 in ref transcript
    • Protein tyrosine kinase.
  • smart IGc2 67aa 4e-15 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 96aa 6e-12 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 58aa 8e-09 in ref transcript
  • Changed! COG SPS1 251aa 7e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

FN1

  • FN1.u.f.30 FN1.u.r.34 296 569
  • NCBIGene 36.3 2335
  • Single exon skipping, size difference: 273
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_212482

  • cd FN2 48aa 2e-16 in ref transcript
    • Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
  • cd FN2 48aa 3e-16 in ref transcript
  • cd FN3 83aa 4e-10 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN1 44aa 4e-09 in ref transcript
    • Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
  • cd FN3 81aa 8e-09 in ref transcript
  • cd FN3 80aa 1e-08 in ref transcript
  • cd FN1 40aa 4e-08 in ref transcript
  • cd FN1 42aa 5e-08 in ref transcript
  • cd FN1 45aa 8e-08 in ref transcript
  • cd FN3 88aa 1e-07 in ref transcript
  • cd FN3 88aa 2e-07 in ref transcript
  • cd FN1 44aa 2e-07 in ref transcript
  • cd FN3 87aa 3e-07 in ref transcript
  • cd FN1 38aa 4e-07 in ref transcript
  • cd FN3 81aa 9e-07 in ref transcript
  • cd FN3 82aa 1e-06 in ref transcript
  • cd FN3 88aa 1e-06 in ref transcript
  • cd FN1 41aa 1e-06 in ref transcript
  • Changed! cd FN3 93aa 2e-06 in ref transcript
  • cd FN1 45aa 2e-06 in ref transcript
  • cd FN1 42aa 2e-06 in ref transcript
  • cd FN3 81aa 6e-06 in ref transcript
  • cd FN3 90aa 8e-06 in ref transcript
  • cd FN3 73aa 9e-06 in ref transcript
  • cd FN1 44aa 1e-05 in ref transcript
  • cd FN1 43aa 3e-05 in ref transcript
  • cd FN1 39aa 0.003 in ref transcript
  • cd FN3 73aa 0.006 in ref transcript
  • smart FN2 49aa 9e-20 in ref transcript
    • Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
  • smart FN2 49aa 4e-19 in ref transcript
  • pfam fn3 82aa 8e-17 in ref transcript
    • Fibronectin type III domain.
  • smart FN1 45aa 6e-13 in ref transcript
    • Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
  • pfam fn1 39aa 1e-12 in ref transcript
    • Fibronectin type I domain.
  • pfam fn3 81aa 3e-12 in ref transcript
  • pfam fn3 81aa 3e-12 in ref transcript
  • smart FN1 45aa 3e-12 in ref transcript
  • smart FN1 42aa 3e-12 in ref transcript
  • smart FN1 45aa 5e-12 in ref transcript
  • smart FN1 39aa 2e-11 in ref transcript
  • pfam fn3 81aa 4e-11 in ref transcript
  • Changed! pfam fn3 83aa 5e-11 in ref transcript
  • pfam fn3 67aa 8e-11 in ref transcript
  • pfam fn1 39aa 8e-11 in ref transcript
  • pfam fn3 81aa 1e-10 in ref transcript
  • pfam fn3 80aa 1e-10 in ref transcript
  • smart FN1 43aa 2e-10 in ref transcript
  • pfam fn3 80aa 5e-10 in ref transcript
  • smart FN1 44aa 7e-10 in ref transcript
  • pfam fn3 81aa 8e-09 in ref transcript
  • pfam fn3 85aa 9e-09 in ref transcript
  • pfam fn3 70aa 1e-08 in ref transcript
  • pfam fn3 65aa 2e-08 in ref transcript
  • smart FN1 41aa 3e-08 in ref transcript
  • pfam fn3 81aa 3e-07 in ref transcript
  • smart FN1 41aa 5e-07 in ref transcript
  • pfam fn3 54aa 5e-05 in ref transcript
  • pfam fn1 31aa 9e-05 in ref transcript

FN1

  • FN1.u.f.40 FN1.u.r.45 126 396
  • NCBIGene 36.3 2335
  • Single exon skipping, size difference: 270
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_212482

  • cd FN2 48aa 2e-16 in ref transcript
    • Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
  • cd FN2 48aa 3e-16 in ref transcript
  • cd FN3 83aa 4e-10 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN1 44aa 4e-09 in ref transcript
    • Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
  • cd FN3 81aa 8e-09 in ref transcript
  • cd FN3 80aa 1e-08 in ref transcript
  • cd FN1 40aa 4e-08 in ref transcript
  • cd FN1 42aa 5e-08 in ref transcript
  • cd FN1 45aa 8e-08 in ref transcript
  • cd FN3 88aa 1e-07 in ref transcript
  • cd FN3 88aa 2e-07 in ref transcript
  • cd FN1 44aa 2e-07 in ref transcript
  • cd FN3 87aa 3e-07 in ref transcript
  • cd FN1 38aa 4e-07 in ref transcript
  • Changed! cd FN3 81aa 9e-07 in ref transcript
  • cd FN3 82aa 1e-06 in ref transcript
  • Changed! cd FN3 88aa 1e-06 in ref transcript
  • cd FN1 41aa 1e-06 in ref transcript
  • cd FN3 93aa 2e-06 in ref transcript
  • cd FN1 45aa 2e-06 in ref transcript
  • cd FN1 42aa 2e-06 in ref transcript
  • cd FN3 81aa 6e-06 in ref transcript
  • cd FN3 90aa 8e-06 in ref transcript
  • cd FN3 73aa 9e-06 in ref transcript
  • cd FN1 44aa 1e-05 in ref transcript
  • cd FN1 43aa 3e-05 in ref transcript
  • cd FN1 39aa 0.003 in ref transcript
  • cd FN3 73aa 0.006 in ref transcript
  • smart FN2 49aa 9e-20 in ref transcript
    • Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
  • smart FN2 49aa 4e-19 in ref transcript
  • pfam fn3 82aa 8e-17 in ref transcript
    • Fibronectin type III domain.
  • smart FN1 45aa 6e-13 in ref transcript
    • Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
  • Changed! pfam fn1 39aa 1e-12 in ref transcript
    • Fibronectin type I domain.
  • pfam fn3 81aa 3e-12 in ref transcript
  • pfam fn3 81aa 3e-12 in ref transcript
  • smart FN1 45aa 3e-12 in ref transcript
  • smart FN1 42aa 3e-12 in ref transcript
  • smart FN1 45aa 5e-12 in ref transcript
  • smart FN1 39aa 2e-11 in ref transcript
  • pfam fn3 81aa 4e-11 in ref transcript
  • pfam fn3 83aa 5e-11 in ref transcript
  • pfam fn3 67aa 8e-11 in ref transcript
  • pfam fn1 39aa 8e-11 in ref transcript
  • pfam fn3 81aa 1e-10 in ref transcript
  • pfam fn3 80aa 1e-10 in ref transcript
  • smart FN1 43aa 2e-10 in ref transcript
  • pfam fn3 80aa 5e-10 in ref transcript
  • smart FN1 44aa 7e-10 in ref transcript
  • pfam fn3 81aa 8e-09 in ref transcript
  • pfam fn3 85aa 9e-09 in ref transcript
  • pfam fn3 70aa 1e-08 in ref transcript
  • pfam fn3 65aa 2e-08 in ref transcript
  • smart FN1 41aa 3e-08 in ref transcript
  • Changed! pfam fn3 81aa 3e-07 in ref transcript
  • smart FN1 41aa 5e-07 in ref transcript
  • pfam fn3 54aa 5e-05 in ref transcript
  • pfam fn1 31aa 9e-05 in ref transcript
  • Changed! cd FN3 89aa 3e-07 in modified transcript
  • Changed! smart FN1 44aa 2e-12 in modified transcript

FN1

  • FN1.u.f.55 FN1.u.r.62 129 204
  • NCBIGene 36.3 2335
  • Alternative 3-prime, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_212482

  • cd FN2 48aa 2e-16 in ref transcript
    • Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
  • cd FN2 48aa 3e-16 in ref transcript
  • cd FN3 83aa 4e-10 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN1 44aa 4e-09 in ref transcript
    • Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
  • cd FN3 81aa 8e-09 in ref transcript
  • cd FN3 80aa 1e-08 in ref transcript
  • cd FN1 40aa 4e-08 in ref transcript
  • cd FN1 42aa 5e-08 in ref transcript
  • cd FN1 45aa 8e-08 in ref transcript
  • cd FN3 88aa 1e-07 in ref transcript
  • cd FN3 88aa 2e-07 in ref transcript
  • cd FN1 44aa 2e-07 in ref transcript
  • cd FN3 87aa 3e-07 in ref transcript
  • cd FN1 38aa 4e-07 in ref transcript
  • cd FN3 81aa 9e-07 in ref transcript
  • cd FN3 82aa 1e-06 in ref transcript
  • cd FN3 88aa 1e-06 in ref transcript
  • cd FN1 41aa 1e-06 in ref transcript
  • cd FN3 93aa 2e-06 in ref transcript
  • cd FN1 45aa 2e-06 in ref transcript
  • cd FN1 42aa 2e-06 in ref transcript
  • cd FN3 81aa 6e-06 in ref transcript
  • cd FN3 90aa 8e-06 in ref transcript
  • cd FN3 73aa 9e-06 in ref transcript
  • cd FN1 44aa 1e-05 in ref transcript
  • cd FN1 43aa 3e-05 in ref transcript
  • cd FN1 39aa 0.003 in ref transcript
  • cd FN3 73aa 0.006 in ref transcript
  • smart FN2 49aa 9e-20 in ref transcript
    • Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
  • smart FN2 49aa 4e-19 in ref transcript
  • pfam fn3 82aa 8e-17 in ref transcript
    • Fibronectin type III domain.
  • smart FN1 45aa 6e-13 in ref transcript
    • Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
  • pfam fn1 39aa 1e-12 in ref transcript
    • Fibronectin type I domain.
  • pfam fn3 81aa 3e-12 in ref transcript
  • pfam fn3 81aa 3e-12 in ref transcript
  • smart FN1 45aa 3e-12 in ref transcript
  • smart FN1 42aa 3e-12 in ref transcript
  • smart FN1 45aa 5e-12 in ref transcript
  • smart FN1 39aa 2e-11 in ref transcript
  • pfam fn3 81aa 4e-11 in ref transcript
  • pfam fn3 83aa 5e-11 in ref transcript
  • pfam fn3 67aa 8e-11 in ref transcript
  • pfam fn1 39aa 8e-11 in ref transcript
  • pfam fn3 81aa 1e-10 in ref transcript
  • pfam fn3 80aa 1e-10 in ref transcript
  • smart FN1 43aa 2e-10 in ref transcript
  • pfam fn3 80aa 5e-10 in ref transcript
  • smart FN1 44aa 7e-10 in ref transcript
  • pfam fn3 81aa 8e-09 in ref transcript
  • pfam fn3 85aa 9e-09 in ref transcript
  • pfam fn3 70aa 1e-08 in ref transcript
  • pfam fn3 65aa 2e-08 in ref transcript
  • smart FN1 41aa 3e-08 in ref transcript
  • pfam fn3 81aa 3e-07 in ref transcript
  • smart FN1 41aa 5e-07 in ref transcript
  • pfam fn3 54aa 5e-05 in ref transcript
  • pfam fn1 31aa 9e-05 in ref transcript

FN1

  • FN1.u.f.56 FN1.u.r.67 231 319
  • NCBIGene 36.3 2335
  • Intron retention, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_212482

  • cd FN2 48aa 2e-16 in ref transcript
    • Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
  • cd FN2 48aa 3e-16 in ref transcript
  • cd FN3 83aa 4e-10 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN1 44aa 4e-09 in ref transcript
    • Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
  • cd FN3 81aa 8e-09 in ref transcript
  • cd FN3 80aa 1e-08 in ref transcript
  • cd FN1 40aa 4e-08 in ref transcript
  • cd FN1 42aa 5e-08 in ref transcript
  • cd FN1 45aa 8e-08 in ref transcript
  • cd FN3 88aa 1e-07 in ref transcript
  • cd FN3 88aa 2e-07 in ref transcript
  • cd FN1 44aa 2e-07 in ref transcript
  • cd FN3 87aa 3e-07 in ref transcript
  • cd FN1 38aa 4e-07 in ref transcript
  • cd FN3 81aa 9e-07 in ref transcript
  • cd FN3 82aa 1e-06 in ref transcript
  • cd FN3 88aa 1e-06 in ref transcript
  • cd FN1 41aa 1e-06 in ref transcript
  • cd FN3 93aa 2e-06 in ref transcript
  • cd FN1 45aa 2e-06 in ref transcript
  • cd FN1 42aa 2e-06 in ref transcript
  • cd FN3 81aa 6e-06 in ref transcript
  • cd FN3 90aa 8e-06 in ref transcript
  • cd FN3 73aa 9e-06 in ref transcript
  • cd FN1 44aa 1e-05 in ref transcript
  • cd FN1 43aa 3e-05 in ref transcript
  • cd FN1 39aa 0.003 in ref transcript
  • cd FN3 73aa 0.006 in ref transcript
  • smart FN2 49aa 9e-20 in ref transcript
    • Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
  • smart FN2 49aa 4e-19 in ref transcript
  • pfam fn3 82aa 8e-17 in ref transcript
    • Fibronectin type III domain.
  • smart FN1 45aa 6e-13 in ref transcript
    • Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
  • pfam fn1 39aa 1e-12 in ref transcript
    • Fibronectin type I domain.
  • pfam fn3 81aa 3e-12 in ref transcript
  • pfam fn3 81aa 3e-12 in ref transcript
  • smart FN1 45aa 3e-12 in ref transcript
  • smart FN1 42aa 3e-12 in ref transcript
  • smart FN1 45aa 5e-12 in ref transcript
  • smart FN1 39aa 2e-11 in ref transcript
  • pfam fn3 81aa 4e-11 in ref transcript
  • pfam fn3 83aa 5e-11 in ref transcript
  • pfam fn3 67aa 8e-11 in ref transcript
  • pfam fn1 39aa 8e-11 in ref transcript
  • pfam fn3 81aa 1e-10 in ref transcript
  • pfam fn3 80aa 1e-10 in ref transcript
  • smart FN1 43aa 2e-10 in ref transcript
  • pfam fn3 80aa 5e-10 in ref transcript
  • smart FN1 44aa 7e-10 in ref transcript
  • pfam fn3 81aa 8e-09 in ref transcript
  • pfam fn3 85aa 9e-09 in ref transcript
  • pfam fn3 70aa 1e-08 in ref transcript
  • pfam fn3 65aa 2e-08 in ref transcript
  • smart FN1 41aa 3e-08 in ref transcript
  • pfam fn3 81aa 3e-07 in ref transcript
  • smart FN1 41aa 5e-07 in ref transcript
  • pfam fn3 54aa 5e-05 in ref transcript
  • pfam fn1 31aa 9e-05 in ref transcript

ARID4B

  • FOX.ARID4BandRBM34.F1 FOX.ARID4BandRBM34.R1 137 395
  • NCBIGene 36.3 51742
  • Single exon skipping, size difference: 258
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016374

  • pfam ARID 94aa 6e-27 in ref transcript
    • ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
  • pfam RBB1NT 93aa 3e-15 in ref transcript
    • RBB1NT (NUC162) domain. This domain is found N terminal to the ARID/BRIGHT domain in DNA-binding proteins of the Retinoblastoma-binding protein 1 family.
  • smart TUDOR 56aa 1e-05 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).

FN1

  • FOX.FN1.F1 FOX.FN1.R1 124 397
  • NCBIGene 36.3 2335
  • Single exon skipping, size difference: 273
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_212482

  • cd FN2 48aa 2e-16 in ref transcript
    • Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
  • cd FN2 48aa 3e-16 in ref transcript
  • cd FN3 83aa 4e-10 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN1 44aa 4e-09 in ref transcript
    • Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
  • cd FN3 81aa 8e-09 in ref transcript
  • cd FN3 80aa 1e-08 in ref transcript
  • cd FN1 40aa 4e-08 in ref transcript
  • cd FN1 42aa 5e-08 in ref transcript
  • cd FN1 45aa 8e-08 in ref transcript
  • cd FN3 88aa 1e-07 in ref transcript
  • cd FN3 88aa 2e-07 in ref transcript
  • cd FN1 44aa 2e-07 in ref transcript
  • cd FN3 87aa 3e-07 in ref transcript
  • cd FN1 38aa 4e-07 in ref transcript
  • cd FN3 81aa 9e-07 in ref transcript
  • cd FN3 82aa 1e-06 in ref transcript
  • cd FN3 88aa 1e-06 in ref transcript
  • cd FN1 41aa 1e-06 in ref transcript
  • Changed! cd FN3 93aa 2e-06 in ref transcript
  • cd FN1 45aa 2e-06 in ref transcript
  • cd FN1 42aa 2e-06 in ref transcript
  • cd FN3 81aa 6e-06 in ref transcript
  • cd FN3 90aa 8e-06 in ref transcript
  • cd FN3 73aa 9e-06 in ref transcript
  • cd FN1 44aa 1e-05 in ref transcript
  • cd FN1 43aa 3e-05 in ref transcript
  • cd FN1 39aa 0.003 in ref transcript
  • cd FN3 73aa 0.006 in ref transcript
  • smart FN2 49aa 9e-20 in ref transcript
    • Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
  • smart FN2 49aa 4e-19 in ref transcript
  • pfam fn3 82aa 8e-17 in ref transcript
    • Fibronectin type III domain.
  • smart FN1 45aa 6e-13 in ref transcript
    • Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
  • pfam fn1 39aa 1e-12 in ref transcript
    • Fibronectin type I domain.
  • pfam fn3 81aa 3e-12 in ref transcript
  • pfam fn3 81aa 3e-12 in ref transcript
  • smart FN1 45aa 3e-12 in ref transcript
  • smart FN1 42aa 3e-12 in ref transcript
  • smart FN1 45aa 5e-12 in ref transcript
  • smart FN1 39aa 2e-11 in ref transcript
  • pfam fn3 81aa 4e-11 in ref transcript
  • Changed! pfam fn3 83aa 5e-11 in ref transcript
  • pfam fn3 67aa 8e-11 in ref transcript
  • pfam fn1 39aa 8e-11 in ref transcript
  • pfam fn3 81aa 1e-10 in ref transcript
  • pfam fn3 80aa 1e-10 in ref transcript
  • smart FN1 43aa 2e-10 in ref transcript
  • pfam fn3 80aa 5e-10 in ref transcript
  • smart FN1 44aa 7e-10 in ref transcript
  • pfam fn3 81aa 8e-09 in ref transcript
  • pfam fn3 85aa 9e-09 in ref transcript
  • pfam fn3 70aa 1e-08 in ref transcript
  • pfam fn3 65aa 2e-08 in ref transcript
  • smart FN1 41aa 3e-08 in ref transcript
  • pfam fn3 81aa 3e-07 in ref transcript
  • smart FN1 41aa 5e-07 in ref transcript
  • pfam fn3 54aa 5e-05 in ref transcript
  • pfam fn1 31aa 9e-05 in ref transcript

INSR

  • FOX.INSR.F1 FOX.INSR.R1 194 230
  • NCBIGene 36.3 3643
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000208

  • cd PTKc_InsR 288aa 1e-179 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Insulin Receptor. Protein Tyrosine Kinase (PTK) family; Insulin Receptor (InsR); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. InsR is a receptor tyr kinase (RTK) that is composed of two alphabeta heterodimers. Binding of the insulin ligand to the extracellular alpha subunit activates the intracellular tyr kinase domain of the transmembrane beta subunit. Receptor activation leads to autophosphorylation, stimulating downstream kinase activities, which initiate signaling cascades and biological function. InsR signaling plays an important role in many cellular processes including glucose homeostasis, glycogen synthesis, lipid and protein metabolism, ion and amino acid transport, cell cycle and proliferation, cell differentiation, gene transcription, and nitric oxide synthesis. Insulin resistance, caused by abnormalities in InsR signaling, has been described in diabetes, hypertension, cardiovascular disease, metabolic syndrome, heart failure, and female infertility.
  • cd FN3 78aa 3e-07 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FU 48aa 2e-06 in ref transcript
    • Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
  • cd FN3 45aa 7e-04 in ref transcript
  • pfam Pkinase_Tyr 268aa 1e-118 in ref transcript
    • Protein tyrosine kinase.
  • pfam Furin-like 162aa 5e-57 in ref transcript
    • Furin-like cysteine rich region.
  • pfam Recep_L_domain 114aa 1e-25 in ref transcript
    • Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain.
  • pfam Recep_L_domain 113aa 2e-25 in ref transcript
  • smart FN3 71aa 5e-05 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • COG SPS1 265aa 2e-18 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

KIF2A

  • FOX.KIF2AandIPO11.F1 rs.KIF2A.R1 157 271
  • NCBIGene 36.3 3796
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098511

  • Changed! cd KISc_KIF2_like 330aa 1e-142 in ref transcript
    • Kinesin motor domain, KIF2-like group. KIF2 is a protein expressed in neurons, which has been associated with axonal transport and neuron development; alternative splice forms have been implicated in lysosomal translocation. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In this subgroup the motor domain is found in the middle (M-type) of the protein chain. M-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second (KIF2 may be slower). To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through the coiled-coil domain close to a second tubulin dimer, about 80 Angstroms along the microtubule. Meanwhile, ATP hydrolysis takes place, and when the second head domain binds to the microtubule, the first domain again replaces ADP with ATP, triggering a conformational change that pulls the first domain forward.
  • Changed! smart KISc 337aa 1e-109 in ref transcript
    • Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
  • Changed! COG KIP1 327aa 7e-51 in ref transcript
    • Kinesin-like protein [Cytoskeleton].
  • Changed! cd KISc_KIF2_like 330aa 1e-142 in modified transcript
  • Changed! smart KISc 336aa 1e-109 in modified transcript
  • Changed! COG KIP1 344aa 5e-51 in modified transcript

SYNE2

  • FOX.SYNE2.F1 FOX.SYNE2.R1 266 335
  • NCBIGene 36.3 23224
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182914

  • cd SPEC 214aa 1e-12 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd CH 103aa 5e-12 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 215aa 1e-10 in ref transcript
  • cd CH 103aa 6e-10 in ref transcript
  • cd SPEC 214aa 4e-09 in ref transcript
  • cd SPEC 221aa 8e-08 in ref transcript
  • cd SPEC 216aa 1e-07 in ref transcript
  • cd SPEC 207aa 2e-05 in ref transcript
  • pfam KASH 60aa 7e-14 in ref transcript
    • Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
  • pfam CH 99aa 2e-13 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • smart CH 102aa 2e-13 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam SMC_N 302aa 1e-06 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • TIGR SMC_prok_B 305aa 4e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 102aa 9e-05 in ref transcript
    • Spectrin repeats.
  • smart SPEC 104aa 3e-04 in ref transcript
  • TIGR SMC_prok_A 303aa 3e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • TIGR SMC_prok_A 375aa 4e-04 in ref transcript
  • pfam SMC_N 281aa 0.002 in ref transcript
  • COG SAC6 255aa 3e-22 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG Smc 266aa 3e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

GDNF

  • GDNF.u.f.4 refseq_GDNF.R2 190 268
  • NCBIGene 36.3 2668
  • Alternative 5-prime, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000514

  • pfam TGF_beta 94aa 8e-15 in ref transcript
    • Transforming growth factor beta like domain.

GNB3

  • GNB3.F12 GNB3.R2 177 418
  • AceView 36.Apr07 GNB3
  • Intron retention, size difference: 241
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: GNB3.aApr07

  • Changed! cd WD40 293aa 1e-65 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! smart WD40 39aa 3e-05 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • Changed! smart WD40 35aa 2e-04 in ref transcript
  • Changed! pfam WD40 39aa 3e-04 in ref transcript
    • WD domain, G-beta repeat.
  • Changed! pfam WD40 39aa 0.002 in ref transcript
  • Changed! COG COG2319 297aa 1e-33 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

HMGA1

  • HMGA1.au.0.f HMGA1.u.r.18 268 334
  • AceView 36.Apr07 HMGA1
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 71
  • Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: HMGA1.aApr07

HMMR

  • HMMR.F1 HMMR.R1 144 189
  • NCBIGene 36.3 3161
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012484

  • pfam SMC_N 239aa 3e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! TIGR SMC_prok_A 213aa 9e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • COG Smc 236aa 4e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • PRK PRK05771 196aa 0.002 in ref transcript
    • V-type ATP synthase subunit I; Validated.
  • Changed! TIGR SMC_prok_A 223aa 3e-04 in modified transcript

HNRPAB

  • HNRPAB-F1 HNRPAB-R1 269 416
  • NCBIGene 36.3 3182
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031266

  • cd RRM 70aa 6e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 48aa 4e-10 in ref transcript
  • TIGR SF-CC1 137aa 2e-18 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • pfam CBFNT 71aa 6e-06 in ref transcript
    • CBFNT (NUC161) domain. This N terminal domain is found in proteins of CARG-binding factor A-like proteins.
  • COG COG0724 142aa 6e-08 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

CADM1

  • IGSF4.F6 IGSF4.R1 132 165
  • AceView 36.Apr07 CADM1
  • Single exon skipping, size difference: 33
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: CADM1.bApr07

  • cd IGcam 71aa 1e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IG 72aa 0.004 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • smart IG_like 80aa 3e-10 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam C2-set_2 70aa 3e-08 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
  • smart IG_like 92aa 9e-08 in ref transcript

INSR

  • INSR.u.f.15 INSR.u.r.18 130 166
  • NCBIGene 36.3 3643
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000208

  • cd PTKc_InsR 288aa 1e-179 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Insulin Receptor. Protein Tyrosine Kinase (PTK) family; Insulin Receptor (InsR); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. InsR is a receptor tyr kinase (RTK) that is composed of two alphabeta heterodimers. Binding of the insulin ligand to the extracellular alpha subunit activates the intracellular tyr kinase domain of the transmembrane beta subunit. Receptor activation leads to autophosphorylation, stimulating downstream kinase activities, which initiate signaling cascades and biological function. InsR signaling plays an important role in many cellular processes including glucose homeostasis, glycogen synthesis, lipid and protein metabolism, ion and amino acid transport, cell cycle and proliferation, cell differentiation, gene transcription, and nitric oxide synthesis. Insulin resistance, caused by abnormalities in InsR signaling, has been described in diabetes, hypertension, cardiovascular disease, metabolic syndrome, heart failure, and female infertility.
  • cd FN3 78aa 3e-07 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FU 48aa 2e-06 in ref transcript
    • Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
  • cd FN3 45aa 7e-04 in ref transcript
  • pfam Pkinase_Tyr 268aa 1e-118 in ref transcript
    • Protein tyrosine kinase.
  • pfam Furin-like 162aa 5e-57 in ref transcript
    • Furin-like cysteine rich region.
  • pfam Recep_L_domain 114aa 1e-25 in ref transcript
    • Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain.
  • pfam Recep_L_domain 113aa 2e-25 in ref transcript
  • smart FN3 71aa 5e-05 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • COG SPS1 265aa 2e-18 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

KITLG

  • KITLG.u.f.9 KITLG.u.r.10 215 299
  • NCBIGene 36.3 4254
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000899

  • Changed! pfam SCF 273aa 1e-134 in ref transcript
    • Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site identified.
  • Changed! pfam SCF 245aa 1e-121 in modified transcript

LGALS9

  • LGALS9.F10 LGALS9.R6 135 231
  • NCBIGene 36.3 3965
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_009587

  • cd GLECT 131aa 2e-36 in ref transcript
    • Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
  • cd GLECT 128aa 5e-33 in ref transcript
  • smart GLECT 131aa 4e-40 in ref transcript
    • Galectin. Galectin - galactose-binding lectin.
  • pfam Gal-bind_lectin 128aa 3e-35 in ref transcript
    • Galactoside-binding lectin. This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).

LIG3

  • LIG3.u.f.1 LIG3.u.r.6 148 378
  • AceView 36.Apr07 LIG3
  • Alternative 3-prime, size difference: 230
  • Exclusion of the protein initiation site
  • Reference transcript: LIG3.aApr07

  • TIGR dnl1 513aa 0.0 in ref transcript
    • All proteins in this family with known functions are ATP-dependent DNA ligases. Functions include DNA repair, DNA replication, and DNA recombination (or any process requiring ligation of two single-stranded DNA sections). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • pfam zf-PARP 57aa 4e-16 in ref transcript
    • Poly(ADP-ribose) polymerase and DNA-Ligase Zn-finger region. Poly(ADP-ribose) polymerase is an important regulatory component of the cellular response to DNA damage. The amino-terminal region of Poly(ADP-ribose) polymerase consists of two PARP-type zinc fingers. This region acts as a DNA nick sensor.
  • smart BRCT 68aa 5e-05 in ref transcript
    • breast cancer carboxy-terminal domain.
  • PRK PRK01109 556aa 6e-74 in ref transcript
    • ATP-dependent DNA ligase; Provisional.

LIG4

  • LIG4.F1 LIG4.R1 267 340
  • AceView 36.Apr07 LIG4
  • Single exon skipping, size difference: 73
  • Exclusion in 5'UTR
  • Reference transcript: LIG4.bApr07

  • cd BRCT 71aa 7e-06 in ref transcript
    • Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
  • TIGR dnl1 522aa 1e-171 in ref transcript
    • All proteins in this family with known functions are ATP-dependent DNA ligases. Functions include DNA repair, DNA replication, and DNA recombination (or any process requiring ligation of two single-stranded DNA sections). This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • smart BRCT 77aa 2e-07 in ref transcript
    • breast cancer carboxy-terminal domain.
  • pfam BRCT 70aa 0.003 in ref transcript
    • BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
  • PRK PRK01109 574aa 6e-59 in ref transcript
    • ATP-dependent DNA ligase; Provisional.

LRDD

  • LRDD.u.f.7 LRDD.u.r.3 121 181
  • NCBIGene 36.3 55367
  • Alternative 3-prime, size difference: 60
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_145886

  • Changed! pfam Death 79aa 3e-09 in ref transcript
    • Death domain.
  • Changed! pfam Peptidase_S68 34aa 2e-08 in ref transcript
    • Peptidase S68. This family of serine peptidases contains PIDD proteins. PIDD forms a complex with RAIDD and procaspase-2 that is known as the 'PIDDosome'. The PIDDosome forms when DNA damage occurs and either activates NF-kappaB, leading to cell survival, or caspase-2, which leads to apoptosis.
  • Changed! smart ZU5 63aa 9e-04 in ref transcript
    • Domain present in ZO-1 and Unc5-like netrin receptors. Domain of unknown function.
  • Changed! COG COG4886 96aa 3e-07 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

MAPT

  • MAPT_x56_x127_f MAPT_x56_x127_r 162 360
  • NCBIGene 36.3 4137
  • Single exon skipping, size difference: 198
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016835

  • pfam Tubulin-binding 31aa 8e-10 in ref transcript
    • Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
  • pfam Tubulin-binding 31aa 2e-08 in ref transcript
  • pfam Tubulin-binding 32aa 2e-06 in ref transcript
  • pfam Tubulin-binding 32aa 3e-06 in ref transcript
  • pfam Hid1 123aa 3e-04 in ref transcript
    • High-temperature-induced dauer-formation protein. Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor.

MCL1

  • MCL1.F1 MCL1.R1 134 382
  • NCBIGene 36.3 4170
  • Single exon skipping, size difference: 248
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_021960

  • Changed! cd Bcl-2_like 144aa 7e-37 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • Changed! pfam Bcl-2 100aa 5e-32 in ref transcript
    • Apoptosis regulator proteins, Bcl-2 family.
  • Changed! cd Bcl-2_like 56aa 2e-04 in modified transcript

NOTCH3

  • NOTCH3.F2 NOTCH3.R2 286 442
  • AceView 36.Apr07 NOTCH3
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NOTCH3.aApr07

  • cd ANK 127aa 4e-24 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 148aa 8e-07 in ref transcript
  • cd EGF_CA 38aa 1e-06 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • Changed! cd EGF_CA 38aa 8e-05 in ref transcript
  • cd EGF_CA 37aa 3e-04 in ref transcript
  • cd EGF_CA 39aa 4e-04 in ref transcript
  • Changed! cd EGF_CA 36aa 6e-04 in ref transcript
  • cd EGF_CA 33aa 0.001 in ref transcript
  • cd EGF_CA 37aa 0.002 in ref transcript
  • cd EGF_CA 38aa 0.002 in ref transcript
  • cd EGF_CA 37aa 0.003 in ref transcript
  • cd EGF_CA 32aa 0.003 in ref transcript
  • cd EGF_CA 37aa 0.005 in ref transcript
  • cd EGF_CA 36aa 0.008 in ref transcript
  • pfam NODP 55aa 4e-14 in ref transcript
    • NOTCH protein. NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. The role of the NOD and NODP domains remains to be elucidated.
  • pfam NOD 55aa 2e-11 in ref transcript
    • NOTCH protein. NOTCH signalling plays a fundamental role during a great number of developmental processes in multicellular animals. NOD and NODP represent a region present in many NOTCH proteins and NOTCH homologs in multiple species such as NOTCH2 and NOTCH3, LIN12, SC1 and TAN1. Role of NOD domain remains to be elucidated.
  • smart EGF_CA 38aa 2e-07 in ref transcript
    • Calcium-binding EGF-like domain.
  • pfam Notch 33aa 5e-06 in ref transcript
    • LNR domain. The LNR (Lin-12/Notch repeat) domain is found in three tandem copies in Notch related proteins. The structure of the domain has been determined by NMR and was shown to contain three disulphide bonds and coordinate a calcium ion. Three repeats are also found in the PAPP-A peptidase.
  • smart NL 38aa 5e-06 in ref transcript
    • Domain found in Notch and Lin-12. The Notch protein is essential for the proper differentiation of the Drosophila ectoderm. This protein contains 3 NL domains.
  • pfam Notch 29aa 2e-05 in ref transcript
  • Changed! smart EGF_CA 38aa 8e-05 in ref transcript
  • smart EGF_CA 39aa 8e-05 in ref transcript
  • smart EGF_CA 37aa 1e-04 in ref transcript
  • TIGR trp 139aa 1e-04 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! smart EGF_CA 36aa 3e-04 in ref transcript
  • smart EGF_CA 37aa 3e-04 in ref transcript
  • smart EGF_CA 37aa 0.001 in ref transcript
  • smart EGF_CA 37aa 0.001 in ref transcript
  • smart EGF_CA 33aa 0.001 in ref transcript
  • smart EGF_CA 38aa 0.002 in ref transcript
  • smart EGF_CA 36aa 0.003 in ref transcript
  • smart EGF_CA 32aa 0.004 in ref transcript
  • COG Arp 154aa 5e-17 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

NRG1

  • NRG1.u.f.21 NRG1.u.r.25 372 396
  • NCBIGene 36.3 3084
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013956

  • cd IGcam 89aa 2e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam Neuregulin 396aa 1e-109 in ref transcript
    • Neuregulin family.
  • smart IG_like 87aa 1e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam EGF 32aa 0.006 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.

NUP98

  • NUP98.F1 NUP98.R1 260 482
  • NCBIGene 36.3 4928
  • Single exon skipping, size difference: 222
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016320

  • pfam Nucleoporin2 156aa 2e-63 in ref transcript
    • Nucleoporin autopeptidase.

OPA1

  • OPA1.u.f.9 OPA1.u.r.8 143 254
  • NCBIGene 36.3 4976
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130837

  • pfam Dynamin_N 179aa 4e-38 in ref transcript
    • Dynamin family.

PAXIP1

  • PAXIP1.F1 PAXIP1.R1 114 185
  • AceView 36.Apr07 PAXIP1
  • Alternative 5-prime, size difference: 71
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: PAXIP1.aApr07

  • cd BRCT 68aa 6e-07 in ref transcript
    • Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
  • cd BRCT 68aa 7e-07 in ref transcript
  • cd BRCT 70aa 8e-07 in ref transcript
  • cd BRCT 75aa 4e-06 in ref transcript
  • pfam BRCT 73aa 4e-11 in ref transcript
    • BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
  • pfam BRCT 72aa 6e-10 in ref transcript
  • pfam BRCT 66aa 8e-07 in ref transcript
  • smart BRCT 81aa 3e-06 in ref transcript
    • breast cancer carboxy-terminal domain.

PCSK6

  • PCSK6.F2 PCSK6.R2 116 260
  • AceView 36.Apr07 PCSK6
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: PCSK6.aApr07

  • cd FU 51aa 6e-07 in ref transcript
    • Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
  • cd FU 43aa 2e-06 in ref transcript
  • cd FU 47aa 3e-06 in ref transcript
  • cd FU 53aa 1e-05 in ref transcript
  • Changed! pfam Peptidase_S8 301aa 2e-94 in ref transcript
    • Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold containing a 7-stranded parallel beta sheet, order 2314567.
  • pfam P_proprotein 91aa 4e-34 in ref transcript
    • Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic domain.
  • smart FU 48aa 2e-08 in ref transcript
    • Furin-like repeats.
  • smart FU 44aa 2e-08 in ref transcript
  • smart FU 44aa 1e-06 in ref transcript
  • pfam Furin-like 112aa 4e-06 in ref transcript
    • Furin-like cysteine rich region.
  • pfam VSP 115aa 5e-04 in ref transcript
    • Giardia variant-specific surface protein.
  • Changed! COG AprE 324aa 1e-20 in ref transcript
    • Subtilisin-like serine proteases [Posttranslational modification, protein turnover, chaperones].
  • COG COG4935 91aa 1e-15 in ref transcript
    • Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones].
  • COG NapH 83aa 0.009 in ref transcript
    • Polyferredoxin [Energy production and conversion].
  • Changed! pfam Peptidase_S8 261aa 2e-76 in modified transcript
  • Changed! COG AprE 281aa 2e-13 in modified transcript

PLD1

  • PLD1.F1 PLD1.R1 124 238
  • AceView 36.Apr07 PLD1
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: PLD1.aApr07

  • cd PLDc 142aa 1e-21 in ref transcript
    • Phospholipase D. Active site motifs; The PLD superfamily includes enzymes involved in signal transduction, lipid biosynthesis, endonucleases and open reading frames in pathogenic viruses and bacteria. PLD hydrolyzes the terminal phosphodiester bond of phospholipids to phosphatidic acid and a hydrophilic constituent. Phosphatidic acid is a compound that is heavily involved in signal transduction. The common features of the family members are that they can bind to a phosphodiester moiety, and that most of these enzymes are active as bi-lobed monomers or dimers.
  • cd PLDc 221aa 1e-15 in ref transcript
  • cd PH_PLD 62aa 8e-12 in ref transcript
    • Phospholipase D (PLD) pleckstrin homology (PH) domain. PLD hydrolyzes phosphatidylcholine to phosphatidic acid (PtdOH), which can bind target proteins. PLD contains a PH domain, a PX domain and four conserved PLD signature domains. The PLD PH domain is specific for bisphosphorylated inositides. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam PLDc 28aa 0.001 in ref transcript
    • Phospholipase D Active site motif. Phosphatidylcholine-hydrolysing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, and/or asparagine residues which may contribute to the active site. aspartic acid. An Escherichia coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not been achieved.
  • smart PLDc 27aa 0.002 in ref transcript
    • Phospholipase D. Active site motifs. Phosphatidylcholine-hydrolyzing phospholipase D (PLD) isoforms are activated by ADP-ribosylation factors (ARFs). PLD produces phosphatidic acid from phosphatidylcholine, which may be essential for the formation of certain types of transport vesicles or may be constitutive vesicular transport to signal transduction pathways. PC-hydrolysing PLD is a homologue of cardiolipin synthase, phosphatidylserine synthase, bacterial PLDs, and viral proteins. Each of these appears to possess a domain duplication which is apparent by the presence of two motifs containing well-conserved histidine, lysine, aspartic acid, and/or asparagine residues which may contribute to the active site. An E. coli endonuclease (nuc) and similar proteins appear to be PLD homologues but possess only one of these motifs. The profile contained here represents only the putative active site regions, since an accurate multiple alignment of the repeat units has not been achieved.
  • COG Cls 261aa 9e-15 in ref transcript
    • Phosphatidylserine/phosphatidylglycerophosphate/cardioli pin synthases and related enzymes [Lipid metabolism].
  • COG Cls 171aa 0.001 in ref transcript

POLB

  • POLB.F7 POLB.R13 213 271
  • AceView 36.Apr07 POLB
  • Single exon skipping, size difference: 58
  • Exclusion in the protein causing a frameshift
  • Reference transcript: POLB.bApr07

  • Changed! cd POLXc 332aa 1e-104 in ref transcript
    • DNA polymerase X family; includes vertebrate DNA polymerase beta and terminal deoxynucleotidyltransferase. An N-terminal 8kD domain and a 31kD C-terminal polymerase domain are connected with a protease-sensitive hinge. The activity of the N-terminal domain seems to be variable, in DNA polymerase beta it has metal dependent nuclease activity and metal independent lyase activity.
  • Changed! smart POLXc 325aa 1e-103 in ref transcript
    • DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases.
  • Changed! COG POL4 299aa 5e-28 in ref transcript
    • DNA polymerase IV (family X) [DNA replication, recombination, and repair].

POLI

  • POLI.F8 POLI.R14 197 303
  • AceView 36.Apr07 POLI
  • Intron retention, size difference: 106
  • Inclusion in 5'UTR
  • Reference transcript: POLI.dApr07

  • cd Pol_iota 33aa 2e-07 in ref transcript
    • Pol iota is member of the DNA polymerase Y-family, and has also been called Rad30 homolog B. Unlike classic DNA polymerases,Y-family polymerases are induced by DNA damage. They can transverse normal replication-blocking DNA lesions. Unlike Pol eta, Pol iota is unable to replicate through a cis-syn T-T dimer. In human Pol iota, the base-pairing mode in the active site at the replicative end mat bee Hoogsteen instead of Watson-Click. Human Pol iota can incorporate the correct nucleotide opposite a purine much more efficiently than opposite a pyrimidine. Pol iota prefers to insert Guanosine instead of Adenosine opposite Thymidine.
  • TIGR dmsA_ynfE 133aa 0.002 in ref transcript
    • Members of this family include known and probable dimethyl sulfoxide reductase (DMSO reductase) A chains. In E. coli, dmsA encodes the canonical anaerobic DMSO reductase A chain. The paralog ynfE, as part of ynfFGH expressed from a multicopy plasmid, could complement a dmsABC deletion, suggesting a similar function and some overlap in specificity, although YnfE could not substitute for DmsA in a mixed complex.

POLM

  • POLM.u.f.1 POLM.u.r.6 316 586
  • AceView 36.Apr07 POLM
  • Multiple exon skipping, size difference: 270
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: POLM.aApr07

  • cd POLXc 118aa 1e-17 in ref transcript
    • DNA polymerase X family; includes vertebrate DNA polymerase beta and terminal deoxynucleotidyltransferase. An N-terminal 8kD domain and a 31kD C-terminal polymerase domain are connected with a protease-sensitive hinge. The activity of the N-terminal domain seems to be variable, in DNA polymerase beta it has metal dependent nuclease activity and metal independent lyase activity.
  • Changed! cd POLXc 77aa 3e-10 in ref transcript
  • smart POLXc 105aa 2e-14 in ref transcript
    • DNA polymerase X family. includes vertebrate polymerase beta and terminal deoxynucleotidyltransferases.
  • Changed! smart POLXc 49aa 3e-07 in ref transcript
  • COG POL4 48aa 1e-05 in ref transcript
    • DNA polymerase IV (family X) [DNA replication, recombination, and repair].

PPP3CB

  • PPP3CB.u.f.21 PPP3CB.u.r.22 117 147
  • AceView 36.Apr07 PPP3CB
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: PPP3CB.aApr07

  • cd PP2Ac 278aa 1e-100 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
  • smart PP2Ac 272aa 1e-112 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
  • PTZ PTZ00239 278aa 6e-60 in ref transcript
    • serine/threonine protein phosphatase 2A; Provisional.

PTK2B

  • PTK2B_x66_x147_f PTK2B_x66_x147_r 170 296
  • NCBIGene 36.3 2185
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_173174

  • cd PTKc_FAK 270aa 1e-146 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Focal Adhesion Kinase. Protein Tyrosine Kinase (PTK) family; Focal Adhesion kinase (FAK); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FAK is a cytoplasmic (or nonreceptor) tyr kinase that contains an autophosphorylation site and a FERM domain at the N-terminus, a central tyr kinase domain, proline-rich regions, and a C-terminal FAT (focal adhesion targeting) domain. FAK activity is dependent on integrin-mediated cell adhesion, which facilitates N-terminal autophosphorylation. Full activation is achieved by the phosphorylation of its two adjacent A-loop tyrosines. FAK is important in mediating signaling initiated at sites of cell adhesions and at growth factor receptors. Through diverse molecular interactions, FAK functions as a biosensor or integrator to control cell motility. It is a key regulator of cell survival, proliferation, migration and invasion, and thus plays an important role in the development and progression of cancer. Src binds to autophosphorylated FAK forming the FAK-Src dual kinase complex, which is activated in a wide variety of tumor cells and generates signals promoting growth and metastasis. FAK is being developed as a target for cancer therapy.
  • pfam Pkinase_Tyr 233aa 5e-98 in ref transcript
    • Protein tyrosine kinase.
  • pfam Focal_AT 139aa 6e-59 in ref transcript
    • Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion targeting' (FAT) region is necessary and sufficient for localising FAK to focal adhesions. The crystal structure of FAT shows it forms a four-helix bundle that resembles those found in two other proteins involved in cell adhesion, alpha-catenin and vinculin. The binding of FAT to the focal adhesion protein, paxillin, requires the integrity of the helical bundle, whereas binding to another focal adhesion protein, talin, does not.
  • smart B41 227aa 2e-31 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • COG SPS1 229aa 2e-16 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

PTPN13

  • PTPN13.F7 PTPN13.R7 133 190
  • NCBIGene 36.3 5783
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080685

  • cd PTPc 228aa 3e-79 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PDZ_signaling 85aa 7e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 80aa 1e-15 in ref transcript
  • cd PDZ_signaling 90aa 2e-13 in ref transcript
  • cd FERM_C 94aa 2e-13 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • cd PDZ_signaling 79aa 2e-08 in ref transcript
  • cd PDZ_signaling 88aa 6e-08 in ref transcript
  • smart PTPc 253aa 5e-91 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart KIND 188aa 2e-42 in ref transcript
    • kinase non-catalytic C-lobe domain. It is an interaction domain identified as being similar to the C-terminal protein kinase catalytic fold (C lobe). Its presence at the N terminus of signalling proteins and the absence of the active-site residues in the catalytic and activation loops suggest that it folds independently and is likely to be non-catalytic. The occurrence of KIND only in metazoa implies that it has evolved from the catalytic protein kinase domain into an interaction domain possibly by keeping the substrate-binding features.
  • smart B41 210aa 4e-40 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam FERM_C 88aa 2e-17 in ref transcript
    • FERM C-terminal PH-like domain.
  • smart PDZ 90aa 1e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 92aa 2e-15 in ref transcript
  • smart PDZ 83aa 1e-13 in ref transcript
  • smart PDZ 92aa 2e-11 in ref transcript
  • smart PDZ 84aa 9e-08 in ref transcript
  • COG PTP2 267aa 1e-40 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG Prc 80aa 9e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 54aa 3e-04 in ref transcript
  • COG Prc 76aa 3e-04 in ref transcript
  • COG Prc 68aa 0.006 in ref transcript

PTPRB

  • PTPRB.F6 PTPRB.R6 131 395
  • AceView 36.Apr07 PTPRB
  • Single exon skipping, size difference: 264
  • Exclusion in the protein (no frameshift)
  • Reference transcript: PTPRB.aApr07

  • cd PTPc 232aa 6e-89 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd FN3 83aa 2e-05 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 89aa 7e-05 in ref transcript
  • cd FN3 87aa 2e-04 in ref transcript
  • cd FN3 76aa 3e-04 in ref transcript
  • cd RICIN 73aa 0.003 in ref transcript
    • Ricin-type beta-trefoil; Carbohydrate-binding domain formed from presumed gene triplication. The domain is found in a variety of molecules serving diverse functions such as enzymatic activity, inhibitory toxicity and signal transduction. Highly specific ligand binding occurs on exposed surfaces of the compact domain sturcture.
  • smart PTPc 259aa 1e-100 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • pfam fn3 84aa 6e-11 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 78aa 5e-09 in ref transcript
  • pfam fn3 81aa 1e-08 in ref transcript
  • pfam fn3 78aa 3e-08 in ref transcript
  • pfam fn3 75aa 3e-08 in ref transcript
  • pfam fn3 76aa 3e-07 in ref transcript
  • pfam fn3 76aa 9e-07 in ref transcript
  • pfam fn3 75aa 1e-06 in ref transcript
  • pfam fn3 78aa 4e-06 in ref transcript
  • pfam fn3 76aa 1e-05 in ref transcript
  • pfam fn3 76aa 2e-05 in ref transcript
  • pfam fn3 82aa 3e-05 in ref transcript
  • Changed! pfam fn3 63aa 4e-05 in ref transcript
  • pfam fn3 68aa 0.003 in ref transcript
  • pfam fn3 65aa 0.004 in ref transcript
  • pfam Ricin_B_lectin 77aa 0.006 in ref transcript
    • Ricin-type beta-trefoil lectin domain.
  • COG PTP2 266aa 5e-48 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].

RAC1

  • RAC1.F18 rs.RAC1.R1 328 386
  • NCBIGene 36.3 5879
  • Single exon skipping, size difference: 58
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_018890

  • Changed! cd Rac1_like 192aa 1e-107 in ref transcript
    • Rac1-like subfamily. The Rac1-like subfamily consists of Rac1, Rac2, and Rac3 proteins, plus the splice variant Rac1b that contains a 19-residue insertion near switch II relative to Rac1. While Rac1 is ubiquitously expressed, Rac2 and Rac3 are largely restricted to hematopoietic and neural tissues respectively. Rac1 stimulates the formation of actin lamellipodia and membrane ruffles. It also plays a role in cell-matrix adhesion and cell anoikis. In intestinal epithelial cells, Rac1 is an important regulator of migration and mediates apoptosis. Rac1 is also essential for RhoA-regulated actin stress fiber and focal adhesion complex formation. In leukocytes, Rac1 and Rac2 have distinct roles in regulating cell morphology, migration, and invasion, but are not essential for macrophage migration or chemotaxis. Rac3 has biochemical properties that are closely related to Rac1, such as effector interaction, nucleotide binding, and hydrolysis; Rac2 has a slower nucleotide association and is more efficiently activated by the RacGEF Tiam1. Both Rac1 and Rac3 have been implicated in the regulation of cell migration and invasion in human metastatic breast cancer. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • Changed! smart RHO 189aa 1e-98 in ref transcript
    • Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
  • Changed! COG COG1100 193aa 1e-27 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].

CLIP1

  • RSN.F10 RSN.R9 155 500
  • AceView 36.Apr07 CLIP1
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 345
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: CLIP1.aApr07

  • pfam CAP_GLY 66aa 6e-27 in ref transcript
    • CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
  • pfam CAP_GLY 66aa 6e-26 in ref transcript
  • Changed! TIGR SMC_prok_A 315aa 1e-13 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • TIGR SMC_prok_B 189aa 2e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! pfam SMC_N 386aa 5e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG NIP100 61aa 2e-11 in ref transcript
    • Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
  • Changed! COG Smc 306aa 4e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG NIP100 62aa 2e-10 in ref transcript
  • Changed! PRK PRK03918 589aa 1e-06 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! TIGR SMC_prok_B 231aa 5e-07 in modified transcript
  • Changed! TIGR SMC_prok_B 360aa 6e-06 in modified transcript
  • Changed! COG Smc 191aa 9e-04 in modified transcript
  • Changed! PRK PRK03918 512aa 0.001 in modified transcript

CLIP1

  • RSN.F35 RSN.R13 248 281
  • AceView 36.Apr07 CLIP1
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: CLIP1.aApr07

  • pfam CAP_GLY 66aa 6e-27 in ref transcript
    • CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
  • pfam CAP_GLY 66aa 6e-26 in ref transcript
  • TIGR SMC_prok_A 315aa 1e-13 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! TIGR SMC_prok_B 189aa 2e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! pfam SMC_N 386aa 5e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG NIP100 61aa 2e-11 in ref transcript
    • Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
  • COG Smc 306aa 4e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG NIP100 62aa 2e-10 in ref transcript
  • Changed! PRK PRK03918 589aa 1e-06 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! TIGR SMC_prok_B 178aa 8e-08 in modified transcript
  • Changed! TIGR SMC_prok_B 349aa 2e-07 in modified transcript
  • Changed! PRK PRK03918 578aa 5e-07 in modified transcript

RUNX2

  • RUNX2.F1 RUNX2.R1 169 235
  • NCBIGene 36.3 860
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001024630

  • pfam Runt 134aa 9e-74 in ref transcript
    • Runt domain.
  • pfam RunxI 78aa 2e-32 in ref transcript
    • Runx inhibition domain. This domain lies to the C-terminus of Runx-related transcription factors and homologous proteins (AML, CBF-alpha, PEBP2). Its function might be to interact with functional cofactors.

SDCCAG8

  • SDCCAG8-F1 SDCCAG8-R1 180 417
  • AceView 36.Apr07 SDCCAG8
  • Multiple exon skipping, size difference: 237
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: SDCCAG8.aApr07

  • Changed! TIGR SMC_prok_B 270aa 2e-06 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! TIGR SMC_prok_B 306aa 5e-06 in ref transcript
  • Changed! COG Smc 382aa 4e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 283aa 2e-10 in modified transcript
  • Changed! COG Smc 300aa 6e-07 in modified transcript

SHC1

  • SHC1.F7 SHC1.R7 111 165
  • AceView 36.Apr07 SHC1
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: SHC1.aApr07

  • Changed! cd SHC 166aa 3e-73 in ref transcript
    • SHC phosphotyrosine-binding (PTB) domain. SHC is a substrate for receptor tyrosine kinases, which can interact with phosphoproteins at NPXY motifs. SHC contains an PTB domain followed by an SH2 domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
  • cd SH2 92aa 7e-13 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • Changed! pfam PID 156aa 6e-39 in ref transcript
    • Phosphotyrosine interaction domain (PTB/PID).
  • pfam SH2 45aa 4e-14 in ref transcript
    • SH2 domain.
  • Changed! cd SHC 148aa 2e-59 in modified transcript
  • Changed! pfam PID 138aa 2e-26 in modified transcript

SHMT1

  • SHMT1.u.f.24 SHMT1.u.r.18 108 225
  • NCBIGene 36.3 6470
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004169

  • Changed! cd SHMT 427aa 0.0 in ref transcript
    • Serine-glycine hydroxymethyltransferase (SHMT). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). SHMT carries out interconversion of serine and glycine; it catalyzes the transfer of hydroxymethyl group of N5, N10-methylene tetrahydrofolate to glycine resulting in the formation of serine and tetrahydrofolate. Both eukaryotic and prokaryotic SHMT enzymes form tight obligate homodimers; the mammalian enzyme forms a homotetramer comprising four pyridoxal phosphate-bound active sites.
  • Changed! pfam SHMT 400aa 0.0 in ref transcript
    • Serine hydroxymethyltransferase.
  • Changed! PTZ PTZ00094 457aa 0.0 in ref transcript
    • serine hydroxymethyltransferase; Provisional.
  • Changed! cd SHMT 388aa 1e-167 in modified transcript
  • Changed! pfam SHMT 361aa 1e-178 in modified transcript
  • Changed! PTZ PTZ00094 418aa 0.0 in modified transcript

SLIT2

  • SLIT2.F36 SLIT2.R4 222 234
  • AceView 36.Apr07 SLIT2
  • Single exon skipping, size difference: 12
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: SLIT2.aApr07

  • cd LamG 154aa 3e-26 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd EGF_CA 37aa 1e-04 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • cd EGF_CA 37aa 0.002 in ref transcript
  • smart LamG 137aa 5e-33 in ref transcript
    • Laminin G domain.
  • TIGR PCC 77aa 4e-10 in ref transcript
    • Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
  • TIGR PCC 156aa 1e-09 in ref transcript
  • Changed! TIGR PCC 63aa 3e-05 in ref transcript
  • smart EGF_CA 37aa 3e-04 in ref transcript
    • Calcium-binding EGF-like domain.
  • TIGR PCC 64aa 4e-04 in ref transcript
  • smart LRRNT 32aa 8e-04 in ref transcript
    • Leucine rich repeat N-terminal domain.
  • smart LRRNT 32aa 0.001 in ref transcript
  • smart LRRNT 33aa 0.003 in ref transcript
  • pfam EGF 32aa 0.003 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
  • COG COG4886 164aa 2e-07 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].
  • COG COG4886 156aa 6e-06 in ref transcript
  • Changed! COG COG4886 230aa 3e-04 in ref transcript
  • COG COG4886 286aa 6e-04 in ref transcript
  • Changed! TIGR PCC 79aa 2e-05 in modified transcript
  • Changed! COG COG4886 234aa 0.001 in modified transcript

SPP1

  • SPP1-3 SPP1-4 181 223
  • NCBIGene 36.3 6696
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040058

  • Changed! pfam Osteopontin 314aa 1e-96 in ref transcript
    • Osteopontin.
  • Changed! pfam Osteopontin 300aa 9e-89 in modified transcript

STIM1

  • STIM1.F1 STIM1.R1 202 295
  • AceView 36.Apr07 STIM1
  • Single exon skipping, size difference: 93
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: STIM1.aApr07

  • cd SAM 69aa 4e-04 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • TIGR SMC_prok_A 132aa 3e-06 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • smart SAM 65aa 3e-05 in ref transcript
    • Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
  • COG Smc 255aa 7e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

SYK

  • SYK.F1 SYK.R16 244 313
  • AceView 36.Apr07 SYK
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: SYK.bApr07

  • cd PTKc_Syk 257aa 1e-155 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Spleen tyrosine kinase. Protein Tyrosine Kinase (PTK) family; Spleen tyrosine kinase (Syk); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Syk, together with Zap-70, form the Syk subfamily of kinases which are cytoplasmic (or nonreceptor) tyr kinases containing two Src homology 2 (SH2) domains N-terminal to the catalytic tyr kinase domain. Syk was first cloned from the spleen, and its function in hematopoietic cells is well-established. Syk is involved in the signaling downstream of activated receptors (including B-cell and Fc receptors) that contain ITAMs (immunoreceptor tyr activation motifs), leading to processes such as cell proliferation, differentiation, survival, adhesion, migration, and phagocytosis. More recently, Syk expression has been detected in other cell types (including epithelial cells, vascular endothelial cells, neurons, hepatocytes, and melanocytes), suggesting a variety of biological functions in non-immune cells. Syk plays a critical role in maintaining vascular integrity and in wound healing during embryogenesis. It also regulates Vav3, which is important in osteoclast function including bone development. In breast epithelial cells, where Syk acts as a negative regulator for epidermal growth factor receptor (EGFR) signaling, loss of Syk expression is associated with abnormal proliferation during cancer development suggesting a potential role as a tumor suppressor. In mice, Syk has been shown to inhibit malignant transformation of mammary epithelial cells induced with murine mammary tumor virus (MMTV).
  • cd SH2 93aa 4e-18 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • cd SH2 93aa 1e-16 in ref transcript
  • pfam Pkinase_Tyr 253aa 2e-96 in ref transcript
    • Protein tyrosine kinase.
  • pfam SH2 77aa 4e-22 in ref transcript
    • SH2 domain.
  • smart SH2 85aa 2e-19 in ref transcript
    • Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
  • COG SPS1 251aa 4e-18 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

SYNE2

  • SYNE2.u.f.128 SYNE2.u.r.124 119 188
  • NCBIGene 36.3 23224
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182914

  • cd SPEC 214aa 1e-12 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd CH 103aa 5e-12 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 215aa 1e-10 in ref transcript
  • cd CH 103aa 6e-10 in ref transcript
  • cd SPEC 214aa 4e-09 in ref transcript
  • cd SPEC 221aa 8e-08 in ref transcript
  • cd SPEC 216aa 1e-07 in ref transcript
  • cd SPEC 207aa 2e-05 in ref transcript
  • pfam KASH 60aa 7e-14 in ref transcript
    • Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
  • pfam CH 99aa 2e-13 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • smart CH 102aa 2e-13 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam SMC_N 302aa 1e-06 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • TIGR SMC_prok_B 305aa 4e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 102aa 9e-05 in ref transcript
    • Spectrin repeats.
  • smart SPEC 104aa 3e-04 in ref transcript
  • TIGR SMC_prok_A 303aa 3e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • TIGR SMC_prok_A 375aa 4e-04 in ref transcript
  • pfam SMC_N 281aa 0.002 in ref transcript
  • COG SAC6 255aa 3e-22 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG Smc 266aa 3e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

TGFBR2

  • TGFBR2.F3 rs.TGFBR2.R1 219 294
  • NCBIGene 36.3 7048
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001024847

  • cd S_TKc 288aa 5e-35 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam ecTbetaR2 104aa 4e-49 in ref transcript
    • Transforming growth factor beta receptor 2 ectodomain. The Transforming growth factor beta receptor 2 ectodomain is a compact fold consisting of nine beta-strands and a single helix stabilised by a network of six intra strand disulphide bonds. The folding topology includes a central five-stranded antiparallel beta-sheet, eight-residues long at its centre, covered by a second layer consisting of two segments of two-stranded antiparallel beta-sheets (beta1-beta4, beta3-beta9).
  • pfam Pkinase 291aa 8e-45 in ref transcript
    • Protein kinase domain.
  • COG SPS1 287aa 2e-16 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MYO18A

  • TIAF1-3 TIAF1-4 279 324
  • NCBIGene 36.3 399687
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_078471

  • cd MYSc_type_XVIII 779aa 0.0 in ref transcript
    • Myosin motor domain, type XVIII myosins. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
  • cd PDZ_signaling 80aa 6e-10 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart MYSc 786aa 1e-105 in ref transcript
    • Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
  • pfam Myosin_tail_1 527aa 1e-36 in ref transcript
    • Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
  • Changed! TIGR SMC_prok_B 288aa 2e-11 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart PDZ 81aa 5e-08 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG COG5022 1200aa 1e-103 in ref transcript
    • Myosin heavy chain [Cytoskeleton].
  • Changed! COG Smc 738aa 5e-21 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 331aa 2e-08 in modified transcript
  • Changed! COG Smc 691aa 1e-18 in modified transcript

TNFRSF10B

  • TNFRSF10B.u.f.10 tnfrsf10b.r.3 303 390
  • NCBIGene 36.3 8795
  • Intron retention, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003842

  • cd TNFR 86aa 2e-16 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • smart DEATH 87aa 2e-17 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • smart TNFR 41aa 3e-06 in ref transcript
    • Tumor necrosis factor receptor / nerve growth factor receptor repeats. Repeats in growth factor receptors that are involved in growth factor binding. TNF/TNFR.
  • smart TNFR 40aa 2e-04 in ref transcript

TOPBP1

  • TOPBP1.u.f.8 TOPBP1.u.r.8 128 143
  • AceView 36.Apr07 TOPBP1
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: TOPBP1.aApr07

  • cd BRCT 71aa 4e-09 in ref transcript
    • Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
  • cd BRCT 70aa 6e-08 in ref transcript
  • cd BRCT 71aa 2e-06 in ref transcript
  • cd BRCT 70aa 2e-06 in ref transcript
  • cd BRCT 58aa 7e-05 in ref transcript
  • pfam BRCT 73aa 2e-13 in ref transcript
    • BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
  • pfam BRCT 74aa 5e-11 in ref transcript
  • pfam BRCT 72aa 3e-10 in ref transcript
  • pfam BRCT 71aa 5e-08 in ref transcript
  • pfam BRCT 79aa 1e-07 in ref transcript
  • smart BRCT 53aa 3e-06 in ref transcript
    • breast cancer carboxy-terminal domain.

TRAF3

  • TRAF3.u.f.1 TRAF3.u.r.3 161 300
  • NCBIGene 36.3 7187
  • Single exon skipping, size difference: 139
  • Exclusion in 5'UTR
  • Reference transcript: NM_145725

  • cd MATH_TRAF3 186aa 1e-106 in ref transcript
    • Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF3 subfamily, TRAF domain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF3 was first described as a molecule that binds the cytoplasmic tail of CD40. However, it is not required for CD40 signaling. More recently, TRAF3 has been identified as a key regulator of type I interferon (IFN) production and the mammalian innate antiviral immunity. It mediates IFN responses in Toll-like receptor (TLR)-dependent as well as TLR-independent viral recognition pathways. It is also a key element in immunological homeostasis through its regulation of the anti-inflammatory cytokine interleukin-10. TRAF3 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal alpha helical region (TRAF-N), and a highly conserved C-terminal MATH subdomain (TRAF-C) with an eight-stranded beta-sandwich structure. TRAF-N mediates trimerization while TRAF-C interacts with receptors.
  • pfam MATH 126aa 4e-18 in ref transcript
    • MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode Caenorhabditis elegans.
  • pfam zf-TRAF 57aa 1e-11 in ref transcript
    • TRAF-type zinc finger.
  • pfam Cast 176aa 2e-05 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • smart RING 36aa 0.002 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • TIGR rad18 203aa 0.005 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • COG Smc 161aa 5e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG COG5222 59aa 0.005 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

TSSC4

  • TSSC4.F10 TSSC4.R7 140 332
  • AceView 36.Apr07 TSSC4
  • Intron retention, size difference: 192
  • Exclusion in the protein (no frameshift)
  • Reference transcript: TSSC4.dApr07

TUBA4A

  • TUBA1.F5 TUBA1.R5 101 324
  • AceView 36.Apr07 TUBA4A
  • Single exon skipping, size difference: 223
  • Exclusion in the protein causing a frameshift
  • Reference transcript: TUBA4A.aApr07

  • Changed! cd alpha_tubulin 434aa 0.0 in ref transcript
    • The tubulin superfamily includes five distinct families, the alpha-, beta-, gamma-, delta-, and epsilon-tubulins and a sixth family (zeta-tubulin) which is present only in kinetoplastid protozoa. The alpha- and beta-tubulins are the major components of microtubules, while gamma-tubulin plays a major role in the nucleation of microtubule assembly. The delta- and epsilon-tubulins are widespread but unlike the alpha, beta, and gamma-tubulins they are not ubiquitous among eukaryotes. The alpha/beta-tubulin heterodimer is the structural subunit of microtubules. The alpha- and beta-tubulins share 40% amino-acid sequence identity, exist in several isotype forms, and undergo a variety of posttranslational modifications. The structures of alpha- and beta-tubulin are basically identical: each monomer is formed by a core of two beta-sheets surrounded by alpha-helices. The monomer structure is very compact, but can be divided into three regions based on function: the amino-terminal nucleotide-binding region, an intermediate taxol-binding region and the carboxy-terminal region which probably constitutes the binding surface for motor proteins.
  • Changed! pfam Tubulin 243aa 2e-67 in ref transcript
    • Tubulin/FtsZ family, GTPase domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial cell division. It is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ and tubulin are GTPases. FtsZ can polymerise into tubes, sheets, and rings in vitro and is ubiquitous in eubacteria and archaea. Tubulin is the major component of microtubules.
  • Changed! pfam Tubulin_C 136aa 3e-52 in ref transcript
    • Tubulin/FtsZ family, C-terminal domain. This family includes the tubulin alpha, beta and gamma chains, as well as the bacterial FtsZ family of proteins. Members of this family are involved in polymer formation. FtsZ is the polymer-forming protein of bacterial cell division. It is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ and tubulin are GTPases. FtsZ can polymerise into tubes, sheets, and rings in vitro and is ubiquitous in eubacteria and archaea. Tubulin is the major component of microtubules.
  • Changed! PTZ PTZ00012 447aa 0.0 in ref transcript
    • alpha-tubulin II; Provisional.

UTRN

  • UTRN.u.f.81 UTRN.u.r.78 231 270
  • AceView 36.Apr07 UTRN
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: UTRN.aApr07

  • cd ZZ_dystrophin 49aa 2e-23 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • cd SPEC 219aa 2e-18 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd CH 105aa 2e-14 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 203aa 6e-14 in ref transcript
  • cd SPEC 241aa 6e-13 in ref transcript
  • cd SPEC 211aa 1e-09 in ref transcript
  • cd SPEC 208aa 4e-09 in ref transcript
  • cd CH 104aa 3e-08 in ref transcript
  • cd SPEC 213aa 6e-08 in ref transcript
  • cd SPEC 117aa 2e-06 in ref transcript
  • cd SPEC 214aa 2e-05 in ref transcript
  • cd SPEC 201aa 4e-04 in ref transcript
  • cd SPEC 205aa 5e-04 in ref transcript
  • cd WW 29aa 6e-04 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • pfam efhand_1 121aa 2e-49 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 92aa 1e-36 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam ZZ 46aa 1e-17 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
  • pfam CH 104aa 5e-17 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • pfam CH 103aa 2e-10 in ref transcript
  • smart SPEC 102aa 9e-10 in ref transcript
    • Spectrin repeats.
  • smart SPEC 102aa 2e-08 in ref transcript
  • smart SPEC 106aa 7e-07 in ref transcript
  • smart SPEC 94aa 2e-06 in ref transcript
  • TIGR SMC_prok_B 742aa 2e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • pfam SMC_N 241aa 2e-04 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • pfam SMC_N 302aa 2e-04 in ref transcript
  • pfam WW 30aa 4e-04 in ref transcript
    • WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
  • TIGR SMC_prok_B 783aa 0.001 in ref transcript
  • TIGR SMC_prok_B 348aa 0.008 in ref transcript
  • COG SAC6 238aa 2e-23 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG Smc 764aa 8e-08 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG SbcC 623aa 7e-06 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • COG Smc 335aa 8e-06 in ref transcript
  • COG Smc 817aa 5e-05 in ref transcript

BCL2L1

  • bcl2l1.f.2 bcl2l1.r.2 193 382
  • NCBIGene 36.3 598
  • Alternative 5-prime, size difference: 189
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138578

  • Changed! cd Bcl-2_like 114aa 2e-40 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • Changed! TIGR bcl-2 233aa 1e-89 in ref transcript
    • in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
  • Changed! cd Bcl-2_like 45aa 1e-08 in modified transcript
  • Changed! TIGR bcl-2 125aa 1e-35 in modified transcript
  • Changed! TIGR bcl-2 45aa 6e-09 in modified transcript

CASP9

  • primer-113212 primer-113213 360 670
  • AceView 36.Apr07 CASP9
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 310
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: CASP9.dApr07

  • Changed! cd CASc 125aa 2e-26 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • Changed! smart CASc 124aa 3e-26 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • smart CARD 91aa 1e-15 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.

A2BP1

  • refseq_A2BP1.F1 refseq_A2BP1.R1 189 267
  • NCBIGene 36.3 54715
  • Alternative 3-prime, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145891

  • cd RRM 72aa 8e-21 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 72aa 1e-20 in ref transcript
    • RNA recognition motif.
  • TIGR hnRNP-R-Q 159aa 7e-04 in ref transcript
    • Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
  • pfam PAT1 133aa 0.009 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
  • COG COG0724 101aa 1e-12 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

A2BP1

  • refseq_A2BP1.F3 refseq_A2BP1.R3 112 165
  • NCBIGene 36.3 54715
  • Single exon skipping, size difference: 53
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_145891

  • cd RRM 72aa 8e-21 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 72aa 1e-20 in ref transcript
    • RNA recognition motif.
  • TIGR hnRNP-R-Q 159aa 7e-04 in ref transcript
    • Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
  • Changed! pfam PAT1 133aa 0.009 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
  • COG COG0724 101aa 1e-12 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

AAA1

  • refseq_AAA1.F1 refseq_AAA1.R1 121 271
  • NCBIGene 36.2 404744
  • Single exon skipping, size difference: 150
  • Exclusion of the stop codon
  • Reference transcript: NM_207288

AAA1

  • refseq_AAA1.F2 refseq_AAA1.R3 204 324
  • NCBIGene 36.2 404744
  • Single exon skipping, size difference: 120
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_207288

ABCB4

  • refseq_ABCB4.F1 refseq_ABCB4.R1 106 127
  • NCBIGene 36.3 5244
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018849

  • cd ABC_MTABC3_MDL1_MDL2 238aa 1e-116 in ref transcript
    • MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another.
  • Changed! cd ABC_MTABC3_MDL1_MDL2 247aa 1e-113 in ref transcript
  • TIGR 3a01208 587aa 1e-115 in ref transcript
  • Changed! TIGR 3a01208 495aa 2e-95 in ref transcript
  • COG MdlB 524aa 1e-111 in ref transcript
    • ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
  • Changed! PTZ PTZ00265 773aa 3e-98 in ref transcript
    • multidrug resistance protein (mdr1); Provisional.
  • Changed! cd ABC_MTABC3_MDL1_MDL2 240aa 1e-115 in modified transcript
  • Changed! TIGR 3a01208 488aa 1e-97 in modified transcript
  • Changed! COG MdlB 498aa 3e-98 in modified transcript

ABCB4

  • refseq_ABCB4.F3 refseq_ABCB4.R3 179 320
  • NCBIGene 36.3 5244
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018849

  • cd ABC_MTABC3_MDL1_MDL2 238aa 1e-116 in ref transcript
    • MTABC3 (also known as ABCB6) is a mitochondrial ATP-binding cassette protein involved in iron homeostasis and one of four ABC transporters expressed in the mitochondrial inner membrane, the other three being MDL1(ABC7), MDL2, and ATM1. In fact, the yeast MDL1 (multidrug resistance-like protein 1) and MDL2 (multidrug resistance-like protein 2) transporters are also included in this CD. MDL1 is an ATP-dependent permease that acts as a high-copy suppressor of ATM1 and is thought to have a role in resistance to oxidative stress. Interestingly, subfamily B is more closely related to the carboxyl-terminal component of subfamily C than the two halves of ABCC molecules are with one another.
  • cd ABC_MTABC3_MDL1_MDL2 247aa 1e-113 in ref transcript
  • TIGR 3a01208 587aa 1e-115 in ref transcript
  • Changed! TIGR 3a01208 495aa 2e-95 in ref transcript
  • COG MdlB 524aa 1e-111 in ref transcript
    • ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
  • Changed! PTZ PTZ00265 773aa 3e-98 in ref transcript
    • multidrug resistance protein (mdr1); Provisional.
  • Changed! TIGR 3a01208 448aa 2e-91 in modified transcript
  • Changed! PTZ PTZ00265 726aa 2e-86 in modified transcript

ABCB9

  • refseq_ABCB9.F1 refseq_ABCB9.R1 184 373
  • NCBIGene 36.2 23457
  • Single exon skipping, size difference: 189
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_019625

  • Changed! cd ABCC_TAP 226aa 1e-110 in ref transcript
    • TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes.
  • Changed! TIGR 3a01208 695aa 0.0 in ref transcript
  • Changed! COG MdlB 573aa 1e-127 in ref transcript
    • ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
  • Changed! cd ABCC_TAP 197aa 4e-94 in modified transcript
  • Changed! TIGR 3a01208 632aa 1e-180 in modified transcript
  • Changed! COG SunT 506aa 1e-94 in modified transcript
    • ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain [Defense mechanisms].

ABCB9

  • refseq_ABCB9.F4 refseq_ABCB9.R4 219 348
  • NCBIGene 36.3 23457
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203444

  • cd ABCC_TAP 226aa 1e-110 in ref transcript
    • TAP, the Transporter Associated with Antigen Processing; TAP is essential for peptide delivery from the cytosol into the lumen of the endoplasmic reticulum (ER), where these peptides are loaded on major histocompatibility complex (MHC) I molecules. Loaded MHC I leave the ER and display their antigenic cargo on the cell surface to cytotoxic T cells. Subsequently, virus-infected or malignantly transformed cells can be eliminated. TAP belongs to the large family of ATP-binding cassette (ABC) transporters, which translocate a vast variety of solutes across membranes.
  • Changed! TIGR 3a01208 695aa 0.0 in ref transcript
  • Changed! COG MdlB 573aa 1e-127 in ref transcript
    • ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
  • Changed! TIGR MsbA_rel 525aa 1e-110 in modified transcript
    • This protein is related to a Proteobacterial ATP transporter that exports lipid A and to eukaryotic P-glycoproteins.
  • Changed! COG MdlB 530aa 1e-110 in modified transcript

ABCC11

  • refseq_ABCC11.F1 refseq_ABCC11.R1 241 355
  • NCBIGene 36.3 85320
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032583

  • Changed! cd ABCC_MRP_domain2 221aa 2e-95 in ref transcript
    • Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
  • cd ABCC_MRP_domain1 193aa 7e-80 in ref transcript
    • Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
  • Changed! TIGR MRP_assoc_pro 1298aa 0.0 in ref transcript
    • This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
  • Changed! PTZ PTZ00243 893aa 1e-153 in ref transcript
    • ABC transporter; Provisional.
  • COG MdlB 583aa 2e-51 in ref transcript
    • ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
  • Changed! cd ABCC_MRP_domain2 183aa 2e-73 in modified transcript
  • Changed! TIGR MRP_assoc_pro 1260aa 0.0 in modified transcript
  • Changed! PTZ PTZ00243 855aa 1e-139 in modified transcript

ABCC9

  • refseq_ABCC9.F2 refseq_ABCC9.R2 236 344
  • NCBIGene 36.3 10060
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020297

  • cd ABCC_SUR2 240aa 1e-137 in ref transcript
    • The SUR domain 2. The sulfonylurea receptor SUR is an ATP binding cassette (ABC) protein of the ABCC/MRP family. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity.
  • Changed! cd ABCC_SUR1_N 218aa 1e-125 in ref transcript
    • The SUR domain 1. The sulfonylurea receptor SUR is an ATP transporter of the ABCC/MRP family with tandem ATPase binding domains. Unlike other ABC proteins, it has no intrinsic transport function, neither active nor passive, but associates with the potassium channel proteins Kir6.1 or Kir6.2 to form the ATP-sensitive potassium (K(ATP)) channel. Within the channel complex, SUR serves as a regulatory subunit that fine-tunes the gating of Kir6.x in response to alterations in cellular metabolism. It constitutes a major pharmaceutical target as it binds numerous drugs, K(ATP) channel openers and blockers, capable of up- or down-regulating channel activity.
  • Changed! TIGR MRP_assoc_pro 1325aa 0.0 in ref transcript
    • This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
  • Changed! PTZ PTZ00243 1183aa 1e-170 in ref transcript
    • ABC transporter; Provisional.
  • Changed! cd ABCC_SUR1_N 217aa 1e-124 in modified transcript
  • Changed! TIGR MRP_assoc_pro 1289aa 0.0 in modified transcript
  • Changed! PTZ PTZ00243 1147aa 1e-170 in modified transcript

ABCD4

  • refseq_ABCD4.F1 refseq_ABCD4.R1 155 283
  • NCBIGene 36.3 5826
  • Single exon skipping, size difference: 128
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005050

  • Changed! cd ABCD_peroxisomal_ALDP 212aa 2e-55 in ref transcript
    • Peroxisomal ATP-binding cassette transporter (Pat) is involved in the import of very long-chain fatty acids (VLCFA) into the peroxisome. The peroxisomal membrane forms a permeability barrier for a wide variety of metabolites required for and formed during fatty acid beta-oxidation. To communicate with the cytoplasm and mitochondria, peroxisomes need dedicated proteins to transport such hydrophilic molecules across their membranes. X-linked adrenoleukodystrophy (X-ALD) is caused by mutations in the ALD gene, which encodes ALDP (adrenoleukodystrophy protein ), a peroxisomal integral membrane protein that is a member of the ATP-binding cassette (ABC) transporter protein family. The disease is characterized by a striking and unpredictable variation in phenotypic expression. Phenotypes include the rapidly progressive childhood cerebral form (CCALD), the milder adult form, adrenomyeloneuropathy (AMN), and variants without neurologic involvement (i.e. asymptomatic).
  • Changed! TIGR 3a01203 522aa 8e-77 in ref transcript
  • Changed! COG COG4178 534aa 1e-75 in ref transcript
    • ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only].

ABCD4

  • refseq_ABCD4.F3 refseq_ABCD4.R3 159 299
  • NCBIGene 36.2 5826
  • Single exon skipping, size difference: 140
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005050

  • Changed! cd ABCD_peroxisomal_ALDP 212aa 2e-55 in ref transcript
    • Peroxisomal ATP-binding cassette transporter (Pat) is involved in the import of very long-chain fatty acids (VLCFA) into the peroxisome. The peroxisomal membrane forms a permeability barrier for a wide variety of metabolites required for and formed during fatty acid beta-oxidation. To communicate with the cytoplasm and mitochondria, peroxisomes need dedicated proteins to transport such hydrophilic molecules across their membranes. X-linked adrenoleukodystrophy (X-ALD) is caused by mutations in the ALD gene, which encodes ALDP (adrenoleukodystrophy protein ), a peroxisomal integral membrane protein that is a member of the ATP-binding cassette (ABC) transporter protein family. The disease is characterized by a striking and unpredictable variation in phenotypic expression. Phenotypes include the rapidly progressive childhood cerebral form (CCALD), the milder adult form, adrenomyeloneuropathy (AMN), and variants without neurologic involvement (i.e. asymptomatic).
  • Changed! TIGR 3a01203 522aa 8e-77 in ref transcript
  • Changed! COG COG4178 534aa 1e-75 in ref transcript
    • ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only].
  • Changed! pfam ABC_membrane_2 67aa 9e-09 in modified transcript
    • ABC transporter transmembrane region 2. This domain covers the transmembrane of a small family of ABC transporters and shares sequence similarity with pfam00664. Mutations in this domain in human PMP70 (70 kDa peroxisomal membrane protein) are believed responsible for Zellweger Syndrome-2; mutations in human ALDP (Adrenoleukodystrophy protein) are responsible for recessive X-linked adrenoleukodystrophy. A Saccharomyces cerevisiae homolog is involved in the import of long-chain fatty acids.

ABCF1

  • refseq_ABCF1.F2 refseq_ABCF1.R2 135 249
  • NCBIGene 36.3 23
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025091

  • cd ABCF_EF-3 189aa 3e-32 in ref transcript
    • ABCF_EF-3 Elongation factor 3 (EF-3) is a cytosolic protein required by fungal ribosomes for in vitro protein synthesis and for in vivo growth. EF-3 stimulates the binding of the EF-1: GTP: aa-tRNA ternary complex to the ribosomal A site by facilitated release of the deacylated tRNA from the E site. The reaction requires ATP hydrolysis. EF-3 contains two ATP nucleotide binding sequence (NBS) motifs. NBSI is sufficient for the intrinsic ATPase activity. NBSII is essential for the ribosome-stimulated functions.
  • cd ABCF_EF-3 76aa 4e-27 in ref transcript
  • cd ABC_cobalt_CbiO_domain1 217aa 6e-18 in ref transcript
    • Domain I of the ABC component of a cobalt transport family found in bacteria, archaea, and eukaryota. The transition metal cobalt is an essential component of many enzymes and must be transported into cells in appropriate amounts when needed. This ABC transport system of the CbiMNQO family is involved in cobalt transport in association with the cobalamin (vitamin B12) biosynthetic pathways. Most of cobalt (Cbi) transport systems possess a separate CbiN component, the cobalt-binding periplasmic protein, and they are encoded by the conserved gene cluster cbiMNQO. Both the CbiM and CbiQ proteins are integral cytoplasmic membrane proteins, and the CbiO protein has the linker peptide and the Walker A and B motifs commonly found in the ATPase components of the ABC-type transport systems.
  • pfam ABC_tran 196aa 4e-29 in ref transcript
    • ABC transporter. ABC transporters for a large family of proteins responsible for translocation of a variety of compounds across biological membranes. ABC transporters are the largest family of proteins in many completely sequenced bacteria. ABC transporters are composed of two copies of this domain and two copies of a transmembrane domain pfam00664. These four domains may belong to a single polypeptide or belong in different polypeptide chains.
  • pfam ABC_tran 163aa 9e-25 in ref transcript
  • TIGR PhnT 49aa 0.004 in ref transcript
    • This ATP-binding component of an ABC transport system is found in Salmonella and Burkholderia lineages in the vicinity of enzymes for the breakdown of 2-aminoethylphosphonate.
  • COG Uup 532aa 1e-107 in ref transcript
    • ATPase components of ABC transporters with duplicated ATPase domains [General function prediction only].

ABCG1

  • refseq_ABCG1.F1 refseq_ABCG1.R1 125 161
  • NCBIGene 36.3 9619
  • Alternative 5-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004915

  • cd ABCG_EPDR 226aa 4e-74 in ref transcript
    • ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus.
  • Changed! TIGR 3a01204 619aa 0.0 in ref transcript
  • COG CcmA 220aa 7e-46 in ref transcript
    • ABC-type multidrug transport system, ATPase component [Defense mechanisms].
  • Changed! TIGR 3a01204 607aa 0.0 in modified transcript

ABCG1

  • refseq_ABCG1.F3 refseq_ABCG1.R3 204 245
  • NCBIGene 36.3 9619
  • Single exon skipping, size difference: 41
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_207627

  • Changed! cd ABCG_EPDR 226aa 2e-74 in ref transcript
    • ABCG transporters are involved in eye pigment (EP) precursor transport, regulation of lipid-trafficking mechanisms, and pleiotropic drug resistance (DR). DR is a well-described phenomenon occurring in fungi and shares several similarities with processes in bacteria and higher eukaryotes. Compared to other members of the ABC transporter subfamilies, the ABCG transporter family is composed of proteins that have an ATP-binding cassette domain at the N-terminus and a TM (transmembrane) domain at the C-terminus.
  • Changed! TIGR 3a01204 607aa 0.0 in ref transcript
  • Changed! COG CcmA 220aa 3e-46 in ref transcript
    • ABC-type multidrug transport system, ATPase component [Defense mechanisms].

ABHD11

  • refseq_ABHD11.F1 refseq_ABHD11.R1 114 135
  • NCBIGene 36.2 83451
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_148912

  • pfam Abhydrolase_1 216aa 4e-13 in ref transcript
    • alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes.
  • PRK PRK10673 246aa 2e-37 in ref transcript
    • hypothetical protein; Provisional.

ABHD2

  • refseq_ABHD2.F1 refseq_ABHD2.R1 100 498
  • NCBIGene 36.3 11057
  • Multiple exon skipping, size difference: 398
  • Exclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_007011

  • pfam Abhydrolase_1 230aa 3e-26 in ref transcript
    • alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes.
  • COG COG0429 336aa 4e-44 in ref transcript
    • Predicted hydrolase of the alpha/beta-hydrolase fold [General function prediction only].

ABI1

  • refseq_ABI1.F1 refseq_ABI1.R1 135 216
  • NCBIGene 36.3 10006
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005470

  • cd SH3 52aa 3e-15 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam Abi_HHR 79aa 4e-32 in ref transcript
    • Abl-interactor HHR. The region featured in this family is found towards the N-terminus of a number of adaptor proteins that interact with Abl-family tyrosine kinases. More specifically, it is termed the homeo-domain homologous region (HHR), as it is similar to the DNA-binding region of homeo-domain proteins. Other homeo-domain proteins have been implicated in specifying positional information during embryonic development, and in the regulation of the expression of cell-type specific genes. The Abl-interactor proteins are thought to coordinate the cytoplasmic and nuclear functions of the Abl-family kinases, and seem to be involved in cytoskeletal reorganisation, but their precise role remains unclear.
  • smart SH3 56aa 2e-17 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

ABI1

  • refseq_ABI1.F3 refseq_ABI1.R3 165 252
  • NCBIGene 36.3 10006
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005470

  • cd SH3 52aa 3e-15 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam Abi_HHR 79aa 4e-32 in ref transcript
    • Abl-interactor HHR. The region featured in this family is found towards the N-terminus of a number of adaptor proteins that interact with Abl-family tyrosine kinases. More specifically, it is termed the homeo-domain homologous region (HHR), as it is similar to the DNA-binding region of homeo-domain proteins. Other homeo-domain proteins have been implicated in specifying positional information during embryonic development, and in the regulation of the expression of cell-type specific genes. The Abl-interactor proteins are thought to coordinate the cytoplasmic and nuclear functions of the Abl-family kinases, and seem to be involved in cytoskeletal reorganisation, but their precise role remains unclear.
  • smart SH3 56aa 2e-17 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

ABLIM1

  • refseq_ABLIM1.F1 refseq_ABLIM1.R1 167 272
  • NCBIGene 36.3 3983
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002313

  • smart VHP 36aa 2e-12 in ref transcript
    • Villin headpiece domain.
  • pfam LIM 56aa 4e-10 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 56aa 1e-08 in ref transcript
  • pfam LIM 45aa 6e-07 in ref transcript
  • smart LIM 33aa 5e-05 in ref transcript
    • Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.

ABLIM1

  • refseq_ABLIM1.F3 refseq_ABLIM1.R3 119 203
  • NCBIGene 36.3 3983
  • Single exon skipping, size difference: 84
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_002313

  • smart VHP 36aa 2e-12 in ref transcript
    • Villin headpiece domain.
  • pfam LIM 56aa 4e-10 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 56aa 1e-08 in ref transcript
  • pfam LIM 45aa 6e-07 in ref transcript
  • smart LIM 33aa 5e-05 in ref transcript
    • Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.

ABTB1

  • refseq_ABTB1.F1 refseq_ABTB1.R1 167 202
  • NCBIGene 36.3 80325
  • Alternative 5-prime, size difference: 35
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172027

  • Changed! cd ANK 73aa 6e-08 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! pfam BTB 76aa 3e-15 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • Changed! smart BTB 96aa 1e-14 in ref transcript
    • Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.

ABTB1

  • refseq_ABTB1.F2 refseq_ABTB1.R2 112 158
  • NCBIGene 36.3 80325
  • Alternative 3-prime, size difference: 46
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_172027

  • Changed! cd ANK 73aa 6e-08 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! pfam BTB 76aa 3e-15 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • Changed! smart BTB 96aa 1e-14 in ref transcript
    • Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.

ACADVL

  • refseq_ACADVL.F2 refseq_ACADVL.R2 327 393
  • NCBIGene 36.3 37
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000018

  • cd VLCAD 410aa 0.0 in ref transcript
    • Very long chain acyl-CoA dehydrogenase (VLCAD). VLCAD acyl-CoA dehydrogenases (ACAD), which is found in the mitochondria of eukaryotes and in some bacteria. It catalyzes the alpha,beta dehydrogenation of the corresponding trans-enoyl-CoA by FAD, which becomes reduced. The reduced form of ACAD is reoxidized in the oxidative half-reaction by electron-transferring flavoprotein (ETF), from which the electrons are transferred to the mitochondrial respiratory chain coupled with ATP synthesis. VLCAD, which is a homodimer.
  • pfam Acyl-CoA_dh_1 147aa 2e-35 in ref transcript
    • Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle.
  • TIGR cyc_hxne_CoA_dh 380aa 1e-26 in ref transcript
    • Cyclohex-1-ene-1carboxyl-CoA is an intermediate in the anaerobic degradation of benzoyl-CoA derived from varioius aromatic compounds, in Rhodopseudomonas palustris but not Thauera aromatica. The aliphatic compound cyclohexanecarboxylate, can be converted to the same intermediate in two steps. The first step is its ligation to coenzyme A. The second is the action of this enzyme, cyclohexanecarboxyl-CoA dehydrogenase.
  • COG CaiA 380aa 2e-79 in ref transcript
    • Acyl-CoA dehydrogenases [Lipid metabolism].

ACCN2

  • refseq_ACCN2.F1 refseq_ACCN2.R1 140 278
  • NCBIGene 36.3 41
  • Alternative 5-prime, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020039

  • Changed! TIGR ENaC 538aa 1e-160 in ref transcript
    • This Hmm is designed from the vertebrate members of the ENaC family.
  • Changed! TIGR ENaC 492aa 1e-168 in modified transcript

ACCN3

  • refseq_ACCN3.F1 refseq_ACCN3.R1 113 133
  • NCBIGene 36.3 9311
  • Alternative 3-prime, size difference: 20
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020321

  • Changed! pfam ASC 441aa 1e-120 in ref transcript
    • Amiloride-sensitive sodium channel.
  • Changed! TIGR ENaC 395aa 1e-120 in modified transcript
    • This Hmm is designed from the vertebrate members of the ENaC family.
  • Changed! TIGR deg-1 87aa 4e-05 in modified transcript
    • This Hmm is designed from the invertebrate members of the ENaC family.

ACCN4

  • refseq_ACCN4.F1 refseq_ACCN4.R1 182 239
  • NCBIGene 36.3 55515
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018674

  • Changed! TIGR ENaC 362aa 5e-95 in ref transcript
    • This Hmm is designed from the vertebrate members of the ENaC family.
  • TIGR ENaC 103aa 0.002 in ref transcript
  • Changed! TIGR ENaC 343aa 2e-92 in modified transcript

A1CF

  • refseq_ACF.F1 refseq_ACF.R1 113 137
  • NCBIGene 36.3 29974
  • Alternative 3-prime, size difference: 24
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_138933

  • cd RRM 69aa 1e-16 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 68aa 4e-13 in ref transcript
  • cd RRM 79aa 7e-09 in ref transcript
  • Changed! TIGR hnRNP-R-Q 566aa 0.0 in ref transcript
    • Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
  • COG COG0724 76aa 4e-09 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 160aa 8e-08 in ref transcript
  • Changed! TIGR hnRNP-R-Q 574aa 0.0 in modified transcript

A1CF

  • refseq_ACF.F3 refseq_ACF.R3 193 241
  • NCBIGene 36.3 29974
  • Single exon skipping, size difference: 48
  • Exclusion in 5'UTR
  • Reference transcript: NM_138932

  • cd RRM 69aa 1e-16 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 68aa 5e-13 in ref transcript
  • cd RRM 79aa 9e-09 in ref transcript
  • TIGR hnRNP-R-Q 593aa 0.0 in ref transcript
    • Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
  • COG COG0724 76aa 7e-09 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 160aa 2e-07 in ref transcript

A1CF

  • refseq_ACF.F5 refseq_ACF.R5 200 343
  • NCBIGene 36.3 29974
  • Single exon skipping, size difference: 143
  • Exclusion of the protein initiation site
  • Reference transcript: NM_138933

  • cd RRM 69aa 1e-16 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 68aa 4e-13 in ref transcript
  • cd RRM 79aa 7e-09 in ref transcript
  • Changed! TIGR hnRNP-R-Q 566aa 0.0 in ref transcript
    • Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
  • COG COG0724 76aa 4e-09 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 160aa 8e-08 in ref transcript
  • Changed! TIGR hnRNP-R-Q 585aa 0.0 in modified transcript

ACLY

  • refseq_ACLY.F1 refseq_ACLY.R1 141 171
  • NCBIGene 36.3 47
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001096

  • cd CCL_ACL-C 238aa 2e-78 in ref transcript
    • Citryl-CoA lyase (CCL), the C-terminal portion of the single-subunit type ATP-citrate lyase (ACL) and the C-terminal portion of the large subunit of the two-subunit type ACL. CCL cleaves citryl-CoA (CiCoA) to acetyl-CoA (AcCoA) and oxaloacetate (OAA). ACL catalyzes an ATP- and a CoA- dependant cleavage of citrate to form AcCoA and OAA in a multistep reaction, the final step of which is likely to involve the cleavage of CiCoA to generate AcCoA and OAA. In fungi, yeast, plants, and animals ACL is cytosolic and generates AcCoA for lipogenesis. ACL may be required for fruiting body maturation in the filamentous fungus Sordaria macrospore. In several groups of autotrophic prokaryotes and archaea, ACL carries out the citrate-cleavage reaction of the reductive tricarboxylic acid (rTCA) cycle. In the family Aquificaceae this latter reaction in the rTCA cycle is carried out via a two enzyme system the second enzyme of which is CCL; the first enzyme is citryl-CoA synthetase (CCS) which is not included in this group. Chlorobium limicola ACL is an example of a two-subunit type ACL. It is comprised of a large and a small subunit; it has been speculated that the large subunit arose from a fusion of the small subunit of the two subunit CCS with CCL. The small ACL subunit is a homolog of the larger CCS subunit. Mammalian ACL is of the single-subunit type and may have arisen from the two-subunit ACL by another gene fusion. Mammalian ACLs are homotetramers; the ACLs of C. limicola and Arabidopsis are a heterooctomers (alpha4beta4). In cancer cells there is a shift in energy metabolism to aerobic glycolysis, the glycolytic end product pyruvate enters a truncated TCA cycle generating citrate which is cleaved in the cytosol by ACL. Inhibiting ACL limits the in-vitro proliferation and survival of these cancer cells, reduces in vivo tumor growth, and induces differentiation.
  • TIGR sucCoAalpha 257aa 3e-31 in ref transcript
    • ATP citrate lyases appear to form an outgroup.
  • pfam Citrate_synt 188aa 1e-12 in ref transcript
    • Citrate synthase.
  • TIGR sucCoAbeta 349aa 3e-12 in ref transcript
    • This family contains a split seen both in a maximum parsimony tree (which ignores gaps) and in the gap pattern near position 85 of the seed alignment. Eukaryotic and most bacterial sequences are longer and contain a region similar to TXQTXXXG. Sequences from Deinococcus radiodurans, Mycobacterium tuberculosis, Streptomyces coelicolor, and the Archaea are 6 amino acids shorter in that region and contain a motif resembling [KR]G.
  • COG SucD 256aa 8e-55 in ref transcript
    • Succinyl-CoA synthetase, alpha subunit [Energy production and conversion].
  • COG SucC 410aa 5e-46 in ref transcript
    • Succinyl-CoA synthetase, beta subunit [Energy production and conversion].
  • COG GltA 235aa 1e-29 in ref transcript
    • Citrate synthase [Energy production and conversion].

ACOT8

  • refseq_ACOT8.F1 refseq_ACOT8.R1 146 341
  • NCBIGene 36.3 10005
  • Single exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005469

  • Changed! cd Thioesterase_II_repeat1 104aa 2e-33 in ref transcript
    • Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene.
  • cd Thioesterase_II_repeat2 91aa 3e-30 in ref transcript
    • Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene.
  • Changed! TIGR tesB 275aa 1e-105 in ref transcript
    • Subunit: homotetramer.
  • Changed! COG TesB 283aa 9e-78 in ref transcript
    • Acyl-CoA thioesterase [Lipid metabolism].
  • Changed! cd Thioesterase_II_repeat1 29aa 9e-06 in modified transcript
  • Changed! TIGR tesB 182aa 5e-60 in modified transcript
  • Changed! TIGR tesB 29aa 2e-08 in modified transcript
  • Changed! COG TesB 184aa 1e-46 in modified transcript
  • Changed! COG TesB 33aa 1e-04 in modified transcript

ACOT8

  • refseq_ACOT8.F3 refseq_ACOT8.R3 234 368
  • NCBIGene 36.2 10005
  • Single exon skipping, size difference: 134
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005469

  • Changed! cd Thioesterase_II_repeat1 104aa 2e-33 in ref transcript
    • Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene.
  • Changed! cd Thioesterase_II_repeat2 91aa 3e-30 in ref transcript
    • Thioesterase II (TEII) is thought to regenerate misprimed nonribosomal peptide synthetases (NRPSs) as well as modular polyketide synthases (PKSs) by hydrolyzing acetyl groups bound to the peptidyl carrier protein (PCP) and acyl carrier protein (ACP) domains, respectively. TEII has two tandem asymmetric hot dog folds that are structurally similar to one found in PaaI thioesterase, 4-hydroxybenzoyl-CoA thioesterase (4HBT) and beta-hydroxydecanoyl-ACP dehydratase and thus, the TEII monomer is equivalent to the homodimeric form of the latter three enzymes. Human TEII is expressed in T cells and has been shown to bind the product of the HIV-1 Nef gene.
  • Changed! TIGR tesB 275aa 1e-105 in ref transcript
    • Subunit: homotetramer.
  • Changed! COG TesB 283aa 9e-78 in ref transcript
    • Acyl-CoA thioesterase [Lipid metabolism].

ACOT9

  • refseq_ACOT9.F2 refseq_ACOT9.R2 103 130
  • NCBIGene 36.3 23597
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001037171

  • cd BFIT_BACH 121aa 1e-26 in ref transcript
    • Brown fat-inducible thioesterase (BFIT). Brain acyl-CoA hydrolase (BACH). These enzymes deacylate long-chain fatty acids by hydrolyzing acyl-CoA thioesters to free fatty acids and CoA-SH. Eukaryotic members of this family are expressed in brain, testis, and brown adipose tissues. The archeal and eukaryotic members of this family have two tandem copies of the conserved hot dog fold, while most bacterial members have only one copy.
  • cd BFIT_BACH 126aa 2e-17 in ref transcript
  • COG COG1607 134aa 6e-11 in ref transcript
    • Acyl-CoA hydrolase [Lipid metabolism].
  • COG COG1607 103aa 5e-05 in ref transcript

ACPT

  • refseq_ACPT.F1 refseq_ACPT.R1 134 413
  • NCBIGene 36.2 93650
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 279
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033068

  • Changed! cd HP_HAP_like 285aa 1e-24 in ref transcript
    • Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
  • Changed! pfam Acid_phosphat_A 307aa 1e-62 in ref transcript
    • Histidine acid phosphatase.
  • Changed! PRK PRK10172 345aa 3e-06 in ref transcript
    • phosphoanhydride phosphorylase; Provisional.
  • Changed! cd HP_HAP_like 192aa 4e-18 in modified transcript
  • Changed! pfam Acid_phosphat_A 170aa 6e-19 in modified transcript
  • Changed! pfam Acid_phosphat_A 63aa 6e-16 in modified transcript
  • Changed! PRK PRK10172 175aa 0.008 in modified transcript

ACPT

  • refseq_ACPT.F3 refseq_ACPT.R3 231 364
  • NCBIGene 36.2 93650
  • Single exon skipping, size difference: 133
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_033068

  • Changed! cd HP_HAP_like 285aa 1e-24 in ref transcript
    • Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
  • Changed! pfam Acid_phosphat_A 307aa 1e-62 in ref transcript
    • Histidine acid phosphatase.
  • Changed! PRK PRK10172 345aa 3e-06 in ref transcript
    • phosphoanhydride phosphorylase; Provisional.
  • Changed! cd HP_HAP_like 119aa 9e-23 in modified transcript
  • Changed! pfam Acid_phosphat_A 183aa 2e-39 in modified transcript
  • Changed! PRK PRK10172 109aa 0.002 in modified transcript

ACRV1

  • refseq_ACRV1.F2 refseq_ACRV1.R2 267 387
  • NCBIGene 36.3 56
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001612

  • Changed! cd LU 77aa 4e-05 in ref transcript
    • Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.

ACRV1

  • refseq_ACRV1.F3 refseq_ACRV1.R3 116 173
  • NCBIGene 36.3 56
  • Alternative 5-prime, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001612

  • cd LU 77aa 4e-05 in ref transcript
    • Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.

ACSL3

  • refseq_ACSL3.F1 refseq_ACSL3.R1 249 356
  • NCBIGene 36.3 2181
  • Single exon skipping, size difference: 107
  • Exclusion in 5'UTR
  • Reference transcript: NM_004457

  • pfam AMP-binding 477aa 2e-82 in ref transcript
    • AMP-binding enzyme.
  • PTZ PTZ00216 617aa 1e-121 in ref transcript
    • acyl-CoA synthetase; Provisional.

ACSL4

  • refseq_ACSL4.F1 refseq_ACSL4.R1 137 449
  • NCBIGene 36.3 2182
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 312
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_022977

  • pfam AMP-binding 476aa 5e-82 in ref transcript
    • AMP-binding enzyme.
  • PTZ PTZ00216 585aa 1e-120 in ref transcript
    • acyl-CoA synthetase; Provisional.
  • PRK PRK07868 119aa 0.004 in ref transcript
    • acyl-CoA synthetase; Validated.

ACTL6A

  • refseq_ACTL6A.F1 refseq_ACTL6A.R1 124 172
  • NCBIGene 36.3 86
  • Alternative 5-prime, size difference: 48
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004301

  • Changed! cd ACTIN 415aa 1e-111 in ref transcript
    • Actin; An ubiquitous protein involved in the formation of filaments that are a major component of the cytoskeleton. Interaction with myosin provides the basis of muscular contraction and many aspects of cell motility. Each actin protomer binds one molecule of ATP and either calcium or magnesium ions. Actin exists as a monomer in low salt concentrations, but filaments form rapidly as salt concentration rises, with the consequent hydrolysis of ATP. Polymerization is regulated by so-called capping proteins. The ATPase domain of actin shares similarity with ATPase domains of hexokinase and hsp70 proteins.
  • Changed! pfam Actin 421aa 1e-145 in ref transcript
    • Actin.
  • Changed! PTZ PTZ00281 420aa 2e-70 in ref transcript
    • actin; Provisional.

ACTR3B

  • refseq_ACTR3B.F1 refseq_ACTR3B.R1 218 428
  • NCBIGene 36.3 57180
  • Multiple exon skipping, size difference: 210
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020445

  • Changed! cd ACTIN 405aa 1e-120 in ref transcript
    • Actin; An ubiquitous protein involved in the formation of filaments that are a major component of the cytoskeleton. Interaction with myosin provides the basis of muscular contraction and many aspects of cell motility. Each actin protomer binds one molecule of ATP and either calcium or magnesium ions. Actin exists as a monomer in low salt concentrations, but filaments form rapidly as salt concentration rises, with the consequent hydrolysis of ATP. Polymerization is regulated by so-called capping proteins. The ATPase domain of actin shares similarity with ATPase domains of hexokinase and hsp70 proteins.
  • Changed! smart ACTIN 407aa 1e-129 in ref transcript
    • Actin. ACTIN subfamily of ACTIN/mreB/sugarkinase/Hsp70 superfamily.
  • Changed! PTZ PTZ00280 415aa 0.0 in ref transcript
    • actin; Provisional.
  • Changed! cd ACTIN 311aa 7e-91 in modified transcript
  • Changed! smart ACTIN 312aa 4e-97 in modified transcript
  • Changed! PTZ PTZ00280 314aa 1e-148 in modified transcript
  • Changed! PTZ PTZ00280 31aa 1e-09 in modified transcript

ACYP1

  • refseq_ACYP1.F1 refseq_ACYP1.R1 189 268
  • NCBIGene 36.3 97
  • Single exon skipping, size difference: 79
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001107

  • Changed! pfam Acylphosphatase 93aa 3e-24 in ref transcript
    • Acylphosphatase.
  • Changed! COG AcyP 92aa 2e-11 in ref transcript
    • Acylphosphatases [Energy production and conversion].
  • Changed! pfam Acylphosphatase 23aa 0.001 in modified transcript

ADAM22

  • refseq_ADAM22.F2 refseq_ADAM22.R2 156 243
  • NCBIGene 36.3 53616
  • Single exon skipping, size difference: 87
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_021723

  • cd ZnMc_adamalysin_II_like 198aa 4e-48 in ref transcript
    • Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
  • pfam Reprolysin 200aa 1e-63 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • smart ACR 141aa 2e-40 in ref transcript
    • ADAM Cysteine-Rich Domain.
  • pfam Disintegrin 77aa 4e-25 in ref transcript
    • Disintegrin.
  • pfam Pep_M12B_propep 104aa 2e-22 in ref transcript
    • Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.

ADAM22

  • refseq_ADAM22.F3 refseq_ADAM22.R3 182 290
  • NCBIGene 36.3 53616
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021723

  • cd ZnMc_adamalysin_II_like 198aa 4e-48 in ref transcript
    • Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
  • pfam Reprolysin 200aa 1e-63 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • smart ACR 141aa 2e-40 in ref transcript
    • ADAM Cysteine-Rich Domain.
  • pfam Disintegrin 77aa 4e-25 in ref transcript
    • Disintegrin.
  • pfam Pep_M12B_propep 104aa 2e-22 in ref transcript
    • Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.

ADAM33

  • refseq_ADAM33.F1 refseq_ADAM33.R1 105 183
  • NCBIGene 36.3 80332
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_025220

  • cd ZnMc_adamalysin_II_like 198aa 7e-63 in ref transcript
    • Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
  • pfam Reprolysin 200aa 8e-68 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • Changed! smart ACR 142aa 1e-35 in ref transcript
    • ADAM Cysteine-Rich Domain.
  • pfam Disintegrin 76aa 2e-29 in ref transcript
    • Disintegrin.
  • pfam Pep_M12B_propep 70aa 3e-21 in ref transcript
    • Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.
  • Changed! smart ACR 141aa 8e-35 in modified transcript

ADAMTS13

  • refseq_ADAMTS13.F2 refseq_ADAMTS13.R2 135 303
  • NCBIGene 36.3 11093
  • Alternative 5-prime, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139025

  • cd ZnMc_ADAMTS_like 204aa 8e-68 in ref transcript
    • Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
  • pfam Reprolysin 206aa 1e-11 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • smart TSP1 53aa 3e-11 in ref transcript
    • Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.

ADAMTS13

  • refseq_ADAMTS13.F3 refseq_ADAMTS13.R3 193 432
  • NCBIGene 36.2 11093
  • Alternative 3-prime, size difference: 239
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_139025

  • Changed! cd ZnMc_ADAMTS_like 204aa 8e-68 in ref transcript
    • Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
  • Changed! pfam Reprolysin 206aa 1e-11 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • Changed! smart TSP1 53aa 3e-11 in ref transcript
    • Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
  • Changed! cd ZnMc_ADAMTS_like 183aa 4e-59 in modified transcript
  • Changed! pfam Reprolysin 171aa 6e-09 in modified transcript

ADAMTS13

  • refseq_ADAMTS13.F5 refseq_ADAMTS13.R5 127 207
  • NCBIGene 36.2 11093
  • Alternative 5-prime, size difference: 80
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_139025

  • cd ZnMc_ADAMTS_like 204aa 8e-68 in ref transcript
    • Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
  • pfam Reprolysin 206aa 1e-11 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • Changed! smart TSP1 53aa 3e-11 in ref transcript
    • Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.

ADAMTSL1

  • refseq_ADAMTSL1.F2 refseq_ADAMTSL1.R2 260 311
  • NCBIGene 36.2 92949
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139238

  • smart TSP1 47aa 5e-05 in ref transcript
    • Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.

ADAR

  • refseq_ADAR.F2 refseq_ADAR.R2 138 195
  • NCBIGene 36.3 103
  • Alternative 3-prime, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001111

  • cd DSRM 67aa 5e-13 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • cd DSRM 67aa 1e-10 in ref transcript
  • cd DSRM 50aa 0.002 in ref transcript
  • smart ADEAMc 384aa 1e-149 in ref transcript
    • tRNA-specific and double-stranded RNA adenosine deaminase (RNA-specific editase).
  • pfam z-alpha 65aa 2e-19 in ref transcript
    • Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this region to B-DNA has been shown to be disfavoured by steric hindrance.
  • pfam z-alpha 67aa 4e-19 in ref transcript
  • smart DSRM 66aa 1e-15 in ref transcript
    • Double-stranded RNA binding motif.
  • smart DSRM 66aa 4e-11 in ref transcript
  • smart DSRM 50aa 2e-06 in ref transcript
  • PRK rnc 59aa 2e-05 in ref transcript
    • ribonuclease III; Reviewed.
  • COG Rnc 67aa 5e-05 in ref transcript
    • dsRNA-specific ribonuclease [Transcription].

ADAR

  • refseq_ADAR.F3 refseq_ADAR.R3 143 221
  • NCBIGene 36.3 103
  • Alternative 5-prime, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001111

  • cd DSRM 67aa 5e-13 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • cd DSRM 67aa 1e-10 in ref transcript
  • cd DSRM 50aa 0.002 in ref transcript
  • smart ADEAMc 384aa 1e-149 in ref transcript
    • tRNA-specific and double-stranded RNA adenosine deaminase (RNA-specific editase).
  • pfam z-alpha 65aa 2e-19 in ref transcript
    • Adenosine deaminase z-alpha domain. This family consists of the N-terminus and thus the z-alpha domain of double-stranded RNA-specific adenosine deaminase (ADAR), an RNA- editing enzyme. The z-alpha domain is a Z-DNA binding domain, and binding of this region to B-DNA has been shown to be disfavoured by steric hindrance.
  • pfam z-alpha 67aa 4e-19 in ref transcript
  • smart DSRM 66aa 1e-15 in ref transcript
    • Double-stranded RNA binding motif.
  • smart DSRM 66aa 4e-11 in ref transcript
  • smart DSRM 50aa 2e-06 in ref transcript
  • PRK rnc 59aa 2e-05 in ref transcript
    • ribonuclease III; Reviewed.
  • Changed! COG Rnc 67aa 5e-05 in ref transcript
    • dsRNA-specific ribonuclease [Transcription].
  • Changed! COG Rnc 76aa 5e-05 in modified transcript

ADARB1

  • refseq_ADARB1.F2 refseq_ADARB1.R2 112 232
  • NCBIGene 36.3 104
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015833

  • cd DSRM 48aa 5e-06 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • cd DSRM 46aa 3e-05 in ref transcript
  • Changed! smart ADEAMc 417aa 1e-133 in ref transcript
    • tRNA-specific and double-stranded RNA adenosine deaminase (RNA-specific editase).
  • smart DSRM 48aa 2e-07 in ref transcript
    • Double-stranded RNA binding motif.
  • smart DSRM 45aa 4e-06 in ref transcript
  • PRK rnc 48aa 0.008 in ref transcript
    • ribonuclease III; Reviewed.
  • Changed! smart ADEAMc 377aa 1e-137 in modified transcript

ADARB1

  • refseq_ADARB1.F4 refseq_ADARB1.R4 180 298
  • NCBIGene 36.3 104
  • Single exon skipping, size difference: 118
  • Inclusion in 5'UTR
  • Reference transcript: NM_015833

  • cd DSRM 48aa 5e-06 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • cd DSRM 46aa 3e-05 in ref transcript
  • smart ADEAMc 417aa 1e-133 in ref transcript
    • tRNA-specific and double-stranded RNA adenosine deaminase (RNA-specific editase).
  • smart DSRM 48aa 2e-07 in ref transcript
    • Double-stranded RNA binding motif.
  • smart DSRM 45aa 4e-06 in ref transcript
  • PRK rnc 48aa 0.008 in ref transcript
    • ribonuclease III; Reviewed.

ADCY6

  • refseq_ADCY6.F1 refseq_ADCY6.R1 235 394
  • NCBIGene 36.3 112
  • Single exon skipping, size difference: 159
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015270

  • pfam Guanylate_cyc 183aa 6e-59 in ref transcript
    • Adenylate and Guanylate cyclase catalytic domain.
  • pfam Guanylate_cyc 194aa 8e-57 in ref transcript
  • pfam DUF1053 110aa 3e-38 in ref transcript
    • Domain of Unknown Function (DUF1053). This domain is found in Adenylate cyclases.
  • COG CyaA 160aa 6e-17 in ref transcript
    • Adenylate cyclase, family 3 (some proteins contain HAMP domain) [Signal transduction mechanisms].
  • COG CyaA 238aa 1e-12 in ref transcript

ADD1

  • refseq_ADD1.F1 refseq_ADD1.R1 210 303
  • NCBIGene 36.3 118
  • Alternative 5-prime, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014189

  • cd Aldolase_II 211aa 2e-52 in ref transcript
    • Class II Aldolase and Adducin head (N-terminal) domain. Aldolases are ubiquitous enzymes catalyzing central steps of carbohydrate metabolism. Based on enzymatic mechanisms, this superfamily has been divided into two distinct classes (Class I and II). Class II enzymes are further divided into two sub-classes A and B. This family includes class II A aldolases and adducins which has not been ascribed any enzymatic function. Members of this class are primarily bacterial and eukaryotic in origin and include L-fuculose-1-phosphate, L-rhamnulose-1-phosphate aldolases and L-ribulose-5-phosphate 4-epimerases. They all share the ability to promote carbon-carbon bond cleavage and stabilize enolate intermediates using divalent cations.
  • pfam Aldolase_II 183aa 1e-50 in ref transcript
    • Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function.
  • PRK PRK07044 247aa 8e-82 in ref transcript
    • aldolase_II super-family; Provisional.

ADD1

  • refseq_ADD1.F3 refseq_ADD1.R3 161 195
  • NCBIGene 36.3 118
  • Single exon skipping, size difference: 34
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_014189

  • cd Aldolase_II 211aa 2e-52 in ref transcript
    • Class II Aldolase and Adducin head (N-terminal) domain. Aldolases are ubiquitous enzymes catalyzing central steps of carbohydrate metabolism. Based on enzymatic mechanisms, this superfamily has been divided into two distinct classes (Class I and II). Class II enzymes are further divided into two sub-classes A and B. This family includes class II A aldolases and adducins which has not been ascribed any enzymatic function. Members of this class are primarily bacterial and eukaryotic in origin and include L-fuculose-1-phosphate, L-rhamnulose-1-phosphate aldolases and L-ribulose-5-phosphate 4-epimerases. They all share the ability to promote carbon-carbon bond cleavage and stabilize enolate intermediates using divalent cations.
  • pfam Aldolase_II 183aa 1e-50 in ref transcript
    • Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function.
  • PRK PRK07044 247aa 8e-82 in ref transcript
    • aldolase_II super-family; Provisional.

ADD2

  • refseq_ADD2.F1 refseq_ADD2.R1 226 312
  • NCBIGene 36.3 119
  • Single exon skipping, size difference: 86
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001617

  • cd Aldolase_II 211aa 1e-54 in ref transcript
    • Class II Aldolase and Adducin head (N-terminal) domain. Aldolases are ubiquitous enzymes catalyzing central steps of carbohydrate metabolism. Based on enzymatic mechanisms, this superfamily has been divided into two distinct classes (Class I and II). Class II enzymes are further divided into two sub-classes A and B. This family includes class II A aldolases and adducins which has not been ascribed any enzymatic function. Members of this class are primarily bacterial and eukaryotic in origin and include L-fuculose-1-phosphate, L-rhamnulose-1-phosphate aldolases and L-ribulose-5-phosphate 4-epimerases. They all share the ability to promote carbon-carbon bond cleavage and stabilize enolate intermediates using divalent cations.
  • pfam Aldolase_II 183aa 9e-45 in ref transcript
    • Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function.
  • PRK PRK07044 250aa 2e-61 in ref transcript
    • aldolase_II super-family; Provisional.

ADD3

  • refseq_ADD3.F2 refseq_ADD3.R2 284 380
  • NCBIGene 36.3 120
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016824

  • cd Aldolase_II 210aa 6e-40 in ref transcript
    • Class II Aldolase and Adducin head (N-terminal) domain. Aldolases are ubiquitous enzymes catalyzing central steps of carbohydrate metabolism. Based on enzymatic mechanisms, this superfamily has been divided into two distinct classes (Class I and II). Class II enzymes are further divided into two sub-classes A and B. This family includes class II A aldolases and adducins which has not been ascribed any enzymatic function. Members of this class are primarily bacterial and eukaryotic in origin and include L-fuculose-1-phosphate, L-rhamnulose-1-phosphate aldolases and L-ribulose-5-phosphate 4-epimerases. They all share the ability to promote carbon-carbon bond cleavage and stabilize enolate intermediates using divalent cations.
  • pfam Aldolase_II 183aa 1e-43 in ref transcript
    • Class II Aldolase and Adducin N-terminal domain. This family includes class II aldolases and adducins which have not been ascribed any enzymatic function.
  • PRK PRK07044 249aa 5e-60 in ref transcript
    • aldolase_II super-family; Provisional.

ADNP

  • refseq_ADNP.F1 refseq_ADNP.R1 116 291
  • NCBIGene 36.3 23394
  • Single exon skipping, size difference: 175
  • Exclusion in 5'UTR
  • Reference transcript: NM_015339

  • cd homeodomain 46aa 3e-06 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • smart HOX 43aa 5e-08 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.

AFG3L1

  • refseq_AFG3L1.F1 refseq_AFG3L1.R1 327 393
  • NCBIGene 36.2 172
  • Alternative 3-prime, size difference: 66
  • Inclusion in 3'UTR
  • Reference transcript: NM_001132

  • cd AAA 50aa 7e-08 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • TIGR FtsH_fam 46aa 5e-20 in ref transcript
    • HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
  • CHL ftsH 46aa 3e-19 in ref transcript
    • cell division protein; Validated.
  • PRK PRK08116 102aa 4e-04 in ref transcript
    • hypothetical protein; Validated.

AFG3L1

  • refseq_AFG3L1.F3 refseq_AFG3L1.R3 112 212
  • NCBIGene 36.2 172
  • Single exon skipping, size difference: 100
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001132

  • Changed! cd AAA 50aa 7e-08 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • Changed! TIGR FtsH_fam 46aa 5e-20 in ref transcript
    • HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
  • Changed! CHL ftsH 46aa 3e-19 in ref transcript
    • cell division protein; Validated.
  • Changed! PRK PRK08116 102aa 4e-04 in ref transcript
    • hypothetical protein; Validated.

AFG3L1

  • refseq_AFG3L1.F6 refseq_AFG3L1.R6 208 281
  • NCBIGene 36.2 172
  • Alternative 3-prime, size difference: 73
  • Inclusion in 5'UTR
  • Reference transcript: NM_001132

  • cd AAA 50aa 7e-08 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • TIGR FtsH_fam 46aa 5e-20 in ref transcript
    • HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
  • CHL ftsH 46aa 3e-19 in ref transcript
    • cell division protein; Validated.
  • PRK PRK08116 102aa 4e-04 in ref transcript
    • hypothetical protein; Validated.

AFTPH

  • refseq_AFTPH.F1 refseq_AFTPH.R1 147 231
  • NCBIGene 36.3 54812
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203437

ACAN

  • refseq_AGC1.F1 refseq_AGC1.R1 166 349
  • NCBIGene 36.3 176
  • Single exon skipping, size difference: 183
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013227

  • cd CLECT_CSPGs 124aa 9e-79 in ref transcript
    • CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classification system to be a variety of CTLD, but are omitted from this hierarchical classification based on insignificant sequence similarity.
  • cd Link_domain_CSPGs_modules_1_3 95aa 5e-46 in ref transcript
    • Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
  • cd Link_domain_CSPGs_modules_2_4 96aa 5e-45 in ref transcript
    • Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
  • cd Link_domain_CSPGs_modules_2_4 96aa 4e-44 in ref transcript
  • cd Link_domain_CSPGs_modules_1_3 95aa 4e-40 in ref transcript
  • Changed! cd CCP 57aa 6e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd EGF_CA 31aa 3e-04 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • pfam Xlink 95aa 2e-40 in ref transcript
    • Extracellular link domain.
  • pfam Xlink 95aa 9e-40 in ref transcript
  • pfam Xlink 96aa 1e-36 in ref transcript
  • smart CLECT 122aa 6e-36 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
  • smart LINK 99aa 7e-35 in ref transcript
    • Link (Hyaluronan-binding).
  • pfam V-set 120aa 9e-09 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • Changed! smart CCP 57aa 3e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • pfam EGF 31aa 1e-04 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.

ACAN

  • refseq_AGC1.F3 refseq_AGC1.R3 104 218
  • NCBIGene 36.3 176
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013227

  • cd CLECT_CSPGs 124aa 9e-79 in ref transcript
    • CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classification system to be a variety of CTLD, but are omitted from this hierarchical classification based on insignificant sequence similarity.
  • cd Link_domain_CSPGs_modules_1_3 95aa 5e-46 in ref transcript
    • Link_domain_CSPGs_modules_1_3; this extracellular link domain is found in the first and third link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan. In addition, it is found in the first link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. In addition, aggrecan contains a second globular domain (G2) which contains link modules 3 and 4. G2 appears to lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
  • cd Link_domain_CSPGs_modules_2_4 96aa 5e-45 in ref transcript
    • Link_domain_CSPGs_modules_2_4; this link domain is found in the second and fourth link modules of the chondroitin sulfate proteoglycan core protein (CSPG) aggrecan and, in the second link module of three other CSPGs: versican, neurocan, and brevican. The link domain is a hyaluronan (HA)-binding domain. CSPGs are characterized by an N-terminal globular domain (G1 domain) containing two contiguous link modules (modules 1 and 2). Both link modules of the G1 domain of aggrecan are involved in interaction with HA. Aggrecan in addition contains a second globular domain (G2) having link modules 3 and 4 which lack HA-binding activity. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with HA. These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggregan may contribute to the structural integrity of many different tissues. Members of the vertebrate HPLN (hyaluronan/HA and proteoglycan binding link) protein family are physically linked adjacent to CSPG genes.
  • cd Link_domain_CSPGs_modules_2_4 96aa 4e-44 in ref transcript
  • cd Link_domain_CSPGs_modules_1_3 95aa 4e-40 in ref transcript
  • cd CCP 57aa 6e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • Changed! cd EGF_CA 31aa 3e-04 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • pfam Xlink 95aa 2e-40 in ref transcript
    • Extracellular link domain.
  • pfam Xlink 95aa 9e-40 in ref transcript
  • pfam Xlink 96aa 1e-36 in ref transcript
  • smart CLECT 122aa 6e-36 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
  • smart LINK 99aa 7e-35 in ref transcript
    • Link (Hyaluronan-binding).
  • pfam V-set 120aa 9e-09 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • smart CCP 57aa 3e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • Changed! pfam EGF 31aa 1e-04 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.

AGER

  • refseq_AGER.F1 refseq_AGER.R1 120 233
  • NCBIGene 36.3 177
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 113
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001136

  • Changed! cd IGcam 66aa 3e-06 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IG 77aa 0.004 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam C2-set_2 96aa 4e-18 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
  • smart IG_like 95aa 1e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • Changed! smart IGc2 58aa 4e-07 in ref transcript
    • Immunoglobulin C-2 Type.
  • Changed! cd IGcam 27aa 0.008 in modified transcript
  • Changed! smart IG_like 26aa 3e-04 in modified transcript

AGER

  • refseq_AGER.F3 refseq_AGER.R3 100 142
  • NCBIGene 36.3 177
  • Alternative 3-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001136

  • cd IGcam 66aa 3e-06 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! cd IG 77aa 0.004 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam C2-set_2 96aa 4e-18 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
  • Changed! smart IG_like 95aa 1e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 58aa 4e-07 in ref transcript
    • Immunoglobulin C-2 Type.
  • Changed! cd IG 63aa 6e-04 in modified transcript
  • Changed! smart IG_like 81aa 6e-08 in modified transcript

AGL

  • refseq_AGL.F1 refseq_AGL.R1 117 260
  • NCBIGene 36.3 178
  • Single exon skipping, size difference: 143
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_000642

  • Changed! TIGR glyc_debranch 1511aa 0.0 in ref transcript
    • glycogen debranching enzyme possesses two different catalytic activities; oligo-1,4-->1,4-glucantransferase (EC 2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site directed mutagenesis studies in S. cerevisiae indicate that the transferase and glucosidase activities are independent and located in different regions of the polypeptide chain. Proteins in this model belong to the larger alpha-amylase family. The model covers eukaryotic proteins with a seed composed of human, nematode and yeast sequences. Yeast seed sequence is well characterized. The model is quite rigorous; either query sequence yields large bit score or it fails to hit the model altogether. There doesn't appear to be any middle ground.
  • Changed! COG GDB1 462aa 5e-33 in ref transcript
    • Glycogen debranching enzyme [Carbohydrate transport and metabolism].
  • Changed! COG AmyA 175aa 8e-05 in ref transcript
    • Glycosidases [Carbohydrate transport and metabolism].

AGL

  • refseq_AGL.F2 refseq_AGL.R2 112 392
  • NCBIGene 36.3 178
  • Alternative 5-prime, size difference: 280
  • Exclusion in 5'UTR
  • Reference transcript: NM_000028

  • TIGR glyc_debranch 1511aa 0.0 in ref transcript
    • glycogen debranching enzyme possesses two different catalytic activities; oligo-1,4-->1,4-glucantransferase (EC 2.4.1.25) and amylo-1,6-glucosidase (EC 3.2.1.33). Site directed mutagenesis studies in S. cerevisiae indicate that the transferase and glucosidase activities are independent and located in different regions of the polypeptide chain. Proteins in this model belong to the larger alpha-amylase family. The model covers eukaryotic proteins with a seed composed of human, nematode and yeast sequences. Yeast seed sequence is well characterized. The model is quite rigorous; either query sequence yields large bit score or it fails to hit the model altogether. There doesn't appear to be any middle ground.
  • COG GDB1 462aa 5e-33 in ref transcript
    • Glycogen debranching enzyme [Carbohydrate transport and metabolism].
  • COG AmyA 175aa 8e-05 in ref transcript
    • Glycosidases [Carbohydrate transport and metabolism].

AGPAT4

  • refseq_AGPAT4.F1 refseq_AGPAT4.R1 102 239
  • NCBIGene 36.2 56895
  • Single exon skipping, size difference: 137
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_020133

  • Changed! smart PlsC 122aa 1e-17 in ref transcript
    • Phosphate acyltransferases. Function in phospholipid biosynthesis and have either glycerolphosphate, 1-acylglycerolphosphate, or 2-acylglycerolphosphoethanolamine acyltransferase activities. Tafazzin, the product of the gene mutated in patients with Barth syndrome, is a member of this family.
  • Changed! COG PlsC 196aa 2e-07 in ref transcript
    • 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism].

AGTRAP

  • refseq_AGTRAP.F1 refseq_AGTRAP.R1 115 136
  • NCBIGene 36.3 57085
  • Alternative 5-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020350

  • Changed! pfam AGTRAP 159aa 1e-78 in ref transcript
    • Angiotensin II, type I receptor-associated protein (AGTRAP). This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the carboxyl-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
  • Changed! pfam AGTRAP 152aa 4e-75 in modified transcript

AGTRAP

  • refseq_AGTRAP.F2 refseq_AGTRAP.R2 140 237
  • NCBIGene 36.3 57085
  • Single exon skipping, size difference: 97
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_020350

  • Changed! pfam AGTRAP 159aa 1e-78 in ref transcript
    • Angiotensin II, type I receptor-associated protein (AGTRAP). This family consists of several angiotensin II, type I receptor-associated protein (AGTRAP) sequences. AGTRAP is known to interact specifically with the carboxyl-terminal cytoplasmic region of the angiotensin II type 1 (AT(1)) receptor to regulate different aspects of AT(1) receptor physiology. The function of this family is unclear.
  • Changed! pfam AGTRAP 20aa 2e-06 in modified transcript

AIFM3

  • refseq_AIFL.F1 refseq_AIFL.R1 110 131
  • NCBIGene 36.3 150209
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144704

  • cd Rieske_AIFL_N 95aa 2e-41 in ref transcript
    • AIFL (apoptosis-inducing factor like) family, N-terminal Rieske domain; members of this family show similarity to human AIFL, containing an N-terminal Rieske domain and a C-terminal pyridine nucleotide-disulfide oxidoreductase domain (Pyr_redox). The Rieske domain is a [2Fe-2S] cluster binding domain involved in electron transfer. AIFL shares 35% homology with human AIF (apoptosis-inducing factor), mainly in the Pyr_redox domain. AIFL is predominantly localized to the mitochondria. AIFL induces apoptosis in a caspase-dependent manner.
  • pfam Pyr_redox_2 278aa 6e-43 in ref transcript
    • Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain.
  • pfam Rieske 88aa 7e-14 in ref transcript
    • Rieske [2Fe-2S] domain. The rieske domain has a [2Fe-2S] centre. Two conserved cysteines that one Fe ion while the other Fe ion is coordinated by two conserved histidines.
  • PRK PRK09754 382aa 1e-33 in ref transcript
    • phenylpropionate dioxygenase ferredoxin reductase subunit; Provisional.
  • COG {NirD} 96aa 1e-16 in ref transcript
    • Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases [Inorganic ion transport and metabolism / General function prediction only].

AIPL1

  • refseq_AIPL1.F2 refseq_AIPL1.R2 136 325
  • NCBIGene 36.3 23746
  • Single exon skipping, size difference: 189
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014336

  • cd TPR 117aa 7e-04 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • pfam FKBP_C 61aa 0.002 in ref transcript
    • FKBP-type peptidyl-prolyl cis-trans isomerase.

AIPL1

  • refseq_AIPL1.F4 refseq_AIPL1.R4 186 366
  • NCBIGene 36.3 23746
  • Single exon skipping, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014336

  • cd TPR 117aa 7e-04 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • Changed! pfam FKBP_C 61aa 0.002 in ref transcript
    • FKBP-type peptidyl-prolyl cis-trans isomerase.

AKAP14

  • refseq_AKAP14.F1 refseq_AKAP14.R1 144 324
  • NCBIGene 36.3 158798
  • Single exon skipping, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_178813

AKAP7

  • refseq_AKAP7.F1 refseq_AKAP7.R1 267 336
  • NCBIGene 36.3 9465
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138633

  • pfam AKAP7_RIRII_bdg 61aa 2e-19 in ref transcript
    • PKA-RI-RII subunit binding domain of A-kinase anchor protein. AKAP7_RIRII_bdg is the C-terminal domain of the cyclic AMP-dependent protein kinase A, PKA, anchor protein AKAP7. This protein anchors PKA, for its role in regulating PKA-mediated gene transcription in both somatic cells and oocytes, by binding to its regulatory subunits, RI and RII, hence being known as a dual-specific AKAP. The 25 crucial amino acids of RII-binding domains in general form structurally conserved amphipathic helices with unrelated sequences; hydrophobic amino acid residues form the backbone of the interaction and hydrogen bond- and salt-bridge-forming amino acid residues increase the affinity of the interaction. The N-terminus, of family AKAP7_NLS, carries the nuclear localisation signal.

AKAP9

  • refseq_AKAP9.F1 refseq_AKAP9.R1 197 233
  • NCBIGene 36.3 10142
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_147171

  • pfam PACT_coil_coil 83aa 2e-26 in ref transcript
    • Pericentrin-AKAP-450 domain of centrosomal targeting protein. This domain is a coiled-coil region close to the C-terminus of centrosomal proteins that is directly responsible for recruiting AKAP-450 and pericentrin to the centrosome. Hence the suggested name for this region is a PACT domain (pericentrin-AKAP-450 centrosomal targeting). This domain is also present at the C-terminus of coiled-coil proteins from Drosophila and S. pombe, and that from the Drosophila protein is sufficient for targeting to the centrosome in mammalian cells. The function of these proteins is unknown but they seem good candidates for having a centrosomal or spindle pole body location. The final 22 residues of this domain in AKAP-450 appear specifically to be a calmodulin-binding domain indicating that this member at least is likely to contribute to centrosome assembly.
  • TIGR SMC_prok_B 404aa 6e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • pfam SMC_N 259aa 6e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • TIGR SMC_prok_A 252aa 0.001 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • TIGR SMC_prok_B 709aa 0.002 in ref transcript
  • COG Smc 264aa 3e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 317aa 7e-04 in ref transcript
  • PRK PRK05771 168aa 0.009 in ref transcript
    • V-type ATP synthase subunit I; Validated.

AKAP9

  • refseq_AKAP9.F2 refseq_AKAP9.R2 133 157
  • NCBIGene 36.3 10142
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 24
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_147171

  • pfam PACT_coil_coil 83aa 2e-26 in ref transcript
    • Pericentrin-AKAP-450 domain of centrosomal targeting protein. This domain is a coiled-coil region close to the C-terminus of centrosomal proteins that is directly responsible for recruiting AKAP-450 and pericentrin to the centrosome. Hence the suggested name for this region is a PACT domain (pericentrin-AKAP-450 centrosomal targeting). This domain is also present at the C-terminus of coiled-coil proteins from Drosophila and S. pombe, and that from the Drosophila protein is sufficient for targeting to the centrosome in mammalian cells. The function of these proteins is unknown but they seem good candidates for having a centrosomal or spindle pole body location. The final 22 residues of this domain in AKAP-450 appear specifically to be a calmodulin-binding domain indicating that this member at least is likely to contribute to centrosome assembly.
  • TIGR SMC_prok_B 404aa 6e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • pfam SMC_N 259aa 6e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • TIGR SMC_prok_A 252aa 0.001 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • TIGR SMC_prok_B 709aa 0.002 in ref transcript
  • COG Smc 264aa 3e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 317aa 7e-04 in ref transcript
  • PRK PRK05771 168aa 0.009 in ref transcript
    • V-type ATP synthase subunit I; Validated.

AKR1A1

  • refseq_AKR1A1.F2 refseq_AKR1A1.R2 166 294
  • NCBIGene 36.3 10327
  • Single exon skipping, size difference: 128
  • Exclusion in 5'UTR
  • Reference transcript: NM_006066

  • cd Aldo_ket_red 284aa 2e-72 in ref transcript
    • Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
  • pfam Aldo_ket_red 287aa 2e-97 in ref transcript
    • Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity.
  • COG ARA1 298aa 4e-81 in ref transcript
    • Aldo/keto reductases, related to diketogulonate reductase [General function prediction only].

ALAD

  • refseq_ALAD.F1 refseq_ALAD.R1 188 354
  • NCBIGene 36.3 210
  • Alternative 3-prime, size difference: 166
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001003945

  • Changed! cd eu_ALAD_PBGS_cysteine_rich 313aa 1e-151 in ref transcript
    • Porphobilinogen synthase (PBGS), which is also called delta-aminolevulinic acid dehydratase (ALAD), catalyzes the condensation of two 5-aminolevulinic acid (ALA) molecules to form the pyrrole porphobilinogen (PBG), which is the second step in the biosynthesis of tetrapyrroles, such as heme, vitamin B12 and chlorophyll. This reaction involves the formation of a Schiff base link between the substrate and the enzyme. PBGSs are metalloenzymes, some of which have a second, allosteric metal binding site, beside the metal ion binding site in their active site. Although PBGS is a family of homologous enzymes, its metal ion utilization at catalytic site varies between zinc and magnesium and/or potassium. PBGS can be classified into two groups based on differences in their active site metal binding site. The eukaryotic PBGSs represented by this model, which contain a cysteine-rich zinc binding motif (DXCXCX(Y/F)X3G(H/Q)CG), require zinc for their activity, they do not contain an additional allosteric metal binding site and do not bind magnesium.
  • Changed! pfam ALAD 289aa 1e-138 in ref transcript
    • Delta-aminolevulinic acid dehydratase.
  • Changed! PRK PRK09283 284aa 1e-112 in ref transcript
    • delta-aminolevulinic acid dehydratase; Validated.

ALAS1

  • refseq_ALAS1.F2 refseq_ALAS1.R2 186 363
  • NCBIGene 36.3 211
  • Single exon skipping, size difference: 177
  • Exclusion in 5'UTR
  • Reference transcript: NM_000688

  • cd KBL_like 356aa 1e-137 in ref transcript
    • KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to serine palmitoyltransferase (SPT), 5-aminolevulinate synthase (ALAS), 8-amino-7-oxononanoate synthase (AONS), and 2-amino-3-ketobutyrate CoA ligase (KBL). SPT is responsible for the condensation of L-serine with palmitoyl-CoA to produce 3-ketodihydrospingosine, the reaction of the first step in sphingolipid biosynthesis. ALAS is involved in heme biosynthesis; it catalyzes the synthesis of 5-aminolevulinic acid from glycine and succinyl-coenzyme A. AONS catalyses the decarboxylative condensation of l-alanine and pimeloyl-CoA in the first committed step of biotin biosynthesis. KBL catalyzes the second reaction step of the metabolic degradation pathway for threonine converting 2-amino-3-ketobutyrate, to glycine and acetyl-CoA. The members of this CD are widely found in all three forms of life.
  • TIGR 5aminolev_synth 406aa 1e-178 in ref transcript
    • This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
  • PRK PRK09064 405aa 1e-167 in ref transcript
    • 5-aminolevulinate synthase; Validated.

ALAS2

  • refseq_ALAS2.F1 refseq_ALAS2.R1 178 374
  • NCBIGene 36.3 212
  • Single exon skipping, size difference: 196
  • Exclusion of the protein initiation site
  • Reference transcript: NM_000032

  • cd KBL_like 352aa 1e-149 in ref transcript
    • KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to serine palmitoyltransferase (SPT), 5-aminolevulinate synthase (ALAS), 8-amino-7-oxononanoate synthase (AONS), and 2-amino-3-ketobutyrate CoA ligase (KBL). SPT is responsible for the condensation of L-serine with palmitoyl-CoA to produce 3-ketodihydrospingosine, the reaction of the first step in sphingolipid biosynthesis. ALAS is involved in heme biosynthesis; it catalyzes the synthesis of 5-aminolevulinic acid from glycine and succinyl-coenzyme A. AONS catalyses the decarboxylative condensation of l-alanine and pimeloyl-CoA in the first committed step of biotin biosynthesis. KBL catalyzes the second reaction step of the metabolic degradation pathway for threonine converting 2-amino-3-ketobutyrate, to glycine and acetyl-CoA. The members of this CD are widely found in all three forms of life.
  • TIGR 5aminolev_synth 406aa 0.0 in ref transcript
    • This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
  • Changed! pfam Preseq_ALAS 101aa 1e-55 in ref transcript
    • 5-aminolevulinate synthase presequence. The N terminal presequence domain found in 5-aminolevulinate synthase exists as an amphipathic helix, with a positively charged surface provided by lysine residues and no stable helix at the N-terminus. The domain is essential for the import process by which ALAS is transported into the mitochondria: translocase of the outer membrane (Tom) and translocase of the inner membrane protein complexes appear responsible for recognition and import through the mitochondrial membrane. The protein Tom20 is anchored to the mitochondrial outer membrane, and its interaction with presequences is thought to be the recognition step which allows subsequent import.
  • PRK PRK09064 405aa 0.0 in ref transcript
    • 5-aminolevulinate synthase; Validated.
  • Changed! pfam Preseq_ALAS 29aa 7e-10 in modified transcript

ALAS2

  • refseq_ALAS2.F1 refseq_ALAS2.R5 160 267
  • NCBIGene 36.3 212
  • Single exon skipping, size difference: 107
  • Inclusion in 5'UTR
  • Reference transcript: NM_000032

  • cd KBL_like 352aa 1e-149 in ref transcript
    • KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to serine palmitoyltransferase (SPT), 5-aminolevulinate synthase (ALAS), 8-amino-7-oxononanoate synthase (AONS), and 2-amino-3-ketobutyrate CoA ligase (KBL). SPT is responsible for the condensation of L-serine with palmitoyl-CoA to produce 3-ketodihydrospingosine, the reaction of the first step in sphingolipid biosynthesis. ALAS is involved in heme biosynthesis; it catalyzes the synthesis of 5-aminolevulinic acid from glycine and succinyl-coenzyme A. AONS catalyses the decarboxylative condensation of l-alanine and pimeloyl-CoA in the first committed step of biotin biosynthesis. KBL catalyzes the second reaction step of the metabolic degradation pathway for threonine converting 2-amino-3-ketobutyrate, to glycine and acetyl-CoA. The members of this CD are widely found in all three forms of life.
  • TIGR 5aminolev_synth 406aa 0.0 in ref transcript
    • This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
  • pfam Preseq_ALAS 101aa 1e-55 in ref transcript
    • 5-aminolevulinate synthase presequence. The N terminal presequence domain found in 5-aminolevulinate synthase exists as an amphipathic helix, with a positively charged surface provided by lysine residues and no stable helix at the N-terminus. The domain is essential for the import process by which ALAS is transported into the mitochondria: translocase of the outer membrane (Tom) and translocase of the inner membrane protein complexes appear responsible for recognition and import through the mitochondrial membrane. The protein Tom20 is anchored to the mitochondrial outer membrane, and its interaction with presequences is thought to be the recognition step which allows subsequent import.
  • PRK PRK09064 405aa 0.0 in ref transcript
    • 5-aminolevulinate synthase; Validated.

ALAS2

  • refseq_ALAS2.F3 refseq_ALAS2.R3 186 297
  • NCBIGene 36.3 212
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000032

  • cd KBL_like 352aa 1e-149 in ref transcript
    • KBL_like; this family belongs to the pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD corresponds to serine palmitoyltransferase (SPT), 5-aminolevulinate synthase (ALAS), 8-amino-7-oxononanoate synthase (AONS), and 2-amino-3-ketobutyrate CoA ligase (KBL). SPT is responsible for the condensation of L-serine with palmitoyl-CoA to produce 3-ketodihydrospingosine, the reaction of the first step in sphingolipid biosynthesis. ALAS is involved in heme biosynthesis; it catalyzes the synthesis of 5-aminolevulinic acid from glycine and succinyl-coenzyme A. AONS catalyses the decarboxylative condensation of l-alanine and pimeloyl-CoA in the first committed step of biotin biosynthesis. KBL catalyzes the second reaction step of the metabolic degradation pathway for threonine converting 2-amino-3-ketobutyrate, to glycine and acetyl-CoA. The members of this CD are widely found in all three forms of life.
  • TIGR 5aminolev_synth 406aa 0.0 in ref transcript
    • This model represents 5-aminolevulinic acid synthase, an enzyme for one of two routes to the heme precursor 5-aminolevulinate. The protein is a pyridoxal phosphate-dependent enzyme related to 2-amino-3-ketobutyrate CoA tranferase and 8-amino-7-oxononanoate synthase. This enzyme appears restricted to the alpha Proteobacteria and mitochondrial derivatives.
  • pfam Preseq_ALAS 101aa 1e-55 in ref transcript
    • 5-aminolevulinate synthase presequence. The N terminal presequence domain found in 5-aminolevulinate synthase exists as an amphipathic helix, with a positively charged surface provided by lysine residues and no stable helix at the N-terminus. The domain is essential for the import process by which ALAS is transported into the mitochondria: translocase of the outer membrane (Tom) and translocase of the inner membrane protein complexes appear responsible for recognition and import through the mitochondrial membrane. The protein Tom20 is anchored to the mitochondrial outer membrane, and its interaction with presequences is thought to be the recognition step which allows subsequent import.
  • PRK PRK09064 405aa 0.0 in ref transcript
    • 5-aminolevulinate synthase; Validated.

ALDH1A2

  • refseq_ALDH1A2.F1 refseq_ALDH1A2.R1 311 425
  • NCBIGene 36.3 8854
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003888

  • Changed! pfam Aldedh 464aa 0.0 in ref transcript
    • Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28. Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27. Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9. Delta-1-pyrroline-5-carboxylate dehydrogenase EC: 1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10. Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This family also includes omega crystallin, an eye lens protein from squid and octopus that has little aldehyde dehydrogenase activity.
  • Changed! COG PutA 477aa 1e-155 in ref transcript
    • NAD-dependent aldehyde dehydrogenases [Energy production and conversion].
  • Changed! pfam Aldedh 426aa 1e-174 in modified transcript
  • Changed! COG PutA 439aa 1e-136 in modified transcript

ALDH3A2

  • refseq_ALDH3A2.F1 refseq_ALDH3A2.R1 231 356
  • NCBIGene 36.3 224
  • Single exon skipping, size difference: 125
  • Exclusion of the stop codon
  • Reference transcript: NM_001031806

  • pfam Aldedh 416aa 7e-73 in ref transcript
    • Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28. Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27. Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9. Delta-1-pyrroline-5-carboxylate dehydrogenase EC: 1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10. Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This family also includes omega crystallin, an eye lens protein from squid and octopus that has little aldehyde dehydrogenase activity.
  • COG PutA 423aa 8e-82 in ref transcript
    • NAD-dependent aldehyde dehydrogenases [Energy production and conversion].

ALDH3B1

  • refseq_ALDH3B1.F1 refseq_ALDH3B1.R1 101 211
  • NCBIGene 36.3 221
  • Single exon skipping, size difference: 110
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001030010

  • Changed! pfam Aldedh 250aa 1e-49 in ref transcript
    • Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28. Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27. Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9. Delta-1-pyrroline-5-carboxylate dehydrogenase EC: 1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10. Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This family also includes omega crystallin, an eye lens protein from squid and octopus that has little aldehyde dehydrogenase activity.
  • Changed! COG PutA 258aa 1e-50 in ref transcript
    • NAD-dependent aldehyde dehydrogenases [Energy production and conversion].
  • Changed! TIGR arg_catab_astD 75aa 0.009 in modified transcript
    • Members of this protein family are succinylglutamic semialdehyde dehydrogenase (EC 1.2.1.71), the fourth enzyme in the arginine succinyltransferase (AST) pathway for arginine catabolism.
  • Changed! COG PutA 54aa 0.005 in modified transcript

ALDH5A1

  • refseq_ALDH5A1.F1 refseq_ALDH5A1.R1 125 164
  • NCBIGene 36.3 7915
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170740

  • Changed! TIGR SSADH 464aa 0.0 in ref transcript
    • SSADH enzyme belongs to the aldehyde dehydrogenase family (pfam00171), sharing a common evolutionary origin and enzymatic mechanism with lactaldehyde dehydrogenase. Like in lactaldehyde dehydrogenase and succinate semialdehyde dehydrogenase, the mammalian catalytic glutamic acid and cysteine residues are conserved in all the enzymes of this family (PS00687, PS00070).
  • Changed! PRK gabD 484aa 1e-161 in ref transcript
    • succinate-semialdehyde dehydrogenase I; Provisional.
  • Changed! TIGR SSADH 451aa 0.0 in modified transcript
  • Changed! PRK gabD 471aa 1e-160 in modified transcript

ALDH8A1

  • refseq_ALDH8A1.F1 refseq_ALDH8A1.R1 183 345
  • NCBIGene 36.3 64577
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022568

  • Changed! pfam Aldedh 464aa 1e-172 in ref transcript
    • Aldehyde dehydrogenase family. This family of dehydrogenases act on aldehyde substrates. Members use NADP as a cofactor. The family includes the following members: The prototypical members are the aldehyde dehydrogenases EC:1.2.1.3. Succinate-semialdehyde dehydrogenase EC:1.2.1.16. Lactaldehyde dehydrogenase EC:1.2.1.22. Benzaldehyde dehydrogenase EC:1.2.1.28. Methylmalonate-semialdehyde dehydrogenase EC:1.2.1.27. Glyceraldehyde-3-phosphate dehydrogenase EC:1.2.1.9. Delta-1-pyrroline-5-carboxylate dehydrogenase EC: 1.5.1.12. Acetaldehyde dehydrogenase EC:1.2.1.10. Glutamate-5-semialdehyde dehydrogenase EC:1.2.1.41. This family also includes omega crystallin, an eye lens protein from squid and octopus that has little aldehyde dehydrogenase activity.
  • Changed! COG PutA 476aa 1e-143 in ref transcript
    • NAD-dependent aldehyde dehydrogenases [Energy production and conversion].
  • Changed! pfam Aldedh 410aa 1e-138 in modified transcript
  • Changed! COG PutA 422aa 1e-115 in modified transcript

ALG8

  • refseq_ALG8.F1 refseq_ALG8.R1 308 383
  • NCBIGene 36.3 79053
  • Single exon skipping, size difference: 75
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_024079

  • Changed! pfam Alg6_Alg8 494aa 1e-139 in ref transcript
    • ALG6, ALG8 glycosyltransferase family. N-linked (asparagine-linked) glycosylation of proteins is mediated by a highly conserved pathway in eukaryotes, in which a lipid (dolichol phosphate)-linked oligosaccharide is assembled at the endoplasmic reticulum membrane prior to the transfer of the oligosaccharide moiety to the target asparagine residues. This oligosaccharide is composed of Glc(3)Man(9)GlcNAc(2). The addition of the three glucose residues is the final series of steps in the synthesis of the oligosaccharide precursor. Alg6 transfers the first glucose residue, and Alg8 transfers the second one. In the human alg6 gene, a C->T transition, which causes Ala333 to be replaced with Val, has been identified as the cause of a congenital disorder of glycosylation, designated as type Ic OMIM:603147.
  • Changed! pfam Alg6_Alg8 437aa 1e-121 in modified transcript

ALKBH1

  • refseq_ALKBH1.F2 refseq_ALKBH1.R2 117 280
  • NCBIGene 36.2 8846
  • Single exon skipping, size difference: 163
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006020

  • Changed! TIGR alkb 171aa 3e-72 in ref transcript
    • Proteins in this family have an as of yet undetermined function in the repair of alkylation damage to DNA. Alignment and family designation based on phylogenomic analysis of Jonathan A. Eisen (PhD Thesis, Stanford University, 1999).
  • Changed! COG AlkB 137aa 2e-16 in ref transcript
    • Alkylated DNA repair protein [DNA replication, recombination, and repair].

ALKBH6

  • refseq_ALKBH6.F2 refseq_ALKBH6.R2 137 175
  • NCBIGene 36.3 84964
  • Alternative 3-prime, size difference: 38
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032878

ALKBH6

  • refseq_ALKBH6.F4 refseq_ALKBH6.R4 145 177
  • NCBIGene 36.2 84964
  • Alternative 3-prime, size difference: 32
  • Exclusion in 5'UTR
  • Reference transcript: NM_032878

ALOX15B

  • refseq_ALOX15B.F1 refseq_ALOX15B.R1 128 263
  • NCBIGene 36.3 247
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 135
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001141

  • cd PLAT_LOX 120aa 7e-37 in ref transcript
    • PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they catalyze enzymatic lipid peroxidation in complex biological structures via direct dioxygenation of phospholipids and cholesterol esters of biomembranes and plasma lipoproteins. Both types of enzymes are cytosolic but need this domain to access their sequestered membrane or micelle bound substrates.
  • Changed! pfam Lipoxygenase 437aa 1e-106 in ref transcript
    • Lipoxygenase.
  • pfam PLAT 114aa 2e-17 in ref transcript
    • PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.
  • Changed! pfam Lipoxygenase 392aa 3e-85 in modified transcript

ALOX15B

  • refseq_ALOX15B.F4 refseq_ALOX15B.R4 134 221
  • NCBIGene 36.3 247
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001141

  • cd PLAT_LOX 120aa 7e-37 in ref transcript
    • PLAT domain of 12/15-lipoxygenase. As a unique subfamily of the mammalian lipoxygenases, they catalyze enzymatic lipid peroxidation in complex biological structures via direct dioxygenation of phospholipids and cholesterol esters of biomembranes and plasma lipoproteins. Both types of enzymes are cytosolic but need this domain to access their sequestered membrane or micelle bound substrates.
  • Changed! pfam Lipoxygenase 437aa 1e-106 in ref transcript
    • Lipoxygenase.
  • pfam PLAT 114aa 2e-17 in ref transcript
    • PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.
  • Changed! pfam Lipoxygenase 408aa 2e-91 in modified transcript

PARD3B

  • refseq_ALS2CR19.F1 refseq_ALS2CR19.R1 119 326
  • NCBIGene 36.3 117583
  • Single exon skipping, size difference: 207
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152526

  • cd PDZ_signaling 79aa 9e-12 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 88aa 1e-05 in ref transcript
  • cd PDZ_signaling 32aa 0.008 in ref transcript
  • smart PDZ 81aa 2e-13 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 91aa 8e-07 in ref transcript
  • COG Prc 81aa 1e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

PARD3B

  • refseq_ALS2CR19.F3 refseq_ALS2CR19.R3 100 403
  • NCBIGene 36.3 117583
  • Single exon skipping, size difference: 303
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152526

  • cd PDZ_signaling 79aa 9e-12 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 88aa 1e-05 in ref transcript
  • cd PDZ_signaling 32aa 0.008 in ref transcript
  • smart PDZ 81aa 2e-13 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 91aa 8e-07 in ref transcript
  • COG Prc 81aa 1e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

PARD3B

  • refseq_ALS2CR19.F4 refseq_ALS2CR19.R4 156 342
  • NCBIGene 36.3 117583
  • Single exon skipping, size difference: 186
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_152526

  • cd PDZ_signaling 79aa 9e-12 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 88aa 1e-05 in ref transcript
  • Changed! cd PDZ_signaling 32aa 0.008 in ref transcript
  • smart PDZ 81aa 2e-13 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 91aa 8e-07 in ref transcript
  • COG Prc 81aa 1e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • Changed! cd PDZ_signaling 78aa 2e-10 in modified transcript
  • Changed! smart PDZ 88aa 2e-10 in modified transcript
  • Changed! TIGR degP_htrA_DO 147aa 1e-05 in modified transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.

AMACR

  • refseq_AMACR.F1 refseq_AMACR.R1 189 350
  • NCBIGene 36.3 23600
  • Single exon skipping, size difference: 161
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014324

  • Changed! pfam CoA_transf_3 192aa 1e-68 in ref transcript
    • CoA-transferase family III. CoA-transferases are found in organisms from all lines of descent. Most of these enzymes belong to two well-known enzyme families, but recent work on unusual biochemical pathways of anaerobic bacteria has revealed the existence of a third family of CoA-transferases. The members of this enzyme family differ in sequence and reaction mechanism from CoA-transferases of the other families. Currently known enzymes of the new family are a formyl-CoA: oxalate CoA-transferase, a succinyl-CoA: (R)-benzylsuccinate CoA-transferase, an (E)-cinnamoyl-CoA: (R)-phenyllactate CoA-transferase, and a butyrobetainyl-CoA: (R)-carnitine CoA-transferase. In addition, a large number of proteins of unknown or differently annotated function from Bacteria, Archaea and Eukarya apparently belong to this enzyme family. Properties and reaction mechanisms of the CoA-transferases of family III are described and compared to those of the previously known CoA-transferases.
  • Changed! COG CaiB 371aa 2e-85 in ref transcript
    • Predicted acyl-CoA transferases/carnitine dehydratase [Energy production and conversion].
  • Changed! pfam CoA_transf_3 79aa 6e-28 in modified transcript
  • Changed! TIGR oxalate_frc 161aa 1e-21 in modified transcript
    • This enzyme, formyl-CoA transferase, transfers coenzyme A from formyl-CoA to oxalate. It forms a pathway, together with oxalyl-CoA decarboxylase, for oxalate degradation; decarboxylation by the latter gene regenerates formyl-CoA. The two enzymes typically are encoded by a two-gene operon.
  • Changed! COG CaiB 129aa 3e-37 in modified transcript

AMD1

  • refseq_AMD1.F2 refseq_AMD1.R2 108 425
  • NCBIGene 36.3 262
  • Multiple exon skipping, size difference: 317
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001634

  • Changed! pfam SAM_decarbox 328aa 1e-124 in ref transcript
    • Adenosylmethionine decarboxylase. This is a family of S-adenosylmethionine decarboxylase (SAMDC) proenzymes. In the biosynthesis of polyamines SAMDC produces decarboxylated S-adenosylmethionine, which serves as the aminopropyl moiety necessary for spermidine and spermine biosynthesis from putrescine. The Pfam alignment contains both the alpha and beta chains that are cleaved to form the active enzyme.
  • Changed! pfam SAM_decarbox 41aa 2e-09 in modified transcript

AMMECR1

  • refseq_AMMECR1.F2 refseq_AMMECR1.R2 252 363
  • NCBIGene 36.3 9949
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015365

  • Changed! pfam AMMECR1 172aa 4e-45 in ref transcript
    • AMMECR1. This family consists of several AMMECR1 as well as several uncharacterised proteins. The contiguous gene deletion syndrome AMME is characterised by Alport syndrome, midface hypoplasia, mental retardation and elliptocytosis and is caused by a deletion in Xq22.3, comprising several genes including COL4A5, FACL4 and AMMECR1. This family contains sequences from several eukaryotic species as well as archaebacteria and it has been suggested that the AMMECR1 protein may have a basic cellular function, potentially in either the transcription, replication, repair or translation machinery.
  • Changed! COG AMMECR1 153aa 1e-24 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam AMMECR1 106aa 6e-28 in modified transcript
  • Changed! COG AMMECR1 85aa 7e-15 in modified transcript

AMZ2

  • refseq_AMZ2.F1 refseq_AMZ2.R1 175 349
  • NCBIGene 36.3 51321
  • Single exon skipping, size difference: 174
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016627

  • Changed! pfam Peptidase_M54 154aa 9e-12 in ref transcript
    • Peptidase family M54. This is a family of metallopeptidases. Two human proteins have been reported to degrade synthetic substrates and peptides.
  • Changed! COG COG1913 178aa 2e-18 in ref transcript
    • Predicted Zn-dependent proteases [General function prediction only].
  • Changed! pfam Peptidase_M54 128aa 6e-12 in modified transcript
  • Changed! COG COG1913 143aa 3e-17 in modified transcript

ANAPC11

  • refseq_ANAPC11.F2 refseq_ANAPC11.R3 141 204
  • NCBIGene 36.3 51529
  • Single exon skipping, size difference: 63
  • Inclusion in 5'UTR
  • Reference transcript: NM_001002244

  • COG APC11 37aa 5e-07 in ref transcript
    • Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning].

ANAPC11

  • refseq_ANAPC11.F3 refseq_ANAPC11.R4 102 403
  • NCBIGene 36.3 51529
  • Single exon skipping, size difference: 301
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001002244

  • Changed! COG APC11 37aa 5e-07 in ref transcript
    • Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning].
  • Changed! COG APC11 84aa 1e-16 in modified transcript

ANGPTL4

  • refseq_ANGPTL4.F2 refseq_ANGPTL4.R2 104 218
  • NCBIGene 36.3 51129
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139314

  • Changed! cd FReD 218aa 1e-68 in ref transcript
    • Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
  • Changed! smart FBG 217aa 2e-62 in ref transcript
    • Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
  • Changed! cd FReD 179aa 5e-56 in modified transcript
  • Changed! smart FBG 179aa 1e-50 in modified transcript

ANK1

  • refseq_ANK1.F1 refseq_ANK1.R1 157 223
  • NCBIGene 36.2 286
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000037

  • cd ANK 126aa 1e-31 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 126aa 2e-31 in ref transcript
  • cd ANK 126aa 3e-31 in ref transcript
  • cd ANK 124aa 1e-30 in ref transcript
  • cd ANK 126aa 1e-30 in ref transcript
  • cd ANK 120aa 2e-28 in ref transcript
  • cd ANK 126aa 3e-28 in ref transcript
  • cd ANK 155aa 2e-27 in ref transcript
  • cd ANK 126aa 8e-24 in ref transcript
  • pfam ZU5 105aa 2e-42 in ref transcript
    • ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function.
  • smart DEATH 87aa 1e-18 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • TIGR trp 253aa 4e-10 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • TIGR trp 143aa 4e-09 in ref transcript
  • TIGR trp 163aa 5e-09 in ref transcript
  • TIGR trp 237aa 5e-06 in ref transcript
  • pfam Ank 31aa 3e-05 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • COG Arp 158aa 2e-17 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 178aa 2e-16 in ref transcript
  • COG Arp 187aa 4e-14 in ref transcript
  • COG Arp 128aa 1e-13 in ref transcript
  • COG Arp 160aa 1e-12 in ref transcript

ANK3

  • refseq_ANK3.F1 refseq_ANK3.R1 104 413
  • NCBIGene 36.3 288
  • Single exon skipping, size difference: 309
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_020987

  • cd ANK 126aa 2e-33 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 126aa 5e-33 in ref transcript
  • cd ANK 126aa 5e-33 in ref transcript
  • cd ANK 126aa 8e-33 in ref transcript
  • cd ANK 126aa 4e-32 in ref transcript
  • cd ANK 125aa 2e-31 in ref transcript
  • cd ANK 122aa 2e-31 in ref transcript
  • pfam ZU5 105aa 9e-37 in ref transcript
    • ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function.
  • smart DEATH 87aa 3e-21 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • TIGR trp 213aa 3e-14 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • TIGR trp 308aa 1e-07 in ref transcript
  • TIGR trp 186aa 2e-07 in ref transcript
  • TIGR trp 131aa 6e-06 in ref transcript
  • pfam Ank 32aa 3e-05 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 32aa 7e-05 in ref transcript
  • COG Arp 191aa 2e-17 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 145aa 1e-14 in ref transcript
  • COG Arp 219aa 3e-14 in ref transcript
  • COG Arp 127aa 3e-13 in ref transcript
  • COG Arp 130aa 2e-12 in ref transcript
  • COG Arp 221aa 9e-11 in ref transcript
  • PTZ PTZ00322 70aa 6e-04 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

ANK3

  • refseq_ANK3.F2 refseq_ANK3.R2 108 135
  • NCBIGene 36.3 288
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020987

  • cd ANK 126aa 2e-33 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 126aa 5e-33 in ref transcript
  • cd ANK 126aa 5e-33 in ref transcript
  • cd ANK 126aa 8e-33 in ref transcript
  • cd ANK 126aa 4e-32 in ref transcript
  • cd ANK 125aa 2e-31 in ref transcript
  • cd ANK 122aa 2e-31 in ref transcript
  • pfam ZU5 105aa 9e-37 in ref transcript
    • ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function.
  • smart DEATH 87aa 3e-21 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • TIGR trp 213aa 3e-14 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • TIGR trp 308aa 1e-07 in ref transcript
  • TIGR trp 186aa 2e-07 in ref transcript
  • TIGR trp 131aa 6e-06 in ref transcript
  • pfam Ank 32aa 3e-05 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 32aa 7e-05 in ref transcript
  • COG Arp 191aa 2e-17 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 145aa 1e-14 in ref transcript
  • COG Arp 219aa 3e-14 in ref transcript
  • COG Arp 127aa 3e-13 in ref transcript
  • COG Arp 130aa 2e-12 in ref transcript
  • COG Arp 221aa 9e-11 in ref transcript
  • PTZ PTZ00322 70aa 6e-04 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

ANKMY1

  • refseq_ANKMY1.F2 refseq_ANKMY1.R2 123 372
  • NCBIGene 36.3 51281
  • Multiple exon skipping, size difference: 249
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016552

  • Changed! cd ANK 166aa 6e-13 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 83aa 1e-08 in ref transcript
  • Changed! pfam Ank 34aa 3e-04 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 33aa 0.004 in ref transcript
  • Changed! COG Arp 119aa 1e-06 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! COG Arp 86aa 6e-05 in ref transcript
  • COG COG4642 83aa 1e-04 in ref transcript
    • Uncharacterized protein conserved in bacteria [Function unknown].
  • Changed! cd ANK 105aa 1e-07 in modified transcript
  • Changed! COG Arp 107aa 1e-05 in modified transcript

ANKMY1

  • refseq_ANKMY1.F4 refseq_ANKMY1.R4 234 397
  • NCBIGene 36.3 51281
  • Single exon skipping, size difference: 163
  • Inclusion in 5'UTR
  • Reference transcript: NM_016552

  • cd ANK 166aa 6e-13 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 83aa 1e-08 in ref transcript
  • pfam Ank 34aa 3e-04 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 33aa 0.004 in ref transcript
  • COG Arp 119aa 1e-06 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 86aa 6e-05 in ref transcript
  • COG COG4642 83aa 1e-04 in ref transcript
    • Uncharacterized protein conserved in bacteria [Function unknown].

ANKRD16

  • refseq_ANKRD16.F1 refseq_ANKRD16.R1 136 215
  • NCBIGene 36.3 54522
  • Single exon skipping, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_019046

  • cd ANK 121aa 5e-22 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd ANK 156aa 5e-20 in ref transcript
  • cd ANK 128aa 3e-19 in ref transcript
  • TIGR trp 142aa 4e-05 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! TIGR trp 223aa 3e-04 in ref transcript
  • Changed! COG Arp 184aa 7e-12 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 122aa 1e-10 in ref transcript
  • Changed! cd ANK 112aa 2e-18 in modified transcript
  • Changed! pfam Ank 32aa 0.003 in modified transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • Changed! TIGR trp 177aa 0.004 in modified transcript
  • Changed! COG Arp 145aa 4e-13 in modified transcript

ANKRD5

  • refseq_ANKRD5.F2 refseq_ANKRD5.R2 339 403
  • NCBIGene 36.3 63926
  • Single exon skipping, size difference: 64
  • Exclusion in 5'UTR
  • Reference transcript: NM_022096

  • cd ANK 125aa 1e-25 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 115aa 4e-15 in ref transcript
  • cd ANK 124aa 6e-14 in ref transcript
  • cd EFh 59aa 0.005 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • pfam Ank 30aa 3e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • TIGR trp 119aa 7e-06 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • pfam Ank 33aa 4e-04 in ref transcript
  • pfam Ank 28aa 0.002 in ref transcript
  • COG Arp 196aa 2e-13 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 143aa 4e-04 in ref transcript
  • COG Arp 91aa 0.003 in ref transcript

ANKS1B

  • refseq_ANKS1B.F2 refseq_ANKS1B.R2 169 241
  • NCBIGene 36.3 56899
  • Single exon skipping, size difference: 72
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_152788

  • cd AIDA-1b 132aa 1e-50 in ref transcript
    • AIDA-1b Phosphotyrosine-binding (PTB) domain. AIDA-1b is an amyloid-beta precursor protein interacting protein. It consists of ankyrin repeats, a SAM domain and a C-terminal PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
  • cd ANK 129aa 3e-29 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 125aa 4e-25 in ref transcript
  • cd SAM 62aa 2e-11 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • cd SAM 63aa 3e-08 in ref transcript
  • pfam PID 131aa 3e-29 in ref transcript
    • Phosphotyrosine interaction domain (PTB/PID).
  • smart SAM 66aa 2e-13 in ref transcript
    • Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
  • smart SAM 62aa 3e-11 in ref transcript
  • TIGR trp 191aa 1e-06 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • pfam Ank 32aa 4e-04 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • COG Arp 134aa 2e-14 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 184aa 2e-08 in ref transcript

ANXA13

  • refseq_ANXA13.F1 refseq_ANXA13.R1 100 223
  • NCBIGene 36.3 312
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001003954

  • pfam Annexin 66aa 6e-21 in ref transcript
    • Annexin. This family of annexins also includes giardin that has been shown to function as an annexin.
  • pfam Annexin 66aa 2e-20 in ref transcript
  • pfam Annexin 67aa 7e-17 in ref transcript
  • pfam Annexin 66aa 3e-12 in ref transcript

ANXA6

  • refseq_ANXA6.F1 refseq_ANXA6.R1 102 120
  • NCBIGene 36.3 309
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001155

  • pfam Annexin 66aa 3e-21 in ref transcript
    • Annexin. This family of annexins also includes giardin that has been shown to function as an annexin.
  • pfam Annexin 66aa 8e-20 in ref transcript
  • pfam Annexin 66aa 2e-18 in ref transcript
  • pfam Annexin 65aa 8e-18 in ref transcript
  • pfam Annexin 66aa 3e-17 in ref transcript
  • pfam Annexin 67aa 2e-15 in ref transcript
  • pfam Annexin 65aa 1e-12 in ref transcript
  • Changed! pfam Annexin 67aa 3e-11 in ref transcript
  • Changed! pfam Annexin 67aa 6e-11 in modified transcript

ANXA7

  • refseq_ANXA7.F2 refseq_ANXA7.R2 288 354
  • NCBIGene 36.3 310
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004034

  • pfam Annexin 66aa 7e-24 in ref transcript
    • Annexin. This family of annexins also includes giardin that has been shown to function as an annexin.
  • pfam Annexin 66aa 3e-22 in ref transcript
  • pfam Annexin 66aa 5e-21 in ref transcript
  • pfam Annexin 67aa 6e-14 in ref transcript

AOC2

  • refseq_AOC2.F1 refseq_AOC2.R1 140 221
  • NCBIGene 36.3 314
  • Alternative 5-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_009590

  • Changed! pfam Cu_amine_oxid 416aa 1e-136 in ref transcript
    • Copper amine oxidase, enzyme domain. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrogen peroxide. The enzymes are dimers of identical 70-90 kDa subunits, each of which contains a single copper ion and a covalently bound cofactor formed by the post-translational modification of a tyrosine side chain to 2,4,5-trihydroxyphenylalanine quinone (TPQ). This family corresponds to the catalytic domain of the enzyme.
  • pfam Cu_amine_oxidN3 99aa 1e-23 in ref transcript
    • Copper amine oxidase, N3 domain. This domain is the second or third structural domain in copper amine oxidases, it is known as the N3 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrogen peroxide. The enzymes are dimers of identical 70-90 kDa subunits, each of which contains a single copper ion and a covalently bound cofactor formed by the post-translational modification of a tyrosine side chain to 2,4,5-trihydroxyphenylalanine quinone (TPQ).
  • pfam Cu_amine_oxidN2 86aa 9e-20 in ref transcript
    • Copper amine oxidase, N2 domain. This domain is the first or second structural domain in copper amine oxidases, it is known as the N2 domain. Its function is uncertain. The catalytic domain can be found in pfam01179. Copper amine oxidases are a ubiquitous and novel group of quinoenzymes that catalyse the oxidative deamination of primary amines to the corresponding aldehydes, with concomitant reduction of molecular oxygen to hydrogen peroxide. The enzymes are dimers of identical 70-90 kDa subunits, each of which contains a single copper ion and a covalently bound cofactor formed by the post-translational modification of a tyrosine side chain to 2,4,5-trihydroxyphenylalanine quinone (TPQ).
  • Changed! COG TynA 416aa 1e-45 in ref transcript
    • Cu2+-containing amine oxidase [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam Cu_amine_oxid 389aa 1e-126 in modified transcript
  • Changed! COG TynA 389aa 1e-41 in modified transcript

AP1B1

  • refseq_AP1B1.F1 refseq_AP1B1.R1 119 140
  • NCBIGene 36.3 162
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001127

  • cd ARM 113aa 2e-09 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • pfam Adaptin_N 524aa 1e-163 in ref transcript
    • Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
  • pfam B2-adapt-app_C 114aa 2e-34 in ref transcript
    • Beta2-adaptin appendage, C-terminal sub-domain. Members of this family adopt a structure consisting of a 5 stranded beta-sheet, flanked by one alpha helix on the outer side, and by two alpha helices on the inner side. This domain is required for binding to clathrin, and its subsequent polymerisation. Furthermore, a hydrophobic patch present in the domain also binds to a subset of D-phi-F/W motif-containing proteins that are bound by the alpha-adaptin appendage domain (epsin, AP180, eps15).
  • smart Alpha_adaptinC2 101aa 2e-16 in ref transcript
    • Adaptin C-terminal domain. Adaptins are components of the adaptor complexes which link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. Gamma-adaptin is a subunit of the golgi adaptor. Alpha adaptin is a heterotetramer that regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This Ig-fold domain is found in alpha, beta and gamma adaptins and consists of a beta-sandwich containing 7 strands in 2 beta-sheets in a greek-key topology PUBMED:10430869, PUBMED:12176391. The adaptor appendage contains an additional N-terminal strand.
  • COG COG5096 548aa 1e-113 in ref transcript
    • Vesicle coat complex, various subunits [Intracellular trafficking and secretion].

AP1GBP1

  • refseq_AP1GBP1.F1 refseq_AP1GBP1.R1 159 195
  • NCBIGene 36.3 11276
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007247

  • cd EH 52aa 3e-09 in ref transcript
    • Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
  • smart EH 62aa 6e-08 in ref transcript
    • Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.

AP2A1

  • refseq_AP2A1.F1 refseq_AP2A1.R1 132 198
  • NCBIGene 36.3 160
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014203

  • pfam Adaptin_N 563aa 1e-151 in ref transcript
    • Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
  • pfam Alpha_adaptin_C 109aa 7e-50 in ref transcript
    • Alpha adaptin AP2, C-terminal domain. Alpha adaptin is a hetero tetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site.
  • pfam Alpha_adaptinC2 106aa 2e-13 in ref transcript
    • Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found in alpha, beta and gamma adaptins.
  • COG COG5096 549aa 3e-11 in ref transcript
    • Vesicle coat complex, various subunits [Intracellular trafficking and secretion].

AP2B1

  • refseq_AP2B1.F1 refseq_AP2B1.R1 196 238
  • NCBIGene 36.3 163
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001030006

  • cd ARM 113aa 1e-09 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • pfam Adaptin_N 497aa 1e-160 in ref transcript
    • Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
  • pfam B2-adapt-app_C 111aa 4e-35 in ref transcript
    • Beta2-adaptin appendage, C-terminal sub-domain. Members of this family adopt a structure consisting of a 5 stranded beta-sheet, flanked by one alpha helix on the outer side, and by two alpha helices on the inner side. This domain is required for binding to clathrin, and its subsequent polymerisation. Furthermore, a hydrophobic patch present in the domain also binds to a subset of D-phi-F/W motif-containing proteins that are bound by the alpha-adaptin appendage domain (epsin, AP180, eps15).
  • smart Alpha_adaptinC2 101aa 3e-17 in ref transcript
    • Adaptin C-terminal domain. Adaptins are components of the adaptor complexes which link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. Gamma-adaptin is a subunit of the golgi adaptor. Alpha adaptin is a heterotetramer that regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This Ig-fold domain is found in alpha, beta and gamma adaptins and consists of a beta-sandwich containing 7 strands in 2 beta-sheets in a greek-key topology PUBMED:10430869, PUBMED:12176391. The adaptor appendage contains an additional N-terminal strand.
  • COG COG5096 548aa 1e-112 in ref transcript
    • Vesicle coat complex, various subunits [Intracellular trafficking and secretion].

AP2S1

  • refseq_AP2S1.F1 refseq_AP2S1.R1 179 293
  • NCBIGene 36.3 1175
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004069

  • Changed! pfam Clat_adaptor_s 142aa 6e-57 in ref transcript
    • Clathrin adaptor complex small chain.
  • Changed! COG APS2 142aa 6e-42 in ref transcript
    • Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion].
  • Changed! pfam Clat_adaptor_s 104aa 2e-33 in modified transcript
  • Changed! COG APS2 104aa 3e-22 in modified transcript

AP3M1

  • refseq_AP3M1.F1 refseq_AP3M1.R1 268 394
  • NCBIGene 36.3 26985
  • Single exon skipping, size difference: 126
  • Exclusion in 5'UTR
  • Reference transcript: NM_207012

  • pfam Adap_comp_sub 254aa 3e-77 in ref transcript
    • Adaptor complexes medium subunit family. This family also contains members which are coatomer subunits.
  • pfam Clat_adaptor_s 144aa 0.002 in ref transcript
    • Clathrin adaptor complex small chain.
  • COG APS2 140aa 0.002 in ref transcript
    • Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion].

AP3S1

  • refseq_AP3S1.F1 refseq_AP3S1.R1 179 215
  • NCBIGene 36.2 1176
  • Single exon skipping, size difference: 36
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001284

  • Changed! pfam Clat_adaptor_s 148aa 7e-47 in ref transcript
    • Clathrin adaptor complex small chain.
  • Changed! COG APS2 152aa 5e-44 in ref transcript
    • Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion].
  • Changed! pfam Clat_adaptor_s 23aa 4e-05 in modified transcript
  • Changed! COG APS2 22aa 2e-04 in modified transcript

APOL1

  • refseq_APOL1.F1 refseq_APOL1.R1 245 399
  • NCBIGene 36.3 8542
  • Single exon skipping, size difference: 154
  • Exclusion of the protein initiation site
  • Reference transcript: NM_145343

  • pfam ApoL 299aa 1e-108 in ref transcript
    • Apolipoprotein L. Apo L belongs to the high density lipoprotein family that plays a central role in cholesterol transport. The cholesterol content of membranes is important in cellular processes such as modulating gene transcription and signal transduction both in the adult brain and during neurodevelopment. There are six apo L genes located in close proximity to each other on chromosome 22q12 in humans. 22q12 is a confirmed high-susceptibility locus for schizophrenia and close to the region associated with velocardiofacial syndrome that includes symptoms of schizophrenia.

APOL3

  • refseq_APOL3.F1 refseq_APOL3.R1 112 175
  • NCBIGene 36.3 80833
  • Single exon skipping, size difference: 63
  • Exclusion in 5'UTR
  • Reference transcript: NM_145641

  • pfam ApoL 196aa 3e-50 in ref transcript
    • Apolipoprotein L. Apo L belongs to the high density lipoprotein family that plays a central role in cholesterol transport. The cholesterol content of membranes is important in cellular processes such as modulating gene transcription and signal transduction both in the adult brain and during neurodevelopment. There are six apo L genes located in close proximity to each other on chromosome 22q12 in humans. 22q12 is a confirmed high-susceptibility locus for schizophrenia and close to the region associated with velocardiofacial syndrome that includes symptoms of schizophrenia.

APOL4

  • refseq_APOL4.F1 refseq_APOL4.R1 218 407
  • NCBIGene 36.3 80832
  • Single exon skipping, size difference: 189
  • Exclusion in 5'UTR
  • Reference transcript: NM_030643

  • pfam ApoL 190aa 1e-49 in ref transcript
    • Apolipoprotein L. Apo L belongs to the high density lipoprotein family that plays a central role in cholesterol transport. The cholesterol content of membranes is important in cellular processes such as modulating gene transcription and signal transduction both in the adult brain and during neurodevelopment. There are six apo L genes located in close proximity to each other on chromosome 22q12 in humans. 22q12 is a confirmed high-susceptibility locus for schizophrenia and close to the region associated with velocardiofacial syndrome that includes symptoms of schizophrenia.

APRT

  • refseq_APRT.F1 refseq_APRT.R1 263 397
  • NCBIGene 36.3 353
  • Alternative 3-prime, size difference: 134
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000485

  • Changed! TIGR apt 171aa 1e-70 in ref transcript
    • A phylogenetic analysis suggested omitting the bi-directional best hit homologs from the spirochetes from the seed for this HMM and making only tentative predictions of adenine phosphoribosyltransferase function for this lineage. The trusted cutoff score is made high for this reason. Most proteins scoring between the trusted and noise cutoffs are likely to act as adenine phosphotransferase.
  • Changed! PRK PRK02304 175aa 7e-67 in ref transcript
    • adenine phosphoribosyltransferase; Provisional.
  • Changed! TIGR apt 125aa 7e-53 in modified transcript
  • Changed! PRK PRK02304 128aa 1e-49 in modified transcript

APTX

  • refseq_APTX.F1 refseq_APTX.R1 192 307
  • NCBIGene 36.3 54840
  • Alternative 3-prime, size difference: 115
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_175073

  • Changed! cd aprataxin_related 102aa 6e-35 in ref transcript
    • aprataxin related: Aprataxin, a HINT family hydrolase is mutated in ataxia oculomotor apraxia syndrome. All the members of this subgroup have the conserved HxHxHxx (where x is a hydrophobic residue) signature motif. Members of this subgroup are predominantly eukaryotic in origin.
  • Changed! cd FHA 94aa 2e-05 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • Changed! TIGR PNK-3'Pase 105aa 1e-14 in ref transcript
    • Note that the EC number for the kinase function is: 2.7.1.78.
  • Changed! pfam HIT 91aa 6e-06 in ref transcript
    • HIT domain.
  • Changed! COG Hit 83aa 1e-05 in ref transcript
    • Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only].
  • Changed! TIGR PNK-3'Pase 58aa 2e-07 in modified transcript

APTX

  • refseq_APTX.F3 refseq_APTX.R3 195 231
  • NCBIGene 36.3 54840
  • Alternative 3-prime, size difference: 36
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_175073

  • cd aprataxin_related 102aa 6e-35 in ref transcript
    • aprataxin related: Aprataxin, a HINT family hydrolase is mutated in ataxia oculomotor apraxia syndrome. All the members of this subgroup have the conserved HxHxHxx (where x is a hydrophobic residue) signature motif. Members of this subgroup are predominantly eukaryotic in origin.
  • cd FHA 94aa 2e-05 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • TIGR PNK-3'Pase 105aa 1e-14 in ref transcript
    • Note that the EC number for the kinase function is: 2.7.1.78.
  • pfam HIT 91aa 6e-06 in ref transcript
    • HIT domain.
  • COG Hit 83aa 1e-05 in ref transcript
    • Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only].

APTX

  • refseq_APTX.F4 refseq_APTX.R4 114 220
  • NCBIGene 36.3 54840
  • Single exon skipping, size difference: 106
  • Exclusion in 5'UTR
  • Reference transcript: NM_175073

  • cd aprataxin_related 102aa 6e-35 in ref transcript
    • aprataxin related: Aprataxin, a HINT family hydrolase is mutated in ataxia oculomotor apraxia syndrome. All the members of this subgroup have the conserved HxHxHxx (where x is a hydrophobic residue) signature motif. Members of this subgroup are predominantly eukaryotic in origin.
  • cd FHA 94aa 2e-05 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • TIGR PNK-3'Pase 105aa 1e-14 in ref transcript
    • Note that the EC number for the kinase function is: 2.7.1.78.
  • pfam HIT 91aa 6e-06 in ref transcript
    • HIT domain.
  • COG Hit 83aa 1e-05 in ref transcript
    • Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases [Nucleotide transport and metabolism / Carbohydrate transport and metabolism / General function prediction only].

ARF1

  • refseq_ARF1.F1 refseq_ARF1.R1 114 136
  • NCBIGene 36.3 375
  • Alternative 5-prime, size difference: 22
  • Exclusion in 5'UTR
  • Reference transcript: NM_001024227

  • cd Arf1_5_like 159aa 1e-98 in ref transcript
    • Arf1-Arf5-like subfamily. This subfamily contains Arf1, Arf2, Arf3, Arf4, Arf5, and related proteins. Arfs1-5 are soluble proteins that are crucial for assembling coat proteins during vesicle formation. Each contains an N-terminal myristoylated amphipathic helix that is folded into the protein in the GDP-bound state. GDP/GTP exchange exposes the helix, which anchors to the membrane. Following GTP hydrolysis, the helix dissociates from the membrane and folds back into the protein. A general feature of Arf1-5 signaling may be the cooperation of two Arfs at the same site. Arfs1-5 are generally considered to be interchangeable in function and location, but some specific functions have been assigned. Arf1 localizes to the early/cis-Golgi, where it is activated by GBF1 and recruits the coat protein COPI. It also localizes to the trans-Golgi network (TGN), where it is activated by BIG1/BIG2 and recruits the AP1, AP3, AP4, and GGA proteins. Humans, but not rodents and other lower eukaryotes, lack Arf2. Human Arf3 shares 96% sequence identity with Arf1 and is believed to generally function interchangeably with Arf1. Human Arf4 in the activated (GTP-bound) state has been shown to interact with the cytoplasmic domain of epidermal growth factor receptor (EGFR) and mediate the EGF-dependent activation of phospholipase D2 (PLD2), leading to activation of the activator protein 1 (AP-1) transcription factor. Arf4 has also been shown to recognize the C-terminal sorting signal of rhodopsin and regulate its incorporation into specialized post-Golgi rhodopsin transport carriers (RTCs). There is some evidence that Arf5 functions at the early-Golgi and the trans-Golgi to affect Golgi-associated alpha-adaptin homology Arf-binding proteins (GGAs).
  • smart ARF 175aa 1e-98 in ref transcript
    • ARF-like small GTPases; ARF, ADP-ribosylation factor. Ras homologues involved in vesicular transport. Activator of phospholipase D isoforms. Unlike Ras proteins they lack cysteine residues at their C-termini and therefore are unlikely to be prenylated. ARFs are N-terminally myristoylated. Contains ATP/GTP-binding motif (P-loop).
  • PTZ PTZ00133 181aa 4e-96 in ref transcript
    • ADP-ribosylation factor; Provisional.

ARF1

  • refseq_ARF1.F3 refseq_ARF1.R3 187 272
  • NCBIGene 36.3 375
  • Alternative 5-prime, size difference: 85
  • Exclusion in 5'UTR
  • Reference transcript: NM_001024226

  • cd Arf1_5_like 159aa 1e-98 in ref transcript
    • Arf1-Arf5-like subfamily. This subfamily contains Arf1, Arf2, Arf3, Arf4, Arf5, and related proteins. Arfs1-5 are soluble proteins that are crucial for assembling coat proteins during vesicle formation. Each contains an N-terminal myristoylated amphipathic helix that is folded into the protein in the GDP-bound state. GDP/GTP exchange exposes the helix, which anchors to the membrane. Following GTP hydrolysis, the helix dissociates from the membrane and folds back into the protein. A general feature of Arf1-5 signaling may be the cooperation of two Arfs at the same site. Arfs1-5 are generally considered to be interchangeable in function and location, but some specific functions have been assigned. Arf1 localizes to the early/cis-Golgi, where it is activated by GBF1 and recruits the coat protein COPI. It also localizes to the trans-Golgi network (TGN), where it is activated by BIG1/BIG2 and recruits the AP1, AP3, AP4, and GGA proteins. Humans, but not rodents and other lower eukaryotes, lack Arf2. Human Arf3 shares 96% sequence identity with Arf1 and is believed to generally function interchangeably with Arf1. Human Arf4 in the activated (GTP-bound) state has been shown to interact with the cytoplasmic domain of epidermal growth factor receptor (EGFR) and mediate the EGF-dependent activation of phospholipase D2 (PLD2), leading to activation of the activator protein 1 (AP-1) transcription factor. Arf4 has also been shown to recognize the C-terminal sorting signal of rhodopsin and regulate its incorporation into specialized post-Golgi rhodopsin transport carriers (RTCs). There is some evidence that Arf5 functions at the early-Golgi and the trans-Golgi to affect Golgi-associated alpha-adaptin homology Arf-binding proteins (GGAs).
  • smart ARF 175aa 1e-98 in ref transcript
    • ARF-like small GTPases; ARF, ADP-ribosylation factor. Ras homologues involved in vesicular transport. Activator of phospholipase D isoforms. Unlike Ras proteins they lack cysteine residues at their C-termini and therefore are unlikely to be prenylated. ARFs are N-terminally myristoylated. Contains ATP/GTP-binding motif (P-loop).
  • PTZ PTZ00133 181aa 4e-96 in ref transcript
    • ADP-ribosylation factor; Provisional.

ARFGAP1

  • refseq_ARFGAP1.F1 refseq_ARFGAP1.R1 136 166
  • NCBIGene 36.3 55738
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_175609

  • pfam ArfGap 118aa 3e-48 in ref transcript
    • Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • COG COG5347 159aa 4e-31 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].

ARFIP1

  • refseq_ARFIP1.F1 refseq_ARFIP1.R1 145 280
  • NCBIGene 36.3 27236
  • Alternative 5-prime, size difference: 135
  • Exclusion in 5'UTR
  • Reference transcript: NM_001025595

  • cd Arfaptin 202aa 1e-51 in ref transcript
    • Arfaptin domain; arfaptin is a ubiquitously expressed protein implicated in mediating cross-talk between Rac, a member of the Rho family, and Arf small GTPases; Arfaptin binds to GTP-bound Arf1 and Arf6, but binds Rac.GTP and Rac.GDP with similar affinities. Structures of Arfaptin with Rac bound to either GDP or the slowly hydrolysable analogue GMPPNP show that the switch regions adopt similar conformations in both complexes. Arf1 and Arf6 are thought to bind to the same surface as Rac.
  • pfam Arfaptin 196aa 5e-70 in ref transcript
    • Arfaptin-like domain. Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils. The N-terminal region of ICA69 is similar to arfaptin.

ARFIP1

  • refseq_ARFIP1.F2 refseq_ARFIP1.R2 290 386
  • NCBIGene 36.3 27236
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025595

  • cd Arfaptin 202aa 1e-51 in ref transcript
    • Arfaptin domain; arfaptin is a ubiquitously expressed protein implicated in mediating cross-talk between Rac, a member of the Rho family, and Arf small GTPases; Arfaptin binds to GTP-bound Arf1 and Arf6, but binds Rac.GTP and Rac.GDP with similar affinities. Structures of Arfaptin with Rac bound to either GDP or the slowly hydrolysable analogue GMPPNP show that the switch regions adopt similar conformations in both complexes. Arf1 and Arf6 are thought to bind to the same surface as Rac.
  • pfam Arfaptin 196aa 5e-70 in ref transcript
    • Arfaptin-like domain. Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils. The N-terminal region of ICA69 is similar to arfaptin.

ARHGAP17

  • refseq_ARHGAP17.F1 refseq_ARHGAP17.R1 162 396
  • NCBIGene 36.3 55114
  • Single exon skipping, size difference: 234
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001006634

  • cd RhoGAP_nadrin 201aa 1e-85 in ref transcript
    • RhoGAP_nadrin: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of Nadrin-like proteins. Nadrin, also named Rich-1, has been shown to be involved in the regulation of Ca2+-dependent exocytosis in neurons and recently has been implicated in tight junction maintenance in mammalian epithelium. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • pfam BAR 238aa 1e-51 in ref transcript
    • BAR domain. BAR domains are dimerisation, lipid binding and curvature sensing modules found in many different protein families. A BAR domain with an additional N-terminal amphipathic helix (an N-BAR) can drive membrane curvature. These N-BAR domains are found in amphiphysin, endophilin, BRAP and Nadrin. BAR domains are also frequently found alongside domains that determine lipid specificity, like pfam00169 and pfam00787 domains in beta centaurins and sorting nexins respectively.
  • smart RhoGAP 176aa 6e-41 in ref transcript
    • GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers.

ARHGAP6

  • refseq_ARHGAP6.F1 refseq_ARHGAP6.R1 103 199
  • NCBIGene 36.3 395
  • Single exon skipping, size difference: 96
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_013427

  • cd RhoGAP_ARHGAP6 207aa 1e-93 in ref transcript
    • RhoGAP_ARHGAP6: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP6-like proteins. ArhGAP6 shows GAP activity towards RhoA, but not towards Cdc42 and Rac1. ArhGAP6 is often deleted in microphthalmia with linear skin defects syndrome (MLS); MLS is a severe X-linked developmental disorder. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • pfam RhoGAP 160aa 4e-45 in ref transcript
    • RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.

ARHGAP6

  • refseq_ARHGAP6.F4 refseq_ARHGAP6.R4 186 276
  • NCBIGene 36.3 395
  • Single exon skipping, size difference: 90
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_013427

  • cd RhoGAP_ARHGAP6 207aa 1e-93 in ref transcript
    • RhoGAP_ARHGAP6: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP6-like proteins. ArhGAP6 shows GAP activity towards RhoA, but not towards Cdc42 and Rac1. ArhGAP6 is often deleted in microphthalmia with linear skin defects syndrome (MLS); MLS is a severe X-linked developmental disorder. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • pfam RhoGAP 160aa 4e-45 in ref transcript
    • RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.

ARHGAP8

  • refseq_ARHGAP8.F2 refseq_ARHGAP8.R2 141 234
  • NCBIGene 36.3 23779
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001017526

  • cd RhoGAP-p50rhoGAP 191aa 4e-73 in ref transcript
    • RhoGAP-p50rhoGAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of p50RhoGAP-like proteins; p50RhoGAP, also known as RhoGAP-1, contains a C-terminal RhoGAP domain and an N-terminal Sec14 domain which binds phosphatidylinositol 3,4,5-trisphosphate (PtdIns(3,4,5)P3). It is ubiquitously expressed and preferentially active on Cdc42. This subgroup also contains closely related ARHGAP8. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • Changed! cd SEC14 177aa 2e-18 in ref transcript
    • Sec14p-like lipid-binding domain. Found in secretory proteins, such as S. cerevisiae phosphatidylinositol transfer protein (Sec14p), and in lipid regulated proteins such as RhoGAPs, RhoGEFs and neurofibromin (NF1). SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
  • pfam RhoGAP 149aa 3e-41 in ref transcript
    • RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
  • Changed! smart SEC14 181aa 1e-17 in ref transcript
    • Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p). Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p) and in RhoGAPs, RhoGEFs and the RasGAP, neurofibromin (NF1). Lipid-binding domain. The SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
  • Changed! cd SEC14 146aa 3e-22 in modified transcript
  • Changed! smart SEC14 150aa 2e-22 in modified transcript

ARHGEF1

  • refseq_ARHGEF1.F2 refseq_ARHGEF1.R2 265 364
  • NCBIGene 36.3 9138
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199002

  • cd RhoGEF 187aa 3e-37 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • Changed! pfam RGS-like 191aa 3e-59 in ref transcript
    • Regulator of G protein signalling-like domain. Members of this family adopt a structure consisting of twelve helices that fold into a compact domain that contains the overall structural scaffold observed in other RGS proteins and three additional helical elements that pack closely to it. Helices 1-9 comprise the RGS (pfam00615) fold, in which helices 4-7 form a classic antiparallel bundle adjacent to the other helices. Like other RGS structures, helices 7 and 8 span the length of the folded domain and form essentially one continuous helix with a kink in the middle. Helices 10-12 form an apparently stable C-terminal extension of the structural domain, and although other RGS proteins lack this structure, these elements are intimately associated with the rest of the structural framework by hydrophobic interactions. Members of the family bind to active G-alpha proteins, promoting GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G protein and rapidly switching off G protein-coupled receptor signalling pathways.
  • smart RhoGEF 185aa 2e-42 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
  • COG ROM1 256aa 3e-13 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].
  • Changed! pfam RGS-like 158aa 1e-32 in modified transcript

ARHGEF10L

  • refseq_ARHGEF10L.F1 refseq_ARHGEF10L.R1 192 309
  • NCBIGene 36.3 55160
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018125

  • cd RhoGEF 167aa 4e-31 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • smart RhoGEF 164aa 2e-33 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
  • COG ROM1 175aa 2e-05 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

ARHGEF11

  • refseq_ARHGEF11.F2 refseq_ARHGEF11.R2 316 436
  • NCBIGene 36.3 9826
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198236

  • cd RhoGEF 187aa 3e-39 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • cd PDZ_signaling 75aa 2e-13 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • pfam RGS-like 180aa 3e-69 in ref transcript
    • Regulator of G protein signalling-like domain. Members of this family adopt a structure consisting of twelve helices that fold into a compact domain that contains the overall structural scaffold observed in other RGS proteins and three additional helical elements that pack closely to it. Helices 1-9 comprise the RGS (pfam00615) fold, in which helices 4-7 form a classic antiparallel bundle adjacent to the other helices. Like other RGS structures, helices 7 and 8 span the length of the folded domain and form essentially one continuous helix with a kink in the middle. Helices 10-12 form an apparently stable C-terminal extension of the structural domain, and although other RGS proteins lack this structure, these elements are intimately associated with the rest of the structural framework by hydrophobic interactions. Members of the family bind to active G-alpha proteins, promoting GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G protein and rapidly switching off G protein-coupled receptor signalling pathways.
  • smart RhoGEF 185aa 7e-45 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
  • smart PDZ 77aa 1e-15 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG ROM1 260aa 5e-15 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].
  • COG DegQ 79aa 6e-05 in ref transcript
    • Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones].

ARID1B

  • refseq_ARID1B.F1 refseq_ARID1B.R1 191 230
  • NCBIGene 36.3 57492
  • Single exon skipping, size difference: 39
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_017519

  • pfam ARID 104aa 1e-30 in ref transcript
    • ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
  • pfam PAT1 152aa 2e-05 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.

ARID1B

  • refseq_ARID1B.F3 refseq_ARID1B.R3 157 316
  • NCBIGene 36.3 57492
  • Single exon skipping, size difference: 159
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017519

  • pfam ARID 104aa 1e-30 in ref transcript
    • ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
  • pfam PAT1 152aa 2e-05 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.

ARID5A

  • refseq_ARID5A.F1 refseq_ARID5A.R1 107 246
  • NCBIGene 36.2 10865
  • Single exon skipping, size difference: 139
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_212481

  • Changed! smart BRIGHT 88aa 5e-25 in ref transcript
    • BRIGHT, ARID (A/T-rich interaction domain) domain. DNA-binding domain containing a helix-turn-helix structure.

ARL5A

  • refseq_ARL5A.F1 refseq_ARL5A.R1 181 437
  • NCBIGene 36.3 26225
  • Alternative 5-prime and 3-prime, size difference: 256
  • Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_012097

  • Changed! cd Arl5_Arl8 173aa 1e-94 in ref transcript
    • Arl5/Arl8 subfamily. Arl5 (Arf-like 5) and Arl8, like Arl4 and Arl7, are localized to the nucleus and nucleolus. Arl5 is developmentally regulated during embryogenesis in mice. Human Arl5 interacts with the heterochromatin protein 1-alpha (HP1alpha), a nonhistone chromosomal protein that is associated with heterochromatin and telomeres, and prevents telomere fusion. Arl5 may also play a role in embryonic nuclear dynamics and/or signaling cascades. Arl8 was identified from a fetal cartilage cDNA library. It is found in brain, heart, lung, cartilage, and kidney. No function has been assigned for Arl8 to date.
  • Changed! pfam Arf 158aa 2e-61 in ref transcript
    • ADP-ribosylation factor family. Pfam combines a number of different Prosite families together.
  • Changed! PTZ PTZ00133 179aa 5e-55 in ref transcript
    • ADP-ribosylation factor; Provisional.
  • Changed! cd Arl5_Arl8 138aa 6e-76 in modified transcript
  • Changed! pfam Arf 135aa 3e-50 in modified transcript
  • Changed! PTZ PTZ00133 142aa 4e-43 in modified transcript

ARL6

  • refseq_ARL6.F2 refseq_ARL6.R2 179 219
  • NCBIGene 36.3 84100
  • Mutually exclusive exon skipping, size difference: 40
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_177976

  • cd Arl6 162aa 1e-88 in ref transcript
    • Arl6 subfamily. Arl6 (Arf-like 6) forms a subfamily of the Arf family of small GTPases. Arl6 expression is limited to the brain and kidney in adult mice, but it is expressed in the neural plate and somites during embryogenesis, suggesting a possible role for Arl6 in early development. Arl6 is also believed to have a role in cilia or flagella function. Several proteins have been identified that bind Arl6, including Arl6 interacting protein (Arl6ip), and SEC61beta, a subunit of the heterotrimeric conducting channel SEC61p. Based on Arl6 binding to these effectors, Arl6 is also proposed to play a role in protein transport, membrane trafficking, or cell signaling during hematopoietic maturation. At least three specific homozygous Arl6 mutations in humans have been found to cause Bardet-Biedl syndrome, a disorder characterized by obesity, retinopathy, polydactyly, renal and cardiac malformations, learning disabilities, and hypogenitalism. Older literature suggests that Arl6 is a part of the Arl4/Arl7 subfamily, but analyses based on more recent sequence data place Arl6 in its own subfamily.
  • pfam Arf 167aa 3e-49 in ref transcript
    • ADP-ribosylation factor family. Pfam combines a number of different Prosite families together.
  • PTZ PTZ00133 182aa 2e-37 in ref transcript
    • ADP-ribosylation factor; Provisional.

ARMC8

  • refseq_ARMC8.F1 refseq_ARMC8.R1 281 335
  • NCBIGene 36.3 25852
  • Alternative 3-prime, size difference: 54
  • Inclusion in 5'UTR
  • Reference transcript: NM_015396

  • cd ARM 116aa 7e-07 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 111aa 2e-06 in ref transcript
  • cd ARM 112aa 0.007 in ref transcript
  • COG COG5369 234aa 1e-18 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • COG SRP1 185aa 2e-05 in ref transcript
    • Karyopherin (importin) alpha [Intracellular trafficking and secretion].

ARMCX2

  • refseq_ARMCX2.F1 refseq_ARMCX2.R1 137 232
  • NCBIGene 36.3 9823
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 95
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_177949

  • cd ARM 118aa 1e-06 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • pfam DUF634 244aa 1e-107 in ref transcript
    • Protein of unknown function (DUF634). Mammalian protein of unknown function.

ARMCX2

  • refseq_ARMCX2.F4 refseq_ARMCX2.R4 158 202
  • NCBIGene 36.3 9823
  • Alternative 3-prime, size difference: 44
  • Inclusion in 5'UTR
  • Reference transcript: NM_177949

  • cd ARM 118aa 1e-06 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • pfam DUF634 244aa 1e-107 in ref transcript
    • Protein of unknown function (DUF634). Mammalian protein of unknown function.

ARMCX3

  • refseq_ARMCX3.F1 refseq_ARMCX3.R1 102 130
  • NCBIGene 36.3 51566
  • Alternative 3-prime, size difference: 28
  • Inclusion in 5'UTR
  • Reference transcript: NM_177947

  • cd ARM 86aa 3e-04 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • pfam DUF634 253aa 1e-106 in ref transcript
    • Protein of unknown function (DUF634). Mammalian protein of unknown function.

ARMCX6

  • refseq_ARMCX6.F2 refseq_ARMCX6.R2 291 373
  • NCBIGene 36.3 54470
  • Single exon skipping, size difference: 82
  • Exclusion in 5'UTR
  • Reference transcript: NM_019007

  • pfam DUF634 191aa 6e-32 in ref transcript
    • Protein of unknown function (DUF634). Mammalian protein of unknown function.

ARNT

  • refseq_ARNT.F1 refseq_ARNT.R1 149 194
  • NCBIGene 36.3 405
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001668

  • cd PAS 97aa 1e-11 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • Changed! cd HLH 57aa 2e-09 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • cd PAS 63aa 1e-05 in ref transcript
  • pfam PAS_3 89aa 5e-14 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • Changed! pfam HLH 54aa 3e-11 in ref transcript
    • Helix-loop-helix DNA-binding domain.
  • pfam PAS 107aa 4e-08 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • Changed! cd HLH 53aa 1e-09 in modified transcript
  • Changed! pfam HLH 54aa 3e-11 in modified transcript

ARNTL

  • refseq_ARNTL.F1 refseq_ARNTL.R1 162 217
  • NCBIGene 36.3 406
  • Single exon skipping, size difference: 55
  • Exclusion in 5'UTR
  • Reference transcript: NM_001178

  • cd PAS 96aa 9e-12 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd HLH 56aa 5e-11 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • cd PAS 95aa 4e-08 in ref transcript
  • pfam HLH 54aa 4e-13 in ref transcript
    • Helix-loop-helix DNA-binding domain.
  • smart PAS 60aa 1e-09 in ref transcript
    • PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
  • smart PAS 54aa 1e-07 in ref transcript

ARNTL

  • refseq_ARNTL.F4 refseq_ARNTL.R4 254 325
  • NCBIGene 36.3 406
  • Single exon skipping, size difference: 71
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001178

  • Changed! cd PAS 96aa 9e-12 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • Changed! cd HLH 56aa 5e-11 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • Changed! cd PAS 95aa 4e-08 in ref transcript
  • Changed! pfam HLH 54aa 4e-13 in ref transcript
    • Helix-loop-helix DNA-binding domain.
  • Changed! smart PAS 60aa 1e-09 in ref transcript
    • PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
  • Changed! smart PAS 54aa 1e-07 in ref transcript

ARRB1

  • refseq_ARRB1.F1 refseq_ARRB1.R1 123 147
  • NCBIGene 36.3 408
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004041

  • pfam Arrestin_N 157aa 5e-30 in ref transcript
    • Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain.
  • Changed! pfam Arrestin_C 160aa 4e-14 in ref transcript
    • Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain.
  • Changed! pfam Arrestin_C 152aa 4e-14 in modified transcript

ARRB2

  • refseq_ARRB2.F1 refseq_ARRB2.R1 127 172
  • NCBIGene 36.3 409
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004313

  • Changed! pfam Arrestin_N 157aa 1e-24 in ref transcript
    • Arrestin (or S-antigen), N-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with C-terminal domain.
  • pfam Arrestin_C 156aa 2e-18 in ref transcript
    • Arrestin (or S-antigen), C-terminal domain. Ig-like beta-sandwich fold. Scop reports duplication with N-terminal domain.
  • Changed! pfam Arrestin_N 142aa 4e-23 in modified transcript

ARS2

  • refseq_ARS2.F1 refseq_ARS2.R1 135 209
  • NCBIGene 36.3 51593
  • Single exon skipping, size difference: 74
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015908

  • Changed! pfam ARS2 222aa 8e-79 in ref transcript
    • Arsenite-resistance protein 2. Arsenite is a carcinogenic compound which can act as a co-mutagen by inhibiting DNA repair. Arsenite-resistance protein 2 is thought to play a role in arsenite resistance.

ASB3

  • refseq_ASB3.F1 refseq_ASB3.R1 167 376
  • NCBIGene 36.3 51130
  • Single exon skipping, size difference: 209
  • Exclusion of the protein initiation site
  • Reference transcript: NM_016115

  • cd ANK 122aa 3e-22 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd ANK 130aa 9e-22 in ref transcript
  • cd SOCS_ASB3 51aa 6e-20 in ref transcript
    • SOCS (suppressors of cytokine signaling) box of ASB3-like proteins. ASB family members have a C-terminal SOCS box and an N-terminal ankyrin-related sequence. ABS3 has been shown to be negative regulator of TNF-R2-mediated cellular responses to TNF-alpha by direct targeting of tumor necrosis factor receptor II (TNF-R2) for ubiquitination and proteasome-mediated degradation. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
  • cd ANK 101aa 3e-07 in ref transcript
  • pfam SOCS_box 39aa 2e-06 in ref transcript
    • SOCS box. The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
  • pfam Ank 32aa 3e-05 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • TIGR trp 154aa 1e-04 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! COG Arp 183aa 4e-12 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! PTZ PTZ00322 82aa 0.004 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.
  • Changed! cd ANK 122aa 3e-24 in modified transcript
  • Changed! COG Arp 124aa 2e-13 in modified transcript
  • Changed! COG Arp 153aa 1e-07 in modified transcript

ASB6

  • refseq_ASB6.F1 refseq_ASB6.R1 169 278
  • NCBIGene 36.3 140459
  • Single exon skipping, size difference: 109
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017873

  • Changed! cd SOCS_ASB6 44aa 2e-18 in ref transcript
    • SOCS (suppressors of cytokine signaling) box of ASB6-like proteins. ASB family members have a C-terminal SOCS box and an N-terminal ankyrin-related sequence. ASB6 interacts with the adaptor protein APS and recruits elongin B/C to the insulin receptor signaling complex. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
  • Changed! cd ANK 127aa 1e-14 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd ANK 124aa 3e-04 in ref transcript
  • Changed! pfam SOCS_box 37aa 8e-06 in ref transcript
    • SOCS box. The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
  • pfam Ank 31aa 2e-05 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • Changed! TIGR trp 167aa 2e-04 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! PTZ PTZ00322 83aa 1e-04 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.
  • Changed! PTZ PTZ00322 57aa 0.001 in ref transcript
  • Changed! PTZ PTZ00322 131aa 0.002 in ref transcript
  • Changed! cd ANK 104aa 4e-10 in modified transcript
  • Changed! TIGR trp 119aa 0.004 in modified transcript
  • Changed! COG Arp 127aa 0.001 in modified transcript
    • FOG: Ankyrin repeat [General function prediction only].

ASB9

  • refseq_ASB9.F1 refseq_ASB9.R1 219 280
  • NCBIGene 36.3 140462
  • Alternative 5-prime, size difference: 61
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001031739

  • cd ANK 125aa 6e-22 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd SOCS_ASB_9_11 42aa 9e-15 in ref transcript
    • SOCS (suppressors of cytokine signaling) box of ASB9 and 11 proteins. ASB family members have a C-terminal SOCS box and an N-terminal ankyrin-related sequence. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
  • Changed! pfam SOCS_box 36aa 2e-06 in ref transcript
    • SOCS box. The SOCS box acts as a bridge between specific substrate- binding domains and more generic proteins that comprise a large family of E3 ubiquitin protein ligases.
  • TIGR trp 134aa 2e-05 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • pfam Ank 29aa 6e-04 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • COG Arp 173aa 2e-12 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! COG Arp 156aa 1e-05 in ref transcript
  • Changed! PTZ PTZ00322 108aa 0.002 in modified transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

ASGR2

  • refseq_ASGR2.F1 refseq_ASGR2.R1 101 189
  • NCBIGene 36.3 433
  • Alternative 3-prime, size difference: 88
  • Inclusion in 5'UTR
  • Reference transcript: NM_001181

  • cd CLECT_DC-SIGN_like 126aa 1e-40 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • pfam Lectin_N 143aa 5e-55 in ref transcript
    • Hepatic lectin, N-terminal domain.
  • pfam Lectin_C 109aa 4e-32 in ref transcript
    • Lectin C-type domain. This family includes both long and short form C-type.

ASL

  • refseq_ASL.F1 refseq_ASL.R1 130 190
  • NCBIGene 36.3 435
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000048

  • Changed! cd Argininosuccinate_lyase 430aa 1e-173 in ref transcript
    • Argininosuccinate lyase (argininosuccinase, ASAL). This group contains proteins similar to ASAL, a member of the Lyase class I family. Members of this family for the most part catalyze similar beta-elimination reactions in which a C-N or C-O bond is cleaved with the release of fumarate as one of the products. These proteins are active as tetramers. The four active sites of the homotetrameric enzyme are each formed by residues from three different subunits. ASAL is a cytosolic enzyme which catalyzes the reversible breakdown of argininosuccinate to arginine and fumarate during arginine biosynthesis. In ureotleic species ASAL also catalyzes a reaction involved in the production of urea. Included in this group are the major soluble avian eye lens proteins from duck, delta 1 and delta 2 crystallin. Of these two isoforms only delta 2 has retained ASAL activity. These crystallins may have evolved by, gene recruitment of ASAL followed by gene duplication. In humans, mutations in ASAL result in the autosomal recessive disorder argininosuccinic aciduria.
  • Changed! TIGR argH 445aa 1e-163 in ref transcript
    • This model describes argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens.
  • Changed! PRK PRK00855 452aa 0.0 in ref transcript
    • argininosuccinate lyase; Provisional.
  • Changed! cd Argininosuccinate_lyase 410aa 1e-159 in modified transcript
  • Changed! TIGR argH 425aa 1e-149 in modified transcript
  • Changed! PRK PRK00855 432aa 1e-167 in modified transcript

ASL

  • refseq_ASL.F3 refseq_ASL.R3 119 197
  • NCBIGene 36.3 435
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000048

  • Changed! cd Argininosuccinate_lyase 430aa 1e-173 in ref transcript
    • Argininosuccinate lyase (argininosuccinase, ASAL). This group contains proteins similar to ASAL, a member of the Lyase class I family. Members of this family for the most part catalyze similar beta-elimination reactions in which a C-N or C-O bond is cleaved with the release of fumarate as one of the products. These proteins are active as tetramers. The four active sites of the homotetrameric enzyme are each formed by residues from three different subunits. ASAL is a cytosolic enzyme which catalyzes the reversible breakdown of argininosuccinate to arginine and fumarate during arginine biosynthesis. In ureotleic species ASAL also catalyzes a reaction involved in the production of urea. Included in this group are the major soluble avian eye lens proteins from duck, delta 1 and delta 2 crystallin. Of these two isoforms only delta 2 has retained ASAL activity. These crystallins may have evolved by, gene recruitment of ASAL followed by gene duplication. In humans, mutations in ASAL result in the autosomal recessive disorder argininosuccinic aciduria.
  • Changed! TIGR argH 445aa 1e-163 in ref transcript
    • This model describes argininosuccinate lyase, but may include examples of avian delta crystallins, in which argininosuccinate lyase activity may or may not be present and the biological role is to provide the optically clear cellular protein of the eye lens.
  • Changed! PRK PRK00855 452aa 0.0 in ref transcript
    • argininosuccinate lyase; Provisional.
  • Changed! cd Argininosuccinate_lyase 404aa 1e-159 in modified transcript
  • Changed! TIGR argH 419aa 1e-149 in modified transcript
  • Changed! PRK PRK00855 426aa 1e-167 in modified transcript

ASNS

  • refseq_ASNS.F2 refseq_ASNS.R2 115 393
  • NCBIGene 36.3 440
  • Single exon skipping, size difference: 278
  • Exclusion in 5'UTR
  • Reference transcript: NM_183356

  • cd AsnB 190aa 2e-43 in ref transcript
    • Glutamine amidotransferases class-II (GATase) asparagine synthase_B type. Asparagine synthetase B catalyses the ATP-dependent conversion of aspartate to asparagine. This enzyme is a homodimer, with each monomer composed of a glutaminase domain and a synthetase domain. The N-terminal glutaminase domain hydrolyzes glutamine to glutamic acid and ammonia.
  • cd Asn_Synthase_B_C 238aa 2e-34 in ref transcript
    • The C-terminal domain of Asparagine Synthase B. This domain is always found associated n-terminal amidotransferase domain. Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase.
  • TIGR asn_synth_AEB 467aa 1e-134 in ref transcript
    • This model describes the glutamine-hydrolysing asparagine synthase. A poorly conserved C-terminal extension was removed from the model. Bacterial members of the family tend to have a long, poorly conserved insert lacking from archaeal and eukaryotic sequences. Multiple isozymes have been demonstrated, such as in Bacillus subtilis. Long-branch members of the phylogenetic tree (which typically were also second or third candidate members from their genomes) were removed from the seed alignment and score below trusted cutoff.
  • TIGR eps_aminotran_1 141aa 1e-12 in ref transcript
    • The predicted protein-sorting transpeptidase that we call exosortase (see TIGR02602) has distinct subclasses that associated with different types of exopolysaccharide production loci. This model represents a distinct clade among a set of amidotransferases largely annotated (not necessarily accurately) as glutatime-hydrolyzing asparagine synthases. Members of this clade are essentially restricted to the characteristic exopolysaccharide (EPS) regions that contain the exosortase 1 genome (xrtA), in genomes that also have numbers of PEP-CTERM domain (TIGR02595) proteins.
  • PRK asnB 555aa 1e-136 in ref transcript
    • asparagine synthetase B; Provisional.

ASPH

  • refseq_ASPH.F1 refseq_ASPH.R1 178 223
  • NCBIGene 36.3 444
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020164

  • Changed! pfam Asp-B-Hydro_N 71aa 1e-15 in ref transcript
    • Aspartyl beta-hydroxylase N-terminal region. This family includes the N-terminal regions of the junctin, junctate and aspartyl beta-hydroxylase proteins. Junctate is an integral ER/SR membrane calcium binding protein, which comes from an alternatively spliced form of the same gene that generates aspartyl beta-hydroxylase and junctin. Aspartyl beta-hydroxylase catalyses the post-translational hydroxylation of aspartic acid or asparagine residues contained within epidermal growth factor (EGF) domains of proteins.
  • pfam Asp-B-Hydro_N 77aa 1e-10 in ref transcript
  • Changed! pfam Asp-B-Hydro_N 56aa 2e-18 in modified transcript

ASTN2

  • refseq_ASTN2.F1 refseq_ASTN2.R1 217 370
  • NCBIGene 36.3 23245
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198187

  • smart MACPF 184aa 1e-36 in ref transcript
    • membrane-attack complex / perforin.

ATF3

  • refseq_ATF3.F2 refseq_ATF3.R2 208 359
  • NCBIGene 36.2 467
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 151
  • Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001030287

  • Changed! smart BRLZ 45aa 7e-05 in ref transcript
    • basic region leucin zipper.

ATG16L1

  • refseq_ATG16L1.F1 refseq_ATG16L1.R1 254 311
  • NCBIGene 36.3 55054
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_030803

  • cd WD40 291aa 4e-48 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam ATG16 195aa 2e-56 in ref transcript
    • Autophagy protein 16 (ATG16). Autophagy is a ubiquitous intracellular degradation system for eukaryotic cells. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. ATG16 (also known as Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate in the yeast autophagy pathway.
  • smart WD40 36aa 7e-06 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 38aa 1e-05 in ref transcript
  • TIGR propeller_TolB 52aa 2e-05 in ref transcript
    • The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
  • pfam SGL 128aa 0.009 in ref transcript
    • SMP-30/Gluconolaconase/LRE-like region. This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30), gluconolactonase and luciferin-regenerating enzyme (LRE). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 and LRE.
  • COG COG2319 260aa 6e-25 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! COG Smc 153aa 4e-06 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! COG Smc 145aa 5e-06 in modified transcript

ATG4A

  • refseq_ATG4A.F1 refseq_ATG4A.R1 184 370
  • NCBIGene 36.3 115201
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 186
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_052936

  • Changed! pfam Peptidase_C54 300aa 1e-123 in ref transcript
    • Peptidase family C54.
  • Changed! pfam Peptidase_C54 238aa 7e-93 in modified transcript

ATG4C

  • refseq_ATG4C.F1 refseq_ATG4C.R1 143 191
  • NCBIGene 36.3 84938
  • Alternative 5-prime, size difference: 48
  • Exclusion in 5'UTR
  • Reference transcript: NM_032852

  • pfam Peptidase_C54 330aa 1e-108 in ref transcript
    • Peptidase family C54.

ATP11C

  • refseq_ATP11C.F2 refseq_ATP11C.R2 127 232
  • NCBIGene 36.3 286410
  • Single exon skipping, size difference: 105
  • Exclusion of the stop codon
  • Reference transcript: NM_173694

  • TIGR ATPase-Plipid 1028aa 0.0 in ref transcript
    • This model describes the P-type ATPase responsible for transporting phospholipids from one leaflet of bilayer membranes to the other. These ATPases are found only in eukaryotes.
  • COG MgtA 659aa 5e-70 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].
  • COG MgtA 205aa 8e-24 in ref transcript

ATP2A1

  • refseq_ATP2A1.F1 refseq_ATP2A1.R1 103 145
  • NCBIGene 36.3 487
  • Single exon skipping, size difference: 42
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_173201

  • TIGR ATPase-IIA1_Ca 937aa 0.0 in ref transcript
    • The calcium P-type ATPases have been characterized as Type IIA based on a phylogenetic analysis which distinguishes this group from the Type IIB PMCA calcium pump modelled by TIGR01517. A separate analysis divides Type IIA into sub-types, SERCA and PMR1 the latter of which is modelled by TIGR01522.
  • pfam Cation_ATPase_N 75aa 1e-23 in ref transcript
    • Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport.
  • COG MgtA 991aa 0.0 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].

ATP2B3

  • refseq_ATP2B3.F1 refseq_ATP2B3.R1 218 372
  • NCBIGene 36.3 492
  • Single exon skipping, size difference: 154
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001001344

  • TIGR ATPase-IIB_Ca 704aa 0.0 in ref transcript
    • The calcium P-type ATPases have been characterized as Type IIB based on a phylogenetic analysis which distinguishes this group from the Type IIA SERCA calcium pump. A separate analysis divides Type IIA into sub-types (SERCA and PMR1) which are modelled by two corresponding HMMs (TIGR01116 and TIGR01522). This model is well separated from those.
  • TIGR ATPase-IIB_Ca 266aa 2e-80 in ref transcript
  • TIGR ATPase-IB_hvy 313aa 2e-13 in ref transcript
    • This model encompasses two equivalog models for the copper and cadmium-type heavy metal transporting P-type ATPases (TIGR01511 and TIGR01512) as well as those species which score ambiguously between both models. For more comments and references, see the files on TIGR01511 and 01512.
  • COG MgtA 993aa 1e-154 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].

ATP2B4

  • refseq_ATP2B4.F1 refseq_ATP2B4.R1 198 376
  • NCBIGene 36.3 493
  • Single exon skipping, size difference: 178
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001684

  • cd HAD_like 159aa 0.007 in ref transcript
    • Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
  • TIGR ATPase-IIB_Ca 1037aa 0.0 in ref transcript
    • The calcium P-type ATPases have been characterized as Type IIB based on a phylogenetic analysis which distinguishes this group from the Type IIA SERCA calcium pump. A separate analysis divides Type IIA into sub-types (SERCA and PMR1) which are modelled by two corresponding HMMs (TIGR01116 and TIGR01522). This model is well separated from those.
  • COG MgtA 681aa 1e-132 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].
  • COG MgtA 236aa 3e-28 in ref transcript

ATP5C1

  • refseq_ATP5C1.F1 refseq_ATP5C1.R1 132 169
  • NCBIGene 36.3 509
  • Single exon skipping, size difference: 37
  • Exclusion of the stop codon
  • Reference transcript: NM_001001973

  • TIGR ATPsyn_F1gamma 272aa 6e-95 in ref transcript
    • This model describes the ATP synthase gamma subunit in bacteria and its equivalents in organelles, namely, mitochondria and chloroplast. F1/F0-ATP synthase is a multisubunit, membrane associated enzyme found in bacteria and organelles of higher eukaryotes, namely, mitochondria and chloroplast. This enzyme is principally involed in the synthesis of ATP from ADP and inorganic phosphate by coupling the energy derived from the proton electrochemical gradient across the biological membrane. A brief description of this multisubunit enzyme complex: F1 and F0 represent two major clusters of subunits. The gamma subunit is the part of F1 cluster. Surrounding the gamma subunit in a cylinder-like structure are three alpha and three subunits in an alternating fashion. This is the central catalytic unit whose different conformations permit the binding of ADP and inorganic phosphate and release of ATP.
  • PRK PRK05621 273aa 7e-66 in ref transcript
    • F0F1 ATP synthase subunit gamma; Validated.

ATP5G1

  • refseq_ATP5G1.F1 refseq_ATP5G1.R1 115 185
  • NCBIGene 36.3 516
  • Alternative 5-prime, size difference: 70
  • Exclusion in 5'UTR
  • Reference transcript: NM_005175

  • pfam ATP-synt_C 46aa 1e-10 in ref transcript
    • ATP synthase subunit C.
  • PRK PRK07558 46aa 7e-09 in ref transcript
    • F0F1 ATP synthase subunit C; Validated.

ATP5G2

  • refseq_ATP5G2.F1 refseq_ATP5G2.R1 213 336
  • NCBIGene 36.3 517
  • Alternative 5-prime, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005176

  • pfam ATP-synt_C 46aa 3e-11 in ref transcript
    • ATP synthase subunit C.
  • PRK PRK07558 46aa 3e-09 in ref transcript
    • F0F1 ATP synthase subunit C; Validated.

ATP5H

  • refseq_ATP5H.F1 refseq_ATP5H.R1 104 176
  • NCBIGene 36.3 10476
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006356

  • Changed! pfam Mt_ATP-synt_D 160aa 1e-60 in ref transcript
    • ATP synthase D chain, mitochondrial (ATP5H). This family consists of several ATP synthase D chain, mitochondrial (ATP5H) proteins. Subunit d has no extensive hydrophobic sequences, and is not apparently related to any subunit described in the simpler ATP synthases in bacteria and chloroplasts.
  • Changed! pfam Mt_ATP-synt_D 136aa 5e-47 in modified transcript

ATP5J

  • refseq_ATP5J.F2 refseq_ATP5J.R2 251 301
  • NCBIGene 36.3 522
  • Alternative 5-prime, size difference: 50
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001003701

  • pfam ATP-synt_F6 92aa 6e-33 in ref transcript
    • Mitochondrial ATP synthase coupling factor 6. Coupling factor 6 (F6) is a component of mitochondrial ATP synthase which is required for the interactions of the catalytic and proton-translocating segments.

ATP5J2

  • refseq_ATP5J2.F2 refseq_ATP5J2.R2 100 118
  • NCBIGene 36.3 9551
  • Alternative 5-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004889

  • pfam WRW 76aa 3e-10 in ref transcript
    • Mitochondrial F1F0-ATP synthase, subunit f. This is a family of small proteins of approximately 110 amino acids, which are highly conserved from nematodes to humans. Some members of the family have been annotated in Swiss-Prot as being the f subunit of mitochondrial F1F0-ATP synthase but this could not be confirmed. The sequence has a well-conserved WRW motif. The exact function of the protein is not known.

ATP5J2

  • refseq_ATP5J2.F3 refseq_ATP5J2.R3 266 383
  • NCBIGene 36.3 9551
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004889

  • Changed! pfam WRW 76aa 3e-10 in ref transcript
    • Mitochondrial F1F0-ATP synthase, subunit f. This is a family of small proteins of approximately 110 amino acids, which are highly conserved from nematodes to humans. Some members of the family have been annotated in Swiss-Prot as being the f subunit of mitochondrial F1F0-ATP synthase but this could not be confirmed. The sequence has a well-conserved WRW motif. The exact function of the protein is not known.
  • Changed! pfam WRW 37aa 0.001 in modified transcript

ATP5S

  • refseq_ATP5S.F1 refseq_ATP5S.R1 179 351
  • NCBIGene 36.3 27109
  • Single exon skipping, size difference: 172
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001003803

ATP6V0A4

  • refseq_ATP6V0A4.F1 refseq_ATP6V0A4.R1 292 395
  • NCBIGene 36.3 50617
  • Single exon skipping, size difference: 103
  • Exclusion in 5'UTR
  • Reference transcript: NM_020632

  • pfam V_ATPase_I 806aa 0.0 in ref transcript
    • V-type ATPase 116kDa subunit family. This family consists of the 116kDa V-type ATPase (vacuolar (H+)-ATPases) subunits, as well as V-type ATP synthase subunit i. The V-type ATPases family are proton pumps that acidify intracellular compartments in eukaryotic cells for example yeast central vacuoles, clathrin-coated and synaptic vesicles. They have important roles in membrane trafficking processes. The 116kDa subunit (subunit a) in the V-type ATPase is part of the V0 functional domain responsible for proton transport. The a subunit is a transmembrane glycoprotein with multiple putative transmembrane helices it has a hydrophilic amino terminal and a hydrophobic carboxy terminal. It has roles in proton transport and assembly of the V-type ATPase complex. This subunit is encoded by two homologous gene in yeast VPH1 and STV1.
  • COG NtpI 595aa 5e-45 in ref transcript
    • Archaeal/vacuolar-type H+-ATPase subunit I [Energy production and conversion].
  • COG NtpI 115aa 5e-22 in ref transcript

ATP6V0B

  • refseq_ATP6V0B.F2 refseq_ATP6V0B.R2 164 213
  • NCBIGene 36.3 533
  • Single exon skipping, size difference: 49
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004047

  • Changed! TIGR V_ATP_synt_C 108aa 7e-06 in ref transcript
    • The principal role V-ATPases are the acidification of intracellular compartments of eukaryotic cells.
  • Changed! pfam ATP-synt_C 62aa 0.002 in ref transcript
    • ATP synthase subunit C.
  • Changed! COG AtpE 64aa 4e-04 in ref transcript
    • F0F1-type ATP synthase, subunit c/Archaeal/vacuolar-type H+-ATPase, subunit K [Energy production and conversion].

ATP6V1E1

  • refseq_ATP6V1E1.F1 refseq_ATP6V1E1.R1 149 215
  • NCBIGene 36.3 529
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001696

  • Changed! pfam vATP-synt_E 199aa 4e-33 in ref transcript
    • ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit.
  • COG NtpE 146aa 4e-05 in ref transcript
    • Archaeal/vacuolar-type H+-ATPase subunit E [Energy production and conversion].
  • Changed! pfam vATP-synt_E 188aa 5e-34 in modified transcript

ATP6V1E1

  • refseq_ATP6V1E1.F3 refseq_ATP6V1E1.R3 246 336
  • NCBIGene 36.3 529
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001696

  • Changed! pfam vATP-synt_E 199aa 4e-33 in ref transcript
    • ATP synthase (E/31 kDa) subunit. This family includes the vacuolar ATP synthase E subunit, as well as the archaebacterial ATP synthase E subunit.
  • Changed! COG NtpE 146aa 4e-05 in ref transcript
    • Archaeal/vacuolar-type H+-ATPase subunit E [Energy production and conversion].
  • Changed! pfam vATP-synt_E 169aa 2e-28 in modified transcript
  • Changed! COG NtpE 116aa 7e-05 in modified transcript

ATP6V1G2

  • refseq_ATP6V1G2.F2 refseq_ATP6V1G2.R2 122 276
  • NCBIGene 36.3 534
  • Alternative 5-prime, size difference: 154
  • Exclusion of the protein initiation site
  • Reference transcript: NM_130463

  • Changed! pfam V-ATPase_G 105aa 1e-13 in ref transcript
    • Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins in several organelles. They are also found in the plasma membranes of some specialised cells. V-ATPases consist of peripheral (V1) and membrane integral (V0) heteromultimeric complexes. The G subunit is part of the V1 subunit, but is also thought to be strongly attached to the V0 complex. It may be involved in the coupling of ATP degradation to H+ translocation.
  • Changed! pfam V-ATPase_G 66aa 3e-11 in modified transcript

ATP6V1G3

  • refseq_ATP6V1G3.F1 refseq_ATP6V1G3.R1 119 165
  • NCBIGene 36.3 127124
  • Single exon skipping, size difference: 46
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_133262

  • Changed! pfam V-ATPase_G 105aa 3e-18 in ref transcript
    • Vacuolar (H+)-ATPase G subunit. This family represents the eukaryotic vacuolar (H+)-ATPase (V-ATPase) G subunit. V-ATPases generate an acidic environment in several intracellular compartments. Correspondingly, they are found as membrane-attached proteins in several organelles. They are also found in the plasma membranes of some specialised cells. V-ATPases consist of peripheral (V1) and membrane integral (V0) heteromultimeric complexes. The G subunit is part of the V1 subunit, but is also thought to be strongly attached to the V0 complex. It may be involved in the coupling of ATP degradation to H+ translocation.
  • Changed! pfam V-ATPase_G 25aa 2e-04 in modified transcript

ATP6V1H

  • refseq_ATP6V1H.F2 refseq_ATP6V1H.R2 181 235
  • NCBIGene 36.3 51606
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015941

  • Changed! cd VATPase_H 449aa 1e-176 in ref transcript
    • VATPase_H, regulatory vacuolar ATP synthase subunit H (Vma13p); activation component of the peripheral V1 complex of V-ATPase, a heteromultimeric enzyme which uses ATP to actively transport protons into organelles and extracellular compartments. The topology is that of a superhelical spiral, in part the geometry is similar to superhelices composed of armadillo repeat motifs, as found in importins for example.
  • Changed! pfam V-ATPase_H 415aa 1e-178 in ref transcript
    • V-ATPase subunit H.
  • COG VMA13 261aa 6e-49 in ref transcript
    • Vacuolar H+-ATPase V1 sector, subunit H [Energy production and conversion].
  • Changed! cd VATPase_H 431aa 1e-177 in modified transcript
  • Changed! pfam V-ATPase_H 397aa 1e-178 in modified transcript

ATP7B

  • refseq_ATP7B.F1 refseq_ATP7B.R1 164 299
  • NCBIGene 36.3 540
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000053

  • cd HMA 63aa 1e-14 in ref transcript
    • Heavy-metal-associated domain (HMA) is a conserved domain of approximately 30 amino acid residues found in a number of proteins that transport or detoxify heavy metals, for example, the CPx-type heavy metal ATPases and copper chaperones. HMA domain contains two cysteine residues that are important in binding and transfer of metal ions, such as copper, cadmium, cobalt and zinc. In the case of copper, stoichiometry of binding is one Cu+ ion per binding domain. Repeats of the HMA domain in copper chaperone has been associated with Menkes/Wilson disease due to binding of multiple copper ions.
  • cd HMA 64aa 1e-12 in ref transcript
  • cd HMA 63aa 5e-12 in ref transcript
  • cd HMA 59aa 8e-12 in ref transcript
  • cd HMA 64aa 1e-11 in ref transcript
  • cd HMA 64aa 2e-11 in ref transcript
  • cd HAD_like 106aa 0.009 in ref transcript
    • Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
  • Changed! TIGR ATPase-IB1_Cu 646aa 0.0 in ref transcript
    • A member from Halobacterium is annotated as "molybdenum-binding protein" although no evidence can be found for this classification.
  • pfam HMA 64aa 2e-15 in ref transcript
    • Heavy-metal-associated domain.
  • pfam HMA 65aa 5e-14 in ref transcript
  • pfam HMA 64aa 9e-14 in ref transcript
  • pfam HMA 65aa 3e-13 in ref transcript
  • pfam HMA 59aa 6e-13 in ref transcript
  • pfam HMA 64aa 7e-13 in ref transcript
  • Changed! COG ZntA 791aa 0.0 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].
  • PRK copA 199aa 3e-09 in ref transcript
    • copper exporting ATPase; Provisional.
  • COG CopZ 65aa 1e-08 in ref transcript
    • Copper chaperone [Inorganic ion transport and metabolism].
  • PRK copA 172aa 2e-07 in ref transcript
  • PRK copA 145aa 9e-07 in ref transcript
  • PRK copA 170aa 6e-06 in ref transcript
  • Changed! TIGR ATPase-IB1_Cu 601aa 0.0 in modified transcript
  • Changed! COG ZntA 746aa 0.0 in modified transcript

ATRX

  • refseq_ATRX.F1 refseq_ATRX.R1 112 235
  • NCBIGene 36.2 546
  • Single exon skipping, size difference: 123
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_000489

  • Changed! cd HELICc 147aa 4e-16 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! cd DEXDc 163aa 4e-11 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • Changed! pfam SNF2_N 317aa 9e-84 in ref transcript
    • SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
  • Changed! pfam Helicase_C 77aa 4e-16 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • Changed! COG HepA 321aa 1e-27 in ref transcript
    • Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
  • Changed! COG HepA 194aa 6e-22 in ref transcript

ATXN3

  • refseq_ATXN3.F1 refseq_ATXN3.R1 240 326
  • NCBIGene 36.2 4287
  • Single exon skipping, size difference: 86
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004993

  • Changed! pfam Josephin 161aa 2e-69 in ref transcript
    • Josephin.
  • Changed! pfam Josephin 71aa 2e-25 in modified transcript

AURKA

  • refseq_AURKA.F2 refseq_AURKA.R2 133 341
  • NCBIGene 36.3 6790
  • Multiple exon skipping, size difference: 208
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_198433

  • cd S_TKc 252aa 1e-80 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 240aa 2e-80 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00263 244aa 4e-53 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

AURKA

  • refseq_AURKA.F3 refseq_AURKA.R3 97 111
  • NCBIGene 36.3 6790
  • Alternative 5-prime, size difference: 14
  • Exclusion in 5'UTR
  • Reference transcript: NM_198434

  • cd S_TKc 252aa 1e-80 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 240aa 2e-80 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00263 244aa 4e-53 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

AURKC

  • refseq_AURKC.F1 refseq_AURKC.R1 208 400
  • NCBIGene 36.3 6795
  • Alternative 5-prime, size difference: 192
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001015878

  • cd S_TKc 252aa 7e-75 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 240aa 9e-75 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00263 262aa 2e-50 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

AXL

  • refseq_AXL.F1 refseq_AXL.R1 129 156
  • NCBIGene 36.3 558
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021913

  • cd PTKc_Axl 272aa 1e-167 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Axl. Protein Tyrosine Kinase (PTK) family; Axl; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Axl is a member of the Axl subfamily, which is composed of receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with two immunoglobulin-like domains followed by two fibronectin type III repeats, a transmembrane segment, and an intracellular catalytic domain. Binding to their ligands, Gas6 and protein S, leads to receptor dimerization, autophosphorylation, activation, and intracellular signaling. Axl is widely expressed in a variety of organs and cells including epithelial, mesenchymal, hematopoietic, as well as non-transformed cells. Axl signaling is important in many cellular functions such as survival, anti-apoptosis, proliferation, migration, and adhesion. Axl was originally isolated from patients with chronic myelogenous leukemia and a chronic myeloproliferative disorder. Axl is overexpressed in many human cancers including colon, squamous cell, thyroid, breast, and lung carcinomas.
  • cd FN3 104aa 2e-06 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IG 77aa 2e-05 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 84aa 0.006 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 88aa 0.009 in ref transcript
  • smart TyrKc 266aa 1e-105 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • pfam fn3 85aa 2e-06 in ref transcript
    • Fibronectin type III domain.
  • smart IG_like 83aa 2e-05 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 97aa 1e-04 in ref transcript
  • pfam I-set 85aa 0.008 in ref transcript
    • Immunoglobulin I-set domain.
  • COG SPS1 280aa 7e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

AZI1

  • refseq_AZI1.F2 refseq_AZI1.R2 278 386
  • NCBIGene 36.3 22994
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014984

  • TIGR SMC_prok_B 223aa 5e-09 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 243aa 5e-08 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

AZIN1

  • refseq_AZIN1.F2 refseq_AZIN1.R2 136 282
  • NCBIGene 36.3 51582
  • Single exon skipping, size difference: 146
  • Exclusion in 5'UTR
  • Reference transcript: NM_015878

  • pfam Orn_Arg_deC_N 236aa 3e-62 in ref transcript
    • Pyridoxal-dependent decarboxylase, pyridoxal binding domain. These pyridoxal-dependent decarboxylases acting on ornithine, lysine, arginine and related substrates This domain has a TIM barrel fold.
  • pfam Orn_DAP_Arg_deC 125aa 2e-25 in ref transcript
    • Pyridoxal-dependent decarboxylase, C-terminal sheet domain. These pyridoxal-dependent decarboxylases act on ornithine, lysine, arginine and related substrates.
  • COG LysA 367aa 2e-51 in ref transcript
    • Diaminopimelate decarboxylase [Amino acid transport and metabolism].

B3GALNT1

  • refseq_B3GALNT1.F1 refseq_B3GALNT1.R1 247 340
  • NCBIGene 36.3 8706
  • Single exon skipping, size difference: 93
  • Exclusion in 5'UTR
  • Reference transcript: NM_001038628

  • pfam Galactosyl_T 194aa 4e-55 in ref transcript
    • Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.

B3GALNT1

  • refseq_B3GALNT1.F3 refseq_B3GALNT1.R3 217 336
  • NCBIGene 36.3 8706
  • Single exon skipping, size difference: 119
  • Exclusion in 5'UTR
  • Reference transcript: NM_001038628

  • pfam Galactosyl_T 194aa 4e-55 in ref transcript
    • Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.

B3GALNT1

  • refseq_B3GALNT1.F5 refseq_B3GALNT1.R5 100 211
  • NCBIGene 36.3 8706
  • Single exon skipping, size difference: 111
  • Exclusion in 5'UTR
  • Reference transcript: NM_001038628

  • pfam Galactosyl_T 194aa 4e-55 in ref transcript
    • Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.

B3GALT5

  • refseq_B3GALT5.F2 refseq_B3GALT5.R2 274 383
  • NCBIGene 36.3 10317
  • Single exon skipping, size difference: 109
  • Exclusion in 5'UTR
  • Reference transcript: NM_033171

  • pfam Galactosyl_T 191aa 3e-44 in ref transcript
    • Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.

B4GALT4

  • refseq_B4GALT4.F1 refseq_B4GALT4.R1 228 311
  • NCBIGene 36.3 8702
  • Single exon skipping, size difference: 83
  • Exclusion in 5'UTR
  • Reference transcript: NM_212543

  • cd b4GalT 218aa 1e-102 in ref transcript
    • Beta-4-Galactosyltransferase is involved in the formation of the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. Beta-4-Galactosyltransferase transfers galactose from uridine diphosphogalactose to the terminal beta-N-acetylglucosamine residues, hereby forming the poly-N-acetyllactosamine core structures present in glycoproteins and glycosphingolipids. At least seven homologous beta-4-galactosyltransferase isoforms have been identified that use different types of glycoproteins and glycolipids as substrates. Of the seven identified members of the beta-1,4-galactosyltransferase subfamily (beta1,4-Gal-T1 to -T7), b1,4-Gal-T1 is most characterized (biochemically). It is a Golgi-resident type II membrane enzyme with a cytoplasmic domain, membrane spanning region, and a stem region and catalytic domain facing the lumen.
  • pfam Galactosyl_T_2 268aa 1e-127 in ref transcript
    • Galactosyltransferase. This is a family of galactosyltransferases from a wide range of Metazoa with three related galactosyltransferases activitys; all three of which are possessed by one sequence in some cases. EC:2.4.1.90, N-acetyllactosamine synthase; EC:2.4.1.38, Beta-N-acetylglucosaminyl-glycopeptide beta-1,4- galactosyltransferase; and EC:2.4.1.22 Lactose synthase. Note that N-acetyllactosamine synthase is a component of Lactose synthase along with alpha-lactalbumin, in the absence of alpha-lactalbumin EC:2.4.1.90 is the catalysed reaction.

BAALC

  • refseq_BAALC.F1 refseq_BAALC.R1 215 382
  • NCBIGene 36.3 79870
  • Single exon skipping, size difference: 167
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024812

  • pfam BAALC_N 31aa 1e-11 in ref transcript
    • BAALC N-terminus. This family represents the N-terminal region of the mammalian BAALC proteins. BAALC (brain and acute leukaemia, cytoplasmic), that is highly conserved among mammals but evidently absent from lower organisms. Two isoforms are specifically expressed in neuroectoderm-derived tissues, but not in tumours or cancer cell lines of non-neural tissue origin. It has been shown that blasts from a subset of patients with acute leukaemia greatly overexpress eight different BAALC transcripts, resulting in five protein isoforms. Among patients with acute myeloid leukaemia, those overexpressing BAALC show distinctly poor prognosis, pointing to a key role of the BAALC products in leukaemia. It has been suggested that BAALC is a gene implicated in both neuroectodermal and hematopoietic cell functions.

BACE1

  • refseq_BACE1.F1 refseq_BACE1.R1 113 245
  • NCBIGene 36.3 23621
  • Alternative 5-prime, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012104

  • Changed! cd beta_secretase_like 366aa 0.0 in ref transcript
    • Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. The enzymes are mostly secreted from cells as inactive proenzymes that activate autocatalytically at acidic pH. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
  • Changed! pfam Asp 342aa 5e-36 in ref transcript
    • Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
  • Changed! PTZ PTZ00165 371aa 1e-15 in ref transcript
    • aspartyl protease; Provisional.
  • Changed! cd beta_secretase_like 322aa 1e-178 in modified transcript
  • Changed! pfam Asp 298aa 2e-26 in modified transcript
  • Changed! PTZ PTZ00165 327aa 2e-10 in modified transcript

BACE1

  • refseq_BACE1.F4 refseq_BACE1.R4 284 359
  • NCBIGene 36.3 23621
  • Alternative 3-prime, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012104

  • Changed! cd beta_secretase_like 366aa 0.0 in ref transcript
    • Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. The enzymes are mostly secreted from cells as inactive proenzymes that activate autocatalytically at acidic pH. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
  • Changed! pfam Asp 342aa 5e-36 in ref transcript
    • Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
  • Changed! PTZ PTZ00165 371aa 1e-15 in ref transcript
    • aspartyl protease; Provisional.
  • Changed! cd beta_secretase_like 341aa 0.0 in modified transcript
  • Changed! pfam Asp 317aa 2e-31 in modified transcript
  • Changed! PTZ PTZ00013 319aa 5e-12 in modified transcript
    • plasmepsin 4 (PM4); Provisional.

BAIAP2

  • refseq_BAIAP2.F1 refseq_BAIAP2.R1 102 148
  • NCBIGene 36.3 10458
  • Single exon skipping, size difference: 46
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_017451

  • cd SH3 57aa 8e-07 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam IMD 220aa 5e-90 in ref transcript
    • IRSp53/MIM homology domain. The N-terminal predicted helical stretch of the insulin receptor tyrosine kinase substrate p53 (IRSp53) is an evolutionary conserved F-actin bundling domain involved in filopodium formation. The domain has been named IMD after the IRSp53 and missing in metastasis (MIM) proteins in which it occurs. Filopodium-inducing IMD activity is regulated by Cdc42 and Rac1 and is SH3-independent.
  • smart SH3 61aa 8e-07 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! COG SbcC 220aa 0.008 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].

BAT1

  • refseq_BAT1.F1 refseq_BAT1.R1 233 363
  • NCBIGene 36.3 7919
  • Alternative 3-prime, size difference: 130
  • Inclusion in 5'UTR
  • Reference transcript: NM_080598

  • cd DEADc 204aa 9e-65 in ref transcript
    • DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
  • cd HELICc 127aa 1e-25 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • pfam DEAD 168aa 3e-42 in ref transcript
    • DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
  • pfam Helicase_C 78aa 1e-18 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • COG SrmB 383aa 8e-98 in ref transcript
    • Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].

BAT3

  • refseq_BAT3.F1 refseq_BAT3.R1 116 140
  • NCBIGene 36.3 7917
  • Alternative 5-prime, size difference: 24
  • Exclusion in 5'UTR
  • Reference transcript: NM_004639

  • cd Scythe_N 71aa 5e-24 in ref transcript
    • Scythe protein (also known as Bat3) is an apoptotic regulator that is highly conserved in eukaryotes and contains a ubiquitin-like domain near its N-terminus. Scythe binds reaper, a potent apoptotic inducer, and Scythe/Reaper are thought to signal apoptosis, in part through regulating the folding and activity of apoptotic signaling molecules.
  • smart UBQ 62aa 3e-14 in ref transcript
    • Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression.
  • PTZ PTZ00044 69aa 3e-05 in ref transcript
    • ubiquitin; Provisional.

BAT3

  • refseq_BAT3.F3 refseq_BAT3.R3 102 120
  • NCBIGene 36.3 7917
  • Alternative 5-prime and 3-prime, size difference: 18
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004639

  • cd Scythe_N 71aa 5e-24 in ref transcript
    • Scythe protein (also known as Bat3) is an apoptotic regulator that is highly conserved in eukaryotes and contains a ubiquitin-like domain near its N-terminus. Scythe binds reaper, a potent apoptotic inducer, and Scythe/Reaper are thought to signal apoptosis, in part through regulating the folding and activity of apoptotic signaling molecules.
  • smart UBQ 62aa 3e-14 in ref transcript
    • Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression.
  • PTZ PTZ00044 69aa 3e-05 in ref transcript
    • ubiquitin; Provisional.

BAX

  • refseq_BAX.F1 refseq_BAX.R1 134 173
  • NCBIGene 36.3 581
  • Alternative 3-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138761

  • Changed! cd Bcl-2_like 140aa 7e-33 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • Changed! TIGR bcl-2 188aa 2e-63 in ref transcript
    • in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
  • Changed! cd Bcl-2_like 134aa 7e-31 in modified transcript
  • Changed! TIGR bcl-2 175aa 2e-55 in modified transcript

BAX

  • refseq_BAX.F3 refseq_BAX.R3 128 226
  • NCBIGene 36.3 581
  • Single exon skipping, size difference: 98
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_004324

  • Changed! cd Bcl-2_like 139aa 5e-31 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • Changed! TIGR bcl-2 158aa 2e-51 in ref transcript
    • in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
  • Changed! cd Bcl-2_like 100aa 6e-18 in modified transcript
  • Changed! TIGR bcl-2 120aa 3e-35 in modified transcript

BAX

  • refseq_BAX.F4 bax.r.7 105 252
  • NCBIGene 36.3 581
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004324

  • Changed! cd Bcl-2_like 139aa 5e-31 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • Changed! TIGR bcl-2 158aa 2e-51 in ref transcript
    • in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
  • Changed! cd Bcl-2_like 81aa 2e-24 in modified transcript
  • Changed! TIGR bcl-2 109aa 1e-30 in modified transcript

BAZ1A

  • refseq_BAZ1A.F1 refseq_BAZ1A.R1 207 303
  • NCBIGene 36.3 11177
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013448

  • cd Bromo_Acf1_like 113aa 4e-51 in ref transcript
    • Bromodomain; Acf1_like or BAZ1A_like subfamily. Bromo adjacent to zinc finger 1A (BAZ1A) was identified as a novel human bromodomain gene by cDNA library screening. The Drosophila homologue, Acf1, is part of the CHRAC (chromatin accessibility complex) and regulates ISWI-induced nucleosome remodeling. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd BAH_plant_2 24aa 0.001 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • pfam WAC_Acf1_DNA_bd 101aa 2e-39 in ref transcript
    • ATP-utilising chromatin assembly and remodelling N-terminal. ACF (for ATP-utilising chromatin assembly and remodelling factor) is a chromatin-remodelling complex that catalyses the ATP-dependent assembly of periodic nucleosome arrays. The WAC (WSTF/Acf1/cbp146) domain is an approximately 110-residue module present at the N-termini of Acf1-related proteins in a variety of organisms. The DNA-binding region of Acf1 includes the WAC domain, which is necessary for the efficient binding of ACF complex to DNA.
  • smart BROMO 92aa 8e-26 in ref transcript
    • bromo domain.
  • pfam DDT 63aa 3e-16 in ref transcript
    • DDT domain. This domain is approximately 60 residues in length, and is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors). It is exclusively associated with nuclear domains, and is thought to be arranged into three alpha helices.
  • pfam PHD 46aa 2e-13 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • Changed! COG COG5076 181aa 3e-14 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • Changed! COG COG5076 143aa 2e-14 in modified transcript

BCAP29

  • refseq_BCAP29.F1 refseq_BCAP29.R1 208 287
  • NCBIGene 36.3 55973
  • Single exon skipping, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001008405

  • pfam Bap31 220aa 4e-45 in ref transcript
    • B-cell receptor-associated protein 31-like. Bap31 is a polytopic integral protein of the endoplasmic reticulum membrane and a substrate of caspase-8. Bap31 is cleaved within its cytosolic domain, generating pro-apoptotic p20 Bap31.
  • Changed! COG COG4372 160aa 0.001 in ref transcript
    • Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown].

BCL11A

  • refseq_BCL11A.F1 refseq_BCL11A.R1 101 119
  • NCBIGene 36.2 53335
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138553

BCL11B

  • refseq_BCL11B.F2 refseq_BCL11B.R2 129 342
  • NCBIGene 36.3 64919
  • Single exon skipping, size difference: 213
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138576

BCL2L14

  • refseq_BCL2L14.F2 refseq_BCL2L14.R2 199 352
  • NCBIGene 36.3 79370
  • Alternative 3-prime, size difference: 153
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_138723

  • Changed! cd Bcl-2_like 135aa 2e-04 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • Changed! TIGR bcl-2 29aa 0.008 in ref transcript
    • in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.

BCL7A

  • refseq_BCL7A.F2 refseq_BCL7A.R2 334 397
  • NCBIGene 36.3 605
  • Alternative 5-prime, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020993

  • pfam BCL_N 49aa 5e-17 in ref transcript
    • BCL7, N-terminal conserver region. Members of the BCL family have significant sequence similarity at their N-terminus, represented in this family. The function of BCL7 proteins is unknown. They may be involved in early development. In addition, BCL7B is commonly hemizygously deleted in patients with Williams syndrome.

BCR

  • refseq_BCR.F1 refseq_BCR.R1 138 270
  • NCBIGene 36.3 613
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004327

  • cd RhoGAP_Bcr 201aa 1e-85 in ref transcript
    • RhoGAP_Bcr: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of Bcr (breakpoint cluster region protein)-like proteins. Bcr is a multidomain protein with a variety of enzymatic functions. It contains a RhoGAP and a Rho GEF domain, a Ser/Thr kinase domain, an N-terminal oligomerization domain, and a C-terminal PDZ binding domain, in addition to PH and C2 domains. Bcr is a negative regulator of: i) RacGTPase, via the Rho GAP domain, ii) the Ras-Raf-MEK-ERK pathway, via phosphorylation of the Ras binding protein AF-6, and iii) the Wnt signaling pathway through binding beta-catenin. Bcr can form a complex with beta-catenin and Tcf1. The Wnt signaling pathway is involved in cell proliferation, differentiation, and cell renewal. Bcr was discovered as a fusion partner of Abl. The Bcr-Abl fusion is characteristic for a large majority of chronic myelogenous leukemias (CML). Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • cd RhoGEF 191aa 3e-29 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • cd PH_BCR-related 61aa 6e-26 in ref transcript
    • BCR (breakpoint cluster region)-related pleckstrin homology (PH) domain. The BCR-related protein has a RhoGEF(DH) domain followed by a PH domain, a C2 domain and a RhoGAP domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinases, tyrosine kinases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
  • cd PH_BCR-related 38aa 7e-12 in ref transcript
  • Changed! cd C2 94aa 5e-06 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • pfam RhoGEF 189aa 5e-53 in ref transcript
    • RhoGEF domain. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that pfam00169 domains invariably occur C-terminal to RhoGEF/DH domains.
  • smart RhoGAP 164aa 3e-42 in ref transcript
    • GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers.
  • pfam Bcr-Abl_Oligo 71aa 3e-33 in ref transcript
    • Bcr-Abl oncoprotein oligomerisation domain. The Bcr-Abl oncoprotein oligomerisation domain consists of a short N-terminal helix (alpha-1), a flexible loop and a long C-terminal helix (alpha-2). Together these form an N-shaped structure, with the loop allowing the two helices to assume a parallel orientation. The monomeric domains associate into a dimer through the formation of an antiparallel coiled coil between the alpha-2 helices and domain swapping of two alpha-1 helices, where one alpha-1 helix swings back and packs against the alpha-2 helix from the second monomer. Two dimers then associate into a tetramer. The oligomerisation domain is essential for the oncogenicity of the Bcr-Abl protein.
  • Changed! smart C2 105aa 2e-09 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • Changed! smart C2 49aa 3e-04 in modified transcript

BDH1

  • refseq_BDH1.F1 refseq_BDH1.R1 100 252
  • NCBIGene 36.3 622
  • Single exon skipping, size difference: 152
  • Exclusion in 5'UTR
  • Reference transcript: NM_203314

  • TIGR 3oxo_ACP_reduc 215aa 3e-18 in ref transcript
    • This model represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis.
  • PRK PRK06182 263aa 7e-37 in ref transcript
    • short chain dehydrogenase; Validated.

BICD1

  • refseq_BICD1.F2 refseq_BICD1.R2 101 405
  • NCBIGene 36.3 636
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 304
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001714

  • pfam BicD 726aa 0.0 in ref transcript
    • Microtubule-associated protein Bicaudal-D. BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER). For full functioning they bind with GSK-3beta pfam05350 to maintain the anchoring of microtubules to the centromere. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells.
  • COG Smc 393aa 9e-09 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

BIN1

  • refseq_BIN1.F1 refseq_BIN1.R1 128 260
  • NCBIGene 36.3 274
  • Multiple exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139343

  • cd SH3 68aa 5e-08 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart BAR 253aa 2e-51 in ref transcript
    • BAR domain.
  • smart SH3 70aa 1e-09 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! PRK PRK07764 241aa 0.002 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.
  • Changed! PRK PRK07764 191aa 2e-04 in modified transcript

BIN1

  • refseq_BIN1.F3 refseq_BIN1.R3 255 384
  • NCBIGene 36.3 274
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139346

  • cd SH3 68aa 4e-08 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart BAR 222aa 2e-45 in ref transcript
    • BAR domain.
  • smart SH3 70aa 1e-09 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

BIN1

  • refseq_BIN1.F4 refseq_BIN1.R4 167 296
  • NCBIGene 36.3 274
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139343

  • cd SH3 68aa 5e-08 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart BAR 253aa 2e-51 in ref transcript
    • BAR domain.
  • smart SH3 70aa 1e-09 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! PRK PRK07764 241aa 0.002 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.
  • Changed! PRK PRK07764 175aa 1e-05 in modified transcript

BIN1

  • refseq_BIN1.F6 refseq_BIN1.R6 306 399
  • NCBIGene 36.3 274
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139343

  • cd SH3 68aa 5e-08 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! smart BAR 253aa 2e-51 in ref transcript
    • BAR domain.
  • smart SH3 70aa 1e-09 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • PRK PRK07764 241aa 0.002 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.
  • Changed! smart BAR 222aa 1e-47 in modified transcript

BIN1

  • refseq_BIN1.F7 refseq_BIN1.R7 126 171
  • NCBIGene 36.3 274
  • Single exon skipping, size difference: 45
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_139343

  • cd SH3 68aa 5e-08 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart BAR 253aa 2e-51 in ref transcript
    • BAR domain.
  • smart SH3 70aa 1e-09 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! PRK PRK07764 241aa 0.002 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.
  • Changed! PRK PRK07764 212aa 0.010 in modified transcript

XAF1

  • refseq_BIRC4BP.F2 refseq_BIRC4BP.R2 217 274
  • NCBIGene 36.3 54739
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017523

BMF

  • refseq_BMF.F1 refseq_BMF.R1 324 387
  • NCBIGene 36.3 90427
  • Alternative 5-prime, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001003940

BMP1

  • refseq_BMP1.F1 refseq_BMP1.R1 103 469
  • NCBIGene 36.3 649
  • Single exon skipping, size difference: 366
  • Exclusion of the stop codon
  • Reference transcript: NM_006128

  • cd ZnMc_BMP1_TLD 195aa 1e-111 in ref transcript
    • Zinc-dependent metalloprotease; BMP1/TLD-like subfamily. BMP1 (Bone morphogenetic protein 1) and TLD (tolloid)-like metalloproteases play vital roles in extracellular matrix formation, by cleaving precursor proteins such as enzymes, structural proteins, and proteins involved in the mineralization of the extracellular matrix. The drosophila protein tolloid and its Xenopus homologue xolloid cleave and inactivate Sog and chordin, respectively, which are inhibitors of Dpp (the Drosophila decapentaplegic gene product) and its homologue BMP4, involved in dorso-ventral patterning.
  • cd CUB 112aa 1e-31 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 110aa 2e-31 in ref transcript
  • cd CUB 112aa 8e-30 in ref transcript
  • cd EGF_CA 42aa 2e-05 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • pfam Astacin 194aa 4e-89 in ref transcript
    • Astacin (Peptidase family M12A). The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family contain two conserved disulphide bridges, these are joined 1-4 and 2-3. Members of this family have an amino terminal propeptide which is cleaved to give the active protease domain. All other linked domains are found to the carboxyl terminus of this domain. This family includes: Astacin, a digestive enzyme from Crayfish. Meprin, a multiple domain membrane component that is constructed from a homologous alpha and beta chain. Proteins involved in morphogenesis, and Tolloid from drosophila.
  • pfam CUB 110aa 6e-46 in ref transcript
    • CUB domain.
  • pfam CUB 110aa 4e-41 in ref transcript
  • pfam CUB 110aa 2e-39 in ref transcript
  • smart EGF_CA 42aa 2e-06 in ref transcript
    • Calcium-binding EGF-like domain.

BNIP1

  • refseq_BNIP1.F1 refseq_BNIP1.R1 130 232
  • NCBIGene 36.3 662
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013979

  • pfam Sec20 92aa 6e-21 in ref transcript
    • Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway.

BNIP1

  • refseq_BNIP1.F3 refseq_BNIP1.R3 119 248
  • NCBIGene 36.3 662
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013979

  • pfam Sec20 92aa 6e-21 in ref transcript
    • Sec20. Sec20 is a membrane glycoprotein associated with secretory pathway.

BNIPL

  • refseq_BNIPL.F1 refseq_BNIPL.R1 125 279
  • NCBIGene 36.2 149428
  • Single exon skipping, size difference: 154
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_138278

  • Changed! cd SEC14 147aa 2e-17 in ref transcript
    • Sec14p-like lipid-binding domain. Found in secretory proteins, such as S. cerevisiae phosphatidylinositol transfer protein (Sec14p), and in lipid regulated proteins such as RhoGAPs, RhoGEFs and neurofibromin (NF1). SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
  • Changed! smart SEC14 148aa 2e-14 in ref transcript
    • Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p). Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p) and in RhoGAPs, RhoGEFs and the RasGAP, neurofibromin (NF1). Lipid-binding domain. The SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
  • Changed! cd SEC14 89aa 0.004 in modified transcript

BOLA3

  • refseq_BOLA3.F1 refseq_BOLA3.R1 248 337
  • NCBIGene 36.3 388962
  • Single exon skipping, size difference: 89
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_212552

  • Changed! pfam BolA 68aa 8e-08 in ref transcript
    • BolA-like protein. This family consist of the morphoprotein BolA from Escherichia coli and its various homologues. In Escherichia coli over expression of this protein causes round morphology and may be involved in switching the cell between elongation and septation systems during cell division. The expression of BolA is growth rate regulated and is induced during the transition into the the stationary phase. BolA is also induced by stress during early stages of growth and may have a general role in stress response. It has also been suggested that BolA can induce the transcription of penicillin binding proteins 6 and 5.
  • Changed! COG BolA 73aa 5e-05 in ref transcript
    • Stress-induced morphogen (activity unknown) [Signal transduction mechanisms].

BPGM

  • refseq_BPGM.F1 refseq_BPGM.R1 131 452
  • NCBIGene 36.3 669
  • Single exon skipping, size difference: 321
  • Exclusion in 5'UTR
  • Reference transcript: NM_199186

  • cd HP_PGM_like 108aa 7e-24 in ref transcript
    • Histidine phosphatase domain found in phosphoglycerate mutases and related proteins, mostly phosphatases; contains a His residue which is phosphorylated during the reaction. Subgroup of the catalytic domain of a functionally diverse set of proteins, most of which are phosphatases. The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. This subgroup contains cofactor-dependent and cofactor-independent phosphoglycerate mutases (dPGM, and BPGM respectively), fructose-2,6-bisphosphatase (F26BP)ase, Sts-1, SixA, and related proteins. Functions include roles in metabolism, signaling, or regulation, for example, F26BPase affects glycolysis and gluconeogenesis through controlling the concentration of F26BP; BPGM controls the concentration of 2,3-BPG (the main allosteric effector of hemoglobin in human blood cells); human Sts-1 is a T-cell regulator; Escherichia coli Six A participates in the ArcB-dependent His-to-Asp phosphorelay signaling system. Deficiency and mutation in many of the human members result in disease, for example erythrocyte BPGM deficiency is a disease associated with a decrease in the concentration of 2,3-BPG.
  • cd HP 67aa 2e-13 in ref transcript
    • Histidine phosphatase domain found in a functionally diverse set of proteins, mostly phosphatases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of a functionally diverse set of proteins, most of which are phosphatases. The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. This set of proteins includes cofactor-dependent and cofactor-independent phosphoglycerate mutases (dPGM, and BPGM respectively), fructose-2,6-bisphosphatase (F26BP)ase, Sts-1, SixA, histidine acid phosphatases, phytases, and related proteins. Functions include roles in metabolism, signaling, or regulation, for example F26BPase affects glycolysis and gluconeogenesis through controlling the concentration of F26BP; BPGM controls the concentration of 2,3-BPG (the main allosteric effector of hemoglobin in human blood cells); human Sts-1 is a T-cell regulator; Escherichia coli Six A participates in the ArcB-dependent His-to-Asp phosphorelay signaling system; phytases scavenge phosphate from extracellular sources. Deficiency and mutation in many of the human members result in disease, for example erythrocyte BPGM deficiency is a disease associated with a decrease in the concentration of 2,3-BPG. Clinical applications include the use of prostatic acid phosphatase (PAP) as a serum marker for prostate cancer. Agricultural applications include the addition of phytases to animal feed.
  • TIGR pgm_1 249aa 2e-94 in ref transcript
    • Most members of this family are phosphoglycerate mutase (EC 5.4.2.1). This enzyme interconverts 2-phosphoglycerate and 3-phosphoglycerate. The enzyme is transiently phosphorylated on an active site histidine by 2,3-diphosphoglyerate, which is both substrate and product. Some members of this family have are phosphoglycerate mutase as a minor activity and act primarily as a bisphoglycerate mutase, interconverting 2,3-diphosphoglycerate and 1,3-diphosphoglycerate (EC 5.4.2.4). This model is designated as a subfamily for this reason. The second and third paralogs in S. cerevisiae are somewhat divergent and apparently inactive (see PUBMED:9544241) but are also part of this subfamily phylogenetically.
  • PTZ PTZ00123 240aa 2e-86 in ref transcript
    • phosphoglycerate mutase I; Provisional.

BRCC3

  • refseq_BRCC3.F1 refseq_BRCC3.R1 104 179
  • NCBIGene 36.3 79184
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024332

  • smart JAB_MPN 166aa 3e-23 in ref transcript
    • JAB/MPN domain. Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function.
  • COG COG1310 31aa 0.001 in ref transcript
    • Predicted metal-dependent protease of the PAD1/JAB1 superfamily [General function prediction only].

BRD8

  • refseq_BRD8.F2 refseq_BRD8.R2 191 236
  • NCBIGene 36.3 10902
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006696

  • cd Bromo_brd8_like 104aa 3e-53 in ref transcript
    • Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with the thyroid hormone receptor in a ligand-dependent fashion and enhances thyroid hormone-dependent activation from thyroid response elements. Brd8 is thought to be a nuclear receptor coactivator. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • smart BROMO 103aa 1e-30 in ref transcript
    • bromo domain.
  • COG COG5076 105aa 6e-13 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].

BRD8

  • refseq_BRD8.F4 refseq_BRD8.R4 153 372
  • NCBIGene 36.3 10902
  • Single exon skipping, size difference: 219
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_139199

  • cd Bromo_brd8_like 104aa 8e-53 in ref transcript
    • Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with the thyroid hormone receptor in a ligand-dependent fashion and enhances thyroid hormone-dependent activation from thyroid response elements. Brd8 is thought to be a nuclear receptor coactivator. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd Bromo_brd8_like 104aa 3e-45 in ref transcript
  • smart BROMO 103aa 5e-30 in ref transcript
    • bromo domain.
  • smart BROMO 98aa 1e-26 in ref transcript
  • COG COG5076 201aa 8e-14 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • COG COG5076 105aa 8e-13 in ref transcript

BRD8

  • refseq_BRD8.F5 refseq_BRD8.R5 172 289
  • NCBIGene 36.3 10902
  • Alternative 3-prime, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139199

  • cd Bromo_brd8_like 104aa 8e-53 in ref transcript
    • Bromodomain, brd8_like subgroup. In mammals, brd8 (bromodomain containing 8) interacts with the thyroid hormone receptor in a ligand-dependent fashion and enhances thyroid hormone-dependent activation from thyroid response elements. Brd8 is thought to be a nuclear receptor coactivator. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd Bromo_brd8_like 104aa 3e-45 in ref transcript
  • smart BROMO 103aa 5e-30 in ref transcript
    • bromo domain.
  • smart BROMO 98aa 1e-26 in ref transcript
  • COG COG5076 201aa 8e-14 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • Changed! COG COG5076 105aa 8e-13 in ref transcript
  • Changed! COG COG5076 81aa 1e-12 in modified transcript

BRD9

  • refseq_BRD9.F1 refseq_BRD9.R1 108 168
  • NCBIGene 36.3 65980
  • Alternative 3-prime, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_023924

  • cd Bromo_brd7_like 98aa 2e-45 in ref transcript
    • Bromodomain, brd7_like subgroup. The BRD7 gene encodes a nuclear protein that has been shown to inhibit cell growth and the progression of the cell cycle by regulating cell-cycle genes at the transcriptional level. BRD7 has been identified as a gene involved in nasopharyngeal carcinoma. The protein interacts with acetylated histone H3 via its bromodomain. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • Changed! smart BROMO 104aa 2e-22 in ref transcript
    • bromo domain.
  • Changed! COG COG5076 125aa 2e-11 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • PRK PRK08099 62aa 0.002 in ref transcript
    • nicotinamide-nucleotide adenylyltransferase; Provisional.
  • Changed! smart BROMO 103aa 1e-22 in modified transcript
  • Changed! COG COG5076 99aa 2e-11 in modified transcript

BRE

  • refseq_BRE.F1 refseq_BRE.R1 112 178
  • NCBIGene 36.3 9577
  • Single exon skipping, size difference: 66
  • Inclusion in 5'UTR
  • Reference transcript: NM_004899

  • pfam BRE 333aa 0.0 in ref transcript
    • Brain and reproductive organ-expressed protein (BRE). This family consists of several eukaryotic brain and reproductive organ-expressed (BRE) proteins. BRE is a putative stress-modulating gene, found able to down-regulate TNF-alpha-induced-NF-kappaB activation upon over expression. A total of six isoforms are produced by alternative splicing predominantly at either end of the gene.Compared to normal cells, immortalised human cell lines uniformly express higher levels of BRE. Peripheral blood monocytes respond to LPS by down-regulating the expression of all the BRE isoforms.It is thought that the function of BRE and its isoforms is to regulate peroxisomal activities.

BRMS1

  • refseq_BRMS1.F1 refseq_BRMS1.R1 255 337
  • NCBIGene 36.3 25855
  • Alternative 3-prime, size difference: 82
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001024957

  • pfam Sds3 170aa 1e-33 in ref transcript
    • Sds3-like. Repression of gene transcription is mediated by histone deacetylases containing repressor-co-repressor complexes, which are recruited to promoters of target genes via interactions with sequence-specific transcription factors. The co-repressor complex contains a core of at least seven proteins. This family represents the conserved region found in Sds3, Dep1 and BRMS1-homologue p40 proteins.

BRPF1

  • refseq_BRPF1.F2 refseq_BRPF1.R2 102 120
  • NCBIGene 36.3 7862
  • Alternative 5-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001003694

  • Changed! cd Bromo_brd1_like 104aa 2e-50 in ref transcript
    • Bromodomain; brd1_like subfamily. BRD1 is a mammalian gene which encodes for a nuclear protein assumed to be a transcriptional regulator. BRD1 has been implicated with brain development and susceptibility to schizophrenia and bipolar affective disorder. Bromodomains are 110 amino acid long domains that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd BR140_related 116aa 2e-50 in ref transcript
    • The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2. BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region. In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
  • Changed! smart BROMO 113aa 9e-25 in ref transcript
    • bromo domain.
  • pfam EPL1 151aa 2e-15 in ref transcript
    • Enhancer of polycomb-like. This is a family of EPL1 (Enhancer of polycomb-like) proteins. The EPL1 protein is a member of a histone acetyltransferase complex which is involved in transcriptional activation of selected genes.
  • smart PWWP 84aa 5e-15 in ref transcript
    • domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues.
  • smart PHD 46aa 3e-07 in ref transcript
    • PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
  • COG COG5141 435aa 5e-72 in ref transcript
    • PHD zinc finger-containing protein [General function prediction only].
  • Changed! COG COG5076 129aa 2e-09 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • COG SFP1 51aa 0.008 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].
  • Changed! cd Bromo_brd1_like 98aa 2e-52 in modified transcript
  • Changed! smart BROMO 107aa 1e-26 in modified transcript
  • Changed! COG COG5076 123aa 4e-11 in modified transcript

BSG

  • refseq_BSG.F1 refseq_BSG.R1 100 448
  • NCBIGene 36.3 682
  • Single exon skipping, size difference: 348
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001728

  • Changed! cd IGcam 89aa 3e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 98aa 5e-04 in ref transcript
  • smart IG_like 92aa 7e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • Changed! smart IG_like 85aa 3e-07 in ref transcript

BSG

  • refseq_BSG.F3 refseq_BSG.R3 238 395
  • NCBIGene 36.3 682
  • Single exon skipping, size difference: 157
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_198591

  • Changed! cd IGcam 98aa 2e-04 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! smart IG_like 92aa 1e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.

BTBD1

  • refseq_BTBD1.F2 refseq_BTBD1.R2 250 338
  • NCBIGene 36.3 53339
  • Single exon skipping, size difference: 88
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_025238

  • Changed! pfam PHR 150aa 1e-49 in ref transcript
    • PHR domain. This domain is called PHR as it was original found in the proteins PAM, highwire and RPM. This domain can be duplicated in the highwire, PFAM and PRM sequence. The C-terminal region of the protein BTBD1 includes the PHR domain and is known to interact with Topoisomerase I, an enzyme which relaxes DNA supercoils.
  • pfam BTB 114aa 1e-22 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • smart BACK 100aa 9e-12 in ref transcript
    • BTB And C-terminal Kelch. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues.

BTF3

  • refseq_BTF3.F2 refseq_BTF3.R2 113 354
  • NCBIGene 36.3 689
  • Alternative 5-prime, size difference: 241
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001037637

  • pfam NAC 60aa 1e-18 in ref transcript
    • NAC domain.

BTN2A1

  • refseq_BTN2A1.F1 refseq_BTN2A1.R1 168 397
  • NCBIGene 36.3 11120
  • Alternative 3-prime, size difference: 229
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_007049

  • Changed! smart SPRY 119aa 5e-29 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • Changed! smart PRY 55aa 4e-19 in ref transcript
    • associated with SPRY domains.
  • pfam V-set 97aa 2e-10 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • pfam C2-set_2 83aa 0.003 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.

BTN3A3

  • refseq_BTN3A3.F1 refseq_BTN3A3.R1 226 316
  • NCBIGene 36.3 10384
  • Single exon skipping, size difference: 90
  • Exclusion of the protein initiation site
  • Reference transcript: NM_006994

  • smart SPRY 123aa 9e-35 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart PRY 52aa 2e-17 in ref transcript
    • associated with SPRY domains.
  • Changed! pfam V-set 105aa 1e-09 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • Changed! pfam V-set 91aa 5e-07 in modified transcript

BTNL3

  • refseq_BTNL3.F1 refseq_BTNL3.R1 119 227
  • NCBIGene 36.2 10917
  • Alternative 5-prime, size difference: 108
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_197975

  • smart SPRY 124aa 5e-25 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart PRY 50aa 2e-11 in ref transcript
    • associated with SPRY domains.
  • pfam V-set 97aa 2e-06 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

BTNL3

  • refseq_BTNL3.F3 refseq_BTNL3.R3 108 318
  • NCBIGene 36.2 10917
  • Alternative 5-prime and 3-prime, size difference: 210
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_197975

  • smart SPRY 124aa 5e-25 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart PRY 50aa 2e-11 in ref transcript
    • associated with SPRY domains.
  • pfam V-set 97aa 2e-06 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

BTRC

  • refseq_BTRC.F1 refseq_BTRC.R1 284 392
  • NCBIGene 36.3 8945
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033637

  • cd WD40 279aa 2e-60 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • smart WD40 38aa 7e-06 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 37aa 2e-05 in ref transcript
  • smart FBOX 39aa 1e-04 in ref transcript
    • A Receptor for Ubiquitination Targets.
  • pfam WD40 36aa 2e-04 in ref transcript
    • WD domain, G-beta repeat.
  • smart WD40 38aa 3e-04 in ref transcript
  • COG COG2319 311aa 8e-30 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! smart WD40 38aa 0.009 in modified transcript
  • Changed! smart WD40 35aa 0.010 in modified transcript

C10orf128

  • refseq_C10orf128.F1 refseq_C10orf128.R1 106 184
  • NCBIGene 36.2 170371
  • Alternative 3-prime, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_926220

C10orf128

  • refseq_C10orf128.F1 refseq_C10orf128.R3 206 285
  • NCBIGene 36.2 170371
  • Single exon skipping, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_931097

C10orf61

  • refseq_C10orf61.F2 refseq_C10orf61.R2 116 197
  • NCBIGene 36.2 26123
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001013840

  • Changed! pfam DUF1619 301aa 8e-86 in ref transcript
    • Protein of unknown function (DUF1619). This is a family of sequences derived from hypothetical eukaryotic proteins. The region in question is approximately 330 residues long and has a cysteine rich amino-terminus.
  • Changed! pfam DUF1619 274aa 2e-73 in modified transcript

C13orf23

  • refseq_C13orf23.F2 refseq_C13orf23.R2 216 282
  • NCBIGene 36.3 80209
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_025138

C14orf102

  • refseq_C14orf102.F2 refseq_C14orf102.R2 100 443
  • NCBIGene 36.3 55051
  • Multiple exon skipping, size difference: 343
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017970

  • Changed! pfam DUF1740 238aa 8e-74 in ref transcript
    • Protein of unknown function (DUF1740). This is a family of eukaryotic proteins of unknown function.

INF2

  • refseq_C14orf173.F1 refseq_C14orf173.R1 295 352
  • NCBIGene 36.3 64423
  • Single exon skipping, size difference: 57
  • Exclusion of the stop codon
  • Reference transcript: NM_022489

  • pfam FH2 372aa 2e-66 in ref transcript
    • Formin Homology 2 Domain.
  • pfam Drf_FH3 67aa 2e-09 in ref transcript
    • Diaphanous FH3 Domain. This region is found in the Formin-like and and diaphanous proteins.
  • pfam Drf_GBD 60aa 6e-08 in ref transcript
    • Diaphanous GTPase-binding Domain. This domain is bound to by GTP-attached Rho proteins, leading to activation of the Drf protein.

ABHD12B

  • refseq_C14orf29.F1 refseq_C14orf29.R1 144 247
  • NCBIGene 36.3 145447
  • Single exon skipping, size difference: 103
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_181814

  • Changed! TIGR hydr2_PEP 125aa 1e-09 in ref transcript
    • This group of proteins are members of the alpha/beta hydrolase superfamily. These proteins are generally found in genomes containing the exosortase/PEP-CTERM protein expoert system, specifically the type 1 variant of this system described by the Genome Property GenProp0652. When found in this context they are invariably present in the vicinity of a second, relatively unrelated enzyme (ortholog 1, TIGR03100) of the same superfamily.
  • Changed! pfam Peptidase_S9 173aa 2e-04 in ref transcript
    • Prolyl oligopeptidase family.
  • Changed! COG COG1073 222aa 2e-15 in ref transcript
    • Hydrolases of the alpha/beta superfamily [General function prediction only].

C15orf21

  • refseq_C15orf21.F1 refseq_C15orf21.R1 200 305
  • NCBIGene 36.3 283651
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 105
  • Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001005266

C15orf21

  • refseq_C15orf21.F4 refseq_C15orf21.R4 148 268
  • NCBIGene 36.3 283651
  • Single exon skipping, size difference: 120
  • Exclusion in 5'UTR
  • Reference transcript: NM_001005266

C15orf21

  • refseq_C15orf21.F4 refseq_C15orf21.R5 104 349
  • NCBIGene 36.3 283651
  • Multiple exon skipping, size difference: 245
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_001005266

C15orf21

  • refseq_C15orf21.F6 refseq_C15orf21.R7 194 271
  • NCBIGene 36.3 283651
  • Single exon skipping, size difference: 77
  • Inclusion in 5'UTR
  • Reference transcript: NM_001005267

C17orf58

  • refseq_C17orf58.F1 refseq_C17orf58.R1 227 370
  • NCBIGene 36.3 284018
  • Alternative 5-prime, size difference: 143
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_181655

  • Changed! cd NTR_PCOLCE 52aa 3e-07 in ref transcript
    • NTR domain, PCOLCE subfamily; Procollagen C-endopeptidase enhancers (PCOLCEs) are extracellular matrix proteins that enhance the activity of procollagen C-proteases, by binding to the procollagen I C-peptide. They contain a C-terminal NTR domain, which have been suggested to possess inhibitory functions towards specific serine proteases but not towards metzincins, which are inhibited by the related TIMPs.

C18orf1

  • refseq_C18orf1.F1 refseq_C18orf1.R1 256 310
  • NCBIGene 36.3 753
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181481

  • cd LDLa 32aa 2e-05 in ref transcript
    • Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
  • pfam Ldl_recept_a 32aa 1e-05 in ref transcript
    • Low-density lipoprotein receptor domain class A.

C18orf24

  • refseq_C18orf24.F1 refseq_C18orf24.R1 143 188
  • NCBIGene 36.3 220134
  • Alternative 5-prime, size difference: 45
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039535

  • pfam DUF1395 235aa 9e-68 in ref transcript
    • Protein of unknown function (DUF1395). This family consists of several hypothetical eukaryotic proteins of around 250 residues in length. The function of this family is unknown.

C19orf12

  • refseq_C19orf12.F2 refseq_C19orf12.R2 175 345
  • NCBIGene 36.2 83636
  • Single exon skipping, size difference: 170
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001031726

C19orf2

  • refseq_C19orf2.F1 refseq_C19orf2.R1 107 142
  • NCBIGene 36.3 8725
  • Single exon skipping, size difference: 35
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003796

  • Changed! cd Prefoldin_alpha 103aa 2e-08 in ref transcript
    • Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
  • Changed! pfam Prefoldin 110aa 1e-09 in ref transcript
    • Prefoldin subunit. This family comprises of several prefoldin subunits. The biogenesis of the cytoskeletal proteins actin and tubulin involves interaction of nascent chains of each of the two proteins with the oligomeric protein prefoldin (PFD) and their subsequent transfer to the cytosolic chaperonin CCT (chaperonin containing TCP-1). Electron microscopy shows that eukaryotic PFD, which has a similar structure to its archaeal counterpart, interacts with unfolded actin along the tips of its projecting arms. In its PFD-bound state, actin seems to acquire a conformation similar to that adopted when it is bound to CCT.
  • Changed! PRK PRK03947 93aa 0.001 in ref transcript
    • prefoldin subunit alpha; Reviewed.

C19orf36

  • refseq_C19orf36.F1 refseq_C19orf36.R1 178 232
  • NCBIGene 36.3 113177
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039846

C19orf48

  • refseq_C19orf48.F1 refseq_C19orf48.R1 162 323
  • NCBIGene 36.3 84798
  • Single exon skipping, size difference: 161
  • Exclusion in 5'UTR
  • Reference transcript: NM_199249

C19orf6

  • refseq_C19orf6.F1 refseq_C19orf6.R1 308 408
  • NCBIGene 36.3 91304
  • Single exon skipping, size difference: 100
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001033026

  • pfam Membralin 370aa 1e-161 in ref transcript
    • Tumour-associated protein. Membralin is evolutionarily highly conserved; though it seems to represent a unique protein family. The protein appears to contain several transmembrane regions. In humans it is expressed in certain cancers, particularly ovarian cancers.

C1GALT1C1

  • refseq_C1GALT1C1.F1 refseq_C1GALT1C1.R1 185 357
  • NCBIGene 36.3 29071
  • Single exon skipping, size difference: 172
  • Exclusion in 5'UTR
  • Reference transcript: NM_152692

CAPRIN2

  • refseq_C1QDC1.F1 refseq_C1QDC1.R1 161 244
  • NCBIGene 36.3 65981
  • Single exon skipping, size difference: 83
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001002259

  • Changed! pfam C1q 126aa 8e-34 in ref transcript
    • C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system.
  • Changed! pfam Herpes_BLLF1 204aa 0.009 in ref transcript
    • Herpes virus major outer envelope glycoprotein (BLLF1). This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.

CAPRIN2

  • refseq_C1QDC1.F3 refseq_C1QDC1.R3 205 352
  • NCBIGene 36.3 65981
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001002259

  • pfam C1q 126aa 8e-34 in ref transcript
    • C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system.
  • Changed! pfam Herpes_BLLF1 204aa 0.009 in ref transcript
    • Herpes virus major outer envelope glycoprotein (BLLF1). This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.

C1QTNF1

  • refseq_C1QTNF1.F1 refseq_C1QTNF1.R1 231 400
  • NCBIGene 36.3 114897
  • Single exon skipping, size difference: 169
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_030968

  • Changed! pfam C1q 129aa 1e-36 in ref transcript
    • C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system.
  • Changed! pfam Collagen 42aa 1e-05 in ref transcript
    • Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.

C1QTNF3

  • refseq_C1QTNF3.F2 refseq_C1QTNF3.R2 131 350
  • NCBIGene 36.3 114899
  • Alternative 5-prime, size difference: 219
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181435

  • pfam C1q 124aa 2e-35 in ref transcript
    • C1q domain. C1q is a subunit of the C1 enzyme complex that activates the serum complement system.

C1S

  • refseq_C1S.F1 refseq_C1S.R1 112 199
  • NCBIGene 36.3 716
  • Alternative 3-prime, size difference: 87
  • Inclusion in 5'UTR
  • Reference transcript: NM_001734

  • cd Tryp_SPc 241aa 2e-53 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • cd CUB 112aa 1e-20 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 113aa 9e-18 in ref transcript
  • cd CCP 62aa 1e-04 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd EGF_CA 35aa 0.005 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • smart Tryp_SPc 239aa 3e-57 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • pfam CUB 113aa 1e-24 in ref transcript
    • CUB domain.
  • smart CUB 107aa 7e-22 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • pfam Sushi 61aa 6e-06 in ref transcript
    • Sushi domain (SCR repeat).
  • smart EGF_CA 41aa 5e-04 in ref transcript
    • Calcium-binding EGF-like domain.
  • pfam Sushi 63aa 0.008 in ref transcript
  • COG COG5640 251aa 1e-16 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].

C1orf102

  • refseq_C1orf102.F1 refseq_C1orf102.R1 166 196
  • NCBIGene 36.3 127700
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145047

  • Changed! pfam Oscp1 184aa 7e-63 in ref transcript
    • Organic solute transport protein 1. Oscp1 is a family of proteins conserved from plants to humans. It is called organic solute transport protein or oxido-red- nitro domain-containing protein 1, however no reference could be find to confirm the function of the protein.
  • Changed! pfam Oscp1 174aa 4e-65 in modified transcript

C1orf125

  • refseq_C1orf125.F1 refseq_C1orf125.R1 260 482
  • NCBIGene 36.3 126859
  • Multiple exon skipping, size difference: 222
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144696

  • pfam Ax_dynein_light 109aa 1e-04 in ref transcript
    • Axonemal dynein light chain. Axonemal dynein light chain proteins play a dynamic role in flagellar and cilia motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains.

C1orf178

  • refseq_C1orf178.F1 refseq_C1orf178.R1 180 302
  • NCBIGene 36.2 440603
  • Single exon skipping, size difference: 122
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001010922

C1orf178

  • refseq_C1orf178.F3 refseq_C1orf178.R3 102 327
  • NCBIGene 36.2 440603
  • Single exon skipping, size difference: 225
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001010922

C1orf2

  • refseq_C1orf2.F1 refseq_C1orf2.R1 112 400
  • NCBIGene 36.3 10712
  • Multiple exon skipping, size difference: 288
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006589

C1orf34

  • refseq_C1orf34.F1 refseq_C1orf34.R1 130 235
  • NCBIGene 36.2 22996
  • Alternative 3-prime, size difference: 105
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: XM_930328

  • Changed! pfam Deme6 441aa 1e-157 in ref transcript
    • Breast cancer protein. This is a family of proteins conserved from fungi to humans, which, in humans is expressed in primary breast carcinomas but not in normal breast tissue. There appears to be a putative eukaryotic RNP-1 motif and a candidate anchoring transmembrane domain. Deme6 is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive. Members of this family also carry a TPR_2 domain pfam07719 at their C-terminus.
  • Changed! pfam Deme6 476aa 1e-155 in modified transcript

C1orf43

  • refseq_C1orf43.F2 refseq_C1orf43.R2 253 307
  • NCBIGene 36.3 25912
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098616

  • Changed! pfam NICE-3 189aa 1e-85 in ref transcript
    • NICE-3 protein. This family consists of several eukaryotic NICE-3 and related proteins. The gene coding for NICE-3 is part of the epidermal differentiation complex (EDC) which comprises a large number of genes that are of crucial importance for the maturation of the human epidermis. The function of NICE-3 is unknown.
  • Changed! pfam NICE-3 171aa 4e-72 in modified transcript

C1orf9

  • refseq_C1orf9.F2 refseq_C1orf9.R2 310 421
  • NCBIGene 36.3 51430
  • Single exon skipping, size difference: 111
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_016227

  • pfam Sad1_UNC 123aa 6e-40 in ref transcript
    • Sad1 / UNC-like C-terminal. The Caenorhabditis elegans UNC-84 protein is a nuclear envelope protein that is involved in nuclear anchoring and migration during development. The S. pombe Sad1 protein localises at the spindle pole body. UNC-84 and and Sad1 share a common C-terminal region, that is often termed the SUN (Sad1 and UNC) domain. In mammals, the SUN domain is present in two proteins, Sun1 and Sun2. The SUN domain of Sun2 has been demonstrated to be in the periplasm.
  • pfam SMC_N 159aa 0.008 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.

C1orf9

  • refseq_C1orf9.F3 refseq_C1orf9.R3 108 129
  • NCBIGene 36.3 51430
  • Single exon skipping, size difference: 21
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_016227

  • Changed! pfam Sad1_UNC 123aa 6e-40 in ref transcript
    • Sad1 / UNC-like C-terminal. The Caenorhabditis elegans UNC-84 protein is a nuclear envelope protein that is involved in nuclear anchoring and migration during development. The S. pombe Sad1 protein localises at the spindle pole body. UNC-84 and and Sad1 share a common C-terminal region, that is often termed the SUN (Sad1 and UNC) domain. In mammals, the SUN domain is present in two proteins, Sun1 and Sun2. The SUN domain of Sun2 has been demonstrated to be in the periplasm.
  • pfam SMC_N 159aa 0.008 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! pfam Sad1_UNC 130aa 9e-40 in modified transcript

C20orf121

  • refseq_C20orf121.F1 refseq_C20orf121.R1 103 127
  • NCBIGene 36.3 79183
  • Single exon skipping, size difference: 24
  • Exclusion in 5'UTR
  • Reference transcript: NM_024331

  • cd SEC14 157aa 2e-31 in ref transcript
    • Sec14p-like lipid-binding domain. Found in secretory proteins, such as S. cerevisiae phosphatidylinositol transfer protein (Sec14p), and in lipid regulated proteins such as RhoGAPs, RhoGEFs and neurofibromin (NF1). SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
  • pfam CRAL_TRIO 116aa 2e-25 in ref transcript
    • CRAL/TRIO domain. The original profile has been extended to include the carboxyl domain from the known structure of Sec14.
  • pfam CRAL_TRIO_N 67aa 2e-07 in ref transcript
    • CRAL/TRIO, N-terminus. This all-alpha domain is found to the N-terminus of pfam00650.

C20orf132

  • refseq_C20orf132.F1 refseq_C20orf132.R1 210 315
  • NCBIGene 36.3 140699
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152503

MACROD2

  • refseq_C20orf133.F2 refseq_C20orf133.R2 218 287
  • NCBIGene 36.3 140733
  • Single exon skipping, size difference: 69
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_080676

  • cd Macro_Appr_pase_like 167aa 3e-69 in ref transcript
    • Macro domain, Appr-1"-pase_like family. The macro domain is a high-affinity ADP-ribose binding module found in a variety of proteins as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Some macro domains recognize poly ADP-ribose as a ligand. Previously identified as displaying an Appr-1"-p (ADP-ribose-1"-monophosphate) processing activity, the macro domain may play roles in distinct ADP-ribose pathways, such as the ADP-ribosylation of proteins, an important post-translational modification which occurs in DNA repair, transcription, chromatin biology, and long-term memory formation, among other processes. This family is composed of uncharacterized proteins that show similarity to Appr-1"-pase, containing conserved putative active site residues. Appr-1"-pase is a phosphatase specific for ADP-ribose-1"-monophosphate.
  • pfam Macro 114aa 5e-42 in ref transcript
    • Macro domain. This domain is an ADP-ribose binding module. It is found in a number of otherwise unrelated proteins. It is found at the C-terminus of the macro-H2A histone protein. This domain is found in the non-structural proteins of several types of ssRNA viruses such as NSP3 from alphaviruses. This domain is also found on its own in a family of proteins from bacteria, archaebacteria, and eukaryotes.
  • Changed! pfam UPF0560 113aa 4e-04 in ref transcript
    • Uncharacterised protein family UPF0560. This family of proteins has no known function.
  • PRK PRK00431 164aa 1e-55 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! pfam UPF0560 125aa 2e-04 in modified transcript
  • Changed! COG MDN1 169aa 0.007 in modified transcript
    • AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only].

BANF2

  • refseq_C20orf179.F2 refseq_C20orf179.R2 202 365
  • NCBIGene 36.3 140836
  • Single exon skipping, size difference: 163
  • Exclusion in 5'UTR
  • Reference transcript: NM_178477

  • pfam BAF 89aa 2e-31 in ref transcript
    • Barrier to autointegration factor. The BAF protein has a SAM-domain-like bundle of orthogonally packed alpha-hairpins - one classic and one pseudo helix-hairpin-helix motif. The protein is involved in the prevention of retroviral DNA integration.

C20orf24

  • refseq_C20orf24.F1 refseq_C20orf24.R1 105 235
  • NCBIGene 36.3 55969
  • Single exon skipping, size difference: 130
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_199483

  • Changed! pfam Rab5ip 64aa 2e-23 in ref transcript
    • Rab5-interacting protein (Rab5ip). This family consists of several Rab5-interacting protein (RIP5 or Rab5ip ) sequences. The ras-related GTPase rab5 is rate-limiting for homotypic early endosome fusion. Rab5ip represents a novel rab5 interacting protein that may function on endocytic vesicles as a receptor for rab5-GDP and participate in the activation of rab5.
  • Changed! pfam Rab5ip 114aa 2e-49 in modified transcript

C20orf30

  • refseq_C20orf30.F2 refseq_C20orf30.R2 127 220
  • NCBIGene 36.3 29058
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 93
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_001009924

  • pfam DUF872 96aa 4e-15 in ref transcript
    • Eukaryotic protein of unknown function (DUF872). This family consists of several uncharacterised eukaryotic proteins. The function of this family is unknown.

UQCC

  • refseq_C20orf44.F1 refseq_C20orf44.R1 111 192
  • NCBIGene 36.3 55245
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 81
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_018244

  • Changed! pfam Ubiq_cyt_C_chap 149aa 3e-64 in ref transcript
    • Ubiquinol-cytochrome C chaperone.
  • Changed! COG COG5452 170aa 2e-11 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam Ubiq_cyt_C_chap 148aa 2e-64 in modified transcript
  • Changed! COG COG5452 138aa 3e-09 in modified transcript

C20orf7

  • refseq_C20orf7.F1 refseq_C20orf7.R1 133 217
  • NCBIGene 36.3 79133
  • Single exon skipping, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024120

  • Changed! cd AdoMet_MTases 92aa 7e-09 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! TIGR BioC 237aa 1e-27 in ref transcript
    • This enzyme, which is found in biotin biosynthetic gene clusters in proteobacteria, firmicutes, green-sulfur bacteria, fusobacterium and bacteroides, is believed to carry out an enzymatic step prior to the formation of pimeloyl-CoA (although attribution of this annotation is not traceable). The enzyme appears related to methyltransferases by homology.
  • Changed! COG UbiE 105aa 3e-12 in ref transcript
    • Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
  • Changed! PRK PRK10258 217aa 8e-08 in ref transcript
    • biotin biosynthesis protein BioC; Provisional.

C20orf7

  • refseq_C20orf7.F2 refseq_C20orf7.R2 285 369
  • NCBIGene 36.3 79133
  • Single exon skipping, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024120

  • Changed! cd AdoMet_MTases 92aa 7e-09 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! TIGR BioC 237aa 1e-27 in ref transcript
    • This enzyme, which is found in biotin biosynthetic gene clusters in proteobacteria, firmicutes, green-sulfur bacteria, fusobacterium and bacteroides, is believed to carry out an enzymatic step prior to the formation of pimeloyl-CoA (although attribution of this annotation is not traceable). The enzyme appears related to methyltransferases by homology.
  • Changed! COG UbiE 105aa 3e-12 in ref transcript
    • Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
  • Changed! PRK PRK10258 217aa 8e-08 in ref transcript
    • biotin biosynthesis protein BioC; Provisional.

C20orf7

  • refseq_C20orf7.F3 refseq_C20orf7.R3 214 298
  • NCBIGene 36.3 79133
  • Single exon skipping, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024120

  • Changed! cd AdoMet_MTases 92aa 7e-09 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! TIGR BioC 237aa 1e-27 in ref transcript
    • This enzyme, which is found in biotin biosynthetic gene clusters in proteobacteria, firmicutes, green-sulfur bacteria, fusobacterium and bacteroides, is believed to carry out an enzymatic step prior to the formation of pimeloyl-CoA (although attribution of this annotation is not traceable). The enzyme appears related to methyltransferases by homology.
  • Changed! COG UbiE 105aa 3e-12 in ref transcript
    • Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
  • Changed! PRK PRK10258 217aa 8e-08 in ref transcript
    • biotin biosynthesis protein BioC; Provisional.

C21orf57

  • refseq_C21orf57.F1 refseq_C21orf57.R1 105 427
  • NCBIGene 36.3 54059
  • Alternative 5-prime, size difference: 322
  • Exclusion in 5'UTR
  • Reference transcript: NM_058181

  • TIGR TIGR00043 115aa 8e-36 in ref transcript
    • This uncharacterized protein family is represented by a single member sequence only in nearly every bacterium.
  • PRK PRK00016 117aa 5e-24 in ref transcript
    • putative metalloprotease; Provisional.

C21orf58

  • refseq_C21orf58.F1 refseq_C21orf58.R1 203 372
  • NCBIGene 36.2 54058
  • Alternative 3-prime, size difference: 169
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_058180

C21orf58

  • refseq_C21orf58.F3 refseq_C21orf58.R3 176 256
  • NCBIGene 36.2 54058
  • Alternative 3-prime, size difference: 80
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_058180

C21orf66

  • refseq_C21orf66.F1 refseq_C21orf66.R1 96 113
  • NCBIGene 36.2 94104
  • Alternative 5-prime, size difference: 17
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_016631

  • Changed! pfam GCFC 501aa 1e-172 in ref transcript
    • GC-rich sequence DNA-binding factor-like protein. Sequences found in this family are similar to a region of a human GC-rich sequence DNA-binding factor homolog. This is thought to be a protein involved in transcriptional regulation due to partial homologies to a transcription repressor and histone-interacting protein.
  • Changed! pfam GCFC 106aa 4e-38 in modified transcript

C2orf28

  • refseq_C2orf28.F1 refseq_C2orf28.R1 201 341
  • NCBIGene 36.3 51374
  • Alternative 5-prime, size difference: 140
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_080592

C3orf17

  • refseq_C3orf17.F1 refseq_C3orf17.R1 131 286
  • NCBIGene 36.3 25871
  • Single exon skipping, size difference: 155
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015412

C3orf43

  • refseq_C3orf43.F1 refseq_C3orf43.R1 176 232
  • NCBIGene 36.2 255798
  • Alternative 5-prime, size difference: 56
  • Inclusion in the protein causing a frameshift
  • Reference transcript: XM_931549

C4orf13

  • refseq_C4orf13.F1 refseq_C4orf13.R1 232 450
  • NCBIGene 36.2 84068
  • Multiple exon skipping, size difference: 218
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001029998

  • Changed! COG COG0385 336aa 1e-21 in ref transcript
    • Predicted Na+-dependent transporter [General function prediction only].
  • Changed! COG COG0385 185aa 2e-09 in modified transcript

C4orf18

  • refseq_C4orf18.F1 refseq_C4orf18.R1 124 148
  • NCBIGene 36.3 51313
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 24
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031700

C6orf1

  • refseq_C6orf1.F1 refseq_C6orf1.R1 179 239
  • NCBIGene 36.3 221491
  • Alternative 5-prime, size difference: 60
  • Exclusion in 5'UTR
  • Reference transcript: NM_001008703

  • TIGR FadB 33aa 0.002 in ref transcript
    • Members represent alpha subunit of multifunctional enzyme complex of the fatty acid degradation cycle. Activities include: enoyl-CoA hydratase (EC 4.2.1.17), dodecenoyl-CoA delta-isomerase activity (EC 5.3.3.8), 3-hydroxyacyl-CoA dehydrogenase (EC 1.1.1.35), 3-hydroxybutyryl-CoA epimerase (EC 5.1.2.3). A representative is E. coli FadB. This model excludes the FadJ family.

C6orf1

  • refseq_C6orf1.F2 refseq_C6orf1.R2 134 386
  • NCBIGene 36.3 221491
  • Alternative 5-prime, size difference: 252
  • Exclusion in 5'UTR
  • Reference transcript: NM_178508

  • TIGR FadB 33aa 0.002 in ref transcript
    • Members represent alpha subunit of multifunctional enzyme complex of the fatty acid degradation cycle. Activities include: enoyl-CoA hydratase (EC 4.2.1.17), dodecenoyl-CoA delta-isomerase activity (EC 5.3.3.8), 3-hydroxyacyl-CoA dehydrogenase (EC 1.1.1.35), 3-hydroxybutyryl-CoA epimerase (EC 5.1.2.3). A representative is E. coli FadB. This model excludes the FadJ family.

C6orf106

  • refseq_C6orf106.F2 refseq_C6orf106.R2 171 369
  • NCBIGene 36.3 64771
  • Single exon skipping, size difference: 198
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024294

C6orf134

  • refseq_C6orf134.F1 refseq_C6orf134.R1 121 403
  • NCBIGene 36.3 79969
  • Alternative 5-prime, size difference: 282
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001031722

  • pfam DUF738 121aa 1e-54 in ref transcript
    • Protein of unknown function (DUF738). This family consists of several uncharacterised eukaryotic proteins of unknown function.

C6orf25

  • refseq_C6orf25.F1 refseq_C6orf25.R1 278 369
  • NCBIGene 36.3 80739
  • Single exon skipping, size difference: 91
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_138272

  • Changed! pfam V-set 105aa 0.008 in modified transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

C6orf48

  • refseq_C6orf48.F1 refseq_C6orf48.R1 175 269
  • NCBIGene 36.3 50854
  • Single exon skipping, size difference: 94
  • Exclusion in 5'UTR
  • Reference transcript: NM_001040437

C9orf105

  • refseq_C9orf105.F1 refseq_C9orf105.R1 240 342
  • NCBIGene 36.2 401505
  • Alternative 5-prime, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_933960

C9orf131

  • refseq_C9orf131.F1 refseq_C9orf131.R1 194 299
  • NCBIGene 36.3 138724
  • Alternative 5-prime, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203299

C9orf23

  • refseq_C9orf23.F1 refseq_C9orf23.R1 156 360
  • NCBIGene 36.3 138716
  • Alternative 5-prime, size difference: 204
  • Exclusion in 5'UTR
  • Reference transcript: NM_148179

  • pfam Alba 65aa 5e-14 in ref transcript
    • Alba. Alba is a novel chromosomal protein that coats archaeal DNA without compacting it.

C9orf24

  • refseq_C9orf24.F1 refseq_C9orf24.R1 152 306
  • NCBIGene 36.3 84688
  • Alternative 5-prime, size difference: 154
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032596

C9orf58

  • refseq_C9orf58.F1 refseq_C9orf58.R1 97 119
  • NCBIGene 36.2 83543
  • Alternative 3-prime, size difference: 22
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_031426

  • Changed! cd EFh 61aa 1e-04 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! PTZ PTZ00184 69aa 2e-04 in ref transcript
    • calmodulin; Provisional.

CA12

  • refseq_CA12.F2 refseq_CA12.R2 143 176
  • NCBIGene 36.3 771
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001218

  • cd alpha_CA_XII_XIV 251aa 1e-120 in ref transcript
    • Carbonic anhydrase alpha, isozymes XII and XIV. Carbonic anhydrases (CAs) are zinc-containing enzymes that catalyze the reversible hydration of carbon dioxide in a two-step mechanism: a nucleophilic attack of a zinc-bound hydroxide ion on carbon dioxide, followed by the regeneration of the active site by ionization of the zinc-bound water molecule and removal of a proton from the active site. They are ubiquitous enzymes involved in fundamental processes like photosynthesis, respiration, pH homeostasis and ion transport. There are three evolutionary distinct groups - alpha, beta and gamma carbonic anhydrases - which show no significant sequence identity or structural similarity. Most alpha CAs are monomeric enzymes. The zinc ion is complexed by three histidine residues. This sub-family comprises the membrane proteins CA XII and XIV.
  • pfam Carb_anhydrase 258aa 3e-66 in ref transcript
    • Eukaryotic-type carbonic anhydrase.
  • COG Cah 264aa 6e-29 in ref transcript
    • Carbonic anhydrase [Inorganic ion transport and metabolism].

CABP1

  • refseq_CABP1.F1 refseq_CABP1.R1 126 306
  • NCBIGene 36.3 9478
  • Single exon skipping, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031205

  • cd EFh 64aa 8e-13 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 54aa 5e-04 in ref transcript
  • smart EFh 29aa 7e-05 in ref transcript
    • EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
  • smart EFh 28aa 6e-04 in ref transcript
  • PTZ PTZ00184 132aa 4e-24 in ref transcript
    • calmodulin; Provisional.
  • Changed! smart EH 58aa 0.007 in modified transcript
    • Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.

CABP2

  • refseq_CABP2.F1 refseq_CABP2.R1 109 280
  • NCBIGene 36.3 51475
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016366

  • cd EFh 64aa 5e-14 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 59aa 6e-06 in ref transcript
  • pfam efhand 29aa 1e-04 in ref transcript
    • EF hand. The EF-hands can be divided into two classes: signaling proteins and buffering/transport proteins. The first group is the largest and includes the most well-known members of the family such as calmodulin, troponin C and S100B. These proteins typically undergo a calcium-dependent conformational change which opens a target binding site. The latter group is represented by calbindin D9k and do not undergo calcium dependent conformational changes.
  • smart EFh 28aa 5e-04 in ref transcript
    • EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
  • Changed! pfam efhand 29aa 0.006 in ref transcript
  • PTZ PTZ00184 145aa 1e-27 in ref transcript
    • calmodulin; Provisional.

CABYR

  • refseq_CABYR.F2 refseq_CABYR.R2 306 360
  • NCBIGene 36.3 26256
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012189

  • smart RIIa 38aa 3e-08 in ref transcript
    • RIIalpha, Regulatory subunit portion of type II PKA R-subunit. RIIalpha, Regulatory subunit portion of type II PKA R-subunit. Contains dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs).

CACNA1G

  • refseq_CACNA1G.F2 refseq_CACNA1G.R2 106 175
  • NCBIGene 36.3 8913
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018896

  • pfam Ion_trans 187aa 4e-37 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam Ion_trans 206aa 6e-35 in ref transcript
  • pfam Ion_trans 223aa 2e-26 in ref transcript
  • pfam Ion_trans 112aa 2e-16 in ref transcript
  • pfam Ion_trans 79aa 4e-11 in ref transcript

CACNA1G

  • refseq_CACNA1G.F3 refseq_CACNA1G.R3 194 338
  • NCBIGene 36.3 8913
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018896

  • pfam Ion_trans 187aa 4e-37 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam Ion_trans 206aa 6e-35 in ref transcript
  • pfam Ion_trans 223aa 2e-26 in ref transcript
  • pfam Ion_trans 112aa 2e-16 in ref transcript
  • pfam Ion_trans 79aa 4e-11 in ref transcript

CACNA1G

  • refseq_CACNA1G.F4 refseq_CACNA1G.R4 187 322
  • NCBIGene 36.3 8913
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018896

  • pfam Ion_trans 187aa 4e-37 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam Ion_trans 206aa 6e-35 in ref transcript
  • pfam Ion_trans 223aa 2e-26 in ref transcript
  • pfam Ion_trans 112aa 2e-16 in ref transcript
  • pfam Ion_trans 79aa 4e-11 in ref transcript
  • Changed! pfam Atrophin-1 286aa 0.001 in modified transcript
    • Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteristic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.

CACNA1H

  • refseq_CACNA1H.F1 refseq_CACNA1H.R1 100 118
  • NCBIGene 36.3 8912
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021098

  • pfam Ion_trans 223aa 4e-33 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam Ion_trans 169aa 9e-32 in ref transcript
  • pfam Ion_trans 188aa 2e-29 in ref transcript
  • pfam Ion_trans 126aa 3e-15 in ref transcript
  • pfam Ion_trans 79aa 3e-11 in ref transcript

CACNA1I

  • refseq_CACNA1I.F1 refseq_CACNA1I.R1 294 399
  • NCBIGene 36.3 8911
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021096

  • pfam Ion_trans 223aa 4e-35 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam Ion_trans 209aa 3e-31 in ref transcript
  • pfam Ion_trans 188aa 4e-18 in ref transcript
  • pfam Ion_trans 89aa 7e-16 in ref transcript
  • pfam Ion_trans 79aa 2e-08 in ref transcript

CACNA2D4

  • refseq_CACNA2D4.F1 refseq_CACNA2D4.R1 104 149
  • NCBIGene 36.2 93589
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172364

  • cd vWA_VGCC_like 184aa 9e-66 in ref transcript
    • VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
  • pfam VWA_N 117aa 1e-46 in ref transcript
    • VWA N-terminal. This domain is found at the N-terminus of proteins containing von Willebrand factor type A (VWA, pfam00092) and Cache (pfam02743) domains. It has been found in vertebrates, Drosophila and Caenorhabditis elegans but has not yet been identified in other eukaryotes. It is probably involved in the function of some voltage-dependent calcium channel subunits.
  • pfam Cache_1 93aa 6e-15 in ref transcript
    • Cache domain.
  • smart VWA 180aa 4e-14 in ref transcript
    • von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
  • COG COG2425 346aa 5e-04 in ref transcript
    • Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only].

CACNA2D4

  • refseq_CACNA2D4.F3 refseq_CACNA2D4.R3 108 179
  • NCBIGene 36.2 93589
  • Single exon skipping, size difference: 71
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172364

  • cd vWA_VGCC_like 184aa 9e-66 in ref transcript
    • VWA Voltage gated Calcium channel like: Voltage-gated calcium channels are a complex of five proteins: alpha 1, beta 1, gamma, alpha 2 and delta. The alpha 2 and delta subunits result from proteolytic processing of a single gene product and carries at its N-terminus the VWA and cache domains, The alpha 2 delta gene family has orthologues in D. melanogaster and C. elegans but none have been detected in aither A. thaliana or yeast. The exact biochemical function of the VWA domain is not known but the alpha 2 delta complex has been shown to regulate various functional properties of the channel complex.
  • pfam VWA_N 117aa 1e-46 in ref transcript
    • VWA N-terminal. This domain is found at the N-terminus of proteins containing von Willebrand factor type A (VWA, pfam00092) and Cache (pfam02743) domains. It has been found in vertebrates, Drosophila and Caenorhabditis elegans but has not yet been identified in other eukaryotes. It is probably involved in the function of some voltage-dependent calcium channel subunits.
  • pfam Cache_1 93aa 6e-15 in ref transcript
    • Cache domain.
  • smart VWA 180aa 4e-14 in ref transcript
    • von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
  • Changed! COG COG2425 346aa 5e-04 in ref transcript
    • Uncharacterized protein containing a von Willebrand factor type A (vWA) domain [General function prediction only].
  • Changed! COG COG2425 325aa 0.003 in modified transcript

CACNB1

  • refseq_CACNB1.F1 refseq_CACNB1.R1 135 270
  • NCBIGene 36.3 782
  • Mutually exclusive exon skipping, size difference: 135
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000723

  • cd SH3 57aa 6e-04 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam Ca_channel_B 187aa 1e-108 in ref transcript
    • Dihydropyridine sensitive L-type calcium channel (Beta subunit).
  • smart SH3 61aa 1e-04 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

CACNG6

  • refseq_CACNG6.F1 refseq_CACNG6.R1 133 271
  • NCBIGene 36.3 59285
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145814

CADPS

  • refseq_CADPS.F1 refseq_CADPS.R1 118 139
  • NCBIGene 36.3 8618
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003716

  • cd PH_CADPS 110aa 2e-57 in ref transcript
    • CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • Changed! pfam DUF1041 106aa 4e-13 in ref transcript
    • Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
  • smart PH 102aa 6e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • Changed! pfam DUF1041 99aa 6e-15 in modified transcript

CADPS

  • refseq_CADPS.F4 refseq_CADPS.R4 109 160
  • NCBIGene 36.3 8618
  • Alternative 5-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003716

  • cd PH_CADPS 110aa 2e-57 in ref transcript
    • CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam DUF1041 106aa 4e-13 in ref transcript
    • Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
  • smart PH 102aa 6e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

CADPS

  • refseq_CADPS.F6 refseq_CADPS.R6 170 317
  • NCBIGene 36.3 8618
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003716

  • cd PH_CADPS 110aa 2e-57 in ref transcript
    • CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam DUF1041 106aa 4e-13 in ref transcript
    • Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
  • smart PH 102aa 6e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

CADPS2

  • refseq_CADPS2.F1 refseq_CADPS2.R1 382 487
  • NCBIGene 36.3 93664
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017954

  • cd PH_CADPS 110aa 2e-52 in ref transcript
    • CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam DUF1041 95aa 1e-13 in ref transcript
    • Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
  • smart PH 102aa 2e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

CADPS2

  • refseq_CADPS2.F1 refseq_CADPS2.R2 116 236
  • NCBIGene 36.3 93664
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017954

  • cd PH_CADPS 110aa 2e-52 in ref transcript
    • CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam DUF1041 95aa 1e-13 in ref transcript
    • Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
  • smart PH 102aa 2e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

CAGE1

  • refseq_CAGE1.F1 refseq_CAGE1.R1 119 224
  • NCBIGene 36.2 285782
  • Single exon skipping, size difference: 105
  • Exclusion in 3'UTR
  • Reference transcript: NM_175745

  • pfam SMC_N 256aa 7e-06 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG Smc 189aa 2e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

CAGE1

  • refseq_CAGE1.F3 refseq_CAGE1.R3 112 248
  • NCBIGene 36.2 285782
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 136
  • Exclusion in the protein (no frameshift), Exclusion of the stop codon
  • Reference transcript: NM_175745

  • pfam SMC_N 256aa 7e-06 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG Smc 189aa 2e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

CALB2

  • refseq_CALB2.F2 refseq_CALB2.R2 114 154
  • NCBIGene 36.3 794
  • Single exon skipping, size difference: 40
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001740

  • Changed! cd EFh 70aa 4e-07 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 73aa 2e-05 in ref transcript
  • Changed! cd EFh 67aa 4e-05 in ref transcript
  • Changed! PTZ PTZ00184 169aa 4e-07 in ref transcript
    • calmodulin; Provisional.
  • Changed! cd EFh 69aa 3e-07 in modified transcript
  • Changed! PTZ PTZ00184 167aa 8e-07 in modified transcript

CALML4

  • refseq_CALML4.F1 refseq_CALML4.R1 104 434
  • NCBIGene 36.2 91860
  • Multiple exon skipping, size difference: 330
  • Exclusion of the protein initiation site, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031733

  • Changed! cd EFh 63aa 1e-05 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! PTZ PTZ00184 107aa 2e-21 in ref transcript
    • calmodulin; Provisional.

CAMK2B

  • refseq_CAMK2B.F2 refseq_CAMK2B.R2 117 189
  • NCBIGene 36.3 816
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001220

  • cd S_TKc 260aa 2e-82 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 1e-85 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CaMKII_AD 128aa 1e-58 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 249aa 6e-40 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAMK2B

  • refseq_CAMK2B.F4 refseq_CAMK2B.R4 100 472
  • NCBIGene 36.3 816
  • Multiple exon skipping, size difference: 372
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001220

  • cd S_TKc 260aa 2e-82 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 1e-85 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CaMKII_AD 128aa 1e-58 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • Changed! PTZ PTZ00263 249aa 6e-40 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! COG SPS1 265aa 4e-40 in modified transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

CAMK2B

  • refseq_CAMK2B.F5 refseq_CAMK2B.R5 171 249
  • NCBIGene 36.3 816
  • Alternative 3-prime, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001220

  • cd S_TKc 260aa 2e-82 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 1e-85 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! pfam CaMKII_AD 128aa 1e-58 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 249aa 6e-40 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! pfam CaMKII_AD 102aa 8e-40 in modified transcript

CAMK2G

  • refseq_CAMK2G.F2 refseq_CAMK2G.R2 187 257
  • NCBIGene 36.2 818
  • Alternative 5-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172171

  • cd S_TKc 260aa 5e-84 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 2e-86 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CaMKII_AD 128aa 5e-59 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 249aa 3e-43 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAMK2G

  • refseq_CAMK2G.F3 refseq_CAMK2G.R3 215 329
  • NCBIGene 36.3 818
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172171

  • cd S_TKc 260aa 5e-84 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 2e-86 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CaMKII_AD 128aa 5e-59 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 249aa 3e-43 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAMK2G

  • refseq_CAMK2G.F4 refseq_CAMK2G.R4 126 189
  • NCBIGene 36.3 818
  • Single exon skipping, size difference: 63
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_172171

  • cd S_TKc 260aa 5e-84 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 2e-86 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CaMKII_AD 128aa 5e-59 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 249aa 3e-43 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAMK2G

  • refseq_CAMK2G.F6 refseq_CAMK2G.R1 106 134
  • NCBIGene 36.2 818
  • Alternative 3-prime, size difference: 28
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172171

  • cd S_TKc 260aa 5e-84 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 2e-86 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! pfam CaMKII_AD 128aa 5e-59 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 249aa 3e-43 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAMKK1

  • refseq_CAMKK1.F1 refseq_CAMKK1.R1 133 247
  • NCBIGene 36.3 84254
  • Single exon skipping, size difference: 114
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_172206

  • Changed! cd S_TKc 283aa 9e-69 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 272aa 6e-73 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG SPS1 359aa 2e-36 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd S_TKc 321aa 3e-62 in modified transcript
  • Changed! smart S_TKc 310aa 1e-65 in modified transcript
  • Changed! COG SPS1 397aa 4e-31 in modified transcript

CAMKK2

  • refseq_CAMKK2.F2 refseq_CAMKK2.R2 199 242
  • NCBIGene 36.3 10645
  • Single exon skipping, size difference: 43
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006549

  • cd S_TKc 283aa 5e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 272aa 4e-70 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00263 295aa 6e-38 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAMKK2

  • refseq_CAMKK2.F3 refseq_CAMKK2.R3 134 263
  • NCBIGene 36.3 10645
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006549

  • Changed! cd S_TKc 283aa 5e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 272aa 4e-70 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00263 295aa 6e-38 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! cd S_TKc 278aa 4e-65 in modified transcript
  • Changed! smart S_TKc 267aa 3e-67 in modified transcript
  • Changed! PTZ PTZ00263 275aa 3e-37 in modified transcript

CANX

  • refseq_CANX.F1 refseq_CANX.R1 132 185
  • NCBIGene 36.3 821
  • Alternative 5-prime, size difference: 53
  • Exclusion in 5'UTR
  • Reference transcript: NM_001746

  • pfam Calreticulin 372aa 1e-143 in ref transcript
    • Calreticulin family.

CAPN9

  • refseq_CAPN9.F1 refseq_CAPN9.R1 167 245
  • NCBIGene 36.3 10753
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006615

  • Changed! cd CysPc 305aa 1e-105 in ref transcript
    • Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
  • cd Calpain_III 150aa 5e-61 in ref transcript
    • Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
  • cd EFh 56aa 0.002 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! smart CysPc 316aa 1e-148 in ref transcript
    • Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).
  • pfam Calpain_III 147aa 1e-69 in ref transcript
    • Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
  • Changed! cd CysPc 279aa 1e-94 in modified transcript
  • Changed! smart CysPc 290aa 1e-132 in modified transcript

CAPS

  • refseq_CAPS.F1 refseq_CAPS.R1 159 240
  • NCBIGene 36.3 828
  • Alternative 5-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004058

  • cd EFh 62aa 2e-08 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 58aa 6e-04 in ref transcript
  • Changed! COG FRQ1 146aa 6e-07 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! PTZ PTZ00184 129aa 4e-07 in modified transcript
    • calmodulin; Provisional.

CARS

  • refseq_CARS.F1 refseq_CARS.R1 115 162
  • NCBIGene 36.3 833
  • Alternative 3-prime, size difference: 47
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001014437

  • Changed! cd CysRS_core 102aa 1e-47 in ref transcript
    • This is the catalytic core domain of cysteinyl tRNA synthetase (CysRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding.
  • Changed! cd CysRS_core 102aa 5e-37 in ref transcript
  • Changed! cd CysRS_core 34aa 4e-05 in ref transcript
  • Changed! pfam tRNA-synt_1e 246aa 9e-88 in ref transcript
    • tRNA synthetases class I (C) catalytic domain. This family includes only cysteinyl tRNA synthetases.
  • Changed! TIGR cysS 86aa 7e-35 in ref transcript
    • This model finds the cysteinyl-tRNA synthetase from most but not from all species. The enzyme from one archaeal species, Archaeoglobus fulgidus, is found but the equivalent enzymes from some other Archaea, including Methanococcus jannaschii, are not found, although biochemical evidence suggests that tRNA(Cys) in these species are charged directly with Cys rather than through a misacylation and correction pathway as for tRNA(Gln).
  • Changed! TIGR leuS_arch 169aa 2e-07 in ref transcript
    • The leucyl-tRNA synthetases belong to two families so broadly different that they are represented by separate models. This model includes both archaeal and cytosolic eukaryotic leucyl-tRNA synthetases; the eubacterial and mitochondrial forms differ so substantially that some other tRNA ligases score higher by this model than does any eubacterial LeuS.
  • Changed! COG CysS 424aa 2e-96 in ref transcript
    • Cysteinyl-tRNA synthetase [Translation, ribosomal structure and biogenesis].
  • Changed! PRK cysS 81aa 2e-36 in ref transcript
    • cysteinyl-tRNA synthetase; Validated.
  • Changed! cd GST_C_EFB1gamma 48aa 0.004 in modified transcript
    • GST_C family, Gamma subunit of Elongation Factor 1B (EFB1gamma) subfamily; EF1Bgamma is part of the eukaryotic translation elongation factor-1 (EF1) complex which plays a central role in the elongation cycle during protein biosynthesis. EF1 consists of two functionally distinct units, EF1A and EF1B. EF1A catalyzes the GTP-dependent binding of aminoacyl-tRNA to the ribosomal A site concomitant with the hydrolysis of GTP. The resulting inactive EF1A:GDP complex is recycled to the active GTP form by the guanine-nucleotide exchange factor EF1B, a complex composed of at least two subunits, alpha and gamma. Metazoan EFB1 contain a third subunit, beta. The EF1B gamma subunit contains a GST fold consisting of an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The GST-like domain of EF1Bgamma is believed to mediate the dimerization of the EF1 complex, which in yeast is a dimer of the heterotrimer EF1A:EF1Balpha:EF1Bgamma. In addition to its role in protein biosynthesis, EF1Bgamma may also display other functions. The recombinant rice protein has been shown to possess GSH conjugating activity. The yeast EF1Bgamma binds membranes in a calcium dependent manner and is also part of a complex that binds to the msrA (methionine sulfoxide reductase) promoter suggesting a function in the regulation of its gene expression. Also included in this subfamily is the GST_C-like domain at the N-terminus of human valyl-tRNA synthetase and its homologs from zebrafish and Xenopus. Although not included in the alignment, the GST_C-like domain containing a deletion present in some aminoacyl-tRNA synthetases is recognized by this model. This domain will be represented in the future by a deletion model of GST_C.

CARS

  • refseq_CARS.F3 refseq_CARS.R3 149 398
  • NCBIGene 36.3 833
  • Single exon skipping, size difference: 249
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001014437

  • cd CysRS_core 102aa 1e-47 in ref transcript
    • This is the catalytic core domain of cysteinyl tRNA synthetase (CysRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding.
  • cd CysRS_core 102aa 5e-37 in ref transcript
  • cd CysRS_core 34aa 4e-05 in ref transcript
  • pfam tRNA-synt_1e 246aa 9e-88 in ref transcript
    • tRNA synthetases class I (C) catalytic domain. This family includes only cysteinyl tRNA synthetases.
  • TIGR cysS 86aa 7e-35 in ref transcript
    • This model finds the cysteinyl-tRNA synthetase from most but not from all species. The enzyme from one archaeal species, Archaeoglobus fulgidus, is found but the equivalent enzymes from some other Archaea, including Methanococcus jannaschii, are not found, although biochemical evidence suggests that tRNA(Cys) in these species are charged directly with Cys rather than through a misacylation and correction pathway as for tRNA(Gln).
  • TIGR leuS_arch 169aa 2e-07 in ref transcript
    • The leucyl-tRNA synthetases belong to two families so broadly different that they are represented by separate models. This model includes both archaeal and cytosolic eukaryotic leucyl-tRNA synthetases; the eubacterial and mitochondrial forms differ so substantially that some other tRNA ligases score higher by this model than does any eubacterial LeuS.
  • COG CysS 424aa 2e-96 in ref transcript
    • Cysteinyl-tRNA synthetase [Translation, ribosomal structure and biogenesis].
  • PRK cysS 81aa 2e-36 in ref transcript
    • cysteinyl-tRNA synthetase; Validated.

CASC5

  • refseq_CASC5.F1 refseq_CASC5.R1 416 494
  • NCBIGene 36.3 57082
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170589

CASP1

  • refseq_CASP1.F2 refseq_CASP1.R2 306 369
  • NCBIGene 36.3 834
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033292

  • cd CASc 250aa 1e-72 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • smart CASc 250aa 5e-94 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • pfam CARD 88aa 5e-20 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.

CASP4

  • refseq_CASP4.F1 refseq_CASP4.R1 141 199
  • NCBIGene 36.2 837
  • Alternative 3-prime, size difference: 58
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001225

  • Changed! cd CASc 250aa 3e-69 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • Changed! smart CASc 250aa 2e-93 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • Changed! smart CARD 89aa 2e-10 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.
  • Changed! smart CARD 89aa 3e-11 in modified transcript

CASP6

  • refseq_CASP6.F2 refseq_CASP6.R2 102 369
  • NCBIGene 36.3 839
  • Multiple exon skipping, size difference: 267
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001226

  • Changed! cd CASc 253aa 2e-82 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • Changed! smart CASc 254aa 7e-91 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • Changed! cd CASc 187aa 6e-60 in modified transcript
  • Changed! smart CASc 184aa 1e-64 in modified transcript

CASP7

  • refseq_CASP7.F1 refseq_CASP7.R1 192 226
  • NCBIGene 36.3 840
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 34
  • Inclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033338

  • Changed! cd CASc 242aa 4e-84 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • Changed! smart CASc 243aa 2e-90 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • Changed! cd CASc 105aa 2e-26 in modified transcript
  • Changed! smart CASc 90aa 5e-28 in modified transcript

CASP8

  • refseq_CASP8.F1 refseq_CASP8.R1 277 373
  • NCBIGene 36.3 841
  • Single exon skipping, size difference: 96
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001080125

  • cd CASc 253aa 3e-85 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • cd DED 75aa 3e-17 in ref transcript
    • Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
  • Changed! cd DED 78aa 7e-13 in ref transcript
  • smart CASc 252aa 4e-90 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • pfam DED 84aa 5e-23 in ref transcript
    • Death effector domain.
  • Changed! pfam DED 82aa 2e-17 in ref transcript
  • Changed! cd DED 75aa 7e-12 in modified transcript
  • Changed! pfam DED 81aa 7e-17 in modified transcript

CAST

  • refseq_CAST.F2 refseq_CAST.R2 103 282
  • NCBIGene 36.2 831
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 179
  • Inclusion in the protein causing a frameshift, Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001750

  • Changed! pfam Calpain_inhib 127aa 3e-19 in ref transcript
    • Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins.
  • Changed! pfam Calpain_inhib 118aa 2e-16 in ref transcript
  • Changed! pfam Calpain_inhib 99aa 1e-13 in ref transcript
  • pfam Calpain_inhib 126aa 2e-11 in ref transcript
  • Changed! pfam Calpain_inhib 86aa 2e-12 in modified transcript

CAV2

  • refseq_CAV2.F1 refseq_CAV2.R1 207 395
  • NCBIGene 36.3 858
  • Single exon skipping, size difference: 188
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001233

  • Changed! pfam Caveolin 149aa 4e-55 in ref transcript
    • Caveolin. All three known Caveolin forms have the FEDVIAEP caveolin 'signature motif' within their hydrophilic N-terminal domain. Caveolin 2 (Cav-2) is co-localised and co-expressed with Cav-1/VIP21, forms heterodimers with it and needs Cav-1 for proper membrane localisation. Cav-3 has greater protein sequence similarity to Cav-1 than to Cav-2. Cellular processes caveolins are involved in include vesicular transport, cholesterol homeostasis, signal transduction, and tumour suppression.
  • Changed! pfam Caveolin 41aa 2e-04 in modified transcript

CBFB

  • refseq_CBFB.F1 refseq_CBFB.R1 141 172
  • NCBIGene 36.3 865
  • Alternative 5-prime, size difference: 31
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_022845

  • Changed! pfam CBF_beta 178aa 4e-73 in ref transcript
    • Core binding factor beta subunit. Core binding factor (CBF) is a heterodimeric transcription factor essential for genetic regulation of hematopoiesis and osteogenesis. The beta subunit enhances DNA-binding ability of the alpha subunit in vitro, and has been show to have a structure related to the OB fold.
  • Changed! pfam CBF_beta 170aa 2e-70 in modified transcript

CCDC108

  • refseq_CCDC108.F1 refseq_CCDC108.R1 132 287
  • NCBIGene 36.3 255101
  • Single exon skipping, size difference: 155
  • Exclusion of the protein initiation site
  • Reference transcript: NM_194302

CCDC55

  • refseq_CCDC55.F1 refseq_CCDC55.R1 220 316
  • NCBIGene 36.2 84081
  • Single exon skipping, size difference: 96
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_032141

  • Changed! pfam DUF2040 126aa 7e-07 in ref transcript
    • Coiled-coil domain-containing protein 55 (DUF2040). This entry is a conserved domain of approximately 130 residues of proteins conserved from fungi to humans. The proteins do contain a coiled-coil domain, but the function is unknown.
  • Changed! TIGR U2AF_lg 110aa 0.009 in ref transcript
    • Members of this subfamily are found in plants, metazoa and fungi.

CCDC62

  • refseq_CCDC62.F1 refseq_CCDC62.R1 260 354
  • NCBIGene 36.3 84660
  • Single exon skipping, size difference: 94
  • Exclusion of the stop codon
  • Reference transcript: NM_201435

  • pfam SMC_N 256aa 5e-07 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG Smc 261aa 6e-06 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

CCDC7

  • refseq_CCDC7.F2 refseq_CCDC7.R2 164 398
  • NCBIGene 36.3 221016
  • Alternative 5-prime, size difference: 234
  • Exclusion in 5'UTR
  • Reference transcript: NM_145023

CCL14

  • refseq_CCL14.F1 refseq_CCL14.R1 100 148
  • NCBIGene 36.3 6358
  • Single exon skipping, size difference: 48
  • Exclusion in 3'UTR
  • Reference transcript: NM_032962

  • cd Chemokine_CC 57aa 4e-17 in ref transcript
    • Chemokine_CC: 1 of 4 subgroup designations based on the arrangement of the two N-terminal cysteine residues; includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity; some members (e.g. 2HCC) contain an additional disulfide bond which is thought to compensate for the highly conserved Trp missing in these; chemotatic for monocytes, macrophages, eosinophils, basophils, and T cells, but not neutrophils; exist as monomers and dimers, but are believed to be functional as monomers; found only in vertebrates and a few viruses; a subgroup of CC, identified by an N-terminal DCCL motif (Exodus-1, Exodus-2, and Exodus-3), has been shown to inhibit specific types of human cancer cell growth in a mouse model. See CDs: Chemokine (cd00169) for the general alignment of chemokines, or Chemokine_CXC (cd00273), Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for the additional chemokine subgroups, and Chemokine_CC_DCCL for the DCCL subgroup of this CD.
  • smart SCY 57aa 3e-19 in ref transcript
    • Intercrine alpha family (small cytokine C-X-C) (chemokine CXC). Family of cytokines involved in cell-specific chemotaxis, mediation of cell growth, and the inflammatory response.

CCL23

  • refseq_CCL23.F2 refseq_CCL23.R2 270 321
  • NCBIGene 36.3 6368
  • Alternative 3-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005064

  • cd Chemokine_CC 57aa 9e-15 in ref transcript
    • Chemokine_CC: 1 of 4 subgroup designations based on the arrangement of the two N-terminal cysteine residues; includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity; some members (e.g. 2HCC) contain an additional disulfide bond which is thought to compensate for the highly conserved Trp missing in these; chemotatic for monocytes, macrophages, eosinophils, basophils, and T cells, but not neutrophils; exist as monomers and dimers, but are believed to be functional as monomers; found only in vertebrates and a few viruses; a subgroup of CC, identified by an N-terminal DCCL motif (Exodus-1, Exodus-2, and Exodus-3), has been shown to inhibit specific types of human cancer cell growth in a mouse model. See CDs: Chemokine (cd00169) for the general alignment of chemokines, or Chemokine_CXC (cd00273), Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for the additional chemokine subgroups, and Chemokine_CC_DCCL for the DCCL subgroup of this CD.
  • smart SCY 59aa 3e-18 in ref transcript
    • Intercrine alpha family (small cytokine C-X-C) (chemokine CXC). Family of cytokines involved in cell-specific chemotaxis, mediation of cell growth, and the inflammatory response.

CCNL2

  • refseq_CCNL2.F1 refseq_CCNL2.R1 126 342
  • NCBIGene 36.3 81669
  • Multiple exon skipping, size difference: 216
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift
  • Reference transcript: NM_030937

  • cd CYCLIN 90aa 5e-09 in ref transcript
    • Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
  • cd CYCLIN 74aa 2e-07 in ref transcript
  • smart CYCLIN 83aa 3e-12 in ref transcript
    • domain present in cyclins, TFIIB and Retinoblastoma. A helical domain present in cyclins and TFIIB (twice) and Retinoblastoma (once). A protein recognition domain functioning in cell-cycle and transcription control.
  • smart CYCLIN 65aa 8e-07 in ref transcript
  • TIGR ccl1 224aa 2e-04 in ref transcript
    • University).
  • COG CCL1 205aa 2e-27 in ref transcript
    • Cdk activating kinase (CAK)/RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH/TFIIK, cyclin H subunit [Cell division and chromosome partitioning / Transcription / DNA replication, recombination, and repair].

CCR3

  • refseq_CCR3.F1 refseq_CCR3.R1 204 283
  • NCBIGene 36.3 1232
  • Single exon skipping, size difference: 79
  • Exclusion in 5'UTR
  • Reference transcript: NM_001837

  • pfam 7tm_1 245aa 4e-42 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

CCR9

  • refseq_CCR9.F2 refseq_CCR9.R2 246 295
  • NCBIGene 36.3 10803
  • Single exon skipping, size difference: 49
  • Exclusion of the protein initiation site
  • Reference transcript: NM_031200

  • pfam 7tm_1 247aa 3e-35 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

CCRK

  • refseq_CCRK.F1 refseq_CCRK.R1 176 239
  • NCBIGene 36.3 23552
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039803

  • Changed! cd S_TKc 286aa 2e-64 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 275aa 6e-63 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 292aa 4e-49 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 265aa 2e-52 in modified transcript
  • Changed! smart S_TKc 254aa 3e-50 in modified transcript
  • Changed! PTZ PTZ00024 271aa 6e-37 in modified transcript

CCRK

  • refseq_CCRK.F4 refseq_CCRK.R4 209 248
  • NCBIGene 36.3 23552
  • Alternative 3-prime, size difference: 39
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001039803

  • Changed! cd S_TKc 286aa 2e-64 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 275aa 6e-63 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 292aa 4e-49 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 299aa 7e-62 in modified transcript
  • Changed! smart S_TKc 288aa 3e-60 in modified transcript
  • Changed! PTZ PTZ00024 305aa 1e-47 in modified transcript

CCT3

  • refseq_CCT3.F1 refseq_CCT3.R1 129 243
  • NCBIGene 36.3 7203
  • Multiple exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005998

  • Changed! cd TCP1_gamma 520aa 0.0 in ref transcript
    • TCP-1 (CTT or eukaryotic type II) chaperonin family, gamma subunit. Chaperonins are involved in productive folding of proteins. They share a common general morphology, a double toroid of 2 stacked rings. In contrast to bacterial group I chaperonins (GroEL), each ring of the eukaryotic cytosolic chaperonin (CTT) consists of eight different, but homologous subunits. Their common function is to sequester nonnative proteins inside their central cavity and promote folding by using energy derived from ATP hydrolysis. The best studied in vivo substrates of CTT are actin and tubulin.
  • Changed! TIGR chap_CCT_gamma 524aa 0.0 in ref transcript
    • Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT gamma chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
  • Changed! COG GroL 521aa 1e-117 in ref transcript
    • Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd TCP1_gamma 482aa 0.0 in modified transcript
  • Changed! TIGR chap_CCT_gamma 486aa 0.0 in modified transcript
  • Changed! COG GroL 464aa 5e-99 in modified transcript

CCT6A

  • refseq_CCT6A.F1 refseq_CCT6A.R1 171 306
  • NCBIGene 36.3 908
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001762

  • Changed! cd TCP1_zeta 520aa 0.0 in ref transcript
    • TCP-1 (CTT or eukaryotic type II) chaperonin family, zeta subunit. Chaperonins are involved in productive folding of proteins. They share a common general morphology, a double toroid of 2 stacked rings. In contrast to bacterial group I chaperonins (GroEL), each ring of the eukaryotic cytosolic chaperonin (CTT) consists of eight different, but homologous subunits. Their common function is to sequester nonnative proteins inside their central cavity and promote folding by using energy derived from ATP hydrolysis. The best studied in vivo substrates of CTT are actin and tubulin.
  • Changed! TIGR chap_CCT_zeta 528aa 0.0 in ref transcript
    • Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT zeta chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
  • Changed! COG GroL 503aa 5e-93 in ref transcript
    • Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd TCP1_zeta 475aa 0.0 in modified transcript
  • Changed! TIGR chap_CCT_zeta 483aa 0.0 in modified transcript
  • Changed! COG GroL 458aa 8e-73 in modified transcript

CD151

  • refseq_CD151.F1 refseq_CD151.R1 101 163
  • NCBIGene 36.3 977
  • Single exon skipping, size difference: 62
  • Exclusion in 5'UTR
  • Reference transcript: NM_004357

  • cd CD151_like_LEL 109aa 3e-40 in ref transcript
    • Tetraspanin, extracellular domain or large extracellular loop (LEL), CD151_Like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". CD151strongly associates with integrins, especially alpha3beta1, alpha6beta1, alpha7beta1, and alpha6beta4; it may play roles in cell-cell adhesion, cell migration, platelet aggregation, and angiogenesis. For example, CD151 is is involved in regulation of migration of neutrophils, endothelial cells, and various tumor cell lines; it associates specifically with laminin-binding integrins and strengthens alpha6beta1 integrin-mediated adhesion to laminin-1; CD151 also specifically attenuates adhesion-dependent activation of Ras and correspdonding downstream effects, and is involved in epithelial cell-cell adhesion as a modulator of PKC- and Cdc42-dependent actin cytoskeletal reorganization.
  • pfam Tetraspannin 234aa 2e-31 in ref transcript
    • Tetraspanin family.

CD163

  • refseq_CD163.F2 refseq_CD163.R2 309 392
  • NCBIGene 36.3 9332
  • Alternative 3-prime, size difference: 83
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004244

  • smart SR 101aa 4e-39 in ref transcript
    • Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
  • smart SR 101aa 1e-35 in ref transcript
  • smart SR 102aa 6e-35 in ref transcript
  • smart SR 101aa 1e-34 in ref transcript
  • smart SR 101aa 5e-31 in ref transcript
  • smart SR 100aa 5e-30 in ref transcript
  • smart SR 101aa 1e-28 in ref transcript
  • smart SR 101aa 4e-27 in ref transcript
  • smart SR 102aa 1e-17 in ref transcript

CD200

  • refseq_CD200.F2 refseq_CD200.R2 280 355
  • NCBIGene 36.3 4345
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001004196

  • smart IGv 77aa 2e-07 in ref transcript
    • Immunoglobulin V-Type.
  • pfam ig 64aa 0.003 in ref transcript
    • Immunoglobulin domain. Members of the immunoglobulin superfamily are found in hundreds of proteins of different functions. Examples include antibodies, the giant muscle kinase titin and receptor tyrosine kinases. Immunoglobulin-like domains may be involved in protein-protein and protein-ligand interactions. The Pfam alignments do not include the first and last strand of the immunoglobulin-like domain.

CD200R1

  • refseq_CD200R1.F2 refseq_CD200R1.R2 318 387
  • NCBIGene 36.3 131450
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138806

  • pfam C2-set_2 70aa 0.002 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.

CD34

  • refseq_CD34.F1 refseq_CD34.R1 151 346
  • NCBIGene 36.3 947
  • Alternative 3-prime, size difference: 195
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001025109

  • Changed! pfam CD34_antigen 196aa 3e-55 in ref transcript
    • CD34/Podocalyxin family. This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion.
  • Changed! pfam CD34_antigen 139aa 3e-38 in modified transcript

CD37

  • refseq_CD37.F1 refseq_CD37.R1 172 206
  • NCBIGene 36.3 951
  • Alternative 5-prime, size difference: 34
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001774

  • Changed! cd CD37_CD82_like_LEL 136aa 9e-38 in ref transcript
    • Tetraspanin, extracellular domain or large extracellular loop (LEL), CD37_CD82_Like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". CD37 is a leukocyte-specific protein, and its restricted expression pattern suggests a role in the immune system. A regulatory role in T-cell proliferation has been suggested. CD82 is a metastasis suppressor implicated in biological processes ranging from fusion, adhesion, and migration to apoptosis and alterations of cell morphology.
  • Changed! pfam Tetraspannin 262aa 5e-25 in ref transcript
    • Tetraspanin family.

CD46

  • refseq_CD46.F1 refseq_CD46.R1 197 239
  • NCBIGene 36.3 4179
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172359

  • cd CCP 60aa 4e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd CCP 63aa 1e-06 in ref transcript
  • cd CCP 50aa 4e-05 in ref transcript
  • pfam Sushi 62aa 5e-09 in ref transcript
    • Sushi domain (SCR repeat).
  • pfam Sushi 59aa 2e-08 in ref transcript
  • smart CCP 50aa 6e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 54aa 5e-04 in ref transcript

CD46

  • refseq_CD46.F3 refseq_CD46.R3 120 213
  • NCBIGene 36.3 4179
  • Single exon skipping, size difference: 93
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_172359

  • cd CCP 60aa 4e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd CCP 63aa 1e-06 in ref transcript
  • cd CCP 50aa 4e-05 in ref transcript
  • pfam Sushi 62aa 5e-09 in ref transcript
    • Sushi domain (SCR repeat).
  • pfam Sushi 59aa 2e-08 in ref transcript
  • smart CCP 50aa 6e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 54aa 5e-04 in ref transcript

CD46

  • refseq_CD46.F4 refseq_CD46.R4 228 273
  • NCBIGene 36.3 4179
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172359

  • cd CCP 60aa 4e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd CCP 63aa 1e-06 in ref transcript
  • Changed! cd CCP 50aa 4e-05 in ref transcript
  • pfam Sushi 62aa 5e-09 in ref transcript
    • Sushi domain (SCR repeat).
  • pfam Sushi 59aa 2e-08 in ref transcript
  • Changed! smart CCP 50aa 6e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 54aa 5e-04 in ref transcript
  • Changed! cd CCP 56aa 6e-08 in modified transcript
  • Changed! smart CCP 56aa 5e-09 in modified transcript

CD68

  • refseq_CD68.F1 refseq_CD68.R1 234 315
  • NCBIGene 36.3 968
  • Alternative 3-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001251

  • pfam Lamp 201aa 2e-34 in ref transcript
    • Lysosome-associated membrane glycoprotein (Lamp).

CD74

  • refseq_CD74.F1 refseq_CD74.R1 118 310
  • NCBIGene 36.3 972
  • Single exon skipping, size difference: 192
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025159

  • Changed! cd TY 60aa 5e-14 in ref transcript
    • Thyroglobulin type I repeats.; The N-terminal region of human thyroglobulin contains 11 type-1 repeats TY repeats are proposed to be inhibitors of cysteine proteases.
  • pfam MHC2-interact 113aa 2e-35 in ref transcript
    • CLIP, MHC2 interacting. Members of this family are found in class II invariant chain-associated peptide (CLIP), and are required for association with class II major histocompatibility complex (MHC) in the MHC class II processing pathway.
  • pfam MHCassoc_trimer 72aa 1e-31 in ref transcript
    • Class II MHC-associated invariant chain trimerisation domain. The class II associated invariant chain peptide is required for folding and localisation of MHC class II heterodimers. This domain is involved in trimerisation of the ectoderm and interferes with DM/class II binding. The trimeric protein forms a cylindrical shape which is thought to be important for interactions between the invariant chain and class II molecules.
  • Changed! pfam Thyroglobulin_1 59aa 1e-16 in ref transcript
    • Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.

CD79A

  • refseq_CD79A.F1 refseq_CD79A.R1 202 316
  • NCBIGene 36.3 973
  • Alternative 5-prime, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001783

  • Changed! cd IGcam 91aa 2e-04 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! smart IG_like 88aa 2e-11 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • Changed! pfam ITAM 21aa 0.010 in modified transcript
    • Immunoreceptor tyrosine-based activation motif.

CD79B

  • refseq_CD79B.F1 refseq_CD79B.R1 101 413
  • NCBIGene 36.3 974
  • Single exon skipping, size difference: 312
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039933

  • Changed! cd IGv 87aa 0.006 in ref transcript
    • Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! pfam V-set 101aa 5e-11 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

CD8A

  • refseq_CD8A.F2 refseq_CD8A.R2 155 266
  • NCBIGene 36.3 925
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001768

  • cd IGv 94aa 3e-05 in ref transcript
    • Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam V-set 107aa 4e-11 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

CD8B

  • refseq_CD8B.F2 refseq_CD8B.R2 192 250
  • NCBIGene 36.3 926
  • Alternative 3-prime, size difference: 58
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172099

  • cd IGv 105aa 1e-09 in ref transcript
    • Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam V-set 116aa 3e-19 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

CD8B

  • refseq_CD8B.F2 refseq_CD8B.R5 180 234
  • NCBIGene 36.3 926
  • Single exon skipping, size difference: 54
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_172213

  • cd IGv 105aa 4e-09 in ref transcript
    • Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam V-set 116aa 6e-19 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

CD8B

  • refseq_CD8B.F3 refseq_CD8B.R3 123 213
  • NCBIGene 36.3 926
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172099

  • cd IGv 105aa 1e-09 in ref transcript
    • Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam V-set 116aa 3e-19 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

CD96

  • refseq_CD96.F1 refseq_CD96.R1 191 239
  • NCBIGene 36.3 10225
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198196

  • COG COG5099 153aa 0.008 in ref transcript
    • RNA-binding protein of the Puf family, translational repressor [Translation, ribosomal structure and biogenesis].

CD97

  • refseq_CD97.F1 refseq_CD97.R1 204 351
  • NCBIGene 36.3 976
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_078481

  • Changed! cd EGF_CA 37aa 3e-05 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • cd EGF_CA 33aa 4e-05 in ref transcript
  • Changed! cd EGF_CA 34aa 6e-05 in ref transcript
  • cd EGF_CA 35aa 0.001 in ref transcript
  • pfam 7tm_2 250aa 1e-78 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • Changed! pfam EGF_CA 48aa 4e-10 in ref transcript
    • Calcium binding EGF domain.
  • smart GPS 51aa 7e-10 in ref transcript
    • G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • Changed! pfam EGF_CA 35aa 2e-09 in ref transcript
  • smart EGF_CA 33aa 3e-06 in ref transcript
    • Calcium-binding EGF-like domain.
  • pfam EGF_CA 51aa 9e-06 in ref transcript
  • COG PutP 125aa 0.004 in ref transcript
    • Na+/proline symporter [Amino acid transport and metabolism / General function prediction only].
  • Changed! cd EGF_CA 37aa 1e-05 in modified transcript
  • Changed! pfam EGF_CA 35aa 6e-10 in modified transcript
  • Changed! smart EGF_CA 44aa 1e-06 in modified transcript

CD99L2

  • refseq_CD99L2.F1 refseq_CD99L2.R1 209 356
  • NCBIGene 36.3 83692
  • Multiple exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031462

CDC14B

  • refseq_CDC14B.F2 refseq_CDC14B.R2 231 348
  • NCBIGene 36.3 8555
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033331

  • cd DSPc 108aa 1e-09 in ref transcript
    • Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
  • smart DSPc 116aa 7e-12 in ref transcript
    • Dual specificity phosphatase, catalytic domain.
  • PTZ PTZ00242 139aa 2e-20 in ref transcript
    • protein tyrosine phosphatase; Provisional.

CDC25A

  • refseq_CDC25A.F1 refseq_CDC25A.R1 145 265
  • NCBIGene 36.3 993
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001789

  • cd Cdc25 119aa 5e-50 in ref transcript
    • Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
  • Changed! pfam M-inducer_phosp 244aa 4e-65 in ref transcript
    • M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
  • pfam Rhodanese 108aa 4e-24 in ref transcript
    • Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
  • COG MIH1 233aa 1e-31 in ref transcript
    • Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].
  • Changed! pfam M-inducer_phosp 204aa 2e-54 in modified transcript

CDC25B

  • refseq_CDC25B.F2 refseq_CDC25B.R2 129 171
  • NCBIGene 36.3 994
  • Alternative 3-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021873

  • cd Cdc25 120aa 9e-49 in ref transcript
    • Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
  • pfam M-inducer_phosp 272aa 1e-76 in ref transcript
    • M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
  • pfam Rhodanese 109aa 9e-25 in ref transcript
    • Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
  • COG MIH1 158aa 7e-34 in ref transcript
    • Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].

CDC25B

  • refseq_CDC25B.F3 refseq_CDC25B.R3 101 224
  • NCBIGene 36.3 994
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021873

  • cd Cdc25 120aa 9e-49 in ref transcript
    • Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
  • Changed! pfam M-inducer_phosp 272aa 1e-76 in ref transcript
    • M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
  • pfam Rhodanese 109aa 9e-25 in ref transcript
    • Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
  • COG MIH1 158aa 7e-34 in ref transcript
    • Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].
  • Changed! pfam M-inducer_phosp 231aa 9e-65 in modified transcript

CDC25C

  • refseq_CDC25C.F1 refseq_CDC25C.R1 133 228
  • NCBIGene 36.3 995
  • Single exon skipping, size difference: 95
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001790

  • Changed! cd Cdc25 120aa 4e-45 in ref transcript
    • Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
  • Changed! pfam M-inducer_phosp 180aa 1e-43 in ref transcript
    • M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
  • Changed! pfam Rhodanese 107aa 6e-22 in ref transcript
    • Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
  • Changed! COG MIH1 241aa 1e-35 in ref transcript
    • Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].

CDC25C

  • refseq_CDC25C.F2 refseq_CDC25C.R2 141 360
  • NCBIGene 36.3 995
  • Multiple exon skipping, size difference: 124
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001790

  • Changed! cd Cdc25 120aa 4e-45 in ref transcript
    • Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
  • Changed! pfam M-inducer_phosp 180aa 1e-43 in ref transcript
    • M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
  • Changed! pfam Rhodanese 107aa 6e-22 in ref transcript
    • Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
  • Changed! COG MIH1 241aa 1e-35 in ref transcript
    • Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].

CDC25C

  • refseq_CDC25C.F3 refseq_CDC25C.R3 281 500
  • NCBIGene 36.3 995
  • Multiple exon skipping, size difference: 124
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001790

  • Changed! cd Cdc25 120aa 4e-45 in ref transcript
    • Cdc25 phosphatases are members of the Rhodanese Homology Domain superfamily. They activate the cell division kinases throughout the cell cycle progression. Cdc25 phosphatases dephosphorylate phosphotyrosine and phosphothreonine residues, in order to activate their Cdk/cyclin substrates. Cdc25A phosphatase functions to regulate S phase entry and Cdc25B is required for G2/M phase transition of the cell cycle. The Cdc25 domain binds oxyanions at the catalytic site and has the signature motif (H/YCxxxxxR).
  • Changed! pfam M-inducer_phosp 180aa 1e-43 in ref transcript
    • M-phase inducer phosphatase. This family represents a region within eukaryotic M-phase inducer phosphatases (EC:3.1.3.48), which also contain the pfam00581 domain. These proteins are involved in the control of mitosis.
  • Changed! pfam Rhodanese 107aa 6e-22 in ref transcript
    • Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
  • Changed! COG MIH1 241aa 1e-35 in ref transcript
    • Mitotic inducer, protein phosphatase [Cell division and chromosome partitioning].

CDC2L1

  • refseq_CDC2L1.F2 refseq_CDC2L1.R5 134 167
  • NCBIGene 36.3 984
  • Alternative 5-prime and 3-prime, size difference: 33
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_033486

  • cd S_TKc 287aa 3e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 276aa 1e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00024 295aa 4e-55 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

CDC2L1

  • refseq_CDC2L1.F3 refseq_CDC2L1.R3 106 133
  • NCBIGene 36.3 984
  • Alternative 3-prime, size difference: 27
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_033486

  • cd S_TKc 287aa 3e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 276aa 1e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00024 295aa 4e-55 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

CDC2L2

  • refseq_CDC2L2.F3 refseq_CDC2L2.R3 128 244
  • NCBIGene 36.2 985
  • Alternative 5-prime, size difference: 116
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_033531

  • Changed! cd S_TKc 287aa 1e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 276aa 5e-69 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 295aa 4e-55 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

CDC2L5

  • refseq_CDC2L5.F1 refseq_CDC2L5.R1 141 321
  • NCBIGene 36.3 8621
  • Alternative 3-prime, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003718

  • cd S_TKc 295aa 4e-72 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 294aa 2e-74 in ref transcript
    • Protein kinase domain.
  • PTZ PTZ00024 301aa 2e-47 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

CDC42

  • refseq_CDC42.F1 refseq_CDC42.R1 235 361
  • NCBIGene 36.3 998
  • Single exon skipping, size difference: 126
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039802

  • cd Cdc42 175aa 1e-104 in ref transcript
    • Cdc42 subfamily. Cdc42 is an essential GTPase that belongs to the Rho family of Ras-like GTPases. These proteins act as molecular switches by responding to exogenous and/or endogenous signals and relaying those signals to activate downstream components of a biological pathway. Cdc42 transduces signals to the actin cytoskeleton to initiate and maintain polarized growth and to mitogen-activated protein morphogenesis. In the budding yeast Saccharomyces cerevisiae, Cdc42 plays an important role in multiple actin-dependent morphogenetic events such as bud emergence, mating-projection formation, and pseudohyphal growth. In mammalian cells, Cdc42 regulates a variety of actin-dependent events and induces the JNK/SAPK protein kinase cascade, which leads to the activation of transcription factors within the nucleus. Cdc42 mediates these processes through interactions with a myriad of downstream effectors, whose number and regulation we are just starting to understand. In addition, Cdc42 has been implicated in a number of human diseases through interactions with its regulators and downstream effectors. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • smart RHO 174aa 5e-94 in ref transcript
    • Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
  • COG COG1100 183aa 2e-30 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].

CDC42SE1

  • refseq_CDC42SE1.F2 refseq_CDC42SE1.R2 100 236
  • NCBIGene 36.3 56882
  • Single exon skipping, size difference: 136
  • Exclusion in 5'UTR
  • Reference transcript: NM_001038707

CDC42SE2

  • refseq_CDC42SE2.F1 refseq_CDC42SE2.R1 232 401
  • NCBIGene 36.3 56990
  • Single exon skipping, size difference: 169
  • Exclusion in 5'UTR
  • Reference transcript: NM_020240

  • cd CRIB 41aa 1e-06 in ref transcript
    • PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. CRIB-containing effector proteins are functionally diverse and include serine/threonine kinases, tyrosine kinases, actin-binding proteins, and adapter molecules.
  • pfam PBD 15aa 0.002 in ref transcript
    • P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).

NUF2

  • refseq_CDCA1.F1 refseq_CDCA1.R1 243 389
  • NCBIGene 36.3 83540
  • Alternative 5-prime, size difference: 146
  • Exclusion in 5'UTR
  • Reference transcript: NM_145697

  • pfam Nuf2 148aa 3e-62 in ref transcript
    • Nuf2 family. Members of this family are components of the mitotic spindle. It has been shown that Nuf2 from yeast is part of a complex called the Ndc80p complex. This complex is thought to bind to the microtubules of the spindle. An arabidopsis protein has been included in this family that has previously not been identified as a member of this family. The match is not strong, but in common with other members of this family contains coiled-coil to the C terminus of this region.
  • pfam SMC_N 307aa 4e-10 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG Smc 316aa 4e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

CDCA7

  • refseq_CDCA7.F1 refseq_CDCA7.R1 152 389
  • NCBIGene 36.3 83879
  • Single exon skipping, size difference: 237
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031942

  • pfam zf-4CXXC_R1 101aa 6e-51 in ref transcript
    • Zinc-finger domain of monoamine-oxidase A repressor R1. R1 is a transcription factor repressor that inhibits monoamine oxidase A gene expression. This domain is a four-CXXC zinc finger putative DNA-binding domain found at the C-terminal end of R1. The domain carries 12 cysteines of which four pairs are of the CXXC type.

CDH24

  • refseq_CDH24.F1 refseq_CDH24.R1 112 226
  • NCBIGene 36.3 64403
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022478

  • Changed! cd CA 192aa 2e-37 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 186aa 2e-36 in ref transcript
  • cd CA 214aa 4e-34 in ref transcript
  • Changed! cd CA 237aa 2e-25 in ref transcript
  • pfam Cadherin_C 150aa 9e-23 in ref transcript
    • Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn.
  • smart CA 79aa 1e-16 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • pfam Cadherin 103aa 1e-15 in ref transcript
    • Cadherin domain.
  • Changed! pfam Cadherin 77aa 3e-15 in ref transcript
  • pfam Cadherin 81aa 1e-11 in ref transcript
  • Changed! smart CA 116aa 1e-10 in ref transcript
  • pfam Cadherin 95aa 9e-05 in ref transcript
  • Changed! cd CA 208aa 3e-39 in modified transcript
  • Changed! cd CA 199aa 6e-30 in modified transcript
  • Changed! pfam Cadherin 90aa 2e-16 in modified transcript

CDK10

  • refseq_CDK10.F3 refseq_CDK10.R3 195 329
  • NCBIGene 36.3 8558
  • Alternative 3-prime, size difference: 134
  • Exclusion of the stop codon
  • Reference transcript: NM_052988

  • cd S_TKc 286aa 1e-62 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 285aa 5e-62 in ref transcript
    • Protein kinase domain.
  • Changed! PTZ PTZ00024 296aa 2e-52 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! PTZ PTZ00024 293aa 6e-50 in modified transcript

CDK10

  • refseq_CDK10.F5 refseq_CDK10.R6 109 257
  • NCBIGene 36.2 8558
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 148
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_003674

  • cd S_TKc 252aa 3e-49 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 251aa 2e-51 in ref transcript
    • Protein kinase domain.
  • PTZ PTZ00024 259aa 2e-44 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

CDK2

  • refseq_CDK2.F2 refseq_CDK2.R2 187 289
  • NCBIGene 36.3 1017
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001798

  • Changed! cd S_TKc 284aa 1e-81 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! pfam Pkinase 283aa 2e-86 in ref transcript
    • Protein kinase domain.
  • Changed! PTZ PTZ00024 285aa 3e-57 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 250aa 4e-60 in modified transcript
  • Changed! pfam Pkinase 249aa 2e-62 in modified transcript
  • Changed! PTZ PTZ00024 251aa 1e-35 in modified transcript

CDK5RAP1

  • refseq_CDK5RAP1.F2 refseq_CDK5RAP1.R2 113 261
  • NCBIGene 36.3 51654
  • Alternative 3-prime, size difference: 148
  • Exclusion of the protein initiation site
  • Reference transcript: NM_016408

  • TIGR TIGR00089 472aa 1e-121 in ref transcript
    • This subfamily is aparrently a part of a larger superfamily of enzymes utilizing both a 4Fe4S cluster and S-adenosyl methionine (SAM) to initiate radical reactions. MiaB acts on a particular isoprenylated Adenine base of certain tRNAs causing thiolation at an aromatic carbon, and probably also transferring a methyl grouyp from SAM to the thiol. The particular substrate of the three other clades is unknown but may be very closely related.
  • COG MiaB 476aa 1e-114 in ref transcript
    • 2-methylthioadenine synthetase [Translation, ribosomal structure and biogenesis].

CDK5RAP2

  • refseq_CDK5RAP2.F1 refseq_CDK5RAP2.R1 113 350
  • NCBIGene 36.3 55755
  • Single exon skipping, size difference: 237
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018249

  • pfam Microtub_assoc 75aa 3e-15 in ref transcript
    • Microtubule associated. This presumed domain has been identified in two microtubule associated proteins in Schizosaccharomyces pombe, Mto1 and Pcp1. Mto1 has been identified in association with spindle pole body and non-spindle pole body microtubules. The pericentrin homolog Pcp1 is also associated with the fungal centrosome or spindle pole body (SPB).
  • TIGR SMC_prok_B 306aa 5e-10 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! TIGR SMC_prok_B 259aa 1e-05 in ref transcript
  • pfam SMC_N 625aa 3e-04 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG Smc 363aa 4e-12 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! COG Smc 270aa 0.001 in ref transcript
  • Changed! TIGR SMC_prok_B 182aa 0.001 in modified transcript

CDKN2B

  • refseq_CDKN2B.F2 refseq_CDKN2B.R2 170 293
  • NCBIGene 36.3 1030
  • Alternative 5-prime, size difference: 123
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004936

  • Changed! cd ANK 115aa 1e-15 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! PTZ PTZ00322 81aa 9e-06 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.
  • Changed! cd ANK 37aa 7e-04 in modified transcript

CEACAM1

  • refseq_CEACAM1.F1 refseq_CEACAM1.R1 115 168
  • NCBIGene 36.3 634
  • Single exon skipping, size difference: 53
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001712

  • cd IGcam 72aa 3e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 75aa 5e-05 in ref transcript
  • cd IGcam 71aa 0.003 in ref transcript
  • pfam V-set 106aa 3e-11 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • smart IGc2 62aa 2e-09 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 61aa 1e-08 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 78aa 1e-05 in ref transcript

CENPM

  • refseq_CENPM.F2 refseq_CENPM.R2 145 237
  • NCBIGene 36.3 79019
  • Single exon skipping, size difference: 92
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024053

CENTG2

  • refseq_CENTG2.F2 refseq_CENTG2.R2 166 325
  • NCBIGene 36.3 116987
  • Single exon skipping, size difference: 159
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001037131

  • cd Centaurin_gamma 158aa 4e-90 in ref transcript
    • Centaurin gamma. The centaurins (alpha, beta, gamma, and delta) are large, multi-domain proteins that all contain an ArfGAP domain and ankyrin repeats, and in some cases, numerous additional domains. Centaurin gamma contains an additional GTPase domain near its N-terminus. The specific function of this GTPase domain has not been well characterized, but centaurin gamma 2 (CENTG2) may play a role in the development of autism. Centaurin gamma 1 is also called PIKE (phosphatidyl inositol (PI) 3-kinase enhancer) and centaurin gamma 2 is also known as AGAP (ArfGAP protein with a GTPase-like domain, ankyrin repeats and a Pleckstrin homology domain) or GGAP. Three isoforms of PIKE have been identified. PIKE-S (short) and PIKE-L (long) are brain-specific isoforms, with PIKE-S restricted to the nucleus and PIKE-L found in multiple cellular compartments. A third isoform, PIKE-A was identified in human glioblastoma brain cancers and has been found in various tissues. GGAP has been shown to have high GTPase activity due to a direct intramolecular interaction between the N-terminal GTPase domain and the C-terminal ArfGAP domain. In human tissue, AGAP mRNA was detected in skeletal muscle, kidney, placenta, brain, heart, colon, and lung. Reduced expression levels were also observed in the spleen, liver, and small intestine.
  • cd PH_centaurin 64aa 9e-16 in ref transcript
    • Centaurin Pleckstrin homology (PH) domain. Centaurin beta and gamma consist of a PH domain, an ArfGAP domain and three ankyrin repeats. Centaurain gamma also has an N-terminal Ras homology domain. Centaurin alpha has a different domain architecture and its PH domain is in a different subfamily. Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • cd ANK 90aa 3e-11 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd PH_centaurin 42aa 2e-05 in ref transcript
  • pfam ArfGap 117aa 7e-37 in ref transcript
    • Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • pfam Miro 108aa 3e-22 in ref transcript
    • Miro-like protein. Mitochondrial Rho proteins (Miro-1 and Miro-2), are atypical Rho GTPases. They have a unique domain organisation, with tandem GTP-binding domains and two EF hand domains (pfam00036), that may bind calcium. They are also larger than classical small GTPases. It has been proposed that they are involved in mitochondrial homeostasis and apoptosis.
  • pfam Ank 32aa 7e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • smart PH 65aa 9e-05 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • smart PH 40aa 0.001 in ref transcript
  • COG COG5347 118aa 3e-24 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
  • COG COG1100 170aa 6e-08 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].
  • PTZ PTZ00322 107aa 6e-07 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

CERKL

  • refseq_CERKL.F2 refseq_CERKL.R2 113 191
  • NCBIGene 36.3 375298
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001030311

  • Changed! pfam DAGK_cat 165aa 2e-15 in ref transcript
    • Diacylglycerol kinase catalytic domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues. YegS is the Escherichia coli protein in this family whose crystal structure reveals an active site in the inter-domain cleft formed by four conserved sequence motifs, revealing a novel metal-binding site. The residues of this site are conserved across the family.
  • Changed! COG LCB5 384aa 3e-16 in ref transcript
    • Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
  • Changed! pfam DAGK_cat 139aa 1e-11 in modified transcript
  • Changed! COG LCB5 358aa 7e-13 in modified transcript

CES2

  • refseq_CES2.F1 refseq_CES2.R1 117 165
  • NCBIGene 36.3 8824
  • Alternative 5-prime, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003869

  • Changed! cd Esterase_lipase 490aa 1e-134 in ref transcript
    • Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on carboxylic esters (EC: 3.1.1.-). The catalytic apparatus involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.These catalytic residues are responsible for the nucleophilic attack on the carbonyl carbon atom of the ester bond. In contrast with other alpha/beta hydrolase fold family members, p-nitrobenzyl esterase and acetylcholine esterase have a Glu instead of Asp at the active site carboxylate.
  • Changed! pfam COesterase 510aa 1e-171 in ref transcript
    • Carboxylesterase.
  • Changed! COG PnbA 510aa 7e-90 in ref transcript
    • Carboxylesterase type B [Lipid metabolism].
  • Changed! cd Esterase_lipase 474aa 1e-128 in modified transcript
  • Changed! pfam COesterase 494aa 1e-164 in modified transcript
  • Changed! COG PnbA 494aa 3e-84 in modified transcript

CGGBP1

  • refseq_CGGBP1.F1 refseq_CGGBP1.R1 199 301
  • NCBIGene 36.3 8545
  • Single exon skipping, size difference: 102
  • Exclusion in 5'UTR
  • Reference transcript: NM_001008390

TPPP3

  • refseq_CGI-38.F1 refseq_CGI-38.R1 149 240
  • NCBIGene 36.3 51673
  • Single exon skipping, size difference: 91
  • Exclusion in 5'UTR
  • Reference transcript: NM_016140

  • pfam p25-alpha 165aa 2e-60 in ref transcript
    • p25-alpha. This family encodes a 25 kDa protein that is phosphorylated by a Ser/Thr-Pro kinase. It has been described as a brain specific protein, but it is found in Tetrahymena thermophila.

CHAT

  • refseq_CHAT.F1 refseq_CHAT.R1 128 300
  • NCBIGene 36.3 1103
  • Single exon skipping, size difference: 172
  • Exclusion in 5'UTR
  • Reference transcript: NM_020985

  • pfam Carn_acyltransf 590aa 0.0 in ref transcript
    • Choline/Carnitine o-acyltransferase.

CHCHD7

  • refseq_CHCHD7.F1 refseq_CHCHD7.R1 105 125
  • NCBIGene 36.3 79145
  • Alternative 5-prime, size difference: 20
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001011668

  • Changed! pfam CHCH 36aa 0.008 in ref transcript
    • CHCH domain. we have identified a conserved motif in the LOC118487 protein that we have called the CHCH motif. Alignment of this protein with related members showed the presence of three subgroups of proteins, which are called the S (Small), N (N-terminal extended) and C (C-terminal extended) subgroups. All three sub-groups of proteins have in common that they contain a predicted conserved [coiled coil 1]-[helix 1]-[coiled coil 2]-[helix 2] domain (CHCH domain). Within each helix of the CHCH domain, there are two cysteines present in a C-X9-C motif. The N-group contains an additional double helix domain, and each helix contains the C-X9-C motif. This family contains a number of characterised proteins: Cox19 protein - a nuclear gene of Saccharomyces cerevisiae, codes for an 11-kDa protein (Cox19p) required for expression of cytochrome oxidase. Because cox19 mutants are able to synthesise the mitochondrial and nuclear gene products of cytochrome oxidase, Cox19p probably functions post-translationally during assembly of the enzyme. Cox19p is present in the cytoplasm and mitochondria, where it exists as a soluble intermembrane protein. This dual location is similar to what was previously reported for Cox17p, a low molecular weight copper protein thought to be required for maturation of the CuA centre of subunit 2 of cytochrome oxidase. Cox19p have four conserved potential metal ligands, these are three cysteines and one histidine. Mrp10 - belongs to the class of yeast mitochondrial ribosomal proteins that are essential for translation. Eukaryotic NADH-ubiquinone oxidoreductase 19 kDa (NDUFA8) subunit. The CHCH domain was previously called DUF657.

CHCHD7

  • refseq_CHCHD7.F3 refseq_CHCHD7.R3 133 177
  • NCBIGene 36.3 79145
  • Alternative 3-prime, size difference: 44
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001011668

  • Changed! pfam CHCH 36aa 0.008 in ref transcript
    • CHCH domain. we have identified a conserved motif in the LOC118487 protein that we have called the CHCH motif. Alignment of this protein with related members showed the presence of three subgroups of proteins, which are called the S (Small), N (N-terminal extended) and C (C-terminal extended) subgroups. All three sub-groups of proteins have in common that they contain a predicted conserved [coiled coil 1]-[helix 1]-[coiled coil 2]-[helix 2] domain (CHCH domain). Within each helix of the CHCH domain, there are two cysteines present in a C-X9-C motif. The N-group contains an additional double helix domain, and each helix contains the C-X9-C motif. This family contains a number of characterised proteins: Cox19 protein - a nuclear gene of Saccharomyces cerevisiae, codes for an 11-kDa protein (Cox19p) required for expression of cytochrome oxidase. Because cox19 mutants are able to synthesise the mitochondrial and nuclear gene products of cytochrome oxidase, Cox19p probably functions post-translationally during assembly of the enzyme. Cox19p is present in the cytoplasm and mitochondria, where it exists as a soluble intermembrane protein. This dual location is similar to what was previously reported for Cox17p, a low molecular weight copper protein thought to be required for maturation of the CuA centre of subunit 2 of cytochrome oxidase. Cox19p have four conserved potential metal ligands, these are three cysteines and one histidine. Mrp10 - belongs to the class of yeast mitochondrial ribosomal proteins that are essential for translation. Eukaryotic NADH-ubiquinone oxidoreductase 19 kDa (NDUFA8) subunit. The CHCH domain was previously called DUF657.

CHD3

  • refseq_CHD3.F1 refseq_CHD3.R1 160 262
  • NCBIGene 36.3 1107
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005271

  • cd HELICc 109aa 8e-21 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • cd DEXDc 159aa 1e-16 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • cd CHROMO 56aa 4e-06 in ref transcript
    • Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
  • cd CHROMO 45aa 4e-04 in ref transcript
  • cd BAH_plant_2 24aa 0.008 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • pfam CHDCT2 173aa 8e-92 in ref transcript
    • CHDCT2 (NUC038) domain. The CHDCT2 C-terminal domain is found in PHD/RING finger and chromo domain-associated CHD-like helicases.
  • pfam SNF2_N 287aa 1e-62 in ref transcript
    • SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
  • pfam DUF1086 133aa 5e-61 in ref transcript
    • Domain of Unknown Function (DUF1086). This family consists of several eukaryotic domains of unknown function which are present in chromodomain helicase DNA binding proteins. This domain is often found in conjunction with pfam00176, pfam00271, pfam06465, pfam00385 and pfam00628.
  • pfam CHDNT 55aa 7e-23 in ref transcript
    • CHDNT (NUC034) domain. The CHDNT domain is found in PHD/RING finger and chromo domain-associated helicases.
  • pfam Helicase_C 81aa 3e-19 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • pfam DUF1087 46aa 4e-16 in ref transcript
    • Domain of Unknown Function (DUF1087). Members of this family are found in various chromatin remodelling factors and transposases. Their exact function is, as yet, unknown.
  • pfam PHD 45aa 8e-12 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • pfam PHD 43aa 8e-12 in ref transcript
  • pfam Chromo 51aa 1e-08 in ref transcript
    • 'chromo' (CHRromatin Organisation MOdifier) domain.
  • smart CHROMO 77aa 2e-05 in ref transcript
    • Chromatin organization modifier domain.
  • COG HepA 501aa 2e-74 in ref transcript
    • Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
  • COG TNG2 51aa 0.004 in ref transcript
    • Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics].

CHI3L2

  • refseq_CHI3L2.F2 refseq_CHI3L2.R2 150 180
  • NCBIGene 36.3 1117
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004000

  • cd GH18_chitolectin_chitotriosidase 359aa 1e-152 in ref transcript
    • This conserved domain family includes a large number of catalytically inactive chitinase-like lectins (chitolectins) including YKL-39, YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic mammalian chitinase), as well as catalytically active chitotriosidases. The conserved domain is an eight-stranded alpha/beta barrel fold belonging to the family 18 glycosyl hydrolases. The fold has a pronounced active-site cleft at the C-terminal end of the beta-barrel. The chitolectins lack a key active site glutamate (the proton donor required for hydrolytic activity) but retain highly conserved residues involved in oligosaccharide binding. Chitotriosidase is a chitinolytic enzyme expressed in maturing macrophages, which suggests that it plays a part in antimicrobial defense. Chitotriosidase hydrolyzes chitotriose, as well as colloidal chitin to yield chitobiose and is therefore considered an exochitinase. Chitotriosidase occurs in two major forms, the large form being converted to the small form by either RNA or post-translational processing. Although the small form, containing the chitinase domain alone, is sufficient for the chitinolytic activity, the additional C-terminal chitin-binding domain of the large form plays a role in processing colloidal chitin. The chitotriosidase gene is nonessential in humans, as about 35% of the population are heterozygous and 6% homozygous for an inactivated form of the gene. HCGP39 is a 39-kDa human cartilage glycoprotein thought to play a role in connective tissue remodeling and defense against pathogens.
  • smart Glyco_18 338aa 2e-94 in ref transcript
    • Glycosyl hydrolase family 18.
  • COG ChiA 340aa 6e-39 in ref transcript
    • Chitinase [Carbohydrate transport and metabolism].

CHIA

  • refseq_CHIA.F2 refseq_CHIA.R2 100 162
  • NCBIGene 36.2 27159
  • Multiple exon skipping, size difference: 62
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_201653

  • cd GH18_chitolectin_chitotriosidase 279aa 1e-134 in ref transcript
    • This conserved domain family includes a large number of catalytically inactive chitinase-like lectins (chitolectins) including YKL-39, YKL-40 (HCGP39), YM1, oviductin, and AMCase (acidic mammalian chitinase), as well as catalytically active chitotriosidases. The conserved domain is an eight-stranded alpha/beta barrel fold belonging to the family 18 glycosyl hydrolases. The fold has a pronounced active-site cleft at the C-terminal end of the beta-barrel. The chitolectins lack a key active site glutamate (the proton donor required for hydrolytic activity) but retain highly conserved residues involved in oligosaccharide binding. Chitotriosidase is a chitinolytic enzyme expressed in maturing macrophages, which suggests that it plays a part in antimicrobial defense. Chitotriosidase hydrolyzes chitotriose, as well as colloidal chitin to yield chitobiose and is therefore considered an exochitinase. Chitotriosidase occurs in two major forms, the large form being converted to the small form by either RNA or post-translational processing. Although the small form, containing the chitinase domain alone, is sufficient for the chitinolytic activity, the additional C-terminal chitin-binding domain of the large form plays a role in processing colloidal chitin. The chitotriosidase gene is nonessential in humans, as about 35% of the population are heterozygous and 6% homozygous for an inactivated form of the gene. HCGP39 is a 39-kDa human cartilage glycoprotein thought to play a role in connective tissue remodeling and defense against pathogens.
  • smart Glyco_18 257aa 3e-80 in ref transcript
    • Glycosyl hydrolase family 18.
  • smart ChtBD2 48aa 3e-09 in ref transcript
    • Chitin-binding domain type 2.
  • COG ChiA 257aa 1e-37 in ref transcript
    • Chitinase [Carbohydrate transport and metabolism].

CHKA

  • refseq_CHKA.F1 refseq_CHKA.R1 121 175
  • NCBIGene 36.3 1119
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001277

  • Changed! cd ChoK_euk 317aa 1e-91 in ref transcript
    • Choline Kinase (ChoK) in eukaryotes. The ChoK subfamily is part of a larger superfamily that includes the catalytic domains of other kinases, such as the typical serine/threonine/tyrosine protein kinases (PKs), RIO kinases, actin-fragmin kinase (AFK), and phosphoinositide 3-kinase (PI3K). It is composed of bacterial and eukaryotic choline kinases, as well as eukaryotic ethanolamine kinase. ChoK catalyzes the transfer of the gamma-phosphoryl group from ATP (or CTP) to its substrate, choline, producing phosphorylcholine (PCho), a precursor to the biosynthesis of two major membrane phospholipids, phosphatidylcholine (PC) and sphingomyelin (SM). Although choline is the preferred substrate, ChoK also shows substantial activity towards ethanolamine and its N-methylated derivatives. ChoK plays an important role in cell signaling pathways and the regulation of cell growth. Along with PCho, it is involved in malignant transformation through Ras oncogenes in various human cancers such as breast, lung, colon, prostate, neuroblastoma, and hepatic lymphoma. In mammalian cells, there are three ChoK isoforms (A-1, A-2, and B) which are active in homo or heterodimeric forms.
  • Changed! pfam Choline_kinase 236aa 4e-62 in ref transcript
    • Choline/ethanolamine kinase. Choline kinase catalyses the committed step in the synthesis of phosphatidylcholine by the CDP-choline pathway. This alignment covers the protein kinase portion of the protein. The divergence of this family makes it very difficult to create a model that specifically predicts choline/ethanolamine kinases only.
  • Changed! PTZ PTZ00296 361aa 6e-36 in ref transcript
    • choline kinase; Provisional.
  • Changed! cd ChoK_euk 299aa 1e-93 in modified transcript
  • Changed! pfam Choline_kinase 218aa 3e-64 in modified transcript
  • Changed! PTZ PTZ00296 343aa 1e-36 in modified transcript

CHRFAM7A

  • refseq_CHRFAM7A.F2 refseq_CHRFAM7A.R2 146 210
  • NCBIGene 36.3 89832
  • Single exon skipping, size difference: 64
  • Exclusion of the protein initiation site
  • Reference transcript: NM_139320

  • Changed! TIGR LIC 373aa 1e-107 in ref transcript
    • selective while glycine receptors are anion selective).
  • Changed! TIGR LIC 307aa 3e-78 in modified transcript

CHRM2

  • refseq_CHRM2.F1 refseq_CHRM2.R1 130 186
  • NCBIGene 36.3 1129
  • Alternative 5-prime, size difference: 56
  • Exclusion in 5'UTR
  • Reference transcript: NM_001006632

  • pfam 7tm_1 174aa 7e-36 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • pfam 7tm_1 81aa 9e-06 in ref transcript

CHRM2

  • refseq_CHRM2.F3 refseq_CHRM2.R3 108 186
  • NCBIGene 36.3 1129
  • Single exon skipping, size difference: 78
  • Exclusion in 5'UTR
  • Reference transcript: NM_001006626

  • pfam 7tm_1 174aa 7e-36 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • pfam 7tm_1 81aa 9e-06 in ref transcript

CHRNA1

  • refseq_CHRNA1.F1 refseq_CHRNA1.R1 116 191
  • NCBIGene 36.3 1134
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039523

  • Changed! pfam Neur_chan_LBD 233aa 3e-70 in ref transcript
    • Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure.
  • Changed! pfam Neur_chan_memb 209aa 7e-45 in ref transcript
    • Neurotransmitter-gated ion-channel transmembrane region. This family includes the four transmembrane helices that form the ion channel.
  • Changed! pfam Neur_chan_LBD 208aa 4e-74 in modified transcript
  • Changed! TIGR LIC 427aa 1e-65 in modified transcript
    • selective while glycine receptors are anion selective).

NLRP3

  • refseq_CIAS1.F1 refseq_CIAS1.R1 125 296
  • NCBIGene 36.3 114548
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001079821

  • Changed! cd LRR_RI 317aa 6e-50 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! cd LRR_RI 207aa 1e-07 in ref transcript
  • pfam NACHT 170aa 1e-41 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam PAAD_DAPIN 84aa 6e-21 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • smart LRR_RI 28aa 0.001 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.
  • Changed! smart LRR_RI 28aa 0.002 in ref transcript
  • Changed! COG RNA1 230aa 0.003 in ref transcript
    • Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
  • Changed! cd LRR_RI 320aa 5e-54 in modified transcript

NLRP3

  • refseq_CIAS1.F3 refseq_CIAS1.R3 215 386
  • NCBIGene 36.3 114548
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001079821

  • Changed! cd LRR_RI 317aa 6e-50 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! cd LRR_RI 207aa 1e-07 in ref transcript
  • pfam NACHT 170aa 1e-41 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam PAAD_DAPIN 84aa 6e-21 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • smart LRR_RI 28aa 0.001 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.
  • smart LRR_RI 28aa 0.002 in ref transcript
  • Changed! COG RNA1 230aa 0.003 in ref transcript
    • Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
  • Changed! cd LRR_RI 320aa 2e-48 in modified transcript
  • Changed! COG RNA1 114aa 0.001 in modified transcript

CKAP5

  • refseq_CKAP5.F2 refseq_CKAP5.R2 183 363
  • NCBIGene 36.3 9793
  • Single exon skipping, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008938

CKLF

  • refseq_CKLF.F1 refseq_CKLF.R1 173 269
  • NCBIGene 36.3 51192
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016951

  • Changed! pfam MARVEL 118aa 3e-05 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.

CKLF

  • refseq_CKLF.F3 refseq_CKLF.R3 206 365
  • NCBIGene 36.3 51192
  • Single exon skipping, size difference: 159
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016951

  • Changed! pfam MARVEL 118aa 3e-05 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.

CLCN3

  • refseq_CLCN3.F2 refseq_CLCN3.R2 286 362
  • NCBIGene 36.3 1182
  • Single exon skipping, size difference: 76
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_173872

  • cd ClC_3_like 435aa 1e-176 in ref transcript
    • ClC-3-like chloride channel proteins. This CD includes ClC-3, ClC-4, ClC-5 and ClC-Y1. ClC-3 was initially cloned from rat kidney. Expression of ClC-3 produces outwardly-rectifying Cl currents that are inhibited by protein kinase C activation. It has been suggested that ClC-3 may be a ubiquitous swelling-activated Cl channel that has very similar characteristics to those of native volume-regulated Cl currents. The function of ClC-4 is unclear. Studies of human ClC-4 have revealed that it gives rise to Cl currents that rapidly activate at positive voltages, and are sensitive to extracellular pH, with currents decreasing when pH falls below 6.5. ClC-4 is broadly distributed, especially in brain and heart. ClC-5 is predominantly expressed in the kidney, but can be found in the brain and liver. Mutations in the ClC-5 gene cause certain hereditary diseases, including Dent's disease, an X-chromosome linked syndrome characterised by proteinuria, hypercalciuria, and kidney stones (nephrolithiasis), leading to progressive renal failure. These proteins belong to the ClC superfamily of chloride ion channels, which share the unique double-barreled architecture and voltage-dependent gating mechanism. The gating is conferred by the permeating anion itself, acting as the gating charge. This domain is found in the eukaryotic halogen ion (Cl- and I-) channel proteins, that perform a variety of functions including cell volume regulation, the membrane potential stabilization, transepithelial chloride transport and charge compensation necessary for the acidification of intracellular organelles.
  • Changed! cd CBS_pair_EriC_assoc_euk_bac 139aa 4e-24 in ref transcript
    • This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in CLC chloride channel family members have been associated with classic Bartter syndrome, Osteopetrosis, Dent's disease, idiopathic generalized epilepsy, and myotonia.
  • pfam Voltage_CLC 403aa 1e-62 in ref transcript
    • Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. In terms of primary structure, they are unrelated to known cation channels or other types of anion channels. Three ClC subfamilies are found in animals. ClC-1 is involved in setting and restoring the resting membrane potential of skeletal muscle, while other channels play important parts in solute concentration mechanisms in the kidney. These proteins contain two pfam00571 domains.
  • Changed! pfam CBS 149aa 3e-10 in ref transcript
    • CBS domain pair. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a pair of CBS domains, that has been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP.
  • COG EriC 455aa 3e-38 in ref transcript
    • Chloride channel protein EriC [Inorganic ion transport and metabolism].
  • Changed! COG TlyC 153aa 0.001 in ref transcript
    • Hemolysins and related proteins containing CBS domains [General function prediction only].
  • Changed! cd CBS_pair_EriC_assoc_euk_bac 139aa 3e-26 in modified transcript
  • Changed! pfam CBS 150aa 1e-11 in modified transcript
  • Changed! COG COG0517 54aa 3e-05 in modified transcript
    • FOG: CBS domain [General function prediction only].
  • Changed! COG COG4109 164aa 7e-05 in modified transcript
    • Predicted transcriptional regulator containing CBS domains [Transcription].

CLCN6

  • refseq_CLCN6.F1 refseq_CLCN6.R1 134 301
  • NCBIGene 36.3 1185
  • Single exon skipping, size difference: 167
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001286

  • Changed! cd ClC_6_like 322aa 1e-105 in ref transcript
    • ClC-6-like chloride channel proteins. This CD includes ClC-6, ClC-7 and ClC-B, C, D in plants. Proteins in this family are ubiquitous in eukarotes and their functions are unclear. They are expressed in intracellular organelles membranes. This family belongs to the ClC superfamily of chloride ion channels, which share the unique double-barreled architecture and voltage-dependent gating mechanism. The gating is conferred by the permeating anion itself, acting as the gating charge. ClC chloride ion channel superfamily perform a variety of functions including cellular excitability regulation, cell volume regulation, membrane potential stabilization, acidification of intracellular organelles, signal transduction, and transepithelial transport in animals.
  • Changed! cd ClC_6_like 112aa 5e-39 in ref transcript
  • Changed! cd CBS_pair_EriC_assoc_euk_bac 54aa 2e-13 in ref transcript
    • This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in CLC chloride channel family members have been associated with classic Bartter syndrome, Osteopetrosis, Dent's disease, idiopathic generalized epilepsy, and myotonia.
  • Changed! pfam Voltage_CLC 231aa 4e-28 in ref transcript
    • Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. In terms of primary structure, they are unrelated to known cation channels or other types of anion channels. Three ClC subfamilies are found in animals. ClC-1 is involved in setting and restoring the resting membrane potential of skeletal muscle, while other channels play important parts in solute concentration mechanisms in the kidney. These proteins contain two pfam00571 domains.
  • Changed! pfam Voltage_CLC 129aa 2e-15 in ref transcript
  • Changed! pfam CBS 58aa 3e-08 in ref transcript
    • CBS domain pair. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a pair of CBS domains, that has been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP.
  • Changed! smart CBS 48aa 0.009 in ref transcript
    • Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease.
  • Changed! COG EriC 287aa 2e-13 in ref transcript
    • Chloride channel protein EriC [Inorganic ion transport and metabolism].
  • Changed! COG EriC 111aa 3e-09 in ref transcript
  • Changed! PRK PRK05567 58aa 3e-05 in ref transcript
    • inositol-5'-monophosphate dehydrogenase; Reviewed.
  • Changed! cd ClC_6_like 270aa 7e-85 in modified transcript
  • Changed! pfam Voltage_CLC 175aa 2e-20 in modified transcript
  • Changed! COG EriC 239aa 2e-09 in modified transcript

CLDN15

  • refseq_CLDN15.F1 refseq_CLDN15.R1 252 392
  • NCBIGene 36.2 24146
  • Single exon skipping, size difference: 140
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_014343

  • Changed! pfam PMP22_Claudin 169aa 5e-16 in ref transcript
    • PMP-22/EMP/MP20/Claudin family.

CLEC10A

  • refseq_CLEC10A.F1 refseq_CLEC10A.R1 156 228
  • NCBIGene 36.3 10462
  • Alternative 3-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182906

  • cd CLECT_DC-SIGN_like 125aa 8e-41 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • pfam Lectin_C 108aa 1e-28 in ref transcript
    • Lectin C-type domain. This family includes both long and short form C-type.
  • Changed! pfam Lectin_N 165aa 2e-26 in ref transcript
    • Hepatic lectin, N-terminal domain.
  • Changed! pfam Lectin_N 138aa 2e-30 in modified transcript

CLEC10A

  • refseq_CLEC10A.F1 refseq_CLEC10A.R3 109 190
  • NCBIGene 36.3 10462
  • Alternative 3-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182906

  • cd CLECT_DC-SIGN_like 125aa 8e-41 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • pfam Lectin_C 108aa 1e-28 in ref transcript
    • Lectin C-type domain. This family includes both long and short form C-type.
  • Changed! pfam Lectin_N 165aa 2e-26 in ref transcript
    • Hepatic lectin, N-terminal domain.
  • Changed! pfam Lectin_N 138aa 2e-30 in modified transcript

CLEC2D

  • refseq_CLEC2D.F1 refseq_CLEC2D.R1 161 243
  • NCBIGene 36.3 29121
  • Single exon skipping, size difference: 82
  • Exclusion in 5'UTR
  • Reference transcript: NM_001004419

  • pfam Nucleoplasmin 52aa 6e-17 in ref transcript
    • Nucleoplasmin. Nucleoplasmins are also known as chromatin decondensation proteins. They bind to core histones and transfer DNA to them in a reaction that requires ATP. This is thought to play a role in the assembly of regular nucleosomal arrays.

CLEC2D

  • refseq_CLEC2D.F3 refseq_CLEC2D.R3 107 182
  • NCBIGene 36.2 29121
  • Alternative 5-prime, size difference: 75
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001004419

  • Changed! cd CLECT_NK_receptors_like 97aa 9e-25 in ref transcript
    • CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
  • Changed! smart CLECT 78aa 2e-16 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
  • Changed! cd CLECT_NK_receptors_like 45aa 1e-11 in modified transcript
  • Changed! smart CLECT 45aa 2e-09 in modified transcript

CLEC4A

  • refseq_CLEC4A.F1 refseq_CLEC4A.R1 130 229
  • NCBIGene 36.3 50856
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016184

  • cd CLECT_DC-SIGN_like 127aa 8e-40 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • smart CLECT 126aa 7e-26 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

CLEC4A

  • refseq_CLEC4A.F3 refseq_CLEC4A.R3 122 242
  • NCBIGene 36.3 50856
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016184

  • cd CLECT_DC-SIGN_like 127aa 8e-40 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • smart CLECT 126aa 7e-26 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

CLEC4C

  • refseq_CLEC4C.F1 refseq_CLEC4C.R1 254 347
  • NCBIGene 36.3 170482
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130441

  • cd CLECT_DC-SIGN_like 126aa 3e-38 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • smart CLECT 125aa 2e-24 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

CLEC4M

  • refseq_CLEC4M.F1 refseq_CLEC4M.R1 156 219
  • NCBIGene 36.3 10332
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 63
  • Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_214675

  • cd CLECT_DC-SIGN_like 124aa 5e-42 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • smart CLECT 123aa 6e-30 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
  • TIGR SMC_prok_B 182aa 1e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! PRK PRK03918 172aa 4e-06 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! PRK PRK05771 186aa 2e-05 in modified transcript
    • V-type ATP synthase subunit I; Validated.

CLEC4M

  • refseq_CLEC4M.F3 refseq_CLEC4M.R3 183 296
  • NCBIGene 36.3 10332
  • Single exon skipping, size difference: 113
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_214675

  • Changed! cd CLECT_DC-SIGN_like 124aa 5e-42 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • Changed! smart CLECT 123aa 6e-30 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
  • TIGR SMC_prok_B 182aa 1e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • PRK PRK03918 172aa 4e-06 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! cd CLECT_DC-SIGN_like 45aa 5e-13 in modified transcript
  • Changed! smart CLECT 45aa 7e-11 in modified transcript

CLEC7A

  • refseq_CLEC7A.F1 refseq_CLEC7A.R1 247 366
  • NCBIGene 36.3 64581
  • Single exon skipping, size difference: 119
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_197947

  • Changed! cd CLECT_NK_receptors_like 124aa 2e-27 in ref transcript
    • CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
  • Changed! smart CLECT 123aa 1e-18 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
  • Changed! cd CLECT_NK_receptors_like 45aa 9e-10 in modified transcript
  • Changed! smart CLECT 45aa 2e-08 in modified transcript

CLEC7A

  • refseq_CLEC7A.F3 refseq_CLEC7A.R3 158 296
  • NCBIGene 36.3 64581
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_197947

  • cd CLECT_NK_receptors_like 124aa 2e-27 in ref transcript
    • CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
  • smart CLECT 123aa 1e-18 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

CLK1

  • refseq_CLK1.F1 refseq_CLK1.R1 109 200
  • NCBIGene 36.2 1195
  • Single exon skipping, size difference: 91
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004071

  • Changed! cd S_TKc 318aa 1e-52 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! pfam Pkinase 317aa 4e-60 in ref transcript
    • Protein kinase domain.
  • Changed! PTZ PTZ00284 332aa 1e-52 in ref transcript
    • protein kinase; Provisional.

CLK2

  • refseq_CLK2.F1 refseq_CLK2.R1 228 313
  • NCBIGene 36.2 1196
  • Single exon skipping, size difference: 85
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003993

  • Changed! cd S_TKc 318aa 1e-46 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! pfam Pkinase 317aa 3e-53 in ref transcript
    • Protein kinase domain.
  • Changed! PTZ PTZ00284 331aa 6e-49 in ref transcript
    • protein kinase; Provisional.

CLK3

  • refseq_CLK3.F2 refseq_CLK3.R2 197 294
  • NCBIGene 36.3 1198
  • Single exon skipping, size difference: 97
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003992

  • Changed! cd S_TKc 318aa 8e-45 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 307aa 4e-49 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00284 341aa 3e-46 in ref transcript
    • protein kinase; Provisional.

CLSTN1

  • refseq_CLSTN1.F1 refseq_CLSTN1.R1 165 195
  • NCBIGene 36.3 22883
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001009566

  • Changed! cd CA 218aa 7e-21 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • smart CA 76aa 1e-07 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • pfam Cadherin 89aa 3e-05 in ref transcript
    • Cadherin domain.
  • Changed! cd CA 208aa 6e-22 in modified transcript

CLTA

  • refseq_CLTA.F2 refseq_CLTA.R2 287 377
  • NCBIGene 36.3 1211
  • Multiple exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007096

  • Changed! pfam Clathrin_lg_ch 220aa 2e-69 in ref transcript
    • Clathrin light chain.
  • Changed! pfam Clathrin_lg_ch 190aa 7e-58 in modified transcript

CLTB

  • refseq_CLTB.F1 refseq_CLTB.R1 219 273
  • NCBIGene 36.3 1212
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007097

  • Changed! pfam Clathrin_lg_ch 201aa 2e-71 in ref transcript
    • Clathrin light chain.
  • Changed! pfam Clathrin_lg_ch 183aa 1e-59 in modified transcript

CLTCL1

  • refseq_CLTCL1.F1 refseq_CLTCL1.R1 186 357
  • NCBIGene 36.2 8218
  • Single exon skipping, size difference: 171
  • Exclusion in 3'UTR
  • Reference transcript: NM_007098

  • pfam Clathrin 143aa 1e-33 in ref transcript
    • Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain.
  • pfam Clathrin 140aa 8e-33 in ref transcript
  • pfam Clathrin 146aa 9e-32 in ref transcript
  • pfam Clathrin 143aa 8e-27 in ref transcript
  • pfam Clathrin 74aa 8e-11 in ref transcript
  • pfam Clathrin-link 24aa 2e-04 in ref transcript
    • Clathrin, heavy-chain linker. Members of this family adopt a structure consisting of alpha-alpha superhelix. They are predominantly found in clathrin, where they act as a heavy-chain linker domain.

CMTM3

  • refseq_CMTM3.F2 refseq_CMTM3.R2 139 245
  • NCBIGene 36.3 123920
  • Alternative 5-prime, size difference: 106
  • Exclusion in 5'UTR
  • Reference transcript: NM_144601

  • pfam MARVEL 55aa 2e-04 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.

CMTM5

  • refseq_CMTM5.F1 refseq_CMTM5.R1 204 297
  • NCBIGene 36.3 116173
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138460

  • Changed! pfam MARVEL 61aa 3e-05 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
  • Changed! pfam MARVEL 53aa 2e-04 in modified transcript

CMTM7

  • refseq_CMTM7.F2 refseq_CMTM7.R2 159 258
  • NCBIGene 36.3 112616
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138410

  • Changed! pfam MARVEL 124aa 7e-10 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
  • Changed! pfam MARVEL 82aa 6e-06 in modified transcript

CNIH

  • refseq_CNIH.F1 refseq_CNIH.R1 121 234
  • NCBIGene 36.2 10175
  • Single exon skipping, size difference: 113
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005776

  • Changed! pfam Cornichon 117aa 2e-39 in ref transcript
    • Cornichon protein.
  • Changed! pfam Cornichon 45aa 1e-07 in modified transcript

CNN2

  • refseq_CNN2.F1 refseq_CNN2.R1 226 343
  • NCBIGene 36.3 1265
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004368

  • Changed! cd CH 101aa 1e-16 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • Changed! pfam CH 102aa 8e-18 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • Changed! pfam Calponin 26aa 1e-05 in ref transcript
    • Calponin family repeat.
  • pfam Calponin 26aa 3e-04 in ref transcript
  • pfam Calponin 25aa 0.001 in ref transcript
  • Changed! COG SCP1 167aa 2e-20 in ref transcript
    • Calponin [Cytoskeleton].
  • Changed! cd CH 100aa 8e-17 in modified transcript
  • Changed! pfam CH 100aa 7e-18 in modified transcript
  • Changed! pfam Calponin 23aa 9e-04 in modified transcript
  • Changed! COG SCP1 168aa 5e-16 in modified transcript

CNNM2

  • refseq_CNNM2.F1 refseq_CNNM2.R1 151 217
  • NCBIGene 36.3 54805
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017649

  • cd CBS_pair_CorC_HlyC_assoc 106aa 2e-16 in ref transcript
    • This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. The second CBS domain in this CD is degenerate.
  • TIGR GldE 326aa 7e-29 in ref transcript
    • Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype.
  • COG TlyC 331aa 9e-37 in ref transcript
    • Hemolysins and related proteins containing CBS domains [General function prediction only].

CNNM3

  • refseq_CNNM3.F2 refseq_CNNM3.R2 276 420
  • NCBIGene 36.3 26505
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017623

  • Changed! cd CBS_pair_CorC_HlyC_assoc 122aa 8e-19 in ref transcript
    • This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains associated with the CorC_HlyC domain. CorC_HlyC is a transporter associated domain. This small domain is found in Na+/H+ antiporters, in proteins involved in magnesium and cobalt efflux, and in association with some proteins of unknown function. The function of the CorC_HlyC domain is uncertain but it might be involved in modulating transport of ion substrates. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. The second CBS domain in this CD is degenerate.
  • Changed! TIGR GldE 143aa 8e-20 in ref transcript
    • Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldC is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. GldE was discovered because of its adjacency to GldD in F. johnsonii. Overexpression of GldE partially supresses the effects of a GldB point mutant suggesting that GldB and GldE interact. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Not all Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility and in fact some do not appear to express the gliding phenotype.
  • Changed! COG TlyC 250aa 1e-22 in ref transcript
    • Hemolysins and related proteins containing CBS domains [General function prediction only].
  • Changed! cd CBS_pair_CorC_HlyC_assoc 87aa 5e-10 in modified transcript
  • Changed! TIGR GldE 95aa 5e-09 in modified transcript
  • Changed! COG TlyC 191aa 2e-12 in modified transcript

CNTFR

  • refseq_CNTFR.F1 refseq_CNTFR.R1 144 255
  • NCBIGene 36.3 1271
  • Single exon skipping, size difference: 111
  • Exclusion in 5'UTR
  • Reference transcript: NM_147164

  • cd FN3 97aa 1e-06 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • smart IGc2 58aa 2e-07 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 91aa 1e-04 in ref transcript
    • Fibronectin type III domain.

CNTN1

  • refseq_CNTN1.F1 refseq_CNTN1.R1 134 167
  • NCBIGene 36.3 1272
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001843

  • cd IGcam 88aa 2e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 86aa 8e-12 in ref transcript
  • cd IGcam 76aa 9e-11 in ref transcript
  • cd FN3 88aa 3e-10 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 86aa 6e-10 in ref transcript
  • cd FN3 91aa 1e-07 in ref transcript
  • cd FN3 93aa 6e-07 in ref transcript
  • cd IGcam 96aa 4e-06 in ref transcript
  • cd IGcam 69aa 0.002 in ref transcript
  • cd FN3 85aa 0.005 in ref transcript
  • smart IGc2 60aa 2e-14 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 81aa 2e-11 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 83aa 8e-11 in ref transcript
    • Fibronectin type III domain.
  • pfam I-set 78aa 3e-10 in ref transcript
    • Immunoglobulin I-set domain.
  • smart IG_like 68aa 5e-09 in ref transcript
  • smart IG_like 91aa 2e-07 in ref transcript
  • pfam fn3 84aa 3e-07 in ref transcript
  • smart FN3 91aa 2e-04 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam fn3 81aa 0.007 in ref transcript
  • smart IG_like 73aa 0.009 in ref transcript

CNTROB

  • refseq_CNTROB.F1 refseq_CNTROB.R1 183 249
  • NCBIGene 36.3 116840
  • Alternative 5-prime, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001037144

COG5

  • refseq_COG5.F1 refseq_COG5.R1 183 246
  • NCBIGene 36.3 10466
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006348

  • pfam COG5 124aa 4e-26 in ref transcript
    • Golgi transport complex subunit 5. The COG complex, the peripheral membrane oligomeric protein complex involved in intra-Golgi protein trafficking, consists of eight subunits arranged in two lobes bridged by Cog1. Cog5 is in the smaller, B lobe, bound in with Cog6-8, and is itself bound to Cog1 as well as, strongly, to Cog7.

COL11A2

  • refseq_COL11A2.F1 refseq_COL11A2.R1 108 288
  • NCBIGene 36.3 1302
  • Single exon skipping, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080680

  • cd LamG 152aa 1e-05 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • smart COLFI 196aa 6e-85 in ref transcript
    • Fibrillar collagens C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.
  • smart TSPN 183aa 7e-56 in ref transcript
    • Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin.

COL13A1

  • refseq_COL13A1.F1 refseq_COL13A1.R1 126 213
  • NCBIGene 36.3 1305
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005203

  • Changed! pfam Collagen 63aa 0.007 in modified transcript
    • Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.

COL13A1

  • refseq_COL13A1.F3 refseq_COL13A1.R3 155 221
  • NCBIGene 36.3 1305
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005203

COL13A1

  • refseq_COL13A1.F4 refseq_COL13A1.R4 93 150
  • NCBIGene 36.3 1305
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005203

COL13A1

  • refseq_COL13A1.F6 refseq_COL13A1.R6 92 143
  • NCBIGene 36.3 1305
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005203

COL2A1

  • refseq_COL2A1.F1 refseq_COL2A1.R1 129 336
  • NCBIGene 36.3 1280
  • Single exon skipping, size difference: 207
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001844

  • smart COLFI 236aa 1e-115 in ref transcript
    • Fibrillar collagens C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.
  • Changed! pfam VWC 56aa 2e-18 in ref transcript
    • von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094.
  • pfam Collagen 59aa 1e-04 in ref transcript
    • Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.

COL4A3

  • refseq_COL4A3.F1 refseq_COL4A3.R1 182 355
  • NCBIGene 36.3 1285
  • Single exon skipping, size difference: 173
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000091

  • pfam C4 110aa 6e-54 in ref transcript
    • C-terminal tandem repeated domain in type 4 procollagen. Duplicated domain in C-terminus of type 4 collagens. Mutations in alpha-5 collagen IV are associated with X-linked Alport syndrome.
  • Changed! pfam C4 113aa 5e-47 in ref transcript
  • Changed! pfam C4 31aa 2e-10 in modified transcript

COL4A3BP

  • refseq_COL4A3BP.F1 refseq_COL4A3BP.R1 195 273
  • NCBIGene 36.3 10087
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005713

  • cd PH_GPBP 90aa 3e-42 in ref transcript
    • Goodpasture antigen binding protein (GPBP) Pleckstrin homology (PH) domain. The GPBP protein is a kinase that phosphorylates an N-terminal region of the alpha 3 chain of type IV collagen , which is commonly known as the goodpasture antigen. It has has an N-terminal PH domain and a C-terminal START domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • Changed! cd START 220aa 1e-25 in ref transcript
    • START(STeroidogenic Acute Regulatory (STAR) related lipid Transfer) Domain. These domains are 200-210 amino acid in length and occur in proteins involved in lipid transport (phosphatidylcholine) and metabolism, signal transduction, and transcriptional regulation. The most striking feature of the START domain structure is a predominantly hydrophobic tunnel extending nearly the entire protein and used to binding a single molecule of large lipophilic compounds, like cholesterol.
  • smart START 202aa 3e-18 in ref transcript
    • in StAR and phosphatidylcholine transfer protein. putative lipid-binding domain in StAR and phosphatidylcholine transfer protein.
  • pfam PH 92aa 3e-13 in ref transcript
    • PH domain. PH stands for pleckstrin homology.
  • Changed! cd START 224aa 7e-26 in modified transcript

COL6A2

  • refseq_COL6A2.F2 refseq_COL6A2.R2 142 435
  • NCBIGene 36.3 1292
  • Alternative 3-prime, size difference: 293
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_058174

  • cd vWA_collagen_alpha_1-VI-type 192aa 1e-51 in ref transcript
    • VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.
  • cd vWA_collagen_alpha_1-VI-type 189aa 1e-46 in ref transcript
  • pfam VWA 187aa 2e-29 in ref transcript
    • von Willebrand factor type A domain.
  • smart VWA 169aa 1e-18 in ref transcript
    • von Willebrand factor (vWF) type A domain. VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.
  • pfam Collagen 46aa 8e-06 in ref transcript
    • Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
  • pfam Collagen 62aa 4e-05 in ref transcript
  • pfam Collagen 88aa 6e-05 in ref transcript

COL8A1

  • refseq_COL8A1.F1 refseq_COL8A1.R1 119 181
  • NCBIGene 36.3 1295
  • Single exon skipping, size difference: 62
  • Exclusion in 5'UTR
  • Reference transcript: NM_001850

  • smart C1Q 136aa 9e-55 in ref transcript
    • Complement component C1q domain. Globular domain found in many collagens and eponymously in complement C1q. When part of full length proteins these domains form a 'bouquet' due to the multimerization of heterotrimers. The C1q fold is similar to that of tumour necrosis factor.

COLEC11

  • refseq_COLEC11.F1 refseq_COLEC11.R1 100 179
  • NCBIGene 36.3 78989
  • Single exon skipping, size difference: 79
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_024027

  • Changed! cd CLECT_collectin_like 116aa 5e-45 in ref transcript
    • CLECT_collectin_like: C-type lectin-like domain (CTLD) of the type found in human collectins including lung surfactant proteins A and D, mannose- or mannan binding lectin (MBL), and CL-L1 (collectin liver 1). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. The CTLDs of these collectins bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, or apoptotic cells) and mediate functions associated with killing and phagocytosis. MBPs recognize high mannose oligosaccharides in a calcium dependent manner, bind to a broad range of pathogens, and trigger cell killing by activating the complement pathway. MBP also acts directly as an opsonin. SP-A and SP-D in addition to functioning as host defense components, are components of pulmonary surfactant which play a role in surfactant homeostasis. Pulmonary surfactant is a phospholipid-protein complex which reduces the surface tension within the lungs. SP-A binds the major surfactant lipid: dipalmitoylphosphatidylcholine (DPPC). SP-D binds two minor components of surfactant that contain sugar moieties: glucosylceramide and phosphatidylinositol (PI). MBP and SP-A, -D monomers are homotrimers with an N-terminal collagen region and three CTLDs. Multiple homotrimeric units associate to form supramolecular complexes. MBL deficiency results in an increased susceptibility to a large number of different infections and to inflammatory disease, such as rheumatoid arthritis.
  • Changed! pfam Lectin_C 108aa 2e-19 in ref transcript
    • Lectin C-type domain. This family includes both long and short form C-type.

COLQ

  • refseq_COLQ.F1 refseq_COLQ.R1 142 169
  • NCBIGene 36.3 8292
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005677

  • TIGR myxo_disulf_rpt 25aa 5e-04 in ref transcript
    • This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
  • pfam Collagen 63aa 0.004 in ref transcript
    • Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.

COLQ

  • refseq_COLQ.F4 refseq_COLQ.R4 136 276
  • NCBIGene 36.2 8292
  • Single exon skipping, size difference: 140
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005677

  • Changed! TIGR myxo_disulf_rpt 25aa 5e-04 in ref transcript
    • This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
  • Changed! pfam Collagen 63aa 0.004 in ref transcript
    • Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.

COMMD6

  • refseq_COMMD6.F1 refseq_COMMD6.R1 147 186
  • NCBIGene 36.3 170622
  • Single exon skipping, size difference: 39
  • Exclusion in 5'UTR
  • Reference transcript: NM_203497

COP1

  • refseq_COP1.F1 refseq_COP1.R1 333 396
  • NCBIGene 36.3 114769
  • Single exon skipping, size difference: 63
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001017534

  • pfam CARD 88aa 7e-19 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.

COPE

  • refseq_COPE.F1 refseq_COPE.R1 222 378
  • NCBIGene 36.3 11316
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007263

  • Changed! cd TPR 83aa 0.007 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • Changed! pfam Coatomer_E 291aa 1e-109 in ref transcript
    • Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex.
  • Changed! pfam Coatomer_E 239aa 5e-75 in modified transcript

COPS8

  • refseq_COPS8.F1 refseq_COPS8.R1 191 256
  • NCBIGene 36.3 10920
  • Single exon skipping, size difference: 65
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006710

  • Changed! pfam PCI_Csn8 175aa 3e-60 in ref transcript
    • COP9 signalosome, subunit CSN8. This PCI_Csn8 domain is conserved from plants to humans. It is a signature protein motif found in components of CSN (COP9 signalosome). It functions as a structural scaffold for subunit-subunit interactions within the complex and is a key regulator of photomorphogenic development.

COQ6

  • refseq_COQ6.F1 refseq_COQ6.R1 168 227
  • NCBIGene 36.3 51004
  • Single exon skipping, size difference: 59
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_182476

  • Changed! TIGR COQ6 428aa 0.0 in ref transcript
    • This model represents the monooxygenase responsible for the 4-hydroxylateion of the phenol ring in the aerobic biosynthesis of ubiquinone.
  • Changed! PRK PRK08850 390aa 3e-61 in ref transcript
    • 2-octaprenyl-6-methoxyphenol hydroxylase; Validated.
  • Changed! TIGR COQ6 67aa 2e-14 in modified transcript

ENOX2

  • refseq_COVA1.F2 refseq_COVA1.R2 184 273
  • NCBIGene 36.3 10495
  • Single exon skipping, size difference: 89
  • Exclusion of the protein initiation site
  • Reference transcript: NM_182314

  • cd RRM 58aa 1e-06 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • pfam RRM_1 58aa 3e-07 in ref transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
  • COG COG0724 79aa 0.001 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

CPB2

  • refseq_CPB2.F1 refseq_CPB2.R1 100 211
  • NCBIGene 36.3 1361
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001872

  • Changed! cd M14_CPB2 302aa 1e-170 in ref transcript
    • Peptidase M14 Carboxypeptidase (CP) B2 (CPB2, also known as plasma carboxypeptidase B, carboxypeptidase U, and CPU), belongs to the carboxpeptidase A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPB2 enzyme displays B-like activity; it only cleaves the basic residues lysine or arginine. It is produced and secreted by the liver as the inactive precursor, procarboxypeptidase U or PCPB2, commonly referred to as thrombin-activatable fibrinolysis inhibitor (TAFI). It circulates in plasma as a zymogen bound to plasminogen, and the active enzyme, TAFIa, inhibits fibrinolysis. It is highly regulated, increased TAFI concentrations are thought to increase the risk of thrombosis and coronary artery disease by reducing fibrinolytic activity while low TAFI levels have been correlated with chronic liver disease.
  • Changed! smart Zn_pept 283aa 2e-92 in ref transcript
    • Zn_pept.
  • pfam Propep_M14 78aa 2e-20 in ref transcript
    • Carboxypeptidase activation peptide. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase, and is responsible for modulation of folding and activity of the pro-enzyme.
  • Changed! COG COG2866 310aa 3e-08 in ref transcript
    • Predicted carboxypeptidase [Amino acid transport and metabolism].
  • Changed! cd M14_CPB2 265aa 1e-145 in modified transcript
  • Changed! smart Zn_pept 246aa 3e-76 in modified transcript

CPEB2

  • refseq_CPEB2.F2 refseq_CPEB2.R2 316 406
  • NCBIGene 36.3 132864
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182485

  • cd RRM 88aa 1e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 79aa 0.002 in ref transcript
  • smart RRM_2 63aa 9e-08 in ref transcript
    • RNA recognition motif.
  • Changed! COG COG0724 176aa 1e-04 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! COG COG0724 272aa 3e-04 in modified transcript

CPM

  • refseq_CPM.F1 refseq_CPM.R1 102 120
  • NCBIGene 36.3 1368
  • Alternative 5-prime, size difference: 18
  • Exclusion in 5'UTR
  • Reference transcript: NM_198320

  • cd M14_CPM 376aa 0.0 in ref transcript
    • Peptidase M14 Carboxypeptidase (CP) M (CPM) belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPM is an extracellular glycoprotein, bound to cell membranes via a glycosyl-phosphatidylinositol on the C-terminus of the protein. It specifically removes C-terminal basic residues such as lysine and arginine from peptides and proteins. The highest levels of CPM have been found in human lung and placenta, but significant amounts are present in kidney, blood vessels, intestine, brain, and peripheral nerves. CPM has also been found in soluble form in various body fluids, including amniotic fluid, seminal plasma and urine. Due to its wide distribution in a variety of tissues, it is believed that it plays an important role in the control of peptide hormones and growth factor activity on the cell surface and in the membrane-localized degradation of extracellular proteins, for example it hydrolyses the C-terminal arginine of epidermal growth factor (EGF) resulting in des-Arg-EGF which binds to the EGF receptor (EGFR) with an equal or greater affinity than native EGF. CPM is a required processing enzyme that generates specific agonists for the B1 receptor.
  • smart Zn_pept 273aa 8e-69 in ref transcript
    • Zn_pept.
  • COG COG2866 138aa 1e-04 in ref transcript
    • Predicted carboxypeptidase [Amino acid transport and metabolism].
  • COG PcaH 71aa 0.002 in ref transcript
    • Protocatechuate 3,4-dioxygenase beta subunit [Secondary metabolites biosynthesis, transport, and catabolism].

CPNE1

  • refseq_CPNE1.F1 refseq_CPNE1.R1 152 295
  • NCBIGene 36.3 8904
  • Single exon skipping, size difference: 143
  • Exclusion in 5'UTR
  • Reference transcript: NM_152930

  • cd vWA_copine_like 259aa 4e-88 in ref transcript
    • VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
  • cd C2 109aa 7e-15 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • cd C2 99aa 2e-10 in ref transcript
  • pfam Copine 148aa 4e-65 in ref transcript
    • Copine. This family represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca(2+)-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth.
  • pfam C2 85aa 3e-14 in ref transcript
    • C2 domain.
  • smart C2 97aa 3e-10 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • COG COG5038 94aa 3e-07 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].

CPNE1

  • refseq_CPNE1.F3 refseq_CPNE1.R3 180 212
  • NCBIGene 36.3 8904
  • Alternative 3-prime, size difference: 32
  • Inclusion in 5'UTR
  • Reference transcript: NM_152931

  • cd vWA_copine_like 259aa 4e-88 in ref transcript
    • VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
  • cd C2 109aa 7e-15 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • cd C2 99aa 2e-10 in ref transcript
  • pfam Copine 148aa 4e-65 in ref transcript
    • Copine. This family represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca(2+)-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth.
  • pfam C2 85aa 3e-14 in ref transcript
    • C2 domain.
  • smart C2 97aa 3e-10 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • COG COG5038 94aa 3e-07 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].

CPNE7

  • refseq_CPNE7.F1 refseq_CPNE7.R1 140 365
  • NCBIGene 36.3 27132
  • Multiple exon skipping, size difference: 225
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014427

  • cd vWA_copine_like 258aa 4e-84 in ref transcript
    • VWA Copine: Copines are phospholipid-binding proteins originally identified in paramecium. They are found in human and orthologues have been found in C. elegans and Arabidopsis Thaliana. None have been found in D. Melanogaster or S. Cereviciae. Phylogenetic distribution suggests that copines have been lost in some eukaryotes. No functional properties have been assigned to the VWA domains present in copines. The members of this subgroup contain a functional MIDAS motif based on their preferential binding to magnesium and manganese. However, the MIDAS motif is not totally conserved, in most cases the MIDAS consists of the sequence DxTxS instead of the motif DxSxS that is found in most cases. The C2 domains present in copines mediate phospholipid binding.
  • Changed! cd C2 98aa 5e-11 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • cd C2 87aa 9e-09 in ref transcript
  • pfam Copine 148aa 2e-70 in ref transcript
    • Copine. This family represents a conserved region approximately 180 residues long within eukaryotic copines. Copines are Ca(2+)-dependent phospholipid-binding proteins that are thought to be involved in membrane-trafficking, and may also be involved in cell division and growth.
  • Changed! pfam C2 87aa 2e-11 in ref transcript
    • C2 domain.
  • pfam C2 86aa 1e-10 in ref transcript
  • Changed! COG COG5038 151aa 6e-04 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
  • Changed! cd C2 100aa 3e-11 in modified transcript
  • Changed! smart C2 98aa 2e-11 in modified transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • Changed! COG COG5038 112aa 0.001 in modified transcript

CPSF3L

  • refseq_CPSF3L.F2 refseq_CPSF3L.R2 211 277
  • NCBIGene 36.2 54973
  • Alternative 3-prime, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017871

  • Changed! TIGR arCOG00543 443aa 5e-86 in ref transcript
    • This family of proteins is universal in the archaea and consistsof an N-terminal type-1 KH-domain (pfam00013) a central beta-lactamase-domain (pfam00753) with a C-terminal motif associated with RNA metabolism (pfam07521). KH-domains are associated with RNA-binding, so taken together, this protein is a likely metal-dependent RNAase. This family was defined in as arCOG01782.
  • Changed! COG YSH1 443aa 1e-94 in ref transcript
    • Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis].
  • Changed! TIGR arCOG00543 421aa 9e-80 in modified transcript
  • Changed! COG YSH1 421aa 3e-83 in modified transcript

CPVL

  • refseq_CPVL.F2 refseq_CPVL.R2 185 273
  • NCBIGene 36.3 54504
  • Alternative 5-prime, size difference: 88
  • Exclusion in 5'UTR
  • Reference transcript: NM_019029

  • pfam Peptidase_S10 400aa 2e-97 in ref transcript
    • Serine carboxypeptidase.
  • COG COG2939 383aa 1e-35 in ref transcript
    • Carboxypeptidase C (cathepsin A) [Amino acid transport and metabolism].

CR2

  • refseq_CR2.F1 refseq_CR2.R1 145 322
  • NCBIGene 36.3 1380
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001006658

  • cd CCP 58aa 1e-09 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • Changed! cd CCP 58aa 9e-09 in ref transcript
  • cd CCP 68aa 1e-08 in ref transcript
  • cd CCP 58aa 2e-08 in ref transcript
  • cd CCP 57aa 6e-08 in ref transcript
  • cd CCP 61aa 6e-08 in ref transcript
  • cd CCP 58aa 9e-08 in ref transcript
  • cd CCP 57aa 2e-07 in ref transcript
  • cd CCP 61aa 2e-07 in ref transcript
  • cd CCP 68aa 6e-07 in ref transcript
  • cd CCP 57aa 4e-06 in ref transcript
  • cd CCP 57aa 1e-05 in ref transcript
  • cd CCP 39aa 5e-05 in ref transcript
  • smart CCP 57aa 1e-11 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 67aa 6e-11 in ref transcript
  • Changed! smart CCP 57aa 1e-10 in ref transcript
  • smart CCP 57aa 8e-10 in ref transcript
  • smart CCP 61aa 1e-09 in ref transcript
  • smart CCP 57aa 2e-09 in ref transcript
  • smart CCP 51aa 7e-09 in ref transcript
  • pfam Sushi 60aa 9e-09 in ref transcript
    • Sushi domain (SCR repeat).
  • smart CCP 56aa 2e-08 in ref transcript
  • smart CCP 67aa 2e-08 in ref transcript
  • smart CCP 56aa 6e-07 in ref transcript
  • pfam Sushi 56aa 1e-06 in ref transcript
  • smart CCP 57aa 1e-05 in ref transcript
  • pfam Sushi 57aa 5e-04 in ref transcript
  • Changed! cd CCP 58aa 0.010 in modified transcript

CRAT

  • refseq_CRAT.F2 refseq_CRAT.R2 139 385
  • NCBIGene 36.2 1384
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 246
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000755

  • Changed! pfam Carn_acyltransf 582aa 0.0 in ref transcript
    • Choline/Carnitine o-acyltransferase.
  • Changed! pfam Carn_acyltransf 250aa 1e-107 in modified transcript
  • Changed! pfam Carn_acyltransf 248aa 2e-94 in modified transcript

CREB1

  • refseq_CREB1.F1 refseq_CREB1.R1 151 193
  • NCBIGene 36.3 1385
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_134442

  • pfam pKID 29aa 3e-10 in ref transcript
    • pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
  • pfam bZIP_1 57aa 2e-06 in ref transcript
    • bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.

CRELD1

  • refseq_CRELD1.F1 refseq_CRELD1.R1 168 362
  • NCBIGene 36.3 78987
  • Single exon skipping, size difference: 194
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031717

  • smart EGF_CA 32aa 0.003 in ref transcript
    • Calcium-binding EGF-like domain.
  • pfam EGF_CA 28aa 0.006 in ref transcript
    • Calcium binding EGF domain.

CREM

  • refseq_CREM.F1 refseq_CREM.R1 306 495
  • NCBIGene 36.3 1390
  • Single exon skipping, size difference: 189
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182769

  • Changed! pfam pKID 41aa 6e-11 in ref transcript
    • pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
  • pfam bZIP_1 53aa 8e-07 in ref transcript
    • bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
  • Changed! pfam pKID 37aa 4e-10 in modified transcript

CREM

  • refseq_CREM.F2 refseq_CREM.R2 106 204
  • NCBIGene 36.3 1390
  • Single exon skipping, size difference: 98
  • Exclusion of the protein initiation site
  • Reference transcript: NM_183013

  • pfam pKID 41aa 2e-10 in ref transcript
    • pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
  • pfam bZIP_1 53aa 1e-04 in ref transcript
    • bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.

CREM

  • refseq_CREM.F5 refseq_CREM.R4 185 221
  • NCBIGene 36.3 1390
  • Single exon skipping, size difference: 36
  • Exclusion of the protein initiation site
  • Reference transcript: NM_182723

  • pfam bZIP_1 53aa 3e-04 in ref transcript
    • bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.

CREM

  • refseq_CREM.F7 refseq_CREM.R5 153 251
  • NCBIGene 36.3 1390
  • Single exon skipping, size difference: 98
  • Exclusion of the protein initiation site
  • Reference transcript: NM_181571

  • pfam pKID 41aa 9e-11 in ref transcript
    • pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
  • pfam bZIP_1 53aa 4e-06 in ref transcript
    • bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.

CREM

  • refseq_CREM.F8 refseq_CREM.R6 98 500
  • NCBIGene 36.3 1390
  • Alternative 3-prime, size difference: 402
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_181571

  • pfam pKID 41aa 9e-11 in ref transcript
    • pKID domain. CBP and P300 bind to the pKID (phosphorylated kinase-inducible-domain) domain of CREB.
  • Changed! pfam bZIP_1 53aa 4e-06 in ref transcript
    • bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.
  • Changed! pfam bZIP_1 53aa 3e-06 in modified transcript

CRISP1

  • refseq_CRISP1.F1 refseq_CRISP1.R1 258 347
  • NCBIGene 36.3 167
  • Single exon skipping, size difference: 89
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001131

  • cd SCP_CRISP 139aa 3e-51 in ref transcript
    • SCP_CRISP: SCP-like extracellular protein domain, CRISP-like sub-family. The wider family of SCP containing proteins includes plant pathogenesis-related protein 1 (PR-1), CRISPs, mammalian cysteine-rich secretory proteins, which combine SCP with a C-terminal cysteine rich domain, and allergen 5 from vespid venom. Involvement of CRISP in response to pathogens, fertilization, and sperm maturation have been proposed. One member, Tex31 from the venom duct of Conus textile, has been shown to possess proteolytic activity sensitive to serine protease inhibitors. SCP has also been proposed to be a Ca++ chelating serine protease. The Ca++-chelating function would fit with various signaling processes that members of this family, such as the CRISPs, are involved in, and is supported by sequence and structural evidence of a conserved pocket containing two histidines and a glutamate. It also may explain how helothermine, a toxic peptide secreted by the beaded lizard, blocks Ca++ transporting ryanodine receptors. One member, DE or CRISP-1, has been shown to mediate gamete fusion by binding to the egg surface; a sequence motif in the SCP domain plays a role in that binding.
  • smart SCP 139aa 1e-31 in ref transcript
    • SCP / Tpx-1 / Ag5 / PR-1 / Sc7 family of extracellular domains. Human glioma pathogenesis-related protein GliPR and the plant pathogenesis-related protein represent functional links between plant defense systems and human immune system. This family has no known function.
  • Changed! pfam Crisp 55aa 3e-18 in ref transcript
    • Crisp. This domain is found on Crisp proteins which contain pfam00188 and has been termed the Crisp domain. It is found in the mammalian reproductive tract and the venom of reptiles, and has been shown to regulate ryanodine receptor Ca2+ signalling. It contains 10 conserved cysteines which are all involved in disulphide bonds and is structurally related to the ion channel inhibitor toxins BgK and ShK.
  • Changed! cd SCP_CRISP 140aa 1e-51 in modified transcript

CRK

  • refseq_CRK.F1 refseq_CRK.R1 156 326
  • NCBIGene 36.3 1398
  • Alternative 5-prime, size difference: 170
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016823

  • cd SH2 106aa 2e-17 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • cd SH3 54aa 1e-12 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! cd SH3 55aa 0.009 in ref transcript
  • smart SH2 97aa 9e-18 in ref transcript
    • Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
  • smart SH3 58aa 1e-14 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! pfam SH3_2 55aa 5e-11 in ref transcript
    • Variant SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.

MED23

  • refseq_CRSP3.F1 refseq_CRSP3.R1 102 120
  • NCBIGene 36.3 9439
  • Single exon skipping, size difference: 18
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_004830

CRTC1

  • refseq_CRTC1.F2 refseq_CRTC1.R2 100 187
  • NCBIGene 36.2 23373
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015321

CS

  • refseq_CS.F1 refseq_CS.R1 149 232
  • NCBIGene 36.3 1431
  • Single exon skipping, size difference: 83
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004077

  • Changed! cd ScCit1-2_like 427aa 0.0 in ref transcript
    • Saccharomyces cerevisiae (Sc) citrate synthases Cit1-2_like. Citrate synthases (CS) catalyzes the condensation of acetyl coenzyme A (AcCoA) with oxaloacetate (OAA) to form citrate and coenzyme A (CoA), the first step in the citric acid cycle (TCA or Krebs cycle). Some CS proteins function as 2-methylcitrate synthase (2MCS). 2MCS catalyzes the condensation of propionyl-coenzyme A (PrCoA) and OAA to form 2-methylcitrate and CoA during propionate metabolism. The overall CS reaction is thought to proceed through three partial reactions and involves both closed and open conformational forms of the enzyme: a) the carbanion or equivalent is generated from AcCoA by base abstraction of a proton, b) the nucleophilic attack of this carbanion on OAA to generate citryl-CoA, and c) the hydrolysis of citryl-CoA to produce citrate and CoA. There are two types of CSs: type I CS and type II CSs. Type I CSs are found in eukarya, gram-positive bacteria, archaea, and in some gram-negative bacteria and are homodimers with both subunits participating in the active site. Type II CSs are unique to gram-negative bacteria and are homohexamers of identical subunits (approximated as a trimer of dimers). ScCit1 is a nuclear-encoded mitochondrial CS with highly specificity for AcCoA. In addition to its CS function, ScCit1 plays a part in the construction of the TCA cycle metabolon. Yeast cells deleted for Cit1 are hyper-susceptible to apoptosis induced by heat and aging stress. ScCit2 is a peroxisomal CS involved in the glyoxylate cycle; in addition to having activity with AcCoA, it may have activity with PrCoA. Chicken and pig heart CS, two Arabidopsis thaliana (Ath) CSs, CSY4 and -5, and Aspergillus niger (An) CS also belong to this group. Ath CSY4 has a mitochondrial targeting sequence; AthCSY5 has no identifiable targeting sequence. AnCS encoded by the citA gene has both an N-terminal mitochondrial import signal and a C-terminal peroxisiomal target sequence; it is not known if both these signals are functional in vivo. This group contains proteins which functions exclusively as either a CS or a 2MCS, as well as those with relaxed specificity which have dual functions as both a CS and a 2MCS.
  • Changed! TIGR cit_synth_euk 428aa 0.0 in ref transcript
    • This model includes both mitochondrial and peroxisomal forms of citrate synthase. Citrate synthase is the entry point to the TCA cycle from acetyl-CoA. Peroxisomal forms, such as a member from yeast (recognized by the C-terminal targeting motif SKL) act in the glyoxylate cycle. Eukaryotic homologs excluded by the high trusted cutoff of this model include a Tetrahymena thermophila citrate synthase that doubles as a filament protein, a putative citrate synthase from Plasmodium falciparum (no TCA cycle), and a methylcitrate synthase from Aspergillus nidulans.
  • Changed! PRK PRK09569 428aa 1e-148 in ref transcript
    • type I citrate synthase; Reviewed.

CSAG1

  • refseq_CSAG1.F1 refseq_CSAG1.R1 100 316
  • NCBIGene 36.3 158511
  • Single exon skipping, size difference: 216
  • Exclusion in 5'UTR
  • Reference transcript: NM_153478

CSDE1

  • refseq_CSDE1.F1 refseq_CSDE1.R1 260 353
  • NCBIGene 36.3 7812
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001007553

  • cd CSP_CDS 61aa 6e-08 in ref transcript
    • Cold-Shock Protein (CSP) contains an S1-like cold-shock domain (CSD) that is found in eukaryotes, prokaryotes, and archaea. CSP's include the major cold-shock proteins CspA and CspB in bacteria and the eukaryotic gene regulatory factor Y-box protein. CSP expression is up-regulated by an abrupt drop in growth temperature. CSP's are also expressed under normal condition at lower level. The function of cold-shock proteins is not fully understood. They preferentially bind poly-pyrimidine region of single-stranded RNA and DNA. CSP's are thought to bind mRNA and regulate ribosomal translation, mRNA degradation, and the rate of transcription termination. The human Y-box protein, which contains a CSD, regulates transcription and translation of genes that contain the Y-box sequence in their promoters. This specific ssDNA-binding properties of CSD are required for the binding of Y-box protein to the promoter's Y-box sequence, thereby regulating transcription.
  • cd CSP_CDS 50aa 1e-06 in ref transcript
  • cd CSP_CDS 53aa 0.003 in ref transcript
  • Changed! cd CSP_CDS 54aa 0.010 in ref transcript
  • pfam CSD 63aa 3e-13 in ref transcript
    • 'Cold-shock' DNA-binding domain.
  • smart CSP 64aa 5e-12 in ref transcript
    • Cold shock protein domain. RNA-binding domain that functions as a RNA-chaperone in bacteria and is involved in regulating translation in eukaryotes. Contains sub-family of RNA-binding domains in the Rho transcription termination factor.
  • pfam CSD 65aa 4e-11 in ref transcript
  • Changed! pfam CSD 62aa 3e-10 in ref transcript
  • smart CSP 64aa 9e-10 in ref transcript
  • TIGR 3_prime_RNase 174aa 3e-04 in ref transcript
    • This model is defined to identify a pair of paralogous 3-prime exoribonucleases in E. coli, plus the set of proteins apparently orthologous to one or the other in other eubacteria. VacB was characterized originally as required for the expression of virulence genes, but is now recognized as the exoribonuclease RNase R (Rnr). Its paralog in E. coli and H. influenzae is designated exoribonuclease II (Rnb). Both are involved in the degradation of mRNA, and consequently have strong pleiotropic effects that may be difficult to disentangle. Both these proteins share domain-level similarity (RNB, S1) with a considerable number of other proteins, and full-length similarity scoring below the trusted cutoff to proteins associated with various phenotypes but uncertain biochemistry; it may be that these latter proteins are also 3-prime exoribonucleases.
  • COG CspC 64aa 7e-06 in ref transcript
    • Cold shock proteins [Transcription].
  • Changed! smart CSP 61aa 3e-10 in modified transcript

CSF3R

  • refseq_CSF3R.F1 refseq_CSF3R.R1 170 251
  • NCBIGene 36.3 1441
  • Alternative 3-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_156039

  • cd FN3 81aa 1e-04 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 91aa 0.002 in ref transcript
  • cd FN3 79aa 0.007 in ref transcript
  • pfam Lep_receptor_Ig 90aa 5e-27 in ref transcript
    • Ig-like C2-type domain. This domain is a ligand-binding immunoglobulin-like domain. The two cysteine residues form a disulphide bridge.
  • smart FN3 81aa 7e-06 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam fn3 85aa 0.002 in ref transcript
    • Fibronectin type III domain.

CSF3R

  • refseq_CSF3R.F4 refseq_CSF3R.R4 122 216
  • NCBIGene 36.3 1441
  • Single exon skipping, size difference: 94
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_156039

  • cd FN3 81aa 1e-04 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 91aa 0.002 in ref transcript
  • cd FN3 79aa 0.007 in ref transcript
  • pfam Lep_receptor_Ig 90aa 5e-27 in ref transcript
    • Ig-like C2-type domain. This domain is a ligand-binding immunoglobulin-like domain. The two cysteine residues form a disulphide bridge.
  • smart FN3 81aa 7e-06 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam fn3 85aa 0.002 in ref transcript
    • Fibronectin type III domain.

CSMD3

  • refseq_CSMD3.F1 refseq_CSMD3.R1 100 412
  • NCBIGene 36.3 114788
  • Single exon skipping, size difference: 312
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198123

  • cd CUB 108aa 3e-26 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 108aa 3e-25 in ref transcript
  • cd CUB 108aa 2e-23 in ref transcript
  • cd CUB 107aa 4e-23 in ref transcript
  • cd CUB 108aa 6e-23 in ref transcript
  • cd CUB 109aa 1e-21 in ref transcript
  • cd CUB 107aa 1e-20 in ref transcript
  • cd CUB 108aa 7e-20 in ref transcript
  • cd CUB 109aa 8e-18 in ref transcript
  • cd CUB 109aa 1e-17 in ref transcript
  • cd CUB 107aa 4e-17 in ref transcript
  • cd CUB 108aa 4e-16 in ref transcript
  • Changed! cd CUB 85aa 2e-14 in ref transcript
  • cd CCP 59aa 6e-09 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd CCP 55aa 7e-09 in ref transcript
  • cd CCP 55aa 1e-08 in ref transcript
  • cd CCP 55aa 2e-08 in ref transcript
  • cd CCP 56aa 2e-08 in ref transcript
  • cd CCP 57aa 4e-08 in ref transcript
  • cd CCP 54aa 4e-08 in ref transcript
  • cd CCP 54aa 4e-08 in ref transcript
  • cd CCP 55aa 6e-08 in ref transcript
  • cd CCP 54aa 2e-07 in ref transcript
  • cd CCP 55aa 4e-07 in ref transcript
  • cd CCP 59aa 5e-07 in ref transcript
  • cd CCP 55aa 5e-07 in ref transcript
  • cd CUB 108aa 1e-06 in ref transcript
  • cd CCP 57aa 1e-06 in ref transcript
  • cd CCP 57aa 1e-06 in ref transcript
  • cd CCP 59aa 2e-06 in ref transcript
  • cd CCP 54aa 3e-06 in ref transcript
  • cd CCP 58aa 8e-06 in ref transcript
  • cd CCP 56aa 9e-06 in ref transcript
  • cd CCP 62aa 2e-05 in ref transcript
  • cd CCP 59aa 3e-05 in ref transcript
  • cd CCP 58aa 3e-05 in ref transcript
  • cd CCP 59aa 4e-05 in ref transcript
  • cd CCP 57aa 1e-04 in ref transcript
  • cd CCP 60aa 3e-04 in ref transcript
  • cd CCP 58aa 4e-04 in ref transcript
  • cd CCP 59aa 4e-04 in ref transcript
  • cd CCP 58aa 0.001 in ref transcript
  • pfam CUB 106aa 3e-26 in ref transcript
    • CUB domain.
  • pfam CUB 106aa 1e-25 in ref transcript
  • pfam CUB 106aa 4e-23 in ref transcript
  • pfam CUB 106aa 4e-23 in ref transcript
  • pfam CUB 101aa 3e-22 in ref transcript
  • pfam CUB 106aa 8e-21 in ref transcript
  • pfam CUB 107aa 4e-20 in ref transcript
  • pfam CUB 109aa 9e-20 in ref transcript
  • pfam CUB 106aa 5e-19 in ref transcript
  • pfam CUB 108aa 7e-18 in ref transcript
  • smart CUB 98aa 2e-17 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • pfam CUB 106aa 5e-16 in ref transcript
  • pfam CUB 102aa 5e-14 in ref transcript
  • smart CCP 54aa 7e-10 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 58aa 3e-09 in ref transcript
  • smart CCP 54aa 6e-09 in ref transcript
  • smart CCP 55aa 7e-09 in ref transcript
  • smart CCP 56aa 1e-08 in ref transcript
  • smart CCP 54aa 1e-08 in ref transcript
  • smart CCP 54aa 1e-08 in ref transcript
  • smart CCP 55aa 6e-08 in ref transcript
  • smart CCP 54aa 8e-08 in ref transcript
  • pfam CUB 106aa 1e-07 in ref transcript
  • smart CCP 54aa 1e-07 in ref transcript
  • smart CCP 56aa 1e-07 in ref transcript
  • pfam Sushi 58aa 1e-07 in ref transcript
    • Sushi domain (SCR repeat).
  • smart CCP 54aa 2e-07 in ref transcript
  • smart CCP 61aa 2e-07 in ref transcript
  • smart CCP 54aa 3e-07 in ref transcript
  • smart CCP 54aa 4e-07 in ref transcript
  • smart CCP 56aa 6e-07 in ref transcript
  • smart CCP 58aa 1e-06 in ref transcript
  • smart CCP 58aa 1e-06 in ref transcript
  • smart CCP 56aa 3e-06 in ref transcript
  • smart CCP 57aa 4e-06 in ref transcript
  • pfam Sushi 58aa 7e-06 in ref transcript
  • pfam Sushi 57aa 3e-05 in ref transcript
  • smart CCP 58aa 5e-05 in ref transcript
  • smart CCP 56aa 7e-05 in ref transcript
  • smart CCP 59aa 2e-04 in ref transcript
  • smart CCP 58aa 4e-04 in ref transcript
  • smart CCP 58aa 0.001 in ref transcript
  • Changed! cd CUB 86aa 2e-14 in modified transcript

CSMD3

  • refseq_CSMD3.F2 refseq_CSMD3.R2 155 350
  • NCBIGene 36.3 114788
  • Single exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198123

  • cd CUB 108aa 3e-26 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 108aa 3e-25 in ref transcript
  • cd CUB 108aa 2e-23 in ref transcript
  • cd CUB 107aa 4e-23 in ref transcript
  • cd CUB 108aa 6e-23 in ref transcript
  • cd CUB 109aa 1e-21 in ref transcript
  • cd CUB 107aa 1e-20 in ref transcript
  • cd CUB 108aa 7e-20 in ref transcript
  • cd CUB 109aa 8e-18 in ref transcript
  • cd CUB 109aa 1e-17 in ref transcript
  • cd CUB 107aa 4e-17 in ref transcript
  • cd CUB 108aa 4e-16 in ref transcript
  • cd CUB 85aa 2e-14 in ref transcript
  • cd CCP 59aa 6e-09 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd CCP 55aa 7e-09 in ref transcript
  • cd CCP 55aa 1e-08 in ref transcript
  • cd CCP 55aa 2e-08 in ref transcript
  • cd CCP 56aa 2e-08 in ref transcript
  • cd CCP 57aa 4e-08 in ref transcript
  • cd CCP 54aa 4e-08 in ref transcript
  • cd CCP 54aa 4e-08 in ref transcript
  • cd CCP 55aa 6e-08 in ref transcript
  • cd CCP 54aa 2e-07 in ref transcript
  • cd CCP 55aa 4e-07 in ref transcript
  • cd CCP 59aa 5e-07 in ref transcript
  • cd CCP 55aa 5e-07 in ref transcript
  • cd CUB 108aa 1e-06 in ref transcript
  • cd CCP 57aa 1e-06 in ref transcript
  • cd CCP 57aa 1e-06 in ref transcript
  • cd CCP 59aa 2e-06 in ref transcript
  • cd CCP 54aa 3e-06 in ref transcript
  • cd CCP 58aa 8e-06 in ref transcript
  • cd CCP 56aa 9e-06 in ref transcript
  • Changed! cd CCP 62aa 2e-05 in ref transcript
  • cd CCP 59aa 3e-05 in ref transcript
  • cd CCP 58aa 3e-05 in ref transcript
  • cd CCP 59aa 4e-05 in ref transcript
  • cd CCP 57aa 1e-04 in ref transcript
  • cd CCP 60aa 3e-04 in ref transcript
  • cd CCP 58aa 4e-04 in ref transcript
  • cd CCP 59aa 4e-04 in ref transcript
  • cd CCP 58aa 0.001 in ref transcript
  • pfam CUB 106aa 3e-26 in ref transcript
    • CUB domain.
  • pfam CUB 106aa 1e-25 in ref transcript
  • pfam CUB 106aa 4e-23 in ref transcript
  • pfam CUB 106aa 4e-23 in ref transcript
  • pfam CUB 101aa 3e-22 in ref transcript
  • pfam CUB 106aa 8e-21 in ref transcript
  • pfam CUB 107aa 4e-20 in ref transcript
  • pfam CUB 109aa 9e-20 in ref transcript
  • pfam CUB 106aa 5e-19 in ref transcript
  • pfam CUB 108aa 7e-18 in ref transcript
  • smart CUB 98aa 2e-17 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • pfam CUB 106aa 5e-16 in ref transcript
  • pfam CUB 102aa 5e-14 in ref transcript
  • smart CCP 54aa 7e-10 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 58aa 3e-09 in ref transcript
  • smart CCP 54aa 6e-09 in ref transcript
  • smart CCP 55aa 7e-09 in ref transcript
  • smart CCP 56aa 1e-08 in ref transcript
  • smart CCP 54aa 1e-08 in ref transcript
  • smart CCP 54aa 1e-08 in ref transcript
  • smart CCP 55aa 6e-08 in ref transcript
  • smart CCP 54aa 8e-08 in ref transcript
  • pfam CUB 106aa 1e-07 in ref transcript
  • smart CCP 54aa 1e-07 in ref transcript
  • smart CCP 56aa 1e-07 in ref transcript
  • pfam Sushi 58aa 1e-07 in ref transcript
    • Sushi domain (SCR repeat).
  • smart CCP 54aa 2e-07 in ref transcript
  • Changed! smart CCP 61aa 2e-07 in ref transcript
  • smart CCP 54aa 3e-07 in ref transcript
  • smart CCP 54aa 4e-07 in ref transcript
  • smart CCP 56aa 6e-07 in ref transcript
  • smart CCP 58aa 1e-06 in ref transcript
  • smart CCP 58aa 1e-06 in ref transcript
  • smart CCP 56aa 3e-06 in ref transcript
  • smart CCP 57aa 4e-06 in ref transcript
  • pfam Sushi 58aa 7e-06 in ref transcript
  • pfam Sushi 57aa 3e-05 in ref transcript
  • smart CCP 58aa 5e-05 in ref transcript
  • smart CCP 56aa 7e-05 in ref transcript
  • smart CCP 59aa 2e-04 in ref transcript
  • smart CCP 58aa 4e-04 in ref transcript
  • smart CCP 58aa 0.001 in ref transcript

CSNK1A1

  • refseq_CSNK1A1.F2 refseq_CSNK1A1.R2 138 222
  • NCBIGene 36.3 1452
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025105

  • Changed! cd S_TKc 292aa 1e-31 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! pfam Pkinase 240aa 2e-30 in ref transcript
    • Protein kinase domain.
  • Changed! COG SPS1 239aa 5e-22 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd S_TKc 264aa 1e-34 in modified transcript
  • Changed! pfam Pkinase 212aa 3e-32 in modified transcript
  • Changed! COG SPS1 211aa 2e-25 in modified transcript

CSNK1D

  • refseq_CSNK1D.F2 refseq_CSNK1D.R2 238 302
  • NCBIGene 36.3 1453
  • Single exon skipping, size difference: 64
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001893

  • cd S_TKc 264aa 7e-42 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 253aa 4e-42 in ref transcript
    • Protein kinase domain.
  • COG SPS1 277aa 1e-26 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

CSNK1G3

  • refseq_CSNK1G3.F1 refseq_CSNK1G3.R1 122 146
  • NCBIGene 36.3 1456
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001044723

  • cd S_TKc 265aa 6e-37 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 257aa 3e-37 in ref transcript
    • Protein kinase domain.
  • COG SPS1 325aa 4e-27 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

CSNK1G3

  • refseq_CSNK1G3.F3 refseq_CSNK1G3.R3 333 405
  • NCBIGene 36.3 1456
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001044723

  • cd S_TKc 265aa 6e-37 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 257aa 3e-37 in ref transcript
    • Protein kinase domain.
  • Changed! COG SPS1 325aa 4e-27 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! COG SPS1 349aa 2e-27 in modified transcript

CSNK2A1

  • refseq_CSNK2A1.F2 refseq_CSNK2A1.R2 225 342
  • NCBIGene 36.3 1457
  • Single exon skipping, size difference: 117
  • Exclusion in 5'UTR
  • Reference transcript: NM_177559

  • cd S_TKc 287aa 4e-46 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 286aa 1e-57 in ref transcript
    • Protein kinase domain.
  • COG SPS1 287aa 3e-26 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

CST11

  • refseq_CST11.F1 refseq_CST11.R1 227 332
  • NCBIGene 36.3 140880
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130794

  • Changed! cd CY 92aa 3e-09 in ref transcript
    • Cystatin-like domain; Cystatins are a family of cysteine protease inhibitors that occur mainly as single domain proteins. However some extracellular proteins such as kininogen, His-rich glycoprotein and fetuin also contain these domains.
  • Changed! smart CY 83aa 5e-11 in ref transcript
    • Cystatin-like domain. Cystatins are a family of cysteine protease inhibitors that occur mainly as single domain proteins. However some extracellular proteins such as kininogen, His-rich glycoprotein and fetuin also contain these domains.
  • Changed! smart CY 26aa 0.003 in modified transcript

CSTF1

  • refseq_CSTF1.F2 refseq_CSTF1.R2 117 219
  • NCBIGene 36.3 1477
  • Alternative 5-prime, size difference: 102
  • Exclusion in 5'UTR
  • Reference transcript: NM_001033522

  • cd WD40 325aa 1e-44 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam WD40 36aa 2e-06 in ref transcript
    • WD domain, G-beta repeat.
  • pfam WD40 30aa 7e-04 in ref transcript
  • COG COG2319 345aa 5e-21 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

CSTF1

  • refseq_CSTF1.F3 refseq_CSTF1.R1 149 324
  • NCBIGene 36.3 1477
  • Alternative 5-prime, size difference: 175
  • Exclusion in 5'UTR
  • Reference transcript: NM_001324

  • cd WD40 325aa 1e-44 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam WD40 36aa 2e-06 in ref transcript
    • WD domain, G-beta repeat.
  • pfam WD40 30aa 7e-04 in ref transcript
  • COG COG2319 345aa 5e-21 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

CTBP1

  • refseq_CTBP1.F1 refseq_CTBP1.R1 186 381
  • NCBIGene 36.3 1487
  • Single exon skipping, size difference: 195
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001328

  • Changed! pfam 2-Hacid_dh 291aa 2e-58 in ref transcript
    • D-isomer specific 2-hydroxyacid dehydrogenase, catalytic domain. This family represents the largest portion of the catalytic domain of 2-hydroxyacid dehydrogenases as the NAD binding domain is inserted within the structural domain.
  • Changed! COG LdhA 296aa 3e-61 in ref transcript
    • Lactate dehydrogenase and related dehydrogenases [Energy production and conversion / Coenzyme metabolism / General function prediction only].

CTDSPL

  • refseq_CTDSPL.F2 refseq_CTDSPL.R2 144 177
  • NCBIGene 36.3 10217
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008392

  • Changed! TIGR HIF-SF_euk 160aa 5e-72 in ref transcript
    • This domain is related to domains found in FCP1-like phosphatases (TIGR02250), and together both are detected by the pfam03031.
  • COG FCP1 176aa 7e-47 in ref transcript
    • TFIIF-interacting CTD phosphatases, including NLI-interacting factor [Transcription].
  • Changed! pfam NIF 165aa 6e-72 in modified transcript
    • NLI interacting factor-like phosphatase. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase. This region has been identified as the minimal phosphatase domain.

CTF8

  • refseq_CTF8.F1 refseq_CTF8.R1 149 207
  • NCBIGene 36.3 54921
  • Single exon skipping, size difference: 58
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039690

CTLA4

  • refseq_CTLA4.F2 refseq_CTLA4.R2 213 323
  • NCBIGene 36.3 1493
  • Single exon skipping, size difference: 110
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005214

  • smart IGv 74aa 2e-06 in ref transcript
    • Immunoglobulin V-Type.

CTNNBIP1

  • refseq_CTNNBIP1.F1 refseq_CTNNBIP1.R1 187 221
  • NCBIGene 36.3 56998
  • Single exon skipping, size difference: 34
  • Exclusion in 5'UTR
  • Reference transcript: NM_020248

CTSE

  • refseq_CTSE.F1 refseq_CTSE.R1 109 251
  • NCBIGene 36.3 1510
  • Single exon skipping, size difference: 142
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001910

  • Changed! cd Cathespin_E 316aa 1e-176 in ref transcript
    • Cathepsin E, non-lysosomal aspartic protease. Cathepsin E is an intracellular, non-lysosomal aspartic protease expressed in a variety of cells and tissues. The protease has proposed physiological roles in antigen presentation by the MHC class II system, in the biogenesis of the vasoconstrictor peptide endothelin, and in neurodegeneration associated with brain ischemia and aging. Cathepsin E is the only A1 aspartic protease that exists as a homodimer with a disulfide bridge linking the two monomers. Like many other aspartic proteases, it is synthesized as a zymogen which is catalytically inactive towards its natural substrates at neutral pH and which auto-activates in an acidic environment. The overall structure follows the general fold of aspartic proteases of the A1 family, it is composed of two structurally similar beta barrel lobes, each lobe contributing an aspartic acid residue to form a catalytic dyad that acts to cleave the substrate peptide bond. The catalytic Asp residues are contained in an Asp-Thr-Gly-Ser/thr motif in both N- and C-terminal lobes of the enzyme. The aspartic acid residues act together to allow a water molecule to attack the peptide bond. One aspartic acid residue (in its deprotonated form) activates the attacking water molecule, whereas the other aspartic acid residue (in its protonated form) polarizes the peptide carbonyl, increasing its susceptibility to attack. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
  • Changed! pfam Asp 318aa 1e-128 in ref transcript
    • Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
  • pfam A1_Propeptide 29aa 5e-05 in ref transcript
    • A1 Propeptide. Most eukaryotic endopeptidases (Merops Family A1) are synthesised with signal and propeptides. The animal pepsin-like endopeptidase propeptides form a distinct family of propeptides, which contain a conserved motif approximately 30 residues long. In pepsinogen A, the first 11 residues of the mature pepsin sequence are displaced by residues of the propeptide. The propeptide contains two helices that block the active site cleft, in particular the conserved Asp11 residue, in pepsin, hydrogen bonds to a conserved Arg residues in the propeptide. This hydrogen bond stabilises the propeptide conformation and is probably responsible for triggering the conversion of pepsinogen to pepsin under acidic conditions.
  • Changed! PTZ PTZ00165 368aa 2e-67 in ref transcript
    • aspartyl protease; Provisional.
  • Changed! cd Cathespin_E 186aa 1e-101 in modified transcript
  • Changed! pfam Asp 185aa 3e-77 in modified transcript
  • Changed! PTZ PTZ00165 238aa 4e-40 in modified transcript

CTSL1

  • refseq_CTSL.F2 refseq_CTSL.R2 224 369
  • NCBIGene 36.3 1514
  • Alternative 5-prime, size difference: 145
  • Exclusion in 5'UTR
  • Reference transcript: NM_001912

  • cd Peptidase_C1A 217aa 4e-83 in ref transcript
    • Peptidase C1A subfamily (MEROPS database nomenclature); composed of cysteine peptidases (CPs) similar to papain, including the mammalian CPs (cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain is an endopeptidase with specific substrate preferences, primarily for bulky hydrophobic or aromatic residues at the S2 subsite, a hydrophobic pocket in papain that accommodates the P2 sidechain of the substrate (the second residue away from the scissile bond). Most members of the papain subfamily are endopeptidases. Some exceptions to this rule can be explained by specific details of the catalytic domains like the occluding loop in cathepsin B which confers an additional carboxydipeptidyl activity and the mini-chain of cathepsin H resulting in an N-terminal exopeptidase activity. Papain-like CPs have different functions in various organisms. Plant CPs are used to mobilize storage proteins in seeds. Parasitic CPs act extracellularly to help invade tissues and cells, to hatch or to evade the host immune system. Mammalian CPs are primarily lysosomal enzymes with the exception of cathepsin W, which is retained in the endoplasmic reticulum. They are responsible for protein degradation in the lysosome. Papain-like CPs are synthesized as inactive proenzymes with N-terminal propeptide regions, which are removed upon activation. In addition to its inhibitory role, the propeptide is required for proper folding of the newly synthesized enzyme and its stabilization in denaturing pH conditions. Residues within the propeptide region also play a role in the transport of the proenzyme to lysosomes or acidified vesicles. Also included in this subfamily are proteins classified as non-peptidase homologs, which lack peptidase activity or have missing active site residues.
  • pfam Peptidase_C1 219aa 9e-99 in ref transcript
    • Papain family cysteine protease.
  • pfam Inhibitor_I29 60aa 1e-14 in ref transcript
    • Cathepsin propeptide inhibitor domain (I29). This domain is found at the N-terminus of some C1 peptidases such as Cathepsin L where it acts as a propeptide. There are also a number of proteins that are composed solely of multiple copies of this domain such as the peptidase inhibitor salarin. This family is classified as I29 by MEROPS.
  • PTZ PTZ00203 288aa 1e-44 in ref transcript
    • cathepsin L protease; Provisional.

CTTN

  • refseq_CTTN.F1 refseq_CTTN.R1 123 234
  • NCBIGene 36.3 2017
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005231

  • cd SH3 53aa 7e-14 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart SH3 53aa 3e-16 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • pfam HS1_rep 37aa 5e-12 in ref transcript
    • Repeat in HS1/Cortactin. The function of this repeat is unknown. Seven copies are found in cortactin and four copies are found in HS1. The repeats are always found amino terminal to an SH3 domain pfam00018.
  • pfam HS1_rep 37aa 2e-11 in ref transcript
  • pfam HS1_rep 37aa 6e-11 in ref transcript
  • Changed! pfam HS1_rep 37aa 2e-10 in ref transcript
  • Changed! pfam HS1_rep 35aa 4e-10 in ref transcript
  • pfam HS1_rep 37aa 1e-09 in ref transcript
  • Changed! pfam HS1_rep 37aa 3e-11 in modified transcript

CYB5A

  • refseq_CYB5A.F1 refseq_CYB5A.R1 124 148
  • NCBIGene 36.3 1528
  • Single exon skipping, size difference: 24
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_148923

  • pfam Cyt-b5 75aa 1e-24 in ref transcript
    • Cytochrome b5-like Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anchored by an N-terminal transmembrane alpha helix. This family also includes a domain in some chitin synthases. There is no known ligand for this domain in the chitin synthases.
  • COG CYB5 81aa 6e-16 in ref transcript
    • Cytochrome b involved in lipid metabolism [Energy production and conversion / Lipid metabolism].

CLIP2

  • refseq_CYLN2.F1 refseq_CYLN2.R1 257 362
  • NCBIGene 36.3 7461
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003388

  • pfam CAP_GLY 66aa 1e-26 in ref transcript
    • CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
  • pfam CAP_GLY 66aa 2e-24 in ref transcript
  • Changed! TIGR SMC_prok_B 354aa 2e-08 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! pfam Myosin_tail_1 592aa 2e-07 in ref transcript
    • Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
  • COG NIP100 61aa 5e-13 in ref transcript
    • Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
  • COG NIP100 62aa 5e-12 in ref transcript
  • Changed! COG Smc 657aa 6e-10 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 313aa 8e-09 in modified transcript
  • Changed! TIGR SMC_prok_B 217aa 2e-07 in modified transcript
  • Changed! pfam Myosin_tail_1 557aa 2e-07 in modified transcript
  • Changed! COG Smc 399aa 2e-08 in modified transcript
  • Changed! COG Smc 350aa 2e-06 in modified transcript

CYP11B1

  • refseq_CYP11B1.F2 refseq_CYP11B1.R2 173 371
  • NCBIGene 36.3 1584
  • Single exon skipping, size difference: 198
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000497

  • Changed! pfam p450 448aa 2e-93 in ref transcript
    • Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
  • Changed! COG CypX 460aa 5e-29 in ref transcript
    • Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam p450 359aa 1e-66 in modified transcript
  • Changed! COG CypX 360aa 1e-15 in modified transcript

CYP19A1

  • refseq_CYP19A1.F1 refseq_CYP19A1.R1 232 341
  • NCBIGene 36.3 1588
  • Single exon skipping, size difference: 109
  • Exclusion in 5'UTR
  • Reference transcript: NM_031226

  • pfam p450 424aa 1e-90 in ref transcript
    • Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
  • COG CypX 415aa 2e-29 in ref transcript
    • Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].

CYP20A1

  • refseq_CYP20A1.F1 refseq_CYP20A1.R1 303 382
  • NCBIGene 36.2 57404
  • Single exon skipping, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_177538

  • Changed! pfam p450 375aa 1e-26 in ref transcript
    • Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
  • Changed! COG CypX 238aa 4e-13 in ref transcript
    • Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam p450 46aa 2e-05 in modified transcript

CYP2A7

  • refseq_CYP2A7.F1 refseq_CYP2A7.R1 147 300
  • NCBIGene 36.3 1549
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 153
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000764

  • Changed! pfam p450 458aa 1e-131 in ref transcript
    • Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
  • Changed! COG CypX 429aa 2e-32 in ref transcript
    • Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam p450 378aa 1e-105 in modified transcript
  • Changed! COG CypX 233aa 2e-27 in modified transcript

CYP2D6

  • refseq_CYP2D6.F1 refseq_CYP2D6.R1 218 371
  • NCBIGene 36.3 1565
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000106

  • Changed! pfam p450 436aa 1e-120 in ref transcript
    • Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
  • Changed! COG CypX 415aa 1e-31 in ref transcript
    • Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam p450 385aa 1e-101 in modified transcript
  • Changed! COG CypX 263aa 1e-26 in modified transcript

DAP3

  • refseq_DAP3.F2 refseq_DAP3.R2 202 271
  • NCBIGene 36.3 7818
  • Alternative 5-prime, size difference: 69
  • Exclusion in 5'UTR
  • Reference transcript: NM_033657

  • pfam DAP3 296aa 8e-93 in ref transcript
    • Mitochondrial ribosomal death-associated protein 3. This is a family of conserved proteins which were originally described as death-associated-protein-3 (DAP-3). The proteins carry a P-loop DNA-binding motif, and induce apoptosis. DAP3 has been shown to be a pro-apoptotic factor in the mitochondrial matrix and to be crucial for mitochondrial biogenesis and so has also been designated as MRP-S29 (mitochondrial ribosomal protein subunit 29).

DAZAP1

  • refseq_DAZAP1.F1 refseq_DAZAP1.R1 189 278
  • NCBIGene 36.3 26528
  • Single exon skipping, size difference: 89
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_018959

  • cd RRM 74aa 1e-17 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 70aa 1e-15 in ref transcript
  • smart RRM 70aa 8e-18 in ref transcript
    • RNA recognition motif.
  • smart RRM 70aa 2e-16 in ref transcript
  • TIGR SF-CC1 174aa 4e-15 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • COG COG0724 172aa 2e-14 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

DCHS2

  • refseq_DCHS2.F2 refseq_DCHS2.R2 100 136
  • NCBIGene 36.2 54798
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017639

  • cd CA 199aa 1e-48 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 197aa 4e-45 in ref transcript
  • cd CA 198aa 2e-40 in ref transcript
  • cd CA 194aa 2e-39 in ref transcript
  • cd CA 209aa 4e-39 in ref transcript
  • cd CA 206aa 1e-38 in ref transcript
  • cd CA 196aa 2e-38 in ref transcript
  • cd CA 200aa 2e-37 in ref transcript
  • cd CA 239aa 7e-34 in ref transcript
  • cd CA 195aa 2e-33 in ref transcript
  • cd CA 189aa 3e-33 in ref transcript
  • cd CA 201aa 1e-32 in ref transcript
  • cd CA 192aa 3e-32 in ref transcript
  • cd CA 201aa 2e-30 in ref transcript
  • Changed! cd CA 167aa 1e-26 in ref transcript
  • cd CA 207aa 5e-25 in ref transcript
  • cd CA 197aa 2e-24 in ref transcript
  • cd CA 193aa 2e-20 in ref transcript
  • smart CA 79aa 7e-23 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • smart CA 79aa 5e-21 in ref transcript
  • smart CA 78aa 1e-19 in ref transcript
  • smart CA 79aa 1e-18 in ref transcript
  • pfam Cadherin 91aa 3e-17 in ref transcript
    • Cadherin domain.
  • smart CA 77aa 3e-16 in ref transcript
  • pfam Cadherin 89aa 6e-16 in ref transcript
  • smart CA 90aa 1e-15 in ref transcript
  • smart CA 73aa 8e-15 in ref transcript
  • smart CA 68aa 1e-13 in ref transcript
  • smart CA 75aa 2e-12 in ref transcript
  • pfam Cadherin 79aa 4e-12 in ref transcript
  • Changed! smart CA 94aa 4e-11 in ref transcript
  • pfam Cadherin 95aa 2e-10 in ref transcript
  • pfam Cadherin 89aa 5e-10 in ref transcript
  • pfam Cadherin 90aa 6e-10 in ref transcript
  • smart CA 121aa 7e-10 in ref transcript
  • smart CA 70aa 3e-09 in ref transcript
  • smart CA 67aa 1e-07 in ref transcript
  • pfam Cadherin 81aa 2e-06 in ref transcript
  • pfam Cadherin 51aa 1e-04 in ref transcript
  • Changed! cd CA 155aa 6e-28 in modified transcript
  • Changed! smart CA 82aa 2e-12 in modified transcript

DCHS2

  • refseq_DCHS2.F4 refseq_DCHS2.R4 103 388
  • NCBIGene 36.2 54798
  • Multiple exon skipping, size difference: 285
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017639

  • cd CA 199aa 1e-48 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 197aa 4e-45 in ref transcript
  • cd CA 198aa 2e-40 in ref transcript
  • cd CA 194aa 2e-39 in ref transcript
  • cd CA 209aa 4e-39 in ref transcript
  • cd CA 206aa 1e-38 in ref transcript
  • cd CA 196aa 2e-38 in ref transcript
  • cd CA 200aa 2e-37 in ref transcript
  • cd CA 239aa 7e-34 in ref transcript
  • cd CA 195aa 2e-33 in ref transcript
  • cd CA 189aa 3e-33 in ref transcript
  • cd CA 201aa 1e-32 in ref transcript
  • cd CA 192aa 3e-32 in ref transcript
  • cd CA 201aa 2e-30 in ref transcript
  • Changed! cd CA 167aa 1e-26 in ref transcript
  • cd CA 207aa 5e-25 in ref transcript
  • cd CA 197aa 2e-24 in ref transcript
  • cd CA 193aa 2e-20 in ref transcript
  • smart CA 79aa 7e-23 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • smart CA 79aa 5e-21 in ref transcript
  • smart CA 78aa 1e-19 in ref transcript
  • smart CA 79aa 1e-18 in ref transcript
  • pfam Cadherin 91aa 3e-17 in ref transcript
    • Cadherin domain.
  • smart CA 77aa 3e-16 in ref transcript
  • pfam Cadherin 89aa 6e-16 in ref transcript
  • smart CA 90aa 1e-15 in ref transcript
  • smart CA 73aa 8e-15 in ref transcript
  • smart CA 68aa 1e-13 in ref transcript
  • smart CA 75aa 2e-12 in ref transcript
  • pfam Cadherin 79aa 4e-12 in ref transcript
  • smart CA 94aa 4e-11 in ref transcript
  • pfam Cadherin 95aa 2e-10 in ref transcript
  • pfam Cadherin 89aa 5e-10 in ref transcript
  • pfam Cadherin 90aa 6e-10 in ref transcript
  • smart CA 121aa 7e-10 in ref transcript
  • smart CA 70aa 3e-09 in ref transcript
  • smart CA 67aa 1e-07 in ref transcript
  • pfam Cadherin 81aa 2e-06 in ref transcript
  • Changed! pfam Cadherin 51aa 1e-04 in ref transcript
  • Changed! cd CA 207aa 2e-32 in modified transcript
  • Changed! cd CA 197aa 3e-31 in modified transcript
  • Changed! smart CA 71aa 3e-09 in modified transcript

DCHS2

  • refseq_DCHS2.F6 refseq_DCHS2.R6 162 294
  • NCBIGene 36.2 54798
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017639

  • cd CA 199aa 1e-48 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 197aa 4e-45 in ref transcript
  • cd CA 198aa 2e-40 in ref transcript
  • cd CA 194aa 2e-39 in ref transcript
  • cd CA 209aa 4e-39 in ref transcript
  • cd CA 206aa 1e-38 in ref transcript
  • cd CA 196aa 2e-38 in ref transcript
  • cd CA 200aa 2e-37 in ref transcript
  • Changed! cd CA 239aa 7e-34 in ref transcript
  • cd CA 195aa 2e-33 in ref transcript
  • cd CA 189aa 3e-33 in ref transcript
  • cd CA 201aa 1e-32 in ref transcript
  • cd CA 192aa 3e-32 in ref transcript
  • cd CA 201aa 2e-30 in ref transcript
  • cd CA 167aa 1e-26 in ref transcript
  • cd CA 207aa 5e-25 in ref transcript
  • cd CA 197aa 2e-24 in ref transcript
  • cd CA 193aa 2e-20 in ref transcript
  • smart CA 79aa 7e-23 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • smart CA 79aa 5e-21 in ref transcript
  • smart CA 78aa 1e-19 in ref transcript
  • smart CA 79aa 1e-18 in ref transcript
  • pfam Cadherin 91aa 3e-17 in ref transcript
    • Cadherin domain.
  • smart CA 77aa 3e-16 in ref transcript
  • pfam Cadherin 89aa 6e-16 in ref transcript
  • smart CA 90aa 1e-15 in ref transcript
  • smart CA 73aa 8e-15 in ref transcript
  • smart CA 68aa 1e-13 in ref transcript
  • smart CA 75aa 2e-12 in ref transcript
  • pfam Cadherin 79aa 4e-12 in ref transcript
  • smart CA 94aa 4e-11 in ref transcript
  • pfam Cadherin 95aa 2e-10 in ref transcript
  • pfam Cadherin 89aa 5e-10 in ref transcript
  • pfam Cadherin 90aa 6e-10 in ref transcript
  • Changed! smart CA 121aa 7e-10 in ref transcript
  • smart CA 70aa 3e-09 in ref transcript
  • smart CA 67aa 1e-07 in ref transcript
  • pfam Cadherin 81aa 2e-06 in ref transcript
  • pfam Cadherin 51aa 1e-04 in ref transcript
  • Changed! cd CA 195aa 8e-40 in modified transcript
  • Changed! cd CA 199aa 3e-32 in modified transcript
  • Changed! smart CA 77aa 3e-16 in modified transcript

DCLRE1C

  • refseq_DCLRE1C.F1 refseq_DCLRE1C.R1 116 172
  • NCBIGene 36.3 64421
  • Single exon skipping, size difference: 56
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001033855

  • Changed! pfam DRMBL 99aa 2e-16 in ref transcript
    • DNA repair metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in DNA repair.
  • Changed! smart Lactamase_B 115aa 3e-08 in ref transcript
    • Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor.
  • Changed! COG YSH1 175aa 1e-11 in ref transcript
    • Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis].
  • Changed! smart Lactamase_B 62aa 0.002 in modified transcript
  • Changed! COG COG0595 79aa 0.009 in modified transcript
    • Predicted hydrolase of the metallo-beta-lactamase superfamily [General function prediction only].

DCLRE1C

  • refseq_DCLRE1C.F3 refseq_DCLRE1C.R3 172 257
  • NCBIGene 36.3 64421
  • Single exon skipping, size difference: 85
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001033855

  • Changed! pfam DRMBL 99aa 2e-16 in ref transcript
    • DNA repair metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in DNA repair.
  • Changed! smart Lactamase_B 115aa 3e-08 in ref transcript
    • Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor.
  • Changed! COG YSH1 175aa 1e-11 in ref transcript
    • Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis].

DCN

  • refseq_DCN.F2 refseq_DCN.R2 102 429
  • NCBIGene 36.3 1634
  • Multiple exon skipping, size difference: 327
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001920

  • Changed! cd LRR_RI 217aa 0.002 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! pfam LRRNT 24aa 1e-05 in ref transcript
    • Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.
  • Changed! COG COG4886 150aa 5e-05 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].
  • Changed! smart LRRNT 24aa 9e-04 in modified transcript
    • Leucine rich repeat N-terminal domain.
  • Changed! COG COG4886 134aa 6e-05 in modified transcript

DCTD

  • refseq_DCTD.F2 refseq_DCTD.R2 101 124
  • NCBIGene 36.3 1635
  • Alternative 5-prime, size difference: 23
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001012732

  • Changed! cd deoxycytidylate_deaminase 122aa 1e-37 in ref transcript
    • Deoxycytidylate deaminase domain. Deoxycytidylate deaminase catalyzes the deamination of dCMP to dUMP, providing the nucleotide substrate for thymidylate synthase. The enzyme binds Zn++, which is required for catalytic activity. The activity of the enzyme is allosterically regulated by the ratio of dCTP to dTTP not only in eukaryotic cells but also in T-even phage-infected Escherichia coli, with dCTP acting as an activator and dTTP as an inhibitor.
  • Changed! pfam dCMP_cyt_deam_1 115aa 4e-24 in ref transcript
    • Cytidine and deoxycytidylate deaminase zinc-binding region.
  • Changed! COG ComEB 152aa 1e-32 in ref transcript
    • Deoxycytidylate deaminase [Nucleotide transport and metabolism].

DCTN3

  • refseq_DCTN3.F1 refseq_DCTN3.R1 303 400
  • NCBIGene 36.3 11258
  • Alternative 3-prime, size difference: 97
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_007234

  • Changed! pfam Dynactin_p22 172aa 1e-82 in ref transcript
    • Dynactin subunit p22. This family contains p22, the smallest subunit of dynactin, a complex that binds to cytoplasmic dynein and is a required activator for cytoplasmic dynein-mediated vesicular transport. Dynactin localises to the cleavage furrow and to the midbodies of dividing cells, suggesting that it may function in cytokinesis. Family members are approximately 170 residues long and seem to be restricted to mammals.
  • Changed! pfam Dynactin_p22 137aa 6e-70 in modified transcript

DCUN1D4

  • refseq_DCUN1D4.F1 refseq_DCUN1D4.R1 374 479
  • NCBIGene 36.3 23142
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040402

  • Changed! pfam DUF298 113aa 3e-33 in ref transcript
    • Domain of unknown function (DUF298). Members of this family contain a basic helix-loop-helix leucine zipper motif. This domain is implicated in some aspect of neddylation of the cullin 3 family and has a possible role in the regulation of the protein modifier Nedd8 E3 ligase. Neddylation is the process by which the C-terminal glycine of the ubiquitin-like protein Nedd8 is covalently linked to lysine residues in a protein through an isopeptide bond.
  • Changed! pfam DUF298 78aa 2e-18 in modified transcript

DDO

  • refseq_DDO.F1 refseq_DDO.R1 165 342
  • NCBIGene 36.3 8528
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003649

  • TIGR thiamin_ThiO 152aa 2e-09 in ref transcript
    • This family consists of the homotetrameric, FAD-dependent glycine oxidase ThiO, from species such as Bacillus subtilis that use glycine in thiamine biosynthesis. In general, members of this family will not be found in species such as E. coli that instead use tyrosine and the ThiH protein.
  • Changed! pfam DAO 309aa 2e-09 in ref transcript
    • FAD dependent oxidoreductase. This family includes various FAD dependent oxidoreductases: Glycerol-3-phosphate dehydrogenase EC:1.1.99.5, Sarcosine oxidase beta subunit EC:1.5.3.1, D-alanine oxidase EC:1.4.99.1, D-aspartate oxidase EC:1.4.3.1.
  • Changed! COG DadA 150aa 1e-04 in ref transcript
    • Glycine/D-amino acid oxidases (deaminating) [Amino acid transport and metabolism].
  • Changed! COG DadA 193aa 1e-04 in modified transcript

DDR2

  • refseq_DDR2.F2 refseq_DDR2.R2 171 251
  • NCBIGene 36.3 4921
  • Single exon skipping, size difference: 80
  • Exclusion in 5'UTR
  • Reference transcript: NM_001014796

  • cd PTKc_DDR2 295aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Discoidin Domain Receptor 2. Protein Tyrosine Kinase (PTK) family; mammalian Discoidin domain receptor 2 (DDR2) and homologs; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. DDR2 is a member of the DDR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular discoidin homology domain, a transmembrane segment, an extended juxtamembrane region, and an intracellular catalytic domain. The binding of the ligand, collagen, to DDRs results in a slow but sustained receptor activation. DDR2 binds mostly to fibrillar collagens. More recently, it has been reported to also bind collagen X. DDR2 is widely expressed in many tissues with the highest levels found in skeletal muscle, skin, kidney and lung. It is important in cell proliferation and development. Mice, with a deletion of DDR2, suffer from dwarfism and delayed healing of epidermal wounds. DDR2 also contributes to collagen (type I) regulation by inhibiting fibrillogenesis and altering the morphology of collagen fibers. It is also expressed in immature dendritic cells (DCs), where it plays a role in DC activation and function.
  • cd FA58C 153aa 2e-37 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • smart TyrKc 287aa 1e-105 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • smart FA58C 156aa 2e-32 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • COG SPS1 289aa 1e-20 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

DDX11

  • refseq_DDX11.F1 refseq_DDX11.R1 122 272
  • NCBIGene 36.3 1663
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152438

  • Changed! TIGR rad3 596aa 1e-101 in ref transcript
    • All proteins in this family for which funcitons are known are DNA-DNA helicases that funciton in the initiation of transcription and nucleotide excision repair as part of the TFIIH complex. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • smart DEXDc3 68aa 6e-25 in ref transcript
    • DEAD-like helicases superfamily.
  • Changed! COG DinG 590aa 3e-29 in ref transcript
    • Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair].
  • COG DinG 53aa 8e-10 in ref transcript
  • Changed! TIGR rad3 546aa 7e-88 in modified transcript
  • Changed! COG DinG 540aa 2e-23 in modified transcript

DDX19B

  • refseq_DDX19B.F1 refseq_DDX19B.R1 264 357
  • NCBIGene 36.3 11269
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007242

  • Changed! cd DEADc 203aa 2e-51 in ref transcript
    • DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
  • cd HELICc 136aa 2e-26 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! pfam DEAD 166aa 4e-32 in ref transcript
    • DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
  • pfam Helicase_C 83aa 2e-22 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • TIGR recQ_fam 201aa 6e-07 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! COG SrmB 399aa 8e-84 in ref transcript
    • Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].
  • Changed! cd DEADc 166aa 7e-40 in modified transcript
  • Changed! pfam DEAD 159aa 2e-30 in modified transcript
  • Changed! COG SrmB 348aa 1e-73 in modified transcript

DDX19B

  • refseq_DDX19B.F3 refseq_DDX19B.R3 219 322
  • NCBIGene 36.3 11269
  • Multiple exon skipping, size difference: 103
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007242

  • Changed! cd DEADc 203aa 2e-51 in ref transcript
    • DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
  • Changed! cd HELICc 136aa 2e-26 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! pfam DEAD 166aa 4e-32 in ref transcript
    • DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
  • Changed! pfam Helicase_C 83aa 2e-22 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • Changed! TIGR recQ_fam 201aa 6e-07 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! COG SrmB 399aa 8e-84 in ref transcript
    • Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].

DDX42

  • refseq_DDX42.F1 refseq_DDX42.R1 113 184
  • NCBIGene 36.3 11325
  • Single exon skipping, size difference: 71
  • Exclusion in 5'UTR
  • Reference transcript: NM_007372

  • cd DEADc 205aa 2e-73 in ref transcript
    • DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
  • cd HELICc 126aa 5e-28 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • pfam DEAD 172aa 1e-56 in ref transcript
    • DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
  • smart HELICc 82aa 2e-24 in ref transcript
    • helicase superfamily c-terminal domain.
  • TIGR recG 309aa 1e-06 in ref transcript
  • PTZ PTZ00110 445aa 1e-116 in ref transcript
    • helicase; Provisional.

DDX47

  • refseq_DDX47.F2 refseq_DDX47.R2 162 309
  • NCBIGene 36.3 51202
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016355

  • cd DEADc 202aa 5e-66 in ref transcript
    • DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
  • Changed! cd HELICc 130aa 3e-28 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • pfam DEAD 168aa 3e-48 in ref transcript
    • DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
  • Changed! pfam Helicase_C 78aa 2e-23 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • Changed! TIGR recQ_fam 373aa 4e-18 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! COG SrmB 424aa 1e-103 in ref transcript
    • Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].
  • Changed! cd HELICc 67aa 9e-17 in modified transcript
  • Changed! pfam Helicase_C 60aa 9e-17 in modified transcript
  • Changed! COG SrmB 375aa 2e-84 in modified transcript

DDX52

  • refseq_DDX52.F1 refseq_DDX52.R1 247 362
  • NCBIGene 36.3 11056
  • Single exon skipping, size difference: 115
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_007010

  • Changed! cd DEADc 208aa 1e-41 in ref transcript
    • DEAD-box helicases. A diverse family of proteins involved in ATP-dependent RNA unwinding, needed in a variety of cellular processes including splicing, ribosome biogenesis and RNA degradation. The name derives from the sequence of the Walker B motif (motif II). This domain contains the ATP- binding region.
  • Changed! cd HELICc 124aa 2e-27 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! pfam DEAD 173aa 2e-37 in ref transcript
    • DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
  • Changed! pfam Helicase_C 77aa 3e-24 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • Changed! TIGR recQ 349aa 1e-09 in ref transcript
    • The ATP-dependent DNA helicase RecQ of E. coli is about 600 residues long. This model represents bacterial proteins with a high degree of similarity in domain architecture and in primary sequence to E. coli RecQ. The model excludes eukaryotic and archaeal proteins with RecQ-like regions, as well as more distantly related bacterial helicases related to RecQ.
  • Changed! COG SrmB 398aa 7e-80 in ref transcript
    • Superfamily II DNA and RNA helicases [DNA replication, recombination, and repair / Transcription / Translation, ribosomal structure and biogenesis].

DEDD

  • refseq_DEDD.F1 refseq_DEDD.R1 168 201
  • NCBIGene 36.3 9191
  • Single exon skipping, size difference: 33
  • Exclusion in 5'UTR
  • Reference transcript: NM_032998

  • cd DED 78aa 7e-11 in ref transcript
    • Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
  • pfam DED 85aa 3e-15 in ref transcript
    • Death effector domain.

DEFB119

  • refseq_DEFB119.F1 refseq_DEFB119.R1 224 264
  • NCBIGene 36.3 245932
  • Single exon skipping, size difference: 40
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_153289

DGKH

  • refseq_DGKH.F2 refseq_DGKH.R2 109 240
  • NCBIGene 36.3 160851
  • Single exon skipping, size difference: 131
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_178009

  • Changed! cd SAM 62aa 2e-14 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • cd PH 89aa 5e-13 in ref transcript
    • Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • cd C1 50aa 5e-12 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • cd C1 51aa 4e-06 in ref transcript
  • smart DAGKa 158aa 3e-68 in ref transcript
    • Diacylglycerol kinase accessory domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain might either be an accessory domain or else contribute to the catalytic domain. Bacterial homologues are known.
  • smart DAGKc 123aa 2e-41 in ref transcript
    • Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain is presumed to be the catalytic domain. Bacterial homologues areknown.
  • smart PH 91aa 2e-14 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • Changed! smart SAM 64aa 7e-14 in ref transcript
    • Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
  • smart C1 50aa 6e-12 in ref transcript
    • Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
  • smart C1 45aa 2e-06 in ref transcript
  • COG LCB5 122aa 3e-07 in ref transcript
    • Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].

DGUOK

  • refseq_DGUOK.F1 refseq_DGUOK.R1 120 384
  • NCBIGene 36.3 1716
  • Multiple exon skipping, size difference: 264
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_080916

  • Changed! cd dNK 215aa 2e-48 in ref transcript
    • Deoxyribonucleoside kinase (dNK) catalyzes the phosphorylation of deoxyribonucleosides to yield corresponding monophosphates (dNMPs). This family consists of various deoxynucleoside kinases including deoxyribo- cytidine (EC 2.7.1.74), guanosine (EC 2.7.1.113), adenosine (EC 2.7.1.76), and thymidine (EC 2.7.1.21) kinases. They are key enzymes in the salvage of deoxyribonucleosides originating from extra- or intracellular breakdown of DNA.
  • Changed! pfam dNK 158aa 2e-44 in ref transcript
    • Deoxynucleoside kinase. This family consists of various deoxynucleoside kinases cytidine EC:2.7.1.74, guanosine EC:2.7.1.113, adenosine EC:2.7.1.76 and thymidine kinase EC:2.7.1.21 (which also phosphorylates deoxyuridine and deoxycytosine.) These enzymes catalyse the production of deoxynucleotide 5'-monophosphate from a deoxynucleoside. Using ATP and yielding ADP in the process.
  • Changed! COG COG1428 239aa 3e-22 in ref transcript
    • Deoxynucleoside kinases [Nucleotide transport and metabolism].
  • Changed! cd dNK 117aa 4e-21 in modified transcript
  • Changed! pfam dNK 47aa 6e-06 in modified transcript
  • Changed! COG COG1428 144aa 6e-08 in modified transcript

DHODH

  • refseq_DHODH.F1 refseq_DHODH.R1 100 402
  • NCBIGene 36.2 1723
  • Multiple exon skipping, size difference: 302
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001361

  • Changed! cd DHOD_2_like 331aa 1e-125 in ref transcript
    • Dihydroorotate dehydrogenase (DHOD) class 2. DHOD catalyzes the oxidation of (S)-dihydroorotate to orotate. This is the fourth step and the only redox reaction in the de novo biosynthesis of UMP, the precursor of all pyrimidine nucleotides. DHOD requires FMN as co-factor. DHOD divides into class 1 and class 2 based on their amino acid sequences, their cellular location and their natural electron acceptor used to reoxidize the flavin group. Members of class 1 are cytosolic enzymes and multimers, while class 2 enzymes are membrane associated, monomeric and use respiratory quinones as their physiological electron acceptors.
  • Changed! TIGR pyrD_sub2 335aa 1e-136 in ref transcript
    • The subfamilies 1 and 2 share extensive homology, particularly toward the C-terminus. This subfamily has a longer N-terminal region.
  • Changed! PRK PRK05286 330aa 1e-123 in ref transcript
    • dihydroorotate dehydrogenase 2; Reviewed.
  • Changed! cd DHOD_2_like 125aa 2e-49 in modified transcript
  • Changed! TIGR pyrD_sub2 124aa 1e-54 in modified transcript
  • Changed! PRK PRK05286 120aa 1e-47 in modified transcript

DHPS

  • refseq_DHPS.F2 refseq_DHPS.R2 111 279
  • NCBIGene 36.3 1725
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 168
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001930

  • Changed! pfam DS 315aa 1e-149 in ref transcript
    • Deoxyhypusine synthase. Eukaryotic initiation factor 5A (eIF-5A) contains an unusual amino acid, hypusine [N epsilon-(4-aminobutyl-2-hydroxy)lysine]. The first step in the post-translational formation of hypusine is catalysed by the enzyme deoxyhypusine synthase (DS) EC:1.1.1.249. The modified version of eIF-5A, and DS, are required for eukaryotic cell proliferation.
  • Changed! COG DYS1 331aa 1e-101 in ref transcript
    • Deoxyhypusine synthase [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam DS 259aa 1e-105 in modified transcript
  • Changed! COG DYS1 275aa 7e-67 in modified transcript

DHX30

  • refseq_DHX30.F1 refseq_DHX30.R1 163 192
  • NCBIGene 36.3 22907
  • Single exon skipping, size difference: 29
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_138615

  • Changed! cd DEXDc 142aa 2e-16 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • Changed! cd HELICc 128aa 1e-05 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! TIGR DEAH_box_HrpA 541aa 8e-86 in ref transcript
    • This model represents HrpA, one of two related but uncharacterized DEAH-box ATP-dependent helicases in many Proteobacteria and a few high-GC Gram-positive bacteria. HrpA is about 1300 amino acids long, while its paralog HrpB, also uncharacterized, is about 800 amino acids long. Related characterized eukarotic proteins are RNA helicases associated with pre-mRNA processing.
  • Changed! pfam DUF1605 123aa 1e-04 in ref transcript
    • Domain of unknown function (DUF1605). This family is found towards the C-terminus of the DEAD-box helicases (pfam00270). In these helicases it apparently always found in association with pfam04408. There do seem to be a couple of instances where it occurs by itself.
  • Changed! COG HrpA 650aa 1e-122 in ref transcript
    • HrpA-like helicases [DNA replication, recombination, and repair].

DHX34

  • refseq_DHX34.F1 refseq_DHX34.R1 103 178
  • NCBIGene 36.2 9704
  • Single exon skipping, size difference: 75
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_014681

  • cd DEXDc 136aa 1e-18 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • cd HELICc 94aa 2e-05 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • TIGR DEAH_box_HrpA 474aa 1e-101 in ref transcript
    • This model represents HrpA, one of two related but uncharacterized DEAH-box ATP-dependent helicases in many Proteobacteria and a few high-GC Gram-positive bacteria. HrpA is about 1300 amino acids long, while its paralog HrpB, also uncharacterized, is about 800 amino acids long. Related characterized eukarotic proteins are RNA helicases associated with pre-mRNA processing.
  • Changed! pfam DUF1605 100aa 0.001 in ref transcript
    • Domain of unknown function (DUF1605). This family is found towards the C-terminus of the DEAD-box helicases (pfam00270). In these helicases it apparently always found in association with pfam04408. There do seem to be a couple of instances where it occurs by itself.
  • COG HrpA 505aa 1e-131 in ref transcript
    • HrpA-like helicases [DNA replication, recombination, and repair].

DIABLO

  • refseq_DIABLO.F1 refseq_DIABLO.R1 130 262
  • NCBIGene 36.3 56616
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_019887

  • Changed! pfam Smac_DIABLO 234aa 1e-100 in ref transcript
    • Second Mitochondria-derived Activator of Caspases. Second Mitochondria-derived Activator of Caspases promotes apoptosis by activating caspases in the cytochrome c/Apaf-1/caspase-9 pathway, and by opposing the inhibitory activity of inhibitor of apoptosis proteins (XIAP-BIR3). The protein assumes an elongated three-helix bundle structure, and forms a dimer in solution.
  • Changed! pfam Smac_DIABLO 190aa 3e-73 in modified transcript

DIO1

  • refseq_DIO1.F1 refseq_DIO1.R1 189 333
  • NCBIGene 36.3 1733
  • Single exon skipping, size difference: 144
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001039715

  • Changed! pfam T4_deiodinase 168aa 5e-55 in ref transcript
    • Iodothyronine deiodinase. Iodothyronine deiodinase converts thyroxine (T4) to 3,5,3'-triiodothyronine (T3).
  • Changed! pfam T4_deiodinase 93aa 2e-28 in modified transcript

DIO1

  • refseq_DIO1.F3 refseq_DIO1.R3 154 346
  • NCBIGene 36.3 1733
  • Alternative 5-prime, size difference: 192
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000792

  • Changed! pfam T4_deiodinase 93aa 2e-28 in ref transcript
    • Iodothyronine deiodinase. Iodothyronine deiodinase converts thyroxine (T4) to 3,5,3'-triiodothyronine (T3).
  • Changed! pfam T4_deiodinase 13aa 0.005 in modified transcript

DIO1

  • refseq_DIO1.F4 refseq_DIO1.R4 102 302
  • NCBIGene 36.3 1733
  • Single exon skipping, size difference: 200
  • Exclusion in 3'UTR
  • Reference transcript: NM_000792

  • pfam T4_deiodinase 93aa 2e-28 in ref transcript
    • Iodothyronine deiodinase. Iodothyronine deiodinase converts thyroxine (T4) to 3,5,3'-triiodothyronine (T3).

DIO2

  • refseq_DIO2.F1 refseq_DIO2.R1 180 288
  • NCBIGene 36.3 1734
  • Single exon skipping, size difference: 108
  • Exclusion in 5'UTR
  • Reference transcript: NM_001007023

DISC1

  • refseq_DISC1.F2 refseq_DISC1.R2 208 274
  • NCBIGene 36.3 27185
  • Alternative 5-prime, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018662

STAG3L1

  • refseq_DKFZP434A0131.F1 refseq_DKFZP434A0131.R1 112 236
  • NCBIGene 36.3 54441
  • Single exon skipping, size difference: 124
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_018991

  • Changed! COG IRR1 114aa 1e-08 in ref transcript
    • Cohesin [Cell division and chromosome partitioning].

FAM149A

  • refseq_DKFZP564J102.F2 refseq_DKFZP564J102.R2 102 129
  • NCBIGene 36.3 25854
  • Alternative 3-prime, size difference: 27
  • Inclusion in 5'UTR
  • Reference transcript: NM_001006655

DKFZP586H2123

  • refseq_DKFZP586H2123.F2 refseq_DKFZP586H2123.R2 130 181
  • NCBIGene 36.3 25891
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015430

  • cd Tryp_SPc 258aa 3e-28 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • cd CUB 107aa 5e-25 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CCP 64aa 2e-07 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd EGF_CA 33aa 7e-04 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • smart Tryp_SPc 255aa 1e-27 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • smart CUB 92aa 5e-23 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • smart CCP 63aa 4e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart EGF_CA 33aa 0.002 in ref transcript
    • Calcium-binding EGF-like domain.
  • smart CCP 35aa 0.003 in ref transcript
  • COG COG5640 259aa 5e-07 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].

DKFZP686M0199

  • refseq_DKFZP686M0199.F1 refseq_DKFZP686M0199.R1 152 250
  • NCBIGene 36.2 653238
  • Single exon skipping, size difference: 98
  • Exclusion in 5'UTR
  • Reference transcript: XM_001130651

  • cd vWA_transcription_factor_IIH_type 181aa 1e-75 in ref transcript
    • Transcription factors IIH type: TFIIH is a multiprotein complex that is one of the five general transcription factors that binds RNA polymerase II holoenzyme. Orthologues of these genes are found in all completed eukaryotic genomes and all these proteins contain a VWA domain. The p44 subunit of TFIIH functions as a DNA helicase in RNA polymerase II transcription initiation and DNA repair, and its transcriptional activity is dependent on its C-terminal Zn-binding domains. The function of the vWA domain is unclear, but may be involved in complex assembly. The MIDAS motif is not conserved in this sub-group.
  • pfam Ssl1 249aa 1e-117 in ref transcript
    • Ssl1-like. Ssl1-like proteins are 40kDa subunits of the Transcription factor II H complex.
  • TIGR ssl1 101aa 3e-41 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • COG SSL1 375aa 6e-76 in ref transcript
    • RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription / DNA replication, recombination, and repair].

ZMIZ2

  • refseq_DKFZp761I2123.F2 refseq_DKFZp761I2123.R2 261 339
  • NCBIGene 36.3 83637
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031449

  • pfam zf-MIZ 49aa 6e-21 in ref transcript
    • MIZ/SP-RING zinc finger. This domain has SUMO (small ubiquitin-like modifier) ligase activity and is involved in DNA repair and chromosome organisation.
  • TIGR PABP-1234 81aa 5e-04 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.

DLEC1

  • refseq_DLEC1.F1 refseq_DLEC1.R1 141 265
  • NCBIGene 36.3 9940
  • Single exon skipping, size difference: 124
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_007337

DLX1

  • refseq_DLX1.F2 refseq_DLX1.R2 161 361
  • NCBIGene 36.3 1745
  • Single exon skipping, size difference: 200
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_178120

  • Changed! cd homeodomain 57aa 3e-15 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • Changed! pfam Homeobox 57aa 9e-21 in ref transcript
    • Homeobox domain.
  • Changed! COG COG5576 78aa 1e-06 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

DMD

  • refseq_DMD.F1 refseq_DMD.R1 219 362
  • NCBIGene 36.3 1756
  • Alternative 5-prime, size difference: 143
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004011

  • Changed! cd ZZ_dystrophin 49aa 2e-23 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • Changed! cd SPEC 217aa 8e-14 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • Changed! cd SPEC 216aa 4e-11 in ref transcript
  • Changed! cd SPEC 244aa 5e-10 in ref transcript
  • Changed! cd SPEC 209aa 4e-09 in ref transcript
  • Changed! cd SPEC 111aa 1e-06 in ref transcript
  • Changed! cd WW 30aa 1e-05 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • Changed! cd SPEC 189aa 2e-05 in ref transcript
  • Changed! cd SPEC 210aa 3e-05 in ref transcript
  • Changed! cd SPEC 226aa 4e-04 in ref transcript
  • Changed! pfam efhand_1 121aa 8e-45 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • Changed! pfam efhand_2 92aa 4e-40 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • Changed! pfam ZZ 46aa 1e-18 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
  • Changed! smart SPEC 101aa 9e-09 in ref transcript
    • Spectrin repeats.
  • Changed! pfam Spectrin 107aa 2e-08 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • Changed! smart SPEC 102aa 5e-08 in ref transcript
  • Changed! smart SPEC 110aa 6e-07 in ref transcript
  • Changed! TIGR SMC_prok_B 582aa 6e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! smart WW 32aa 4e-06 in ref transcript
    • Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
  • Changed! pfam Spectrin 117aa 2e-05 in ref transcript
  • Changed! TIGR SMC_prok_B 278aa 8e-05 in ref transcript
  • Changed! COG SbcC 548aa 1e-08 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! PRK PRK00409 138aa 4e-04 in ref transcript
    • recombination and DNA strand exchange inhibitor protein; Reviewed.

DMD

  • refseq_DMD.F3 refseq_DMD.R3 166 198
  • NCBIGene 36.3 1756
  • Single exon skipping, size difference: 32
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004006

  • cd ZZ_dystrophin 49aa 3e-23 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • cd CH 106aa 6e-16 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 217aa 7e-14 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd SPEC 218aa 5e-13 in ref transcript
  • cd SPEC 216aa 3e-11 in ref transcript
  • cd SPEC 244aa 5e-10 in ref transcript
  • cd SPEC 209aa 4e-09 in ref transcript
  • cd CH 104aa 6e-09 in ref transcript
  • cd SPEC 207aa 7e-09 in ref transcript
  • cd SPEC 215aa 1e-07 in ref transcript
  • cd SPEC 111aa 1e-06 in ref transcript
  • cd SPEC 183aa 5e-06 in ref transcript
  • cd SPEC 207aa 5e-06 in ref transcript
  • cd SPEC 189aa 2e-05 in ref transcript
  • cd SPEC 210aa 3e-05 in ref transcript
  • cd WW 30aa 4e-05 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • cd SPEC 226aa 4e-04 in ref transcript
  • cd SPEC 208aa 0.004 in ref transcript
  • pfam efhand_1 121aa 1e-44 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 92aa 7e-40 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam ZZ 46aa 2e-18 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
  • smart CH 100aa 4e-18 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • smart CH 101aa 4e-11 in ref transcript
  • TIGR SMC_prok_B 595aa 6e-09 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 101aa 1e-08 in ref transcript
    • Spectrin repeats.
  • pfam Spectrin 107aa 2e-08 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • smart SPEC 103aa 5e-08 in ref transcript
  • smart SPEC 102aa 6e-08 in ref transcript
  • pfam Spectrin 105aa 1e-07 in ref transcript
  • smart SPEC 110aa 8e-07 in ref transcript
  • smart WW 32aa 1e-05 in ref transcript
    • Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
  • pfam Spectrin 117aa 3e-05 in ref transcript
  • TIGR SMC_prok_B 278aa 8e-05 in ref transcript
  • smart SPEC 101aa 3e-04 in ref transcript
  • smart SPEC 98aa 0.003 in ref transcript
  • pfam Spectrin 66aa 0.007 in ref transcript
  • TIGR SMC_prok_B 391aa 0.010 in ref transcript
  • COG SAC6 225aa 6e-25 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG SbcC 643aa 3e-09 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • PRK PRK00409 138aa 5e-04 in ref transcript
    • recombination and DNA strand exchange inhibitor protein; Reviewed.

DMD

  • refseq_DMD.F5 refseq_DMD.R5 270 354
  • NCBIGene 36.3 1756
  • Alternative 5-prime, size difference: 84
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004009

  • Changed! cd ZZ_dystrophin 49aa 3e-23 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • Changed! cd CH 106aa 5e-16 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • Changed! cd SPEC 217aa 8e-14 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • Changed! cd SPEC 218aa 5e-13 in ref transcript
  • Changed! cd SPEC 216aa 3e-11 in ref transcript
  • Changed! cd SPEC 244aa 5e-10 in ref transcript
  • Changed! cd SPEC 209aa 4e-09 in ref transcript
  • Changed! cd CH 104aa 6e-09 in ref transcript
  • Changed! cd SPEC 207aa 8e-09 in ref transcript
  • Changed! cd SPEC 215aa 1e-07 in ref transcript
  • Changed! cd SPEC 111aa 1e-06 in ref transcript
  • Changed! cd SPEC 183aa 5e-06 in ref transcript
  • Changed! cd SPEC 207aa 5e-06 in ref transcript
  • Changed! cd SPEC 189aa 2e-05 in ref transcript
  • Changed! cd SPEC 210aa 3e-05 in ref transcript
  • Changed! cd WW 30aa 3e-05 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • Changed! cd SPEC 226aa 4e-04 in ref transcript
  • Changed! cd SPEC 208aa 0.004 in ref transcript
  • Changed! pfam efhand_1 121aa 1e-44 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • Changed! pfam efhand_2 92aa 6e-40 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • Changed! pfam ZZ 46aa 2e-18 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
  • Changed! smart CH 100aa 4e-18 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • Changed! smart CH 101aa 4e-11 in ref transcript
  • Changed! TIGR SMC_prok_B 595aa 6e-09 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! smart SPEC 101aa 1e-08 in ref transcript
    • Spectrin repeats.
  • Changed! pfam Spectrin 107aa 2e-08 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • Changed! smart SPEC 103aa 5e-08 in ref transcript
  • Changed! smart SPEC 102aa 6e-08 in ref transcript
  • Changed! pfam Spectrin 105aa 1e-07 in ref transcript
  • Changed! smart SPEC 110aa 8e-07 in ref transcript
  • Changed! smart WW 32aa 1e-05 in ref transcript
    • Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
  • Changed! pfam Spectrin 117aa 3e-05 in ref transcript
  • Changed! TIGR SMC_prok_B 278aa 8e-05 in ref transcript
  • Changed! smart SPEC 101aa 3e-04 in ref transcript
  • Changed! smart SPEC 98aa 0.003 in ref transcript
  • Changed! pfam Spectrin 66aa 0.007 in ref transcript
  • Changed! COG SAC6 225aa 6e-25 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • Changed! COG SbcC 643aa 3e-09 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! PRK PRK00409 138aa 5e-04 in ref transcript
    • recombination and DNA strand exchange inhibitor protein; Reviewed.

DMD

  • refseq_DMD.F7 refseq_DMD.R7 116 155
  • NCBIGene 36.3 1756
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004006

  • cd ZZ_dystrophin 49aa 3e-23 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • cd CH 106aa 6e-16 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 217aa 7e-14 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd SPEC 218aa 5e-13 in ref transcript
  • cd SPEC 216aa 3e-11 in ref transcript
  • cd SPEC 244aa 5e-10 in ref transcript
  • cd SPEC 209aa 4e-09 in ref transcript
  • cd CH 104aa 6e-09 in ref transcript
  • cd SPEC 207aa 7e-09 in ref transcript
  • cd SPEC 215aa 1e-07 in ref transcript
  • cd SPEC 111aa 1e-06 in ref transcript
  • cd SPEC 183aa 5e-06 in ref transcript
  • cd SPEC 207aa 5e-06 in ref transcript
  • cd SPEC 189aa 2e-05 in ref transcript
  • cd SPEC 210aa 3e-05 in ref transcript
  • cd WW 30aa 4e-05 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • cd SPEC 226aa 4e-04 in ref transcript
  • cd SPEC 208aa 0.004 in ref transcript
  • pfam efhand_1 121aa 1e-44 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 92aa 7e-40 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam ZZ 46aa 2e-18 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin, CBP/p300. ZZ in dystrophin binds calmodulin. Putative zinc finger; binding not yet shown. Four to six cysteine residues in its sequence are responsible for coordinating zinc ions, to reinforce the structure.
  • smart CH 100aa 4e-18 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • smart CH 101aa 4e-11 in ref transcript
  • TIGR SMC_prok_B 595aa 6e-09 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 101aa 1e-08 in ref transcript
    • Spectrin repeats.
  • pfam Spectrin 107aa 2e-08 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • smart SPEC 103aa 5e-08 in ref transcript
  • smart SPEC 102aa 6e-08 in ref transcript
  • pfam Spectrin 105aa 1e-07 in ref transcript
  • smart SPEC 110aa 8e-07 in ref transcript
  • smart WW 32aa 1e-05 in ref transcript
    • Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
  • pfam Spectrin 117aa 3e-05 in ref transcript
  • TIGR SMC_prok_B 278aa 8e-05 in ref transcript
  • smart SPEC 101aa 3e-04 in ref transcript
  • smart SPEC 98aa 0.003 in ref transcript
  • pfam Spectrin 66aa 0.007 in ref transcript
  • Changed! TIGR SMC_prok_B 391aa 0.010 in ref transcript
  • COG SAC6 225aa 6e-25 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG SbcC 643aa 3e-09 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • PRK PRK00409 138aa 5e-04 in ref transcript
    • recombination and DNA strand exchange inhibitor protein; Reviewed.

DNAJC21

  • refseq_DNAJA5.F1 refseq_DNAJA5.R1 216 351
  • NCBIGene 36.3 134218
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_194283

  • cd DnaJ 56aa 1e-17 in ref transcript
    • DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
  • TIGR DnaJ_bact 88aa 2e-25 in ref transcript
    • This model represents bacterial forms of DnaJ, part of the DnaK-DnaJ-GrpE chaperone system. The three components typically are encoded by consecutive genes. DnaJ homologs occur in many genomes, typically not near DnaK and GrpE-like genes; most such genes are not included by this family. Eukaryotic (mitochondrial and chloroplast) forms are not included in the scope of this family.
  • smart ZnF_U1 30aa 8e-04 in ref transcript
    • U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
  • COG DnaJ 75aa 8e-23 in ref transcript
    • DnaJ-class molecular chaperone with C-terminal Zn finger domain [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG ZUO1 264aa 4e-12 in ref transcript
    • Ribosome-associated chaperone zuotin [Translation, ribosomal structure and biogenesis / Posttranslational modification, protein turnover, chaperones].
  • Changed! COG ZUO1 245aa 9e-12 in modified transcript

DNAJC19

  • refseq_DNAJC19.F1 refseq_DNAJC19.R1 137 189
  • NCBIGene 36.2 131118
  • Single exon skipping, size difference: 52
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_145261

  • Changed! cd DnaJ 45aa 5e-07 in ref transcript
    • DnaJ domain or J-domain. DnaJ/Hsp40 (heat shock protein 40) proteins are highly conserved and play crucial roles in protein translation, folding, unfolding, translocation, and degradation. They act primarily by stimulating the ATPase activity of Hsp70s, an important chaperonine family. Hsp40 proteins are characterized by the presence of a J domain, which mediates the interaction with Hsp70. They may contain other domains as well, and the architectures provide a means of classification.
  • Changed! smart DnaJ 46aa 2e-07 in ref transcript
    • DnaJ molecular chaperone homology domain.
  • Changed! PTZ PTZ00100 58aa 4e-20 in ref transcript
    • DnaJ chaperone protein; Provisional.

DNASE1L1

  • refseq_DNASE1L1.F1 refseq_DNASE1L1.R1 111 369
  • NCBIGene 36.3 1774
  • Single exon skipping, size difference: 258
  • Exclusion in 5'UTR
  • Reference transcript: NM_001009932

  • smart DNaseIc 272aa 1e-114 in ref transcript
    • deoxyribonuclease I. Deoxyribonuclease I catalyzes the endonucleolytic cleavage of double-stranded DNA. The enzyme is secreted outside the cell and also involved in apoptosis in the nucleus.

DNM1

  • refseq_DNM1.F2 refseq_DNM1.R2 159 196
  • NCBIGene 36.3 1759
  • Single exon skipping, size difference: 37
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004408

  • cd PH_dynamin 110aa 1e-50 in ref transcript
    • Dynamin pleckstrin homology (PH) domain. Dynamin is a GTPase that regulates endocytic vesicle formation. It has an N-terminal GTPase domain, followed by a PH domain, a GTPase effector domain and a C-terminal proline arginine rich domain. Dynamin-like proteins, which are found in metazoa, plants and yeast have the same domain architecture as dynamin, but lack the PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • smart DYNc 240aa 1e-130 in ref transcript
    • Dynamin, GTPase. Large GTPases that mediate vesicle trafficking. Dynamin participates in the endocytic uptake of receptors, associated ligands, and plasma membrane following an exocytic event.
  • pfam Dynamin_M 294aa 1e-118 in ref transcript
    • Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169.
  • pfam GED 92aa 2e-22 in ref transcript
    • Dynamin GTPase effector domain.
  • smart PH 104aa 2e-05 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • COG COG0699 424aa 5e-34 in ref transcript
    • Predicted GTPases (dynamin-related) [General function prediction only].

DPAGT1

  • refseq_DPAGT1.F1 refseq_DPAGT1.R1 194 315
  • NCBIGene 36.3 1798
  • Single exon skipping, size difference: 121
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001382

  • Changed! cd GT_GPT_euk 289aa 1e-113 in ref transcript
    • UDP-GlcNAc:dolichol-P GlcNAc-1-P transferase (GPT) catalyzes the transfer of GlcNAc-1-P from UDP-GlcNAc to dolichol-P to form GlcNAc-P-P-dolichol. The reaction is the first step in the assembly of dolichol-linked oligosaccharide intermediates and is essential for eukaryotic N-glycosylation. GPT activity has been identified in all eukaryotic cells examined to date. A series of six conserved motifs designated A through F, ranging in length from 5 to 13 amino acid residues, has been identified in this family. They have been determined to be important for stable expression, substrate binding, or catalytic activities.
  • Changed! pfam Glycos_transf_4 178aa 1e-44 in ref transcript
    • Glycosyl transferase family 4.
  • Changed! COG Rfe 196aa 1e-18 in ref transcript
    • UDP-N-acetylmuramyl pentapeptide phosphotransferase/UDP-N- acetylglucosamine-1-phosphate transferase [Cell envelope biogenesis, outer membrane].
  • Changed! cd GT_GPT_euk 26aa 0.008 in modified transcript

DPP8

  • refseq_DPP8.F1 refseq_DPP8.R1 218 354
  • NCBIGene 36.3 54878
  • Single exon skipping, size difference: 136
  • Inclusion in 5'UTR
  • Reference transcript: NM_130434

  • pfam DPPIV_N 421aa 7e-72 in ref transcript
    • Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site. The Prosite motif does not correspond to this Pfam entry.
  • pfam Peptidase_S9 206aa 9e-53 in ref transcript
    • Prolyl oligopeptidase family.
  • COG DAP2 330aa 7e-48 in ref transcript
    • Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism].

DPP8

  • refseq_DPP8.F3 refseq_DPP8.R3 184 337
  • NCBIGene 36.3 54878
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_197960

  • pfam DPPIV_N 421aa 4e-72 in ref transcript
    • Dipeptidyl peptidase IV (DPP IV) N-terminal region. This family is an alignment of the region to the N-terminal side of the active site. The Prosite motif does not correspond to this Pfam entry.
  • Changed! pfam Peptidase_S9 206aa 6e-53 in ref transcript
    • Prolyl oligopeptidase family.
  • Changed! COG DAP2 330aa 4e-48 in ref transcript
    • Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism].
  • Changed! pfam Peptidase_S9 155aa 7e-25 in modified transcript
  • Changed! COG DAP2 279aa 1e-27 in modified transcript

DRD2

  • refseq_DRD2.F1 refseq_DRD2.R1 276 363
  • NCBIGene 36.3 1813
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000795

  • pfam 7tm_1 131aa 8e-23 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • pfam 7tm_1 68aa 7e-17 in ref transcript

DSC1

  • refseq_DSC1.F2 refseq_DSC1.R2 251 297
  • NCBIGene 36.3 1823
  • Single exon skipping, size difference: 46
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_024421

  • cd CA 201aa 1e-34 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 209aa 2e-34 in ref transcript
  • cd CA 182aa 6e-23 in ref transcript
  • cd CA 212aa 4e-21 in ref transcript
  • pfam Cadherin 97aa 4e-20 in ref transcript
    • Cadherin domain.
  • pfam Cadherin_pro 81aa 4e-17 in ref transcript
    • Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions.
  • pfam Cadherin 95aa 9e-16 in ref transcript
  • pfam Cadherin 83aa 3e-13 in ref transcript
  • pfam Cadherin 91aa 2e-11 in ref transcript
  • pfam Cadherin 86aa 0.002 in ref transcript

DSC2

  • refseq_DSC2.F1 refseq_DSC2.R1 217 263
  • NCBIGene 36.3 1824
  • Single exon skipping, size difference: 46
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_024422

  • cd CA 210aa 5e-32 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 209aa 2e-24 in ref transcript
  • cd CA 196aa 2e-22 in ref transcript
  • pfam Cadherin 104aa 8e-14 in ref transcript
    • Cadherin domain.
  • smart CA 86aa 9e-14 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • pfam Cadherin_pro 81aa 2e-13 in ref transcript
    • Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions.
  • pfam Cadherin 93aa 6e-13 in ref transcript
  • pfam Cadherin 95aa 1e-11 in ref transcript
  • Changed! pfam Cadherin_C 45aa 0.002 in ref transcript
    • Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn.

DSCAM

  • refseq_DSCAM.F1 refseq_DSCAM.R1 141 332
  • NCBIGene 36.2 1826
  • Alternative 5-prime and 3-prime, size difference: 191
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001389

  • cd FN3 91aa 1e-12 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 94aa 1e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 77aa 3e-12 in ref transcript
  • cd IGcam 91aa 2e-11 in ref transcript
  • cd IGcam 88aa 2e-11 in ref transcript
  • cd IGcam 70aa 1e-09 in ref transcript
  • cd FN3 98aa 3e-09 in ref transcript
  • cd IGcam 90aa 1e-08 in ref transcript
  • cd IGcam 95aa 5e-08 in ref transcript
  • cd FN3 86aa 6e-08 in ref transcript
  • cd FN3 94aa 2e-07 in ref transcript
  • cd FN3 74aa 1e-05 in ref transcript
  • cd IGcam 64aa 3e-04 in ref transcript
  • cd IGcam 90aa 0.002 in ref transcript
  • cd FN3 73aa 0.005 in ref transcript
  • pfam I-set 76aa 5e-13 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam fn3 84aa 1e-12 in ref transcript
    • Fibronectin type III domain.
  • pfam I-set 95aa 1e-11 in ref transcript
  • pfam I-set 79aa 3e-11 in ref transcript
  • pfam I-set 85aa 5e-11 in ref transcript
  • smart IGc2 65aa 1e-09 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart FN3 88aa 2e-09 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart IGc2 58aa 8e-09 in ref transcript
  • pfam fn3 87aa 2e-08 in ref transcript
  • smart IG_like 90aa 2e-08 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 79aa 8e-08 in ref transcript
  • pfam fn3 69aa 9e-06 in ref transcript

PSMG1

  • refseq_DSCR2.F1 refseq_DSCR2.R1 214 277
  • NCBIGene 36.3 8624
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003720

DSTN

  • refseq_DSTN.F1 refseq_DSTN.R1 152 284
  • NCBIGene 36.3 11034
  • Single exon skipping, size difference: 132
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006870

  • Changed! cd ADF 150aa 6e-32 in ref transcript
    • Actin depolymerisation factor/cofilin -like domains; present in a family of essential eukaryotic actin regulatory proteins; these proteins enhance the turnover rate of actin and interact with actin monomers as well as actin filaments.
  • Changed! smart ADF 135aa 4e-35 in ref transcript
    • Actin depolymerisation factor/cofilin -like domains. Severs actin filaments and binds to actin monomers.
  • Changed! PTZ PTZ00152 131aa 1e-05 in ref transcript
    • actin depolymerizing factor; Provisional.

DTNA

  • refseq_DTNA.F1 refseq_DTNA.R1 273 351
  • NCBIGene 36.3 1837
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032980

  • PRK PRK05431 71aa 7e-05 in ref transcript
    • seryl-tRNA synthetase; Provisional.
  • COG COG3264 191aa 0.005 in ref transcript
    • Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane].

DTNA

  • refseq_DTNA.F3 refseq_DTNA.R3 142 226
  • NCBIGene 36.3 1837
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_001390

  • cd ZZ_dystrophin 49aa 1e-20 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • pfam efhand_1 127aa 6e-50 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 89aa 2e-35 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • smart ZnF_ZZ 45aa 3e-13 in ref transcript
    • Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
  • pfam Cast 103aa 0.005 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • PRK PRK05431 71aa 1e-04 in ref transcript
    • seryl-tRNA synthetase; Provisional.
  • COG COG3264 191aa 0.004 in ref transcript
    • Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane].

DTNA

  • refseq_DTNA.F4 refseq_DTNA.R1 100 193
  • NCBIGene 36.3 1837
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001390

  • cd ZZ_dystrophin 49aa 1e-20 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • pfam efhand_1 127aa 6e-50 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 89aa 2e-35 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • smart ZnF_ZZ 45aa 3e-13 in ref transcript
    • Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
  • Changed! pfam Cast 103aa 0.005 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • PRK PRK05431 71aa 1e-04 in ref transcript
    • seryl-tRNA synthetase; Provisional.
  • COG COG3264 191aa 0.004 in ref transcript
    • Small-conductance mechanosensitive channel [Cell envelope biogenesis, outer membrane].
  • Changed! TIGR SMC_prok_B 77aa 0.006 in modified transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.

DTNB

  • refseq_DTNB.F1 refseq_DTNB.R1 114 204
  • NCBIGene 36.3 1838
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021907

  • cd ZZ_dystrophin 49aa 4e-18 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • pfam efhand_1 127aa 3e-47 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 89aa 3e-37 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • smart ZnF_ZZ 45aa 3e-11 in ref transcript
    • Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
  • TIGR SMC_prok_B 66aa 0.008 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • PRK PRK05431 65aa 0.003 in ref transcript
    • seryl-tRNA synthetase; Provisional.

DTNB

  • refseq_DTNB.F2 refseq_DTNB.R2 115 205
  • NCBIGene 36.3 1838
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021907

  • cd ZZ_dystrophin 49aa 4e-18 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • pfam efhand_1 127aa 3e-47 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 89aa 3e-37 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • smart ZnF_ZZ 45aa 3e-11 in ref transcript
    • Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
  • TIGR SMC_prok_B 66aa 0.008 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • PRK PRK05431 65aa 0.003 in ref transcript
    • seryl-tRNA synthetase; Provisional.

DTNB

  • refseq_DTNB.F5 refseq_DTNB.R5 110 164
  • NCBIGene 36.3 1838
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021907

  • cd ZZ_dystrophin 49aa 4e-18 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • pfam efhand_1 127aa 3e-47 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 89aa 3e-37 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • smart ZnF_ZZ 45aa 3e-11 in ref transcript
    • Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
  • TIGR SMC_prok_B 66aa 0.008 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • PRK PRK05431 65aa 0.003 in ref transcript
    • seryl-tRNA synthetase; Provisional.

DTNBP1

  • refseq_DTNBP1.F1 refseq_DTNBP1.R1 226 271
  • NCBIGene 36.3 84062
  • Alternative 3-prime, size difference: 45
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_032122

  • Changed! pfam Dysbindin 121aa 7e-16 in ref transcript
    • Dysbindin (Dystrobrevin binding protein 1). Dysbindin is an evolutionary conserved 40-kDa coiled-coil-containing protein that binds to alpha- and beta-dystrobrevin in muscle and brain. Dystrophin and alpha-dystrobrevin are co-immunoprecipitated with dysbindin, indicating that dysbindin is DPC-associated in muscle. Dysbindin co-localises with alpha-dystrobrevin at the sarcolemma and is up-regulated in dystrophin-deficient muscle. In the brain, dysbindin is found primarily in axon bundles and especially in certain axon terminals, notably mossy fibre synaptic terminals in the cerebellum and hippocampus. Dysbindin may have implications for the molecular pathology of Duchenne muscular dystrophy and may provide an alternative route for anchoring dystrobrevin and the DPC to the muscle membrane. Genetic variation in the human dysbindin gene is also thought to be associated with Schizophrenia.

DUOX1

  • refseq_DUOX1.F2 refseq_DUOX1.R2 127 319
  • NCBIGene 36.3 53905
  • Single exon skipping, size difference: 192
  • Exclusion in 5'UTR
  • Reference transcript: NM_017434

  • cd NOX_Duox_like_FAD_NADP 180aa 9e-45 in ref transcript
    • NADPH oxidase (NOX) catalyzes the generation of reactive oxygen species (ROS) such as superoxide and hydrogen peroxide. ROS were originally identified as bactericidal agents in phagocytes, but are now also implicated in cell signaling and metabolism. NOX has a 6-alpha helix heme-binding transmembrane domain fused to a flavoprotein with the nucleotide binding domain located in the cytoplasm. Duox enzymes link a peroxidase domain to the NOX domain via a single transmembrane and EF-hand Ca2+ binding sites. The flavoprotein module has a ferredoxin like FAD/NADPH binding domain. In classical phagocytic NOX2, electron transfer occurs from NADPH to FAD to the heme of cytb to oxygen leading to superoxide formation.
  • cd EFh 62aa 1e-11 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 62aa 8e-05 in ref transcript
  • cd NOX_Duox_like_FAD_NADP 36aa 0.007 in ref transcript
  • pfam An_peroxidase 528aa 1e-139 in ref transcript
    • Animal haem peroxidase.
  • pfam NAD_binding_6 155aa 2e-27 in ref transcript
    • Ferric reductase NAD binding domain.
  • pfam FAD_binding_8 97aa 4e-17 in ref transcript
    • FAD-binding domain.
  • pfam Ferric_reduct 70aa 7e-08 in ref transcript
    • Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mouse-ear cress. This may be a family of flavocytochromes capable of moving electrons across the plasma membrane. The Frp1 protein from S. pombe is a ferric reductase component and is required for cell surface ferric reductase activity, mutants in frp1 are deficient in ferric iron uptake. Cytochrome B-245 heavy chain is a FAD-dependent dehydrogenase it is also has electron transferase activity which reduces molecular oxygen to superoxide anion, a precursor in the production of microbicidal oxidants. Mutations in the sequence of cytochrome B-245 heavy chain (gp91-phox) lead to the X-linked chronic granulomatous disease. The bacteriocidal ability of phagocytic cells is reduced and is characterised by the absence of a functional plasma membrane associated NADPH oxidase. The chronic granulomatous disease gene codes for the beta chain of cytochrome B-245 and cytochrome B-245 is missing from patients with the disease.
  • smart EFh 25aa 4e-04 in ref transcript
    • EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
  • smart EFh 25aa 0.002 in ref transcript
  • COG FRQ1 171aa 2e-11 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • COG UbiB 195aa 3e-10 in ref transcript
    • 2-polyprenylphenol hydroxylase and related flavodoxin oxidoreductases [Coenzyme metabolism / Energy production and conversion].

DUSP13

  • refseq_DUSP13.F1 refseq_DUSP13.R1 144 352
  • NCBIGene 36.2 51207
  • Single exon skipping, size difference: 208
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001007271

  • cd DSPc 144aa 2e-32 in ref transcript
    • Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
  • smart DSPc 143aa 4e-26 in ref transcript
    • Dual specificity phosphatase, catalytic domain.
  • COG CDC14 140aa 1e-07 in ref transcript
    • Predicted protein-tyrosine phosphatase [Signal transduction mechanisms].

DUT

  • refseq_DUT.F1 refseq_DUT.R1 103 419
  • NCBIGene 36.3 1854
  • Alternative 5-prime, size difference: 316
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001025248

  • TIGR dut 138aa 5e-60 in ref transcript
    • Changed role from 132 to 123. RTD.
  • COG Dut 134aa 1e-36 in ref transcript
    • dUTPase [Nucleotide transport and metabolism].
  • Changed! PRK PRK07764 117aa 0.006 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.

DYRK1A

  • refseq_DYRK1A.F2 refseq_DYRK1A.R2 143 170
  • NCBIGene 36.3 1859
  • Alternative 3-prime, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001396

  • cd S_TKc 322aa 6e-56 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 230aa 2e-54 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart S_TKc 28aa 4e-04 in ref transcript
  • PTZ PTZ00284 225aa 9e-30 in ref transcript
    • protein kinase; Provisional.

DYRK1A

  • refseq_DYRK1A.F3 refseq_DYRK1A.R3 133 264
  • NCBIGene 36.3 1859
  • Alternative 5-prime, size difference: 131
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001396

  • Changed! cd S_TKc 322aa 6e-56 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 230aa 2e-54 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart S_TKc 28aa 4e-04 in ref transcript
  • PTZ PTZ00284 225aa 9e-30 in ref transcript
    • protein kinase; Provisional.
  • Changed! cd S_TKc 241aa 9e-54 in modified transcript

DYRK1A

  • refseq_DYRK1A.F5 refseq_DYRK1A.R5 101 273
  • NCBIGene 36.3 1859
  • Alternative 3-prime, size difference: 172
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_101395

  • cd S_TKc 241aa 9e-54 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 230aa 2e-52 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart S_TKc 28aa 7e-04 in ref transcript
  • PTZ PTZ00284 225aa 4e-30 in ref transcript
    • protein kinase; Provisional.

DYRK2

  • refseq_DYRK2.F1 refseq_DYRK2.R1 198 347
  • NCBIGene 36.3 8445
  • Single exon skipping, size difference: 149
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006482

  • Changed! cd S_TKc 237aa 4e-60 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! cd STKc_Cdc7_like 30aa 3e-04 in ref transcript
    • Serine/threonine kinases (STKs), cell division control protein 7 (Cdc7)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The Cdc7-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily include Schizosaccharomyces pombe Cdc7, Saccharomyces cerevisiae Cdc15, Arabidopsis thaliana mitogen-activated protein kinase (MAPK) kinase kinase (MAPKKK) epsilon, and related proteins. MAPKKKs phosphorylate and activate MAPK kinases (MAPKKs or MKKs or MAP2Ks), which in turn phosphorylate and activate MAPKs during signaling cascades that are important in mediating cellular responses to extracellular signals. Fission yeast Cdc7 is essential for cell division by playing a key role in the initiation of septum formation and cytokinesis. Budding yeast Cdc15 functions to coordinate mitotic exit with cytokinesis. Arabidopsis MAPKKK epsilon is required for pollen development in the plasma membrane.
  • Changed! smart S_TKc 304aa 2e-63 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG SPS1 335aa 6e-31 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

DYRK3

  • refseq_DYRK3.F1 refseq_DYRK3.R1 260 354
  • NCBIGene 36.3 8444
  • Single exon skipping, size difference: 94
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_003582

  • Changed! cd S_TKc 315aa 1e-60 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 304aa 8e-65 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG SPS1 315aa 4e-28 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

DYX1C1

  • refseq_DYX1C1.F1 refseq_DYX1C1.R1 181 287
  • NCBIGene 36.3 161582
  • Single exon skipping, size difference: 106
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_130810

  • cd p23_DYX1C1_like 78aa 3e-31 in ref transcript
    • p23_like domain found in proteins similar to dyslexia susceptibility 1 (DYX1) candidate 1 (C1) protein, DYX1C1. The human gene encoding this protein is a positional candidate gene for developmental dyslexia (DD), it is located on 15q21.3 by the DYX1 DD susceptibility locus (15q15-21). Independent association studies have reported conflicting results. However, association of short-term memory, which plays a role in DD, with a variant within the DYX1C1 gene has been reported. Most proteins belonging to this group contain a C-terminal tetratricopeptide repeat (TPR) protein binding region.
  • Changed! cd TPR 106aa 5e-05 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • pfam CS 70aa 7e-12 in ref transcript
    • CS domain. The CS and CHORD (pfam04968) are fused into a single polypeptide chain in metazoans but are found in separate proteins in plants; this is thought to be indicative of an interaction between CS and CHORD. It has been suggested that the CS domain is a binding module for HSP90, implying that CS domain-containing proteins are involved in recruiting heat shock proteins to multiprotein assemblies.
  • Changed! TIGR 3a0801s09 121aa 4e-06 in ref transcript
  • Changed! TIGR 3a0801s09 75aa 0.009 in modified transcript

DZIP1

  • refseq_DZIP1.F1 refseq_DZIP1.R1 148 205
  • NCBIGene 36.3 22873
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198968

EBF2

  • refseq_EBF2.F1 refseq_EBF2.R1 109 178
  • NCBIGene 36.2 64641
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_927203

  • cd IPT_COE 85aa 3e-43 in ref transcript
    • IPT domain of the COE family (Col/Olf-1/EBF) of non-basic, helix-loop-helix (HLH)-containing transcription factors. COE family proteins are all transcription factors and play an important role in variety of developmental processes. Mouse EBF is involved in the regulation of the early stages of B-cell differentiation, Drosophila collier is a regulator of the head patterning, and a related protein in Xenopus is involved in primary neurogenesis. All COE family members have a well conserved DNA binding domain that contains an atypical Zn finger motif. The function of the IPT domain is unknown.
  • pfam TIG 83aa 5e-09 in ref transcript
    • IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: This family does not currently recognise a significant number of members.

ECM1

  • refseq_ECM1.F2 refseq_ECM1.R2 100 475
  • NCBIGene 36.3 1893
  • Single exon skipping, size difference: 375
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004425

  • Changed! pfam ECM1 279aa 1e-120 in ref transcript
    • Extracellular matrix protein 1 (ECM1). This family consists of several eukaryotic extracellular matrix protein 1 (ECM1) sequences. ECM1 has been shown to regulate endochondral bone formation, stimulate the proliferation of endothelial cells and induce angiogenesis. Mutations in the ECM1 gene can cause lipoid proteinosis, a disorder which causes generalised thickening of skin, mucosae and certain viscera. Classical features include beaded eyelid papules and laryngeal infiltration leading to hoarseness.
  • Changed! pfam ECM1 242aa 1e-97 in ref transcript
  • Changed! pfam ECM1 415aa 0.0 in modified transcript

EFEMP1

  • refseq_EFEMP1.F1 refseq_EFEMP1.R1 165 206
  • NCBIGene 36.3 2202
  • Single exon skipping, size difference: 41
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039348

  • cd vWA_Matrilin 40aa 2e-04 in ref transcript
    • VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
  • cd vWA_Matrilin 42aa 0.001 in ref transcript
  • cd vWA_Matrilin 33aa 0.004 in ref transcript
  • cd vWA_Matrilin 40aa 0.004 in ref transcript
  • pfam EGF_CA 34aa 3e-06 in ref transcript
    • Calcium binding EGF domain.
  • smart EGF_CA 31aa 4e-05 in ref transcript
    • Calcium-binding EGF-like domain.
  • smart EGF_CA 40aa 0.002 in ref transcript

EFNA1

  • refseq_EFNA1.F1 refseq_EFNA1.R1 320 386
  • NCBIGene 36.3 1942
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004428

  • Changed! pfam Ephrin 135aa 4e-61 in ref transcript
    • Ephrin.
  • Changed! pfam Ephrin 119aa 4e-53 in modified transcript

EFNA4

  • refseq_EFNA4.F1 refseq_EFNA4.R1 143 289
  • NCBIGene 36.3 1945
  • Alternative 3-prime, size difference: 146
  • Exclusion of the stop codon
  • Reference transcript: NM_005227

  • pfam Ephrin 131aa 3e-43 in ref transcript
    • Ephrin.

EFS

  • refseq_EFS.F1 refseq_EFS.R1 101 380
  • NCBIGene 36.3 10278
  • Single exon skipping, size difference: 279
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005864

  • Changed! cd SH3 56aa 9e-11 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! smart SH3 57aa 6e-13 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

EGFLAM

  • refseq_EGFLAM.F1 refseq_EGFLAM.R1 199 389
  • NCBIGene 36.3 133584
  • Single exon skipping, size difference: 190
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_152403

  • Changed! cd LamG 154aa 5e-26 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • Changed! cd LamG 138aa 9e-25 in ref transcript
  • Changed! cd LamG 151aa 3e-23 in ref transcript
  • cd FN3 98aa 3e-13 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 93aa 2e-12 in ref transcript
  • Changed! cd EGF_CA 29aa 1e-04 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • Changed! cd EGF_CA 43aa 2e-04 in ref transcript
  • Changed! pfam Laminin_G_2 126aa 6e-29 in ref transcript
    • Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
  • Changed! smart LamG 133aa 4e-27 in ref transcript
    • Laminin G domain.
  • Changed! smart LamG 135aa 2e-22 in ref transcript
  • pfam fn3 87aa 8e-13 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 91aa 1e-09 in ref transcript
  • Changed! pfam EGF 28aa 7e-04 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
  • Changed! pfam EGF 32aa 0.004 in ref transcript
  • Changed! cd LamG 112aa 3e-15 in modified transcript
  • Changed! smart LamG 91aa 2e-12 in modified transcript

EHMT2

  • refseq_EHMT2.F1 refseq_EHMT2.R1 117 219
  • NCBIGene 36.3 10919
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006709

  • cd ANK 120aa 3e-29 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 126aa 2e-28 in ref transcript
  • pfam SET 127aa 2e-36 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • smart PreSET 100aa 4e-27 in ref transcript
    • N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
  • TIGR trp 177aa 1e-08 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • pfam Ank 31aa 4e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • COG COG2940 242aa 3e-20 in ref transcript
    • Proteins containing SET domain [General function prediction only].
  • COG Arp 156aa 1e-17 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 158aa 1e-13 in ref transcript

EI24

  • refseq_EI24.F1 refseq_EI24.R1 159 271
  • NCBIGene 36.3 9538
  • Single exon skipping, size difference: 112
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001007277

  • Changed! pfam EI24 154aa 6e-44 in ref transcript
    • Etoposide-induced protein 2.4 (EI24). This family contains a number of eukaryotic etoposide-induced 2.4 (EI24) proteins approximately 350 residues long. In cells treated with the cytotoxic drug etoposide, EI24 is induced by p53. It has been suggested to play an important role in negative cell growth control.
  • Changed! pfam EI24 212aa 2e-66 in modified transcript

EIF2C3

  • refseq_EIF2C3.F1 refseq_EIF2C3.R1 187 324
  • NCBIGene 36.3 192669
  • Single exon skipping, size difference: 137
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024852

  • Changed! cd Piwi_ago-like 426aa 1e-173 in ref transcript
    • Piwi_ago-like: PIWI domain, Argonaute-like subfamily. Argonaute is the central component of the RNA-induced silencing complex (RISC) and related complexes. The PIWI domain is the C-terminal portion of Argonaute and consists of two subdomains, one of which provides the 5' anchoring of the guide RNA and the other, the catalytic site for slicing.
  • Changed! cd PAZ_argonaute_like 121aa 1e-32 in ref transcript
    • PAZ domain, argonaute_like subfamily. Argonaute is part of the RNA-induced silencing complex (RISC), and is an endonuclease that plays a key role in the RNA interference pathway. The PAZ domain has been named after the proteins Piwi,Argonaut, and Zwille. PAZ is found in two families of proteins that are essential components of RNA-mediated gene-silencing pathways, including RNA interference, the Piwi and Dicer families. PAZ functions as a nucleic acid binding domain, with a strong preference for single-stranded nucleic acids (RNA or DNA) or RNA duplexes with single-stranded 3' overhangs. It has been suggested that the PAZ domain provides a unique mode for the recognition of the two 3'-terminal nucleotides in single-stranded nucleic acids and buries the 3' OH group, and that it might recognize characteristic 3' overhangs in siRNAs within RISC (RNA-induced silencing) and other complexes.
  • Changed! pfam Piwi 302aa 1e-103 in ref transcript
    • Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is the dsRNA guided hydrolysis of ssRNA. Determination of the crystal structure of Argonaute reveals that PIWI is an RNase H domain, and identifies Argonaute as Slicer, the enzyme that cleaves mRNA in the RNAi RISC complex. In addition, Mg+2 dependence and production of 3'-OH and 5' phosphate products are shared characteristics of RNaseH and RISC. The PIWI domain core has a tertiary structure belonging to the RNase H family of enzymes. RNase H fold proteins all have a five-stranded mixed beta-sheet surrounded by helices. By analogy to RNase H enzymes which cleave single-stranded RNA guided by the DNA strand in an RNA/DNA hybrid, the PIWI domain can be inferred to cleave single-stranded RNA, for example mRNA, guided by double stranded siRNA.
  • Changed! pfam PAZ 117aa 4e-32 in ref transcript
    • PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the Carpel factory protein. The function of the domains is unknown but has been suggested to mediate complex formation between proteins of the Piwi and Dicer families by hetero-dimerisation. The three-dimensional structure of this domain has been solved. The PAZ domain is composed of two subdomains. One subdomain is similar to the OB fold, albeit with a different topology. The OB-fold is well known as a single-stranded nucleic acid binding fold. The second subdomain is composed of a beta-hairpin followed by an alpha-helix. The PAZ domains shows low-affinity nucleic acid binding and appears to interact with the 3' ends of single-stranded regions of RNA in the cleft between the two subdomains. PAZ can bind the characteristic two-base 3' overhangs of siRNAs, indicating that although PAZ may not be a primary nucleic acid binding site in Dicer or RISC, it may contribute to the specific and productive incorporation of siRNAs and miRNAs into the RNAi pathway.
  • Changed! pfam DUF1785 53aa 3e-16 in ref transcript
    • Domain of unknown function (DUF1785). This region is found in argonaute proteins and often co-occurs with pfam02179 and pfam02171.

EIF2C3

  • refseq_EIF2C3.F4 refseq_EIF2C3.R4 233 354
  • NCBIGene 36.3 192669
  • Single exon skipping, size difference: 121
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024852

  • Changed! cd Piwi_ago-like 426aa 1e-173 in ref transcript
    • Piwi_ago-like: PIWI domain, Argonaute-like subfamily. Argonaute is the central component of the RNA-induced silencing complex (RISC) and related complexes. The PIWI domain is the C-terminal portion of Argonaute and consists of two subdomains, one of which provides the 5' anchoring of the guide RNA and the other, the catalytic site for slicing.
  • Changed! cd PAZ_argonaute_like 121aa 1e-32 in ref transcript
    • PAZ domain, argonaute_like subfamily. Argonaute is part of the RNA-induced silencing complex (RISC), and is an endonuclease that plays a key role in the RNA interference pathway. The PAZ domain has been named after the proteins Piwi,Argonaut, and Zwille. PAZ is found in two families of proteins that are essential components of RNA-mediated gene-silencing pathways, including RNA interference, the Piwi and Dicer families. PAZ functions as a nucleic acid binding domain, with a strong preference for single-stranded nucleic acids (RNA or DNA) or RNA duplexes with single-stranded 3' overhangs. It has been suggested that the PAZ domain provides a unique mode for the recognition of the two 3'-terminal nucleotides in single-stranded nucleic acids and buries the 3' OH group, and that it might recognize characteristic 3' overhangs in siRNAs within RISC (RNA-induced silencing) and other complexes.
  • Changed! pfam Piwi 302aa 1e-103 in ref transcript
    • Piwi domain. This domain is found in the protein Piwi and its relatives. The function of this domain is the dsRNA guided hydrolysis of ssRNA. Determination of the crystal structure of Argonaute reveals that PIWI is an RNase H domain, and identifies Argonaute as Slicer, the enzyme that cleaves mRNA in the RNAi RISC complex. In addition, Mg+2 dependence and production of 3'-OH and 5' phosphate products are shared characteristics of RNaseH and RISC. The PIWI domain core has a tertiary structure belonging to the RNase H family of enzymes. RNase H fold proteins all have a five-stranded mixed beta-sheet surrounded by helices. By analogy to RNase H enzymes which cleave single-stranded RNA guided by the DNA strand in an RNA/DNA hybrid, the PIWI domain can be inferred to cleave single-stranded RNA, for example mRNA, guided by double stranded siRNA.
  • Changed! pfam PAZ 117aa 4e-32 in ref transcript
    • PAZ domain. This domain is named PAZ after the proteins Piwi Argonaut and Zwille. This domain is found in two families of proteins that are involved in post-transcriptional gene silencing. These are the Piwi family and the Dicer family, that includes the Carpel factory protein. The function of the domains is unknown but has been suggested to mediate complex formation between proteins of the Piwi and Dicer families by hetero-dimerisation. The three-dimensional structure of this domain has been solved. The PAZ domain is composed of two subdomains. One subdomain is similar to the OB fold, albeit with a different topology. The OB-fold is well known as a single-stranded nucleic acid binding fold. The second subdomain is composed of a beta-hairpin followed by an alpha-helix. The PAZ domains shows low-affinity nucleic acid binding and appears to interact with the 3' ends of single-stranded regions of RNA in the cleft between the two subdomains. PAZ can bind the characteristic two-base 3' overhangs of siRNAs, indicating that although PAZ may not be a primary nucleic acid binding site in Dicer or RISC, it may contribute to the specific and productive incorporation of siRNAs and miRNAs into the RNAi pathway.
  • Changed! pfam DUF1785 53aa 3e-16 in ref transcript
    • Domain of unknown function (DUF1785). This region is found in argonaute proteins and often co-occurs with pfam02179 and pfam02171.

EIF3C

  • refseq_EIF3S8.F1 refseq_EIF3S8.R1 189 238
  • NCBIGene 36.3 8663
  • Alternative 5-prime, size difference: 49
  • Exclusion in 5'UTR
  • Reference transcript: NM_001037808

  • pfam eIF-3c_N 389aa 1e-168 in ref transcript
    • Eukaryotic translation initiation factor 3 subunit 8 N-terminus (eIF3c_N). The largest of the mammalian translation initiation factors, eIF3, consists of at least eight subunits ranging in mass from 35 to 170 kDa. eIF3 binds to the 40 S ribosome in an early step of translation initiation and promotes the binding of methionyl-tRNAi and mRNA.
  • pfam eIF-3c_N 146aa 3e-52 in ref transcript
  • smart PAM 89aa 8e-16 in ref transcript
    • PCI/PINT associated module.

EIF3B

  • refseq_EIF3S9.F2 refseq_EIF3S9.R2 323 398
  • NCBIGene 36.3 8662
  • Alternative 3-prime, size difference: 75
  • Inclusion in 3'UTR
  • Reference transcript: NM_003751

  • cd RRM 53aa 3e-06 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • pfam eIF2A 196aa 4e-69 in ref transcript
    • Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
  • smart RRM_2 73aa 1e-08 in ref transcript
    • RNA recognition motif.
  • COG COG5354 459aa 2e-79 in ref transcript
    • Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only].

EIF4G1

  • refseq_EIF4G1.F1 refseq_EIF4G1.R1 132 370
  • NCBIGene 36.3 1981
  • Multiple exon skipping, size difference: 238
  • Exclusion in 5'UTR, Exclusion of the protein initiation site, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198241

  • pfam MIF4G 229aa 3e-45 in ref transcript
    • MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
  • pfam MA3 102aa 6e-27 in ref transcript
    • MA3 domain. Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.
  • pfam W2 62aa 5e-16 in ref transcript
    • eIF4-gamma/eIF5/eIF2-epsilon. This domain of unknown function is found at the C-terminus of several translation initiation factors.

ELAVL3

  • refseq_ELAVL3.F1 refseq_ELAVL3.R1 119 140
  • NCBIGene 36.3 1995
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001420

  • cd RRM 75aa 3e-18 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 75aa 1e-16 in ref transcript
  • cd RRM 77aa 1e-12 in ref transcript
  • Changed! TIGR ELAV_HUD_SF 331aa 1e-151 in ref transcript
    • These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
  • COG COG0724 160aa 2e-14 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 81aa 2e-11 in ref transcript
  • Changed! TIGR ELAV_HUD_SF 324aa 1e-150 in modified transcript
  • Changed! COG COG0724 271aa 6e-11 in modified transcript

ELF2

  • refseq_ELF2.F1 refseq_ELF2.R1 137 173
  • NCBIGene 36.3 1998
  • Alternative 3-prime, size difference: 36
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_201999

  • smart ETS 83aa 7e-39 in ref transcript
    • erythroblast transformation specific domain. variation of the helix-turn-helix motif.

ELMO2

  • refseq_ELMO2.F1 refseq_ELMO2.R1 132 236
  • NCBIGene 36.3 63916
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 104
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_133171

  • cd PH_PLC 68aa 1e-04 in ref transcript
    • Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam ELMO_CED12 184aa 3e-49 in ref transcript
    • ELMO/CED-12 family. This family represents a conserved domain which is found in a number of eukaryotic proteins including CED-12, ELMO I and ELMO II. ELMO1 is a component of signalling pathways that regulate phagocytosis and cell migration and is the mammalian orthologue of the Caenorhabditis elegans gene, ced-12. CED-12 is required for the engulfment of dying cells and cell migration. In mammalian cells, ELMO1 interacts with Dock180 as part of the CrkII/Dock180/Rac pathway responsible for phagocytosis and cell migration. ELMO1 is ubiquitously expressed, although its expression is highest in the spleen, an organ rich in immune cells. ELMO1 has a PH domain and a polyproline sequence motif at its C terminus which are not present in this alignment.

EML1

  • refseq_EML1.F1 refseq_EML1.R1 129 186
  • NCBIGene 36.3 2009
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008707

  • cd WD40 389aa 3e-22 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 197aa 2e-09 in ref transcript
  • pfam HELP 78aa 6e-36 in ref transcript
    • HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localise to membranes. A preliminary study has shown that the N terminus of Sea urchin EMAP containing HELP is sufficient for microtubule binding in vitro (Eichenmuller et al In press).
  • pfam WD40 49aa 0.009 in ref transcript
    • WD domain, G-beta repeat.
  • COG COG2319 404aa 2e-16 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

EMR2

  • refseq_EMR2.F2 refseq_EMR2.R2 244 391
  • NCBIGene 36.3 30817
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013447

  • cd EGF_CA 36aa 2e-05 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • Changed! cd EGF_CA 37aa 2e-05 in ref transcript
  • Changed! cd EGF_CA 34aa 5e-05 in ref transcript
  • cd EGF_CA 35aa 0.002 in ref transcript
  • pfam 7tm_2 253aa 9e-46 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • smart GPS 51aa 9e-12 in ref transcript
    • G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • Changed! pfam EGF_CA 48aa 4e-10 in ref transcript
    • Calcium binding EGF domain.
  • Changed! pfam EGF_CA 35aa 1e-09 in ref transcript
  • smart EGF_CA 33aa 3e-06 in ref transcript
    • Calcium-binding EGF-like domain.
  • pfam EGF_CA 51aa 3e-05 in ref transcript
  • Changed! cd EGF_CA 37aa 9e-06 in modified transcript
  • Changed! pfam EGF_CA 35aa 5e-10 in modified transcript

EMR2

  • refseq_EMR2.F3 refseq_EMR2.R3 175 208
  • NCBIGene 36.3 30817
  • Alternative 3-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013447

  • cd EGF_CA 36aa 2e-05 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • cd EGF_CA 37aa 2e-05 in ref transcript
  • cd EGF_CA 34aa 5e-05 in ref transcript
  • cd EGF_CA 35aa 0.002 in ref transcript
  • pfam 7tm_2 253aa 9e-46 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • smart GPS 51aa 9e-12 in ref transcript
    • G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • pfam EGF_CA 48aa 4e-10 in ref transcript
    • Calcium binding EGF domain.
  • pfam EGF_CA 35aa 1e-09 in ref transcript
  • smart EGF_CA 33aa 3e-06 in ref transcript
    • Calcium-binding EGF-like domain.
  • pfam EGF_CA 51aa 3e-05 in ref transcript

EMR3

  • refseq_EMR3.F1 refseq_EMR3.R1 212 253
  • NCBIGene 36.2 84658
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 41
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032571

  • Changed! cd EGF_CA 38aa 3e-05 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • Changed! pfam 7tm_2 253aa 6e-50 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • Changed! pfam GPS 46aa 2e-09 in ref transcript
    • Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • smart EGF_CA 35aa 2e-05 in ref transcript
    • Calcium-binding EGF-like domain.

ENAH

  • refseq_ENAH.F1 refseq_ENAH.R1 133 196
  • NCBIGene 36.3 55740
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008493

  • cd Ena-Vasp 110aa 5e-54 in ref transcript
    • Enabled-VASP-type homology (EVH1) domain. The EVH1 domain binds to other proteins at proline rich sequences. It is found in proteins involved in cytoskeletal reorganization such as Enabled and VASP. Ena-VASP type EVH1 domains specifically recognize FPPPP motifs in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
  • pfam WH1 107aa 2e-38 in ref transcript
    • WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homologue. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif.
  • pfam VASP_tetra 40aa 3e-13 in ref transcript
    • VASP tetramerisation domain. Vasodilator-stimulated phosphoprotein (VASP) is an actin cytoskeletal regulatory protein. This region corresponds to the tetramerisation domain which forms a right handed alpha helical coiled coil structure.

ENO3

  • refseq_ENO3.F1 refseq_ENO3.R1 109 151
  • NCBIGene 36.3 2027
  • Alternative 5-prime, size difference: 42
  • Exclusion in 5'UTR
  • Reference transcript: NM_001976

  • cd enolase 411aa 0.0 in ref transcript
    • Enolase: Enolases are homodimeric enzymes that catalyse the reversible dehydration of 2-phospho-D-glycerate to phosphoenolpyruvate as part of the glycolytic and gluconeogenesis pathways. The reaction is facilitated by the presence of metal ions.
  • TIGR eno 427aa 1e-161 in ref transcript
    • Alternate name: enolase.
  • PTZ PTZ00081 429aa 0.0 in ref transcript
    • enolase (2-phospho-D-glycerate hydrolase); Provisional.

ENSA

  • refseq_ENSA.F1 refseq_ENSA.R1 239 287
  • NCBIGene 36.3 2029
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207042

  • Changed! pfam Endosulfine 64aa 2e-05 in ref transcript
    • cAMP-regulated phosphoprotein/endosulfine conserved region. Conserved region found in both cAMP-regulated phosphoprotein 19 (ARPP-19) and Alpha/Beta endosulfine. No function has yet been assigned to ARPP-19. Endosulfine is the endogenous ligand for the ATP-dependent potassium (K ATP) channels which occupy a key position in the control of insulin release from the pancreatic beta cell by coupling cell polarity to metabolism. In both cases the region occupies the majority of the protein.
  • Changed! pfam Endosulfine 48aa 1e-06 in modified transcript

ENTPD2

  • refseq_ENTPD2.F1 refseq_ENTPD2.R1 133 202
  • NCBIGene 36.3 954
  • Alternative 3-prime, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203468

  • Changed! pfam GDA1_CD39 419aa 2e-82 in ref transcript
    • GDA1/CD39 (nucleoside phosphatase) family.
  • COG COG5371 212aa 1e-12 in ref transcript
    • Golgi nucleoside diphosphatase [Carbohydrate transport and metabolism / Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam GDA1_CD39 396aa 9e-77 in modified transcript

EPB41

  • refseq_EPB41.F1 refseq_EPB41.R1 204 306
  • NCBIGene 36.3 2035
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203343

  • cd FERM_C 94aa 1e-32 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • smart B41 157aa 1e-40 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • Changed! pfam 4_1_CTD 115aa 3e-26 in ref transcript
    • 4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
  • pfam FERM_C 85aa 2e-20 in ref transcript
    • FERM C-terminal PH-like domain.
  • pfam SAB 47aa 5e-14 in ref transcript
    • SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
  • pfam FA 45aa 5e-13 in ref transcript
    • FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
  • Changed! pfam 4_1_CTD 81aa 7e-29 in modified transcript

EPB41

  • refseq_EPB41.F3 refseq_EPB41.R3 313 393
  • NCBIGene 36.3 2035
  • Single exon skipping, size difference: 80
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_203343

  • Changed! cd FERM_C 94aa 1e-32 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • Changed! smart B41 157aa 1e-40 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • Changed! pfam 4_1_CTD 115aa 3e-26 in ref transcript
    • 4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
  • Changed! pfam FERM_C 85aa 2e-20 in ref transcript
    • FERM C-terminal PH-like domain.
  • Changed! pfam SAB 47aa 5e-14 in ref transcript
    • SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
  • Changed! pfam FA 45aa 5e-13 in ref transcript
    • FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.

EPB41

  • refseq_EPB41.F4 refseq_EPB41.R4 253 358
  • NCBIGene 36.3 2035
  • Single exon skipping, size difference: 105
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_203343

  • cd FERM_C 94aa 1e-32 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • Changed! smart B41 157aa 1e-40 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam 4_1_CTD 115aa 3e-26 in ref transcript
    • 4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
  • pfam FERM_C 85aa 2e-20 in ref transcript
    • FERM C-terminal PH-like domain.
  • pfam SAB 47aa 5e-14 in ref transcript
    • SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
  • pfam FA 45aa 5e-13 in ref transcript
    • FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
  • Changed! smart B41 192aa 2e-55 in modified transcript

EPB41

  • refseq_EPB41.F5 refseq_EPB41.R5 96 113
  • NCBIGene 36.3 2035
  • Alternative 3-prime, size difference: 17
  • Exclusion of the protein initiation site
  • Reference transcript: NM_203343

  • cd FERM_C 94aa 1e-32 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • smart B41 157aa 1e-40 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam 4_1_CTD 115aa 3e-26 in ref transcript
    • 4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
  • pfam FERM_C 85aa 2e-20 in ref transcript
    • FERM C-terminal PH-like domain.
  • pfam SAB 47aa 5e-14 in ref transcript
    • SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
  • pfam FA 45aa 5e-13 in ref transcript
    • FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.

EPB41L1

  • refseq_EPB41L1.F2 refseq_EPB41L1.R2 186 222
  • NCBIGene 36.3 2036
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012156

  • cd FERM_C 94aa 1e-31 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • smart B41 190aa 1e-54 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam 4_1_CTD 93aa 1e-24 in ref transcript
    • 4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
  • pfam FERM_C 89aa 1e-22 in ref transcript
    • FERM C-terminal PH-like domain.
  • pfam FA 47aa 9e-13 in ref transcript
    • FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
  • Changed! pfam SAB 44aa 4e-09 in ref transcript
    • SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
  • Changed! pfam SAB 43aa 2e-08 in modified transcript

EPB41L1

  • refseq_EPB41L1.F3 refseq_EPB41L1.R3 117 201
  • NCBIGene 36.3 2036
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012156

  • cd FERM_C 94aa 1e-31 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • smart B41 190aa 1e-54 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam 4_1_CTD 93aa 1e-24 in ref transcript
    • 4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
  • pfam FERM_C 89aa 1e-22 in ref transcript
    • FERM C-terminal PH-like domain.
  • pfam FA 47aa 9e-13 in ref transcript
    • FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
  • pfam SAB 44aa 4e-09 in ref transcript
    • SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.

EPHA5

  • refseq_EPHA5.F1 refseq_EPHA5.R1 187 253
  • NCBIGene 36.3 2044
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004439

  • cd PTKc_EphR_A 267aa 1e-169 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinases, Class EphA Ephrin Receptors. Protein Tyrosine Kinase (PTK) family; Ephrin Receptor (EphR) subfamily; most class EphA receptors including EphA3, EphA4, EphA5, and EphA7, but excluding EphA1, EphA2 and EphA10; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. EphRs comprise the largest subfamily of receptor tyr kinases (RTKs). In general, class EphA receptors bind GPI-anchored ephrin-A ligands. There are ten vertebrate EphA receptors (EphA1-10), which display promiscuous interactions with six ephrin-A ligands. One exception is EphA4, which also binds ephrins-B2/B3. EphRs contain an ephrin-binding domain and two fibronectin repeats extracellularly, a transmembrane segment, and a cytoplasmic tyr kinase domain. Binding of the ephrin ligand to EphR requires cell-cell contact since both are anchored to the plasma membrane. The resulting downstream signals occur bidirectionally in both EphR-expressing cells (forward signaling) and ephrin-expressing cells (reverse signaling). Ephrin/EphR interaction mainly results in cell-cell repulsion or adhesion, making it important in neural development and plasticity, cell morphogenesis, cell-fate determination, embryonic development, tissue patterning, and angiogenesis. EphARs and ephrin-A ligands are expressed in multiple areas of the developing brain, especially in the retina and tectum. They are part of a system controlling retinotectal mapping.
  • cd SAM 60aa 7e-16 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • cd FN3 91aa 9e-14 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 95aa 3e-05 in ref transcript
  • pfam Pkinase_Tyr 258aa 1e-114 in ref transcript
    • Protein tyrosine kinase.
  • pfam Ephrin_lbd 173aa 6e-97 in ref transcript
    • Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand.
  • pfam SAM_1 61aa 6e-19 in ref transcript
    • SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
  • smart FN3 81aa 1e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam fn3 90aa 2e-07 in ref transcript
    • Fibronectin type III domain.
  • COG SPS1 295aa 3e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

EPN2

  • refseq_EPN2.F1 refseq_EPN2.R1 213 384
  • NCBIGene 36.3 22905
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014964

  • cd ENTH_epsin 123aa 2e-61 in ref transcript
    • ENTH domain, Epsin family; The epsin (Eps15 interactor) N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that E/ANTH domains are universal components of the machinery for clathrin-mediated membrane budding.
  • pfam ENTH 124aa 1e-62 in ref transcript
    • ENTH domain. The ENTH (Epsin N-terminal homology) domain is found in proteins involved in endocytosis and cytoskeletal machinery. The function of the ENTH domain is unknown.

EPS8L3

  • refseq_EPS8L3.F1 refseq_EPS8L3.R1 129 219
  • NCBIGene 36.3 79574
  • Alternative 3-prime, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139053

  • cd EPS8 126aa 4e-46 in ref transcript
    • Epidermal growth factor receptor kinase substrate (EPS8) Phosphotyrosine-binding (PTB) domain. EPS8 is a regulator of Rac signaling. It consists of a PTB and an SH3 domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
  • cd SH3 53aa 1e-09 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam PTB 120aa 2e-37 in ref transcript
    • Phosphotyrosine-binding domain. The phosphotyrosine-binding domain (PTB, also phosphotyrosine-interaction or PI domain) in the protein tensin tends to be found at the C-terminus. Tensin is a multi-domain protein that binds to actin filaments and functions as a focal-adhesion molecule (focal adhesions are regions of plasma membrane through which cells attach to the extracellular matrix). Human tensin has actin-binding sites, an SH2 (pfam00017) domain and a region similar to the tumour suppressor PTEN. The PTB domain interacts with the cytoplasmic tails of beta integrin by binding to an NPXY motif.
  • smart SH3 57aa 3e-11 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

EPSTI1

  • refseq_EPSTI1.F2 refseq_EPSTI1.R2 164 196
  • NCBIGene 36.3 94240
  • Single exon skipping, size difference: 32
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001002264

ERBB2IP

  • refseq_ERBB2IP.F1 refseq_ERBB2IP.R1 178 385
  • NCBIGene 36.3 55914
  • Single exon skipping, size difference: 207
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018695

  • cd PDZ_signaling 60aa 3e-09 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd LRR_RI 162aa 2e-04 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! smart PDZ 90aa 4e-10 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG COG4886 348aa 3e-18 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].
  • COG Prc 60aa 4e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • Changed! smart PDZ 90aa 6e-10 in modified transcript

ERG

  • refseq_ERG.F1 refseq_ERG.R1 100 172
  • NCBIGene 36.3 2078
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182918

  • pfam Ets 84aa 1e-36 in ref transcript
    • Ets-domain.
  • smart SAM_PNT 83aa 1e-23 in ref transcript
    • SAM / Pointed domain. A subfamily of the SAM domain.

ERP29

  • refseq_ERP29.F1 refseq_ERP29.R1 187 326
  • NCBIGene 36.3 10961
  • Single exon skipping, size difference: 139
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006817

  • Changed! cd PDI_a_ERp29_N 114aa 3e-46 in ref transcript
    • PDIa family, endoplasmic reticulum protein 29 (ERp29) subfamily; ERp29 is a ubiquitous ER-resident protein expressed in high levels in secretory cells. It forms homodimers and higher oligomers in vitro and in vivo. It contains a redox inactive TRX-like domain at the N-terminus, which is homologous to the redox active TRX (a) domains of PDI, and a C-terminal helical domain similar to the C-terminal domain of P5. The expression profile of ERp29 suggests a role in secretory protein production distinct from that of PDI. It has also been identified as a member of the thyroglobulin folding complex. The Drosophila homolog, Wind, is the product of windbeutel, an essential gene in the development of dorsal-ventral patterning. Wind is required for correct targeting of Pipe, a Golgi-resident type II transmembrane protein with homology to 2-O-sulfotransferase.
  • Changed! cd ERp29c 94aa 3e-20 in ref transcript
    • ERp29 and ERp38, C-terminal domain; composed of the protein disulfide isomerase (PDI)-like proteins ERp29 and ERp38. ERp29 (also called ERp28) is a ubiquitous endoplasmic reticulum (ER)-resident protein expressed in high levels in secretory cells. It contains a redox inactive TRX-like domain at the N-terminus. The expression profile of ERp29 suggests a role in secretory protein production, distinct from that of PDI. It has also been identified as a member of the thyroglobulin folding complex and is essential in regulating the secretion of thyroglobulin. The Drosophila homolog, Wind, is the product of windbeutel, an essential gene in the development of dorsal-ventral patterning. Wind is required for correct targeting of Pipe, a Golgi-resident type II transmembrane protein with homology to 2-O-sulfotransferase. ERp38 is a P5-like protein, first isolated from alfalfa (the cDNA clone was named G1), which contains two redox active TRX domains at the N-terminus, like human P5. However, unlike human P5, ERp38 also contains a C-terminal domain with homology to the C-terminal domain of ERp29. It may be a glucose-regulated protein. The function of the all-helical C-terminal domain of ERp29 and ERp38 remains unclear. The C-terminal domain of Wind is thought to provide a distinct site required for interaction with its substrate, Pipe.
  • Changed! pfam ERp29_N 123aa 1e-48 in ref transcript
    • ERp29, N-terminal domain. ERp29 is a ubiquitously expressed endoplasmic reticulum protein, and is involved in the processes of protein maturation and protein secretion in this organelle. The protein exists as a homodimer, with each monomer being composed of two domains. The N-terminal domain featured in this family is organised into a thioredoxin-like fold that resembles the a domain of human protein disulphide isomerase (PDI). However, this domain lacks the C-X-X-C motif required for the redox function of PDI; it is therefore thought that ERp29's function is similar to the chaperone function of PDI. The N-terminal domain is exclusively responsible for the homodimerisation of the protein, without covalent linkages or additional contacts with other domains.
  • Changed! pfam ERp29 97aa 3e-23 in ref transcript
    • Endoplasmic reticulum protein ERp29, C-terminal domain. ERp29 is a ubiquitously expressed endoplasmic reticulum protein found in mammals. ERp29 is comprised of two domains. This domain, the C-terminal domain, has an all helical fold. ERp29 is thought to form part of the thyroglobulin folding complex.

ESRRG

  • refseq_ESRRG.F1 refseq_ESRRG.R1 218 336
  • NCBIGene 36.3 2104
  • Single exon skipping, size difference: 118
  • Exclusion in 5'UTR
  • Reference transcript: NM_206594

  • cd NR_LBD_ERR 220aa 1e-111 in ref transcript
    • The ligand binding domain of estrogen receptor-related nuclear receptors. The ligand binding domain of estrogen receptor-related receptors (ERRs): The family of estrogen receptor-related receptors (ERRs), a subfamily of nuclear receptors, is closely related to the estrogen receptor (ER) family, but it lacks the ability to bind estrogen. ERRs can interfere with the classic ER-mediated estrogen signaling pathway, positively or negatively. ERRs share target genes, co-regulators and promoters with the estrogen receptor (ER) family. There are three subtypes of ERRs: alpha, beta and gamma. ERRs bind at least two types of DNA sequence, the estrogen response element and another site, originally characterized as SF-1 (steroidogenic factor 1) response element. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, ERR has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam Hormone_recep 181aa 2e-42 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • pfam zf-C4 76aa 9e-37 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.

EVI2A

  • refseq_EVI2A.F2 refseq_EVI2A.R2 144 285
  • NCBIGene 36.3 2123
  • Single exon skipping, size difference: 141
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001003927

  • pfam EVI2A 227aa 1e-104 in ref transcript
    • Ectropic viral integration site 2A protein (EVI2A). This family contains several mammalian ectropic viral integration site 2A (EVI2A) proteins. The function of this protein is unknown although it is thought to be a membrane protein and may function as an oncogene in retrovirus induced myeloid tumours.

EWSR1

  • refseq_EWSR1.F1 refseq_EWSR1.R1 174 393
  • NCBIGene 36.3 2130
  • Multiple exon skipping, size difference: 219
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005243

  • cd RRM 82aa 3e-13 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 81aa 1e-14 in ref transcript
    • RNA recognition motif.
  • pfam zf-RanBP 31aa 2e-07 in ref transcript
    • Zn-finger in Ran binding protein and others.
  • COG COG0724 93aa 2e-07 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

EXO1

  • refseq_EXO1.F1 refseq_EXO1.R1 290 395
  • NCBIGene 36.3 9156
  • Single exon skipping, size difference: 105
  • Exclusion in 5'UTR
  • Reference transcript: NM_130398

  • cd XPG 308aa 1e-73 in ref transcript
    • Xeroderma pigmentosum G N- and I-regions (XPGN, XPGI); contains the HhH2 motif; domain in nucleases. XPG is a eukaryotic enzyme that functions in nucleotide-excision repair and transcription-coupled repair of oxidative DNA damage. Functionally/structurally related to FEN-1; divalent metal ion-dependent exo- and endonuclease, and bacterial and bacteriophage 5'3' exonucleases.
  • smart XPGN 83aa 1e-24 in ref transcript
    • Xeroderma pigmentosum G N-region. domain in nucleases.
  • pfam XPG_I 92aa 1e-23 in ref transcript
    • XPG I-region.
  • TIGR fen_arch 308aa 7e-17 in ref transcript
    • Endonuclease that cleaves the 5'-overhanging flap structure that is generated by displacement synthesis when DNA polymerase encounters the 5'-end of a downstream Okazaki fragment. Has 5'-endo-/exonuclease and 5'-pseudo-Y-endonuclease activities. Cleaves the junction between single and double-stranded regions of flap DNA.
  • PTZ PTZ00217 290aa 2e-27 in ref transcript
    • flap endonuclease-1; Provisional.

EXOC1

  • refseq_EXOC1.F2 refseq_EXOC1.R2 161 317
  • NCBIGene 36.3 55763
  • Alternative 5-prime, size difference: 156
  • Exclusion in 5'UTR
  • Reference transcript: NM_001024924

  • pfam Sec3 703aa 0.0 in ref transcript
    • Exocyst complex component Sec3. This entry is the conserved middle and C-terminus of the Sec3 protein. Sec3 binds to the C-terminal cytoplasmic domain of GLYT1 (glycine transporter protein 1). Sec3 is the exocyst component that is closest to the plasma membrane docking site and it serves as a spatial landmark in the plasma membrane for incoming secretory vesicles. Sec3 is recruited to the sites of polarised membrane growth through its interaction with Rho1p, a small GTP-binding protein.

EXOC7

  • refseq_EXOC7.F2 refseq_EXOC7.R2 198 291
  • NCBIGene 36.3 23265
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001013839

  • pfam Exo70 367aa 2e-74 in ref transcript
    • Exo70 exocyst complex subunit. The Exo70 protein forms one subunit of the exocyst complex. First discovered in Saccharomyces cerevisiae, Exo70 and other exocyst proteins have been observed in several other eukaryotes, including humans. In Saccharomyces cerevisiae, the exocyst complex is involved in the late stages of exocytosis, and is localised at the tip of the bud, the major site of exocytosis in yeast. Exo70 interacts with the Rho3 GTPase. This interaction mediates one of the three known functions of Rho3 in cell polarity: vesicle docking and fusion with the plasma membrane (the other two functions are regulation of actin polarity and transport of exocytic vesicles from the mother cell to the bud). In humans, the functions of Exo70 and the exocyst complex are less well characterised: Exo70 is expressed in several tissues and is thought to also be involved in exocytosis.

EXOSC10

  • refseq_EXOSC10.F1 refseq_EXOSC10.R1 159 234
  • NCBIGene 36.3 5394
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001998

  • cd Rrp6p_like_exo 191aa 1e-108 in ref transcript
    • Yeast Rrp6p and its human homolog, the Polymyositis/scleroderma autoantigen 100kDa (PM/Scl-100), are exosome-associated proteins involved in the degradation and processing of precursors to stable RNAs. Both proteins contain a DEDDy-type DnaQ-like 3'-5' exonuclease domain possessing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific YX(3)D pattern at ExoIII. The motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. PM/Scl-100, an autoantigen present in the nucleolar compartment of the cell, reacts with autoantibodies produced by about 50% of patients with polymyositis-scleroderma overlap syndrome.
  • pfam 3_5_exonuc 169aa 8e-54 in ref transcript
    • 3'-5' exonuclease. This domain is responsible for the 3'-5' exonuclease proofreading activity of Escherichia coli DNA polymerase I (polI) and other enzymes, it catalyses the hydrolysis of unpaired or mismatched nucleotides. This domain consists of the amino-terminal half of the Klenow fragment in Escherichia coli polI it is also found in the Werner syndrome helicase (WRN), focus forming activity 1 protein (FFA-1) and ribonuclease D (RNase D). Werner syndrome is a human genetic disorder causing premature aging; the WRN protein has helicase activity in the 3'-5' direction. The FFA-1 protein is required for formation of a replication foci and also has helicase activity; it is a homologue of the WRN protein. RNase D is a 3'-5' exonuclease involved in tRNA processing. Also found in this family is the autoantigen PM/Scl thought to be involved in polymyositis-scleroderma overlap syndrome.
  • pfam PMC2NT 92aa 1e-26 in ref transcript
    • PMC2NT (NUC016) domain. This domain is found at the N-terminus of 3'-5' exonucleases with HRDC domains, and also in putative exosome components.
  • smart HRDC 81aa 5e-18 in ref transcript
    • Helicase and RNase D C-terminal. Hypothetical role in nucleic acid binding. Mutations in the HRDC domain cause human disease.
  • COG Rnd 290aa 5e-49 in ref transcript
    • Ribonuclease D [Translation, ribosomal structure and biogenesis].

EXOSC3

  • refseq_EXOSC3.F1 refseq_EXOSC3.R1 226 378
  • NCBIGene 36.3 51010
  • Single exon skipping, size difference: 152
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016042

  • Changed! cd S1_Rrp40 86aa 3e-38 in ref transcript
    • S1_Rrp40: Rrp40 S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. Rrp4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In Saccharomyces cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure".
  • Changed! COG RRP4 207aa 2e-24 in ref transcript
    • RNA-binding protein Rrp4 and related proteins (contain S1 domain and KH domain) [Translation, ribosomal structure and biogenesis].
  • Changed! cd S1_Rrp40 52aa 1e-20 in modified transcript
  • Changed! COG RRP4 91aa 1e-07 in modified transcript

EXOSC9

  • refseq_EXOSC9.F1 refseq_EXOSC9.R1 208 259
  • NCBIGene 36.3 5393
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001034194

  • pfam RNase_PH 133aa 9e-10 in ref transcript
    • 3' exoribonuclease family, domain 1. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase) contains two tandem copies of the domain. PNPase is involved in mRNA degradation in a 3'-5' direction. The exosome is a 3'-5' exoribonuclease complex that is required for 3' processing of the 5.8S rRNA. Three of its five protein components contain a copy of this domain. A hypothetical protein from S. pombe appears to belong to an uncharacterised subfamily. This subfamily is found in both eukaryotes and archaebacteria.
  • pfam RNase_PH_C 66aa 0.002 in ref transcript
    • 3' exoribonuclease family, domain 2. This family includes 3'-5' exoribonucleases. Ribonuclease PH contains a single copy of this domain, and removes nucleotide residues following the -CCA terminus of tRNA. Polyribonucleotide nucleotidyltransferase (PNPase) contains two tandem copies of the domain. PNPase is involved in mRNA degradation in a 3'-5' direction. The exosome is a 3'-5' exoribonuclease complex that is required for 3' processing of the 5.8S rRNA. Three of its five protein components contain a copy of this domain. A hypothetical protein from S. pombe appears to belong to an uncharacterised subfamily. This subfamily is found in both eukaryotes and archaebacteria.
  • COG COG2123 271aa 1e-55 in ref transcript
    • RNase PH-related exoribonuclease [Translation, ribosomal structure and biogenesis].

EXT2

  • refseq_EXT2.F1 refseq_EXT2.R1 143 296
  • NCBIGene 36.3 2132
  • Multiple exon skipping, size difference: 153
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_000401

  • pfam Glyco_transf_64 246aa 1e-101 in ref transcript
    • Glycosyl transferase family 64 domain. Members of this family catalyse the transfer reaction of N-acetylglucosamine and N-acetylgalactosamine from the respective UDP-sugars to the non-reducing end of [glucuronic acid]beta 1-3[galactose]beta 1-O-naphthalenemethanol, an acceptor substrate analog of the natural common linker of various glycosylaminoglycans. They are also required for the biosynthesis of heparan-sulphate.
  • pfam Exostosin 281aa 5e-54 in ref transcript
    • Exostosin family. The EXT family is a family of tumour suppressor genes. Mutations of EXT1 on 8q24.1, EXT2 on 11p11-13, and EXT3 on 19p have been associated with the autosomal dominant disorder known as hereditary multiple exostoses (HME). This is the most common known skeletal dysplasia. The chromosomal locations of other EXT genes suggest association with other forms of neoplasia. EXT1 and EXT2 have both been shown to encode a heparan sulphate polymerase with both D-glucuronyl (GlcA) and N-acetyl-D-glucosaminoglycan (GlcNAC) transferase activities. The nature of the defect in heparan sulphate biosynthesis in HME is unclear.

EXTL2

  • refseq_EXTL2.F2 refseq_EXTL2.R2 183 219
  • NCBIGene 36.3 2135
  • Alternative 3-prime, size difference: 36
  • Inclusion in 5'UTR
  • Reference transcript: NM_001033025

  • pfam Glyco_transf_64 256aa 3e-86 in ref transcript
    • Glycosyl transferase family 64 domain. Members of this family catalyse the transfer reaction of N-acetylglucosamine and N-acetylgalactosamine from the respective UDP-sugars to the non-reducing end of [glucuronic acid]beta 1-3[galactose]beta 1-O-naphthalenemethanol, an acceptor substrate analog of the natural common linker of various glycosylaminoglycans. They are also required for the biosynthesis of heparan-sulphate.

EYA1

  • refseq_EYA1.F2 refseq_EYA1.R2 187 277
  • NCBIGene 36.3 2138
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000503

  • Changed! TIGR EYA-cons_domain 271aa 1e-126 in ref transcript
    • This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.
  • Changed! TIGR EYA-cons_domain 241aa 1e-107 in modified transcript

EYA1

  • refseq_EYA1.F4 refseq_EYA1.R4 171 299
  • NCBIGene 36.3 2138
  • Single exon skipping, size difference: 128
  • Exclusion of the protein initiation site
  • Reference transcript: NM_172058

  • TIGR EYA-cons_domain 271aa 1e-126 in ref transcript
    • This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.

EYA4

  • refseq_EYA4.F1 refseq_EYA4.R1 113 182
  • NCBIGene 36.3 2070
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172105

  • TIGR EYA-cons_domain 272aa 1e-129 in ref transcript
    • This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.

EZH2

  • refseq_EZH2.F1 refseq_EZH2.R1 156 273
  • NCBIGene 36.3 2146
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004456

  • pfam SET 124aa 3e-36 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • Changed! COG COG2940 208aa 4e-15 in ref transcript
    • Proteins containing SET domain [General function prediction only].
  • Changed! COG COG2940 180aa 7e-15 in modified transcript

F11

  • refseq_F11.F1 refseq_F11.R1 141 301
  • NCBIGene 36.2 2160
  • Single exon skipping, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000128

  • Changed! cd Tryp_SPc 231aa 4e-76 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! cd APPLE_Factor_XI_like 84aa 3e-09 in ref transcript
    • Subfamily of PAN/APPLE-like domains; present in plasma prekallikrein/coagulation factor XI, microneme antigen proteins, and a few prokaryotic proteins. PAN/APPLE domains fulfill diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
  • cd APPLE_Factor_XI_like 85aa 4e-09 in ref transcript
  • Changed! cd APPLE_Factor_XI_like 85aa 9e-09 in ref transcript
  • Changed! cd APPLE_Factor_XI_like 85aa 2e-08 in ref transcript
  • Changed! smart Tryp_SPc 232aa 3e-81 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! smart APPLE 84aa 7e-25 in ref transcript
    • APPLE domain. Four-fold repeat in plasma kallikrein and coagulation factor XI. Factor XI apple 3 mediates binding to platelets. Factor XI apple 1 binds high-molecular-mass kininogen. Apple 4 in factor XI mediates dimer formation and binds to factor XIIa. Mutations in apple 4 cause factor XI deficiency, an inherited bleeding disorder.
  • smart APPLE 84aa 1e-23 in ref transcript
  • Changed! smart APPLE 84aa 6e-23 in ref transcript
  • Changed! smart APPLE 84aa 4e-22 in ref transcript
  • Changed! COG COG5640 239aa 4e-29 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].

F11R

  • refseq_F11R.F1 refseq_F11R.R1 102 162
  • NCBIGene 36.2 50848
  • Alternative 5-prime and 3-prime, size difference: 60
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144503

  • cd IGcam 83aa 1e-09 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! cd IGcam 91aa 2e-05 in ref transcript
  • Changed! smart IG_like 90aa 2e-11 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 79aa 3e-07 in ref transcript
  • Changed! cd IGcam 84aa 4e-05 in modified transcript
  • Changed! smart IG_like 84aa 2e-11 in modified transcript

F11R

  • refseq_F11R.F1 refseq_F11R.R4 159 279
  • NCBIGene 36.2 50848
  • Alternative 5-prime and 3-prime, size difference: 60
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_144503

  • cd IGcam 83aa 1e-09 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! cd IGcam 91aa 2e-05 in ref transcript
  • Changed! smart IG_like 90aa 2e-11 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 79aa 3e-07 in ref transcript
  • Changed! cd IGcam 71aa 4e-04 in modified transcript
  • Changed! smart IG_like 70aa 3e-07 in modified transcript

F11R

  • refseq_F11R.F4 refseq_F11R.R5 143 177
  • NCBIGene 36.2 50848
  • Alternative 5-prime and 3-prime, size difference: 34
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_144503

  • cd IGcam 83aa 1e-09 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 91aa 2e-05 in ref transcript
  • smart IG_like 90aa 2e-11 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 79aa 3e-07 in ref transcript

F11R

  • refseq_F11R.F6 refseq_F11R.R7 115 193
  • NCBIGene 36.2 50848
  • Single exon skipping, size difference: 78
  • Exclusion in 5'UTR
  • Reference transcript: NM_144503

  • cd IGcam 83aa 1e-09 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 91aa 2e-05 in ref transcript
  • smart IG_like 90aa 2e-11 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 79aa 3e-07 in ref transcript

BPTF

  • refseq_FALZ.F1 refseq_FALZ.R1 100 478
  • NCBIGene 36.3 2186
  • Multiple exon skipping, size difference: 378
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_182641

  • cd Bromo_gcn5_like 101aa 1e-46 in ref transcript
    • Bromodomain; Gcn5_like subfamily. Gcn5p is a histone acetyltransferase (HAT) which mediates acetylation of histones at lysine residues; such acetylation is generally correlated with the activation of transcription. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd BAH_plant_2 27aa 8e-04 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • smart BROMO 102aa 1e-27 in ref transcript
    • bromo domain.
  • pfam DDT 61aa 1e-20 in ref transcript
    • DDT domain. This domain is approximately 60 residues in length, and is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors). It is exclusively associated with nuclear domains, and is thought to be arranged into three alpha helices.
  • pfam PHD 45aa 6e-10 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • smart PHD 47aa 5e-08 in ref transcript
    • PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
  • COG COG5076 105aa 8e-14 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].

FAM111A

  • refseq_FAM111A.F1 refseq_FAM111A.R1 122 161
  • NCBIGene 36.3 63901
  • Alternative 3-prime, size difference: 39
  • Inclusion in 5'UTR
  • Reference transcript: NM_198847

FAM119B

  • refseq_FAM119B.F2 refseq_FAM119B.R2 155 294
  • NCBIGene 36.3 25895
  • Single exon skipping, size difference: 139
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_015433

  • Changed! pfam Methyltransf_16 159aa 5e-35 in ref transcript
    • Putative methyltransferase.
  • Changed! COG COG3897 173aa 8e-08 in ref transcript
    • Predicted methyltransferase [General function prediction only].
  • Changed! pfam Methyltransf_16 75aa 8e-12 in modified transcript
  • Changed! COG COG3897 93aa 2e-05 in modified transcript

FAM13C1

  • refseq_FAM13C1.F1 refseq_FAM13C1.R1 106 400
  • NCBIGene 36.3 220965
  • Multiple exon skipping, size difference: 294
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_198215

FAM14B

  • refseq_FAM14B.F1 refseq_FAM14B.R1 174 220
  • NCBIGene 36.3 122509
  • Alternative 3-prime, size difference: 46
  • Inclusion in 5'UTR
  • Reference transcript: NM_206949

FAM19A3

  • refseq_FAM19A3.F2 refseq_FAM19A3.R2 148 216
  • NCBIGene 36.3 284467
  • Alternative 3-prime, size difference: 68
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001004440

FAM19A4

  • refseq_FAM19A4.F1 refseq_FAM19A4.R1 266 348
  • NCBIGene 36.3 151647
  • Alternative 5-prime, size difference: 82
  • Exclusion in 5'UTR
  • Reference transcript: NM_182522

FAM3B

  • refseq_FAM3B.F1 refseq_FAM3B.R1 208 352
  • NCBIGene 36.3 54097
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_058186

FAM3C

  • refseq_FAM3C.F1 refseq_FAM3C.R1 174 214
  • NCBIGene 36.3 10447
  • Alternative 5-prime and 3-prime, size difference: 40
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_014888

FAM48A

  • refseq_FAM48A.F2 refseq_FAM48A.R2 111 215
  • NCBIGene 36.3 55578
  • Single exon skipping, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001014286

FAM48A

  • refseq_FAM48A.F4 refseq_FAM48A.R4 126 205
  • NCBIGene 36.3 55578
  • Alternative 5-prime, size difference: 79
  • Inclusion in 5'UTR
  • Reference transcript: NM_001014286

FAM86A

  • refseq_FAM86A.F1 refseq_FAM86A.R1 109 211
  • NCBIGene 36.3 196483
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201400

  • pfam Methyltransf_16 152aa 1e-12 in ref transcript
    • Putative methyltransferase.
  • COG COG3897 130aa 6e-04 in ref transcript
    • Predicted methyltransferase [General function prediction only].
  • Changed! PRK PRK03910 88aa 0.003 in ref transcript
    • D-cysteine desulfhydrase; Validated.

FAM96A

  • refseq_FAM96A.F2 refseq_FAM96A.R2 137 187
  • NCBIGene 36.3 84191
  • Single exon skipping, size difference: 50
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032231

  • Changed! pfam DUF59 78aa 0.001 in ref transcript
    • Domain of unknown function DUF59. This family includes prokaryotic proteins of unknown function. The family also includes PhaH from Pseudomonas putida. PhaH forms a complex with PhaF, PhaG and PhaI, which hydroxylates phenylacetic acid to 2-hydroxyphenylacetic acid. So members of this family may all be components of ring hydroxylating complexes.
  • Changed! COG COG5133 118aa 1e-24 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam DUF59 59aa 0.005 in modified transcript
  • Changed! COG COG5133 58aa 8e-07 in modified transcript

FANCB

  • refseq_FANCB.F2 refseq_FANCB.R2 142 263
  • NCBIGene 36.3 2187
  • Single exon skipping, size difference: 121
  • Exclusion in 5'UTR
  • Reference transcript: NM_001018113

FBF1

  • refseq_FBF1.F1 refseq_FBF1.R1 163 205
  • NCBIGene 36.2 85302
  • Single exon skipping, size difference: 42
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: XM_932831

  • TIGR SMC_prok_B 283aa 8e-09 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 294aa 1e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

FBLIM1

  • refseq_FBLIM1.F1 refseq_FBLIM1.R1 121 412
  • NCBIGene 36.3 54751
  • Multiple exon skipping, size difference: 291
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001024215

  • pfam LIM 56aa 2e-10 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 56aa 5e-05 in ref transcript

FBLN2

  • refseq_FBLN2.F2 refseq_FBLN2.R2 124 265
  • NCBIGene 36.3 2199
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001004019

  • cd ANATO 61aa 2e-05 in ref transcript
    • Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to repeats in fibulins.
  • cd ANATO 40aa 0.003 in ref transcript
  • cd vWA_Matrilin 44aa 0.005 in ref transcript
    • VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
  • cd EGF_CA 34aa 0.005 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • pfam EGF_CA 49aa 2e-05 in ref transcript
    • Calcium binding EGF domain.
  • smart EGF_CA 43aa 8e-05 in ref transcript
    • Calcium-binding EGF-like domain.
  • pfam EGF_CA 45aa 2e-04 in ref transcript
  • smart EGF_CA 48aa 3e-04 in ref transcript
  • smart EGF_CA 42aa 0.002 in ref transcript
  • pfam ANATO 36aa 0.004 in ref transcript
    • Anaphylotoxin-like domain. C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.
  • Changed! pfam EGF_CA 43aa 0.005 in ref transcript
  • Changed! pfam EGF_CA 43aa 0.005 in modified transcript

FBXL10

  • refseq_FBXL10.F1 refseq_FBXL10.R1 193 307
  • NCBIGene 36.3 84678
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032590

  • pfam JmjC 107aa 6e-09 in ref transcript
    • JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
  • pfam zf-CXXC 31aa 1e-08 in ref transcript
    • CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
  • smart JmjC 76aa 5e-05 in ref transcript
    • A domain family that is part of the cupin metalloenzyme superfamily. Probable enzymes, but of unknown functions, that regulate chromatin reorganisation processes (Clissold and Ponting, in press).
  • pfam F-box 43aa 7e-04 in ref transcript
    • F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.

FBXL5

  • refseq_FBXL5.F1 refseq_FBXL5.R1 267 371
  • NCBIGene 36.3 26234
  • Single exon skipping, size difference: 104
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_012161

  • Changed! pfam F-box 45aa 5e-06 in ref transcript
    • F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
  • Changed! smart LRR_CC 25aa 0.005 in ref transcript
    • Leucine-rich repeat - CC (cysteine-containing) subfamily.

FBXL6

  • refseq_FBXL6.F1 refseq_FBXL6.R1 100 118
  • NCBIGene 36.3 26233
  • Alternative 5-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012162

FBXO21

  • refseq_FBXO21.F1 refseq_FBXO21.R1 119 140
  • NCBIGene 36.3 23014
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033624

  • TIGR yccV 98aa 1e-36 in ref transcript
    • This model describes the small protein from E. coli YccV and its homologs in other Proteobacteria. YccV is now described as a hemimethylated DNA binding protein. The model also describes a domain in longer eukaryotic proteins.
  • COG COG2912 189aa 9e-18 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • COG COG3785 96aa 2e-06 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

FBXO24

  • refseq_FBXO24.F1 refseq_FBXO24.R1 127 169
  • NCBIGene 36.3 26261
  • Alternative 5-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033506

  • pfam F-box 47aa 9e-05 in ref transcript
    • F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
  • COG ATS1 167aa 3e-04 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].

FBXO24

  • refseq_FBXO24.F3 refseq_FBXO24.R3 225 347
  • NCBIGene 36.3 26261
  • Single exon skipping, size difference: 122
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_033506

  • pfam F-box 47aa 9e-05 in ref transcript
    • F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
  • Changed! COG ATS1 167aa 3e-04 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].

FBXO25

  • refseq_FBXO25.F1 refseq_FBXO25.R1 142 169
  • NCBIGene 36.3 26260
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_183421

FBXO25

  • refseq_FBXO25.F4 refseq_FBXO25.R4 270 320
  • NCBIGene 36.3 26260
  • Single exon skipping, size difference: 50
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_183421

FBXO38

  • refseq_FBXO38.F2 refseq_FBXO38.R2 146 371
  • NCBIGene 36.3 81545
  • Alternative 5-prime, size difference: 225
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_205836

FBXO44

  • refseq_FBXO44.F1 refseq_FBXO44.R1 101 473
  • NCBIGene 36.3 93611
  • Single exon skipping, size difference: 372
  • Exclusion in 5'UTR
  • Reference transcript: NM_001014765

  • pfam FBA 185aa 2e-69 in ref transcript
    • F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. The human F-box only protein 2 is involved in binding to N-glycosylated proteins.

FBXO44

  • refseq_FBXO44.F3 refseq_FBXO44.R3 116 241
  • NCBIGene 36.3 93611
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 125
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001014765

  • Changed! pfam FBA 185aa 2e-69 in ref transcript
    • F-box associated region. Members of this family are associated with F-box domains, hence the name FBA. This domain is probably involved in binding other proteins that will be targeted for ubiquitination. The human F-box only protein 2 is involved in binding to N-glycosylated proteins.
  • Changed! pfam FBA 56aa 2e-13 in modified transcript

FBXW8

  • refseq_FBXW8.F1 refseq_FBXW8.R1 158 356
  • NCBIGene 36.3 26259
  • Alternative 5-prime, size difference: 198
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153348

  • cd WD40 288aa 6e-14 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam F-box 43aa 3e-06 in ref transcript
    • F-box domain. This domain is approximately 50 amino acids long, and is usually found in the N-terminal half of a variety of proteins. Two motifs that are commonly found associated with the F-box domain are the leucine rich repeats (LRRs; pfam00560 and pfam07723) and the WD repeat (pfam00400). The F-box domain has a role in mediating protein-protein interactions in a variety of contexts, such as polyubiquitination, transcription elongation, centromere binding and translational repression.
  • COG COG2319 302aa 6e-12 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

FCAR

  • refseq_FCAR.F4 refseq_FCAR.R4 166 202
  • NCBIGene 36.3 2204
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002000

FCGR2B

  • refseq_FCGR2B.F1 refseq_FCGR2B.R1 152 209
  • NCBIGene 36.3 2213
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004001

  • cd IGcam 54aa 0.002 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • smart IG_like 78aa 4e-06 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 50aa 5e-05 in ref transcript
    • Immunoglobulin C-2 Type.

FCGR2C

  • refseq_FCGR2C.F2 refseq_FCGR2C.R2 161 246
  • NCBIGene 36.2 9103
  • Single exon skipping, size difference: 85
  • Inclusion in 5'UTR
  • Reference transcript: NM_201563

FCN2

  • refseq_FCN2.F1 refseq_FCN2.R1 206 320
  • NCBIGene 36.3 2220
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004108

  • cd FReD 211aa 7e-82 in ref transcript
    • Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
  • smart FBG 212aa 3e-86 in ref transcript
    • Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.

FCN3

  • refseq_FCN3.F1 refseq_FCN3.R1 146 179
  • NCBIGene 36.3 8547
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003665

  • Changed! cd FReD 210aa 7e-77 in ref transcript
    • Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
  • Changed! smart FBG 210aa 3e-74 in ref transcript
    • Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
  • Changed! cd FReD 209aa 1e-75 in modified transcript
  • Changed! smart FBG 209aa 4e-73 in modified transcript

FDXR

  • refseq_FDXR.F2 refseq_FDXR.R2 102 120
  • NCBIGene 36.3 2232
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004110

  • TIGR gltA 158aa 3e-15 in ref transcript
    • This protein is homologous to the small subunit of NADPH and NADH forms of glutamate synthase as found in eukaryotes and some bacteria. This protein is found in numerous species having no homolog of the glutamate synthase large subunit. The prototype of the family, from Pyrococcus sp. KOD1, was shown to be active as a homotetramer and to require NADPH.
  • Changed! PTZ PTZ00188 236aa 1e-44 in ref transcript
    • adrenodoxin reductase; Provisional.
  • Changed! COG GltD 451aa 3e-36 in ref transcript
    • NADPH-dependent glutamate synthase beta chain and related oxidoreductases [Amino acid transport and metabolism / General function prediction only].
  • Changed! TIGR Se_ygfK 380aa 2e-09 in modified transcript
    • Members of this protein family are YgfK, predicted to be one subunit of a three-subunit, molybdopterin-containing selenate reductase. This enzyme is found, typically, in genomic regions associated with xanthine dehydrogenase homologs predicted to belong to the selenium-dependent molybdenum hydroxylases (SDMH). Therefore, the selenate reductase is suggested to play a role in furnishing selenide for SelD, the selenophosphate synthase.
  • Changed! PTZ PTZ00188 230aa 2e-46 in modified transcript
  • Changed! COG GltD 445aa 4e-38 in modified transcript

FECH

  • refseq_FECH.F1 refseq_FECH.R1 100 118
  • NCBIGene 36.3 2235
  • Alternative 5-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012515

  • cd Ferrochelatase_C 138aa 7e-44 in ref transcript
    • Ferrochelatase, C-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus.
  • cd Ferrochelatase_N 162aa 3e-41 in ref transcript
    • Ferrochelatase, N-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus.
  • pfam Ferrochelatase 321aa 1e-122 in ref transcript
    • Ferrochelatase.
  • Changed! PRK hemH 325aa 1e-89 in ref transcript
    • ferrochelatase; Reviewed.
  • Changed! PRK hemH 322aa 9e-89 in modified transcript

FER1L3

  • refseq_FER1L3.F1 refseq_FER1L3.R1 216 255
  • NCBIGene 36.3 26509
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013451

  • cd C2 103aa 3e-15 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • cd C2_1 114aa 5e-11 in ref transcript
    • Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
  • Changed! cd C2 114aa 1e-10 in ref transcript
  • cd C2 95aa 2e-07 in ref transcript
  • cd C2 103aa 3e-06 in ref transcript
  • cd C2 117aa 1e-04 in ref transcript
  • pfam FerB 76aa 9e-38 in ref transcript
    • FerB (NUC096) domain. This is central domain B in proteins of the Ferlin family.
  • pfam FerI 72aa 5e-32 in ref transcript
    • FerI (NUC094) domain. This domain is present in proteins of the Ferlin family. It is often located between two C2 domains.
  • pfam FerA 66aa 1e-20 in ref transcript
    • FerA (NUC095) domain. This is central domain A in proteins of the Ferlin family.
  • pfam C2 84aa 6e-15 in ref transcript
    • C2 domain.
  • smart DysFN 57aa 3e-13 in ref transcript
    • Dysferlin domain, N-terminal region. Domain of unknown function present in yeast peroxisomal proteins, dysferlin, myoferlin and hypothetical proteins. Due to an insertion of a dysferlin domain within a second dysferlin domain we have chosen to predict these domains in two parts: the N-terminal region and the C-terminal region.
  • pfam C2 89aa 4e-13 in ref transcript
  • pfam C2 97aa 2e-11 in ref transcript
  • smart DysFN 59aa 2e-11 in ref transcript
  • pfam C2 84aa 2e-09 in ref transcript
  • smart C2 94aa 1e-07 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • smart C2 96aa 2e-04 in ref transcript
  • COG COG5038 77aa 8e-07 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
  • COG COG5038 87aa 6e-05 in ref transcript
  • COG COG5038 90aa 0.009 in ref transcript
  • Changed! cd C2 106aa 5e-10 in modified transcript

FGF1

  • refseq_FGF1.F2 refseq_FGF1.R2 288 395
  • NCBIGene 36.3 2246
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 107
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000800

  • Changed! cd FGF 123aa 1e-38 in ref transcript
    • Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
  • Changed! smart FGF 127aa 1e-44 in ref transcript
    • Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
  • Changed! cd FGF 29aa 1e-05 in modified transcript
  • Changed! smart FGF 35aa 1e-06 in modified transcript

FGF5

  • refseq_FGF5.F2 refseq_FGF5.R2 256 360
  • NCBIGene 36.3 2250
  • Single exon skipping, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004464

  • Changed! cd FGF 129aa 6e-37 in ref transcript
    • Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
  • Changed! smart FGF 134aa 4e-44 in ref transcript
    • Acidic and basic fibroblast growth factor family. Mitogens that stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family play essential roles in patterning and differentiation during vertebrate embryogenesis, and have neurotrophic activities.
  • Changed! cd FGF 26aa 0.001 in modified transcript
  • Changed! pfam FGF 37aa 2e-06 in modified transcript
    • Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. They are found in a wide range of organisms, from nematodes to humans. Most share an internal core region of high similarity, conserved residues in which are involved in binding with their receptors. On binding, they cause dimerisation of their tyrosine kinase receptors leading to intracellular signalling. There are currently four known tyrosine kinase receptors for fibroblast growth factors. These receptors can each bind several different members of this family. Members of this family have a beta trefoil structure. Most have N-terminal signal peptides and are secreted. A few lack signal sequences but are secreted anyway; still others also lack the signal peptide but are found on the cell surface and within the extracellular matrix. A third group remain intracellular. They have central roles in development, regulating cell proliferation, migration and differentiation. On the other hand, they are important in tissue repair following injury in adult organisms.

FGFR1OP

  • refseq_FGFR1OP.F1 refseq_FGFR1OP.R1 194 254
  • NCBIGene 36.3 11116
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007045

  • pfam FOP_dimer 81aa 8e-35 in ref transcript
    • FOP N terminal dimerisation domain. Fibroblast growth factor receptor 1 (FGFR1) oncogene partner (FOP) is a centrosomal protein that is involved in anchoring microtubules to subcellular structures. This domain includes a Lis-homology motif. It forms an alpha helical bundle and is involved in dimerisation.

FGL1

  • refseq_FGL1.F2 refseq_FGL1.R2 216 343
  • NCBIGene 36.3 2267
  • Single exon skipping, size difference: 127
  • Exclusion in 5'UTR
  • Reference transcript: NM_201553

  • cd FReD 227aa 1e-83 in ref transcript
    • Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
  • smart FBG 226aa 2e-90 in ref transcript
    • Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.

FGL1

  • refseq_FGL1.F3 refseq_FGL1.R1 114 241
  • NCBIGene 36.3 2267
  • Single exon skipping, size difference: 127
  • Exclusion in 5'UTR
  • Reference transcript: NM_201552

  • cd FReD 227aa 1e-83 in ref transcript
    • Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
  • smart FBG 226aa 2e-90 in ref transcript
    • Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.

FHL2

  • refseq_FHL2.F1 refseq_FHL2.R1 205 377
  • NCBIGene 36.3 2274
  • Multiple exon skipping, size difference: 172
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001450

  • Changed! pfam LIM 56aa 7e-06 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • Changed! smart LIM 38aa 3e-04 in ref transcript
    • Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
  • Changed! pfam LIM 58aa 5e-04 in ref transcript
  • Changed! smart LIM 41aa 0.008 in ref transcript

FHL2

  • refseq_FHL2.F3 refseq_FHL2.R1 123 244
  • NCBIGene 36.3 2274
  • Single exon skipping, size difference: 121
  • Exclusion in 5'UTR
  • Reference transcript: NM_201555

  • pfam LIM 56aa 5e-05 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • smart LIM 38aa 0.001 in ref transcript
    • Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
  • pfam LIM 58aa 0.002 in ref transcript

FHL2

  • refseq_FHL2.F5 refseq_FHL2.R4 187 315
  • NCBIGene 36.3 2274
  • Alternative 5-prime, size difference: 128
  • Exclusion in 5'UTR
  • Reference transcript: NM_201555

  • pfam LIM 56aa 5e-05 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • smart LIM 38aa 0.001 in ref transcript
    • Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.
  • pfam LIM 58aa 0.002 in ref transcript

FIBP

  • refseq_FIBP.F1 refseq_FIBP.R1 116 137
  • NCBIGene 36.3 9158
  • Alternative 5-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198897

  • Changed! pfam FIBP 363aa 1e-162 in ref transcript
    • Acidic fibroblast growth factor binding (FIBP). Acidic fibroblast growth factor (aFGF) intracellular binding protein (FIBP) is a protein found mainly in the nucleus that is thought to be involved in the intracellular function of aFGF.
  • Changed! pfam FIBP 356aa 1e-164 in modified transcript

FKBP1B

  • refseq_FKBP1B.F2 refseq_FKBP1B.R2 144 189
  • NCBIGene 36.3 2281
  • Alternative 3-prime, size difference: 45
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004116

  • Changed! pfam FKBP_C 95aa 6e-33 in ref transcript
    • FKBP-type peptidyl-prolyl cis-trans isomerase.
  • Changed! COG FkpA 104aa 3e-30 in ref transcript
    • FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam FKBP_C 53aa 3e-17 in modified transcript
  • Changed! COG FkpA 66aa 9e-14 in modified transcript

FKBP7

  • refseq_FKBP7.F1 refseq_FKBP7.R1 113 227
  • NCBIGene 36.2 51661
  • Single exon skipping, size difference: 114
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_181342

  • Changed! pfam FKBP_C 98aa 2e-28 in ref transcript
    • FKBP-type peptidyl-prolyl cis-trans isomerase.
  • Changed! COG FkpA 97aa 1e-17 in ref transcript
    • FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam FKBP_C 81aa 3e-20 in modified transcript
  • Changed! COG FkpA 103aa 1e-11 in modified transcript

FKRP

  • refseq_FKRP.F2 refseq_FKRP.R2 173 392
  • NCBIGene 36.3 79147
  • Alternative 5-prime, size difference: 219
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039885

  • pfam LicD 141aa 2e-16 in ref transcript
    • LICD Protein Family. The LICD family of proteins show high sequence similarity and are involved in phosphorylcholine metabolism. There is evidence to show that LicD2 mutants have a reduced ability to take up choline, have decreased ability to adhere to host cells and are less virulent.
  • COG COG3475 48aa 6e-08 in ref transcript
    • LPS biosynthesis protein [Cell envelope biogenesis, outer membrane].
  • PRK ispDF 95aa 0.006 in ref transcript
    • bifunctional 2-C-methyl-D-erythritol 4-phosphate cytidylyltransferase/2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase protein; Provisional.

FLJ12949

  • refseq_FLJ12949.F1 refseq_FLJ12949.R1 125 199
  • NCBIGene 36.2 65095
  • Single exon skipping, size difference: 74
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_023008

  • Changed! pfam Kri1 202aa 5e-52 in ref transcript
    • KRI1-like family. The yeast member of this family (Kri1p) is found to be required for 40S ribosome biogenesis in the nucleolus.

FLJ20105

  • refseq_FLJ20105.F2 refseq_FLJ20105.R2 241 359
  • NCBIGene 36.2 54821
  • Single exon skipping, size difference: 118
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_017669

  • Changed! cd DEXDc 144aa 8e-17 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • Changed! cd HELICc 134aa 3e-14 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! pfam SNF2_N 286aa 3e-60 in ref transcript
    • SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
  • Changed! smart HELICc 84aa 5e-09 in ref transcript
    • helicase superfamily c-terminal domain.
  • Changed! COG HepA 571aa 2e-66 in ref transcript
    • Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].

DEF8

  • refseq_FLJ20186.F1 refseq_FLJ20186.R1 139 264
  • NCBIGene 36.3 54849
  • Alternative 5-prime, size difference: 125
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_207514

  • Changed! cd C1 51aa 0.001 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • Changed! pfam C1_1 54aa 0.002 in ref transcript
    • Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.

FLJ21839

  • refseq_FLJ21839.F1 refseq_FLJ21839.R1 200 247
  • NCBIGene 36.2 60509
  • Single exon skipping, size difference: 47
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001035506

  • cd M14_AGBL5_like 167aa 2e-83 in ref transcript
    • Peptidase M14-like domain of ATP/GTP binding protein_like (AGBL)-5, and related proteins. The Peptidase M14 family of metallocarboxypeptidases are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. This eukaryotic subgroup includes the human AGBL5 and the mouse cytosolic carboxypeptidase (CCP)-5. ATP/GTP binding protein (AGTPBP-1/Nna1)-like proteins are active metallopeptidases that are thought to act on cytosolic proteins such as alpha-tubulin, to remove a C-terminal tyrosine. Mutations in AGTPBP-1/Nna1 cause Purkinje cell degeneration (pcd). AGTPBP-1/Nna1 however does not belong to this subgroup. AGTPBP-1/Nna1-like proteins from the different phyla are highly diverse, but they all contain a unique N-terminal conserved domain right before the CP domain. It has been suggested that this N-terminal domain might act as a folding domain.
  • cd M14_AGBL5_like 145aa 8e-65 in ref transcript
  • pfam Peptidase_M14 79aa 1e-16 in ref transcript
    • Zinc carboxypeptidase.

FLJ22222

  • refseq_FLJ22222.F2 refseq_FLJ22222.R2 193 406
  • NCBIGene 36.3 79701
  • Single exon skipping, size difference: 213
  • Exclusion of the stop codon
  • Reference transcript: NM_175902

  • Changed! smart P4Hc 115aa 5e-05 in ref transcript
    • Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
  • Changed! smart P4Hc 145aa 1e-14 in modified transcript

FLJ31033

  • refseq_FLJ31033.F2 refseq_FLJ31033.R2 228 276
  • NCBIGene 36.2 91351
  • Alternative 5-prime, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_930988

  • cd DEXDc 143aa 3e-11 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • cd HELICc 103aa 1e-07 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • pfam DEAD 155aa 3e-15 in ref transcript
    • DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
  • smart HELICc 77aa 8e-09 in ref transcript
    • helicase superfamily c-terminal domain.
  • COG COG4581 123aa 1e-24 in ref transcript
    • Superfamily II RNA helicase [DNA replication, recombination, and repair].
  • COG COG4581 176aa 2e-22 in ref transcript

FLJ31033

  • refseq_FLJ31033.F4 refseq_FLJ31033.R4 111 207
  • NCBIGene 36.2 91351
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_930988

  • Changed! cd DEXDc 143aa 3e-11 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • cd HELICc 103aa 1e-07 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! pfam DEAD 155aa 3e-15 in ref transcript
    • DEAD/DEAH box helicase. Members of this family include the DEAD and DEAH box helicases. Helicases are involved in unwinding nucleic acids. The DEAD box helicases are involved in various aspects of RNA metabolism, including nuclear transcription, pre mRNA splicing, ribosome biogenesis, nucleocytoplasmic transport, translation, RNA decay and organellar gene expression.
  • smart HELICc 77aa 8e-09 in ref transcript
    • helicase superfamily c-terminal domain.
  • Changed! COG COG4581 123aa 1e-24 in ref transcript
    • Superfamily II RNA helicase [DNA replication, recombination, and repair].
  • Changed! COG COG4581 176aa 2e-22 in ref transcript
  • Changed! cd DEXDc 61aa 6e-04 in modified transcript
  • Changed! smart DEXDc 74aa 1e-05 in modified transcript
    • DEAD-like helicases superfamily.
  • Changed! COG COG4581 299aa 9e-25 in modified transcript
  • Changed! COG COG4581 144aa 3e-13 in modified transcript

FLJ44379

  • refseq_FLJ44379.F1 refseq_FLJ44379.R1 311 401
  • NCBIGene 36.2 132203
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_059578

  • Changed! cd S-100 115aa 2e-10 in ref transcript
    • S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.
  • Changed! cd S-100 85aa 7e-15 in modified transcript

FLRT3

  • refseq_FLRT3.F1 refseq_FLRT3.R1 191 385
  • NCBIGene 36.3 23767
  • Single exon skipping, size difference: 194
  • Exclusion in 5'UTR
  • Reference transcript: NM_198391

  • cd LRR_RI 271aa 1e-04 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • TIGR PCC 78aa 2e-08 in ref transcript
    • Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
  • smart LRRNT 31aa 0.005 in ref transcript
    • Leucine rich repeat N-terminal domain.
  • COG COG4886 241aa 9e-07 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

FMNL3

  • refseq_FMNL3.F1 refseq_FMNL3.R1 127 280
  • NCBIGene 36.3 91010
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_175736

  • pfam FH2 366aa 1e-77 in ref transcript
    • Formin Homology 2 Domain.
  • pfam Drf_FH3 202aa 8e-42 in ref transcript
    • Diaphanous FH3 Domain. This region is found in the Formin-like and and diaphanous proteins.
  • Changed! pfam Drf_GBD 116aa 6e-26 in ref transcript
    • Diaphanous GTPase-binding Domain. This domain is bound to by GTP-attached Rho proteins, leading to activation of the Drf protein.
  • Changed! pfam Drf_GBD 49aa 1e-12 in ref transcript
  • Changed! pfam Drf_GBD 183aa 7e-44 in modified transcript

FMO3

  • refseq_FMO3.F2 refseq_FMO3.R2 96 113
  • NCBIGene 36.3 2328
  • Alternative 5-prime, size difference: 17
  • Exclusion in 5'UTR
  • Reference transcript: NM_001002294

  • pfam FMO-like 531aa 0.0 in ref transcript
    • Flavin-binding monooxygenase-like. This family includes FMO proteins and cyclohexanone monooxygenase.
  • COG TrkA 340aa 4e-34 in ref transcript
    • Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism].

FOLH1

  • refseq_FOLH1.F1 refseq_FOLH1.R1 249 342
  • NCBIGene 36.3 2346
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004476

  • cd PA_GCPII_like 227aa 9e-72 in ref transcript
    • PA_GCPII_like: Protease-associated domain containing protein, glutamate carboxypeptidase II (GCPII)-like. This group contains various PA domain-containing proteins similar to GCPII including, GCPIII (NAALADase2) and NAALADase L. These proteins belong to the peptidase M28 family. GCPII is also known N-acetylated-alpha-linked acidic dipeptidase (NAALDase1), folate hydrolase or prostate-specific membrane antigen (PSMA). GCPII is found in various human tissues including prostate, small intestine, and the central nervous system. In the brain, GCPII is known as NAALDase1, it functions as a NAALDase hydrolyzing the neuropeptide N-acetyl-L-aspartyl-L-glutamate (alpha-NAAG), to release free glutamate. In the small intestine, GCPII releases the terminal glutamate from poly-gamma-glutamated folates. GCPII (PSMA) is a useful cancer marker; its expression is markedly increased in prostate cancer and in tumor-associated neovasculature. GCPIII hydrolyzes alpha-NAAG with a lower efficiency than does GCPII; NAALADase L is not able to hydrolyze alpha-NAAG. The GCPII PA domain (referred to as the apical domain) participates in substrate binding and may act as a protein-protein interaction domain.
  • Changed! pfam TFR_dimer 138aa 8e-48 in ref transcript
    • Transferrin receptor-like dimerisation domain. This domain is involved in dimerisation of the transferrin receptor as shown in its crystal structure.
  • pfam PA 93aa 5e-15 in ref transcript
    • PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family. It has been suggested that this domain forms a lid-like structure that covers the active site in active proteases, and is involved in protein recognition in vacuolar sorting receptors.
  • pfam Peptidase_M28 85aa 8e-13 in ref transcript
    • Peptidase family M28.
  • Changed! COG Iap 219aa 2e-09 in ref transcript
    • Predicted aminopeptidases [General function prediction only].
  • Changed! pfam TFR_dimer 107aa 8e-26 in modified transcript
  • Changed! COG Iap 238aa 2e-09 in modified transcript

FOLR1

  • refseq_FOLR1.F1 refseq_FOLR1.R1 157 223
  • NCBIGene 36.3 2348
  • Single exon skipping, size difference: 66
  • Exclusion in 5'UTR
  • Reference transcript: NM_016724

  • pfam Folate_rec 185aa 4e-78 in ref transcript
    • Folate receptor family. This family includes the folate receptor which binds to folate and reduced folic acid derivatives and mediates delivery of 5-methyltetrahydrofolate to the interior of cells. These proteins are attached to the membrane by a GPI-anchor. The proteins contain 16 conserved cysteines that form eight disulphide bridges.

FOXI1

  • refseq_FOXI1.F1 refseq_FOXI1.R1 100 385
  • NCBIGene 36.3 2299
  • Alternative 3-prime, size difference: 285
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012188

  • Changed! cd FH 78aa 2e-36 in ref transcript
    • Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
  • Changed! smart FH 89aa 7e-43 in ref transcript
    • FORKHEAD. FORKHEAD, also known as a "winged helix".
  • Changed! COG COG5025 92aa 2e-15 in ref transcript
    • Transcription factor of the Forkhead/HNF3 family [Transcription].
  • Changed! cd FH 69aa 4e-30 in modified transcript
  • Changed! smart FH 69aa 3e-31 in modified transcript
  • Changed! COG COG5025 121aa 1e-12 in modified transcript

FOXM1

  • refseq_FOXM1.F2 refseq_FOXM1.R2 100 145
  • NCBIGene 36.3 2305
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_202002

  • cd FH 76aa 2e-28 in ref transcript
    • Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
  • smart FH 81aa 2e-30 in ref transcript
    • FORKHEAD. FORKHEAD, also known as a "winged helix".
  • COG COG5025 78aa 9e-10 in ref transcript
    • Transcription factor of the Forkhead/HNF3 family [Transcription].

FOXM1

  • refseq_FOXM1.F3 refseq_FOXM1.R3 284 398
  • NCBIGene 36.3 2305
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_202002

  • cd FH 76aa 2e-28 in ref transcript
    • Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
  • smart FH 81aa 2e-30 in ref transcript
    • FORKHEAD. FORKHEAD, also known as a "winged helix".
  • COG COG5025 78aa 9e-10 in ref transcript
    • Transcription factor of the Forkhead/HNF3 family [Transcription].

FRMPD2

  • refseq_FRMPD2.F1 refseq_FRMPD2.R1 169 235
  • NCBIGene 36.3 143162
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001018071

  • cd PDZ_signaling 82aa 1e-12 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd FERM_C 93aa 9e-11 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • cd PDZ_signaling 87aa 8e-10 in ref transcript
  • cd PDZ_signaling 85aa 2e-08 in ref transcript
  • Changed! smart KIND 173aa 6e-39 in ref transcript
    • kinase non-catalytic C-lobe domain. It is an interaction domain identified as being similar to the C-terminal protein kinase catalytic fold (C lobe). Its presence at the N terminus of signalling proteins and the absence of the active-site residues in the catalytic and activation loops suggest that it folds independently and is likely to be non-catalytic. The occurrence of KIND only in metazoa implies that it has evolved from the catalytic protein kinase domain into an interaction domain possibly by keeping the substrate-binding features.
  • smart B41 205aa 2e-32 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam FERM_C 88aa 2e-25 in ref transcript
    • FERM C-terminal PH-like domain.
  • smart PDZ 88aa 4e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 90aa 3e-13 in ref transcript
  • TIGR degP_htrA_DO 191aa 2e-06 in ref transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
  • smart PDZ 86aa 6e-05 in ref transcript
  • COG Prc 62aa 3e-06 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 68aa 2e-05 in ref transcript
  • Changed! smart KIND 151aa 6e-28 in modified transcript

FSHB

  • refseq_FSHB.F1 refseq_FSHB.R1 144 174
  • NCBIGene 36.3 2488
  • Alternative 5-prime, size difference: 30
  • Exclusion in 5'UTR
  • Reference transcript: NM_000510

  • cd GHB 102aa 5e-34 in ref transcript
    • Glycoprotein hormone beta chain homologues. Gonadotropins; reproductive hormones consisting of two glycosylated chains (alpha and beta) of similar topology with Cysteine-knot motifs.
  • smart GHB 107aa 2e-39 in ref transcript
    • Glycoprotein hormone beta chain homologues. Also called gonadotropins. Glycoprotein hormones consist of two glycosylated chains (alpha and beta) of similar topology.

FST

  • refseq_FST.F1 refseq_FST.R1 134 398
  • NCBIGene 36.3 10468
  • Alternative 3-prime, size difference: 264
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_013409

  • cd FSL_SPARC 56aa 2e-05 in ref transcript
    • Follistatin-like SPARC (secreted protein, acidic, and rich in cysteines) domain; SPARC/BM-40/osteonectin is a multifunctional glycoprotein which modulates cellular interaction with the extracellular matrix by its binding to structural matrix proteins such as collagen and vitronectin. The protein it composed of an N-terminal acidic region, a follistatin (FS) domain and an EF-hand calcium binding domain. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a small hydrophobic core of alpha/beta structure (Kazal domain) and has five disulfide bonds and a conserved N-glycosylation site. The FSL_SPARC domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.
  • cd KAZAL_FS 33aa 5e-04 in ref transcript
    • Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor. Follistatin-like family members include SPARC, also known as, BM-40 or osteonectin, the Gallus gallus Flik protein, as well as, agrin which has a long array of FS domains. The kazal-type inhibitor domain has also been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The distant homolog, Ascidian trypsin inhibitor, is included in this CD.
  • cd KAZAL_FS 40aa 7e-04 in ref transcript
  • Changed! cd FSL_SPARC 60aa 0.009 in ref transcript
  • smart KAZAL 48aa 7e-07 in ref transcript
    • Kazal type serine protease inhibitors. Kazal type serine protease inhibitors and follistatin-like domains.
  • smart KAZAL 47aa 4e-06 in ref transcript
  • smart KAZAL 48aa 8e-06 in ref transcript
  • pfam PRKCSH 55aa 3e-05 in ref transcript
    • Glucosidase II beta subunit-like protein. The sequences found in this family are similar to a region found in the beta-subunit of glucosidase II, which is also known as protein kinase C substrate 80K-H (PRKCSH). The enzyme catalyses the sequential removal of two alpha-1,3-linked glucose residues in the second step of N-linked oligosaccharide processing. The beta subunit is required for the solubility and stability of the heterodimeric enzyme, and is involved in retaining the enzyme within the endoplasmic reticulum. Mutations in the gene coding for PRKCSH have been found to be involved in the development of autosomal dominant polycystic liver disease (ADPLD), but the precise role the protein has in the pathogenesis of this disease is unknown. This family also includes an ER sensor for misfolded glycoproteins and is therefore likely to be a generic sugar binding domain.
  • pfam FOLN 22aa 0.004 in ref transcript
    • Follistatin/Osteonectin-like EGF domain. Members of this family are predominantly found in osteonectin and follistatin and adopt an EGF-like structure.

AKTIP

  • refseq_FTS.F2 refseq_FTS.R2 142 171
  • NCBIGene 36.3 64400
  • Alternative 5-prime, size difference: 29
  • Exclusion in 5'UTR
  • Reference transcript: NM_001012398

  • cd UBCc 135aa 4e-11 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • smart UBCc 142aa 2e-24 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • COG COG5078 140aa 1e-12 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

FTSJ1

  • refseq_FTSJ1.F2 refseq_FTSJ1.R2 254 360
  • NCBIGene 36.3 24140
  • Single exon skipping, size difference: 106
  • Inclusion in 5'UTR
  • Reference transcript: NM_012280

  • pfam FtsJ 179aa 2e-58 in ref transcript
    • FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in complex with its cofactor S-adenosylmethionine revealed that FtsJ has a methyltransferase fold. This family also includes the N terminus of flaviviral NS5 protein. It has been hypothesised that the N-terminal domain of NS5 is a methyltransferase involved in viral RNA capping.
  • COG FtsJ 200aa 1e-56 in ref transcript
    • 23S rRNA methylase [Translation, ribosomal structure and biogenesis].

FUS

  • refseq_FUS.F2 refseq_FUS.R2 155 190
  • NCBIGene 36.2 2521
  • Single exon skipping, size difference: 35
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004960

  • Changed! cd RRM 83aa 2e-11 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! smart RRM_2 82aa 3e-11 in ref transcript
    • RNA recognition motif.
  • Changed! pfam zf-RanBP 24aa 0.007 in ref transcript
    • Zn-finger in Ran binding protein and others.
  • Changed! COG COG0724 107aa 1e-06 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

FUT8

  • refseq_FUT8.F1 refseq_FUT8.R1 264 366
  • NCBIGene 36.3 2530
  • Single exon skipping, size difference: 102
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_178155

  • cd SH3 53aa 5e-05 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart SH3 54aa 1e-06 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

FUT8

  • refseq_FUT8.F3 refseq_FUT8.R3 186 284
  • NCBIGene 36.3 2530
  • Single exon skipping, size difference: 98
  • Exclusion in 5'UTR
  • Reference transcript: NM_178155

  • cd SH3 53aa 5e-05 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart SH3 54aa 1e-06 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

FXR1

  • refseq_FXR1.F1 refseq_FXR1.R1 103 447
  • NCBIGene 36.3 8087
  • Single exon skipping, size difference: 344
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_005087

  • Changed! cd KH-I 67aa 1e-05 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • Changed! pfam Agenet 60aa 1e-07 in ref transcript
    • Agenet domain. This domain is related to the TUDOR domain pfam00567. The function of the agenet domain is unknown. This family currently only matches one of the two Agenet domains in the FMR proteins.
  • Changed! pfam KH_1 59aa 9e-06 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • Changed! TIGR polynuc_phos 89aa 0.002 in ref transcript
    • Members of this protein family are polyribonucleotide nucleotidyltransferase, also called polynucleotide phosphorylase. Some members have been shown also to have additional functions as guanosine pentaphosphate synthetase and as poly(A) polymerase (see model TIGR02696 for an exception clade, within this family).
  • Changed! PRK PRK11824 59aa 7e-04 in ref transcript
    • polynucleotide phosphorylase/polyadenylase; Provisional.
  • Changed! COG Pnp 156aa 0.002 in ref transcript
    • Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) [Translation, ribosomal structure and biogenesis].

FXR1

  • refseq_FXR1.F3 refseq_FXR1.R3 270 362
  • NCBIGene 36.3 8087
  • Single exon skipping, size difference: 92
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005087

  • cd KH-I 67aa 1e-05 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • pfam Agenet 60aa 1e-07 in ref transcript
    • Agenet domain. This domain is related to the TUDOR domain pfam00567. The function of the agenet domain is unknown. This family currently only matches one of the two Agenet domains in the FMR proteins.
  • pfam KH_1 59aa 9e-06 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • Changed! TIGR polynuc_phos 89aa 0.002 in ref transcript
    • Members of this protein family are polyribonucleotide nucleotidyltransferase, also called polynucleotide phosphorylase. Some members have been shown also to have additional functions as guanosine pentaphosphate synthetase and as poly(A) polymerase (see model TIGR02696 for an exception clade, within this family).
  • PRK PRK11824 59aa 7e-04 in ref transcript
    • polynucleotide phosphorylase/polyadenylase; Provisional.
  • COG Pnp 156aa 0.002 in ref transcript
    • Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) [Translation, ribosomal structure and biogenesis].
  • Changed! TIGR nusA_arch 87aa 0.001 in modified transcript
    • This model represents a family of archaeal proteins found in a single copy per genome. It contains two KH domains (pfam00013) and is most closely related to the central region bacterial NusA, a transcription termination factor named for its iteraction with phage lambda protein N in E. coli. The proteins required for antitermination by N include NusA, NusB, nusE (ribosomal protein S10), and nusG. This system, on the whole, appears not to be present in the Archaea.

FXYD3

  • refseq_FXYD3.F1 refseq_FXYD3.R1 102 186
  • NCBIGene 36.3 5349
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_021910

  • pfam ATP1G1_PLM_MAT8 74aa 2e-15 in ref transcript
    • ATP1G1/PLM/MAT8 family.

FYB

  • refseq_FYB.F1 refseq_FYB.R1 93 231
  • NCBIGene 36.3 2533
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001465

  • cd SH3 54aa 0.003 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart SH3 58aa 5e-05 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

FYTTD1

  • refseq_FYTTD1.F1 refseq_FYTTD1.R1 149 322
  • NCBIGene 36.3 84248
  • Single exon skipping, size difference: 173
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_032288

  • Changed! pfam FYTT 316aa 1e-143 in ref transcript
    • Forty-two-three protein. This family consists of several mammalian proteins of around 320 residues in length called 40-2-3 proteins. The function of this family is unknown.
  • Changed! pfam FYTT 76aa 2e-29 in modified transcript

G3BP1

  • refseq_G3BP.F2 refseq_G3BP.R2 131 162
  • NCBIGene 36.3 10146
  • Alternative 3-prime, size difference: 31
  • Inclusion in 5'UTR
  • Reference transcript: NM_198395

  • cd NTF2 129aa 8e-32 in ref transcript
    • Nuclear transport factor 2 (NTF2) domain plays an important role in the trafficking of macromolecules, ions and small molecules between the cytoplasm and nucleus. This bi-directional transport of macromolecules across the nuclear envelope requires many soluble factors that includes GDP-binding protein Ran (RanGDP). RanGDP is required for both import and export of proteins and poly(A) RNA. RanGDP also has been implicated in cell cycle control, specifically in mitotic spindle assembly. In interphase cells, RanGDP is predominately nuclear and thought to be GTP bound, but it is also present in the cytoplasm, probably in the GDP-bound state. NTF2 mediates the nuclear import of RanGDP. NTF2 binds to both RanGDP and FxFG repeat-containing nucleoporins.
  • cd RRM 70aa 9e-11 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • pfam NTF2 123aa 1e-24 in ref transcript
    • Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins.
  • pfam RRM_1 56aa 1e-10 in ref transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
  • COG COG0724 105aa 2e-05 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

GAB1

  • refseq_GAB1.F2 refseq_GAB1.R2 275 365
  • NCBIGene 36.3 2549
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207123

  • cd PH_Gab 108aa 2e-52 in ref transcript
    • Gab (Grb2-associated binder) pleckstrin homology (PH) domain. The Gab subfamily includes several Gab proteins, Drosophila DOS and C. elegans SOC-1. They are scaffolding adaptor proteins, which possess N-terminal PH domains and a C-terminus with proline-rich regions and multiple phosphorylation sites. Following activation of growth factor receptors, Gab proteins are tyrosine phosphorylated and activate PI3K, which generates 3-phosphoinositide lipids. By binding to these lipids via the PH domain, Gab proteins remain in proximity to the receptor, leading to further signaling. While not all Gab proteins depend on the PH domain for recruitment, it is required for Gab activity. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam PH 110aa 8e-17 in ref transcript
    • PH domain. PH stands for pleckstrin homology.

GABBR1

  • refseq_GABBR1.F1 refseq_GABBR1.R1 129 315
  • NCBIGene 36.3 2550
  • Single exon skipping, size difference: 186
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001470

  • cd PBP1_GABAb_receptor 393aa 1e-118 in ref transcript
    • Ligand-binding domain of GABAb receptors, which are metabotropic transmembrane receptors for gamma-aminobutyric acid (GABA). GABA is the major inhibitory neurotransmitter in the mammalian CNS and, like glutamate and other transmitters, acts via both ligand gated ion channels (GABAa receptors) and G-protein coupled receptors (GABAb). GABAa receptors are members of the ionotropic receptor superfamily which includes alpha-adrenergic and glycine receptors. The GABAb receptor is a member of a receptor superfamily which includes the mGlu receptors. The GABAb receptor is coupled to G alpha_i proteins, and activation causes a decrease in calcium, an increase in potassium membrane conductance, and inhibition of cAMP formation. The response is thus inhibitory and leads to hyperpolarization and decreased neurotransmitter release, for example.
  • Changed! cd CCP 53aa 1e-05 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • pfam ANF_receptor 357aa 5e-63 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • pfam 7tm_3 259aa 5e-38 in ref transcript
    • 7 transmembrane sweet-taste receptor of 3 GCPR. This is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G pfam07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor pfam01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness.
  • Changed! smart CCP 52aa 7e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • COG LivK 386aa 5e-27 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].

GABBR1

  • refseq_GABBR1.F3 refseq_GABBR1.R3 144 295
  • NCBIGene 36.3 2550
  • Single exon skipping, size difference: 151
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001470

  • cd PBP1_GABAb_receptor 393aa 1e-118 in ref transcript
    • Ligand-binding domain of GABAb receptors, which are metabotropic transmembrane receptors for gamma-aminobutyric acid (GABA). GABA is the major inhibitory neurotransmitter in the mammalian CNS and, like glutamate and other transmitters, acts via both ligand gated ion channels (GABAa receptors) and G-protein coupled receptors (GABAb). GABAa receptors are members of the ionotropic receptor superfamily which includes alpha-adrenergic and glycine receptors. The GABAb receptor is a member of a receptor superfamily which includes the mGlu receptors. The GABAb receptor is coupled to G alpha_i proteins, and activation causes a decrease in calcium, an increase in potassium membrane conductance, and inhibition of cAMP formation. The response is thus inhibitory and leads to hyperpolarization and decreased neurotransmitter release, for example.
  • cd CCP 53aa 1e-05 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • pfam ANF_receptor 357aa 5e-63 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • Changed! pfam 7tm_3 259aa 5e-38 in ref transcript
    • 7 transmembrane sweet-taste receptor of 3 GCPR. This is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G pfam07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor pfam01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness.
  • smart CCP 52aa 7e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • COG LivK 386aa 5e-27 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].

GABPB2

  • refseq_GABPB2.F1 refseq_GABPB2.R1 135 159
  • NCBIGene 36.3 2553
  • Single exon skipping, size difference: 24
  • Inclusion in 5'UTR
  • Reference transcript: NM_005254

  • cd ANK 123aa 4e-29 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • pfam Ank 30aa 1e-07 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • TIGR trp 156aa 1e-06 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • COG Arp 132aa 5e-14 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

GABPB2

  • refseq_GABPB2.F3 refseq_GABPB2.R3 199 235
  • NCBIGene 36.3 2553
  • Alternative 5-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005254

  • cd ANK 123aa 4e-29 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • pfam Ank 30aa 1e-07 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • TIGR trp 156aa 1e-06 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • COG Arp 132aa 5e-14 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

GABRB2

  • refseq_GABRB2.F1 refseq_GABRB2.R1 285 399
  • NCBIGene 36.3 2561
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021911

  • Changed! TIGR LIC 502aa 1e-129 in ref transcript
    • selective while glycine receptors are anion selective).
  • Changed! TIGR LIC 464aa 1e-131 in modified transcript

GABRG2

  • refseq_GABRG2.F1 refseq_GABRG2.R1 101 125
  • NCBIGene 36.3 2566
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198904

  • Changed! TIGR LIC 404aa 2e-73 in ref transcript
    • selective while glycine receptors are anion selective).
  • Changed! TIGR LIC 396aa 7e-74 in modified transcript

GABRG2

  • refseq_GABRG2.F2 refseq_GABRG2.R2 127 189
  • NCBIGene 36.3 2566
  • Alternative 3-prime, size difference: 62
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_198904

  • Changed! TIGR LIC 404aa 2e-73 in ref transcript
    • selective while glycine receptors are anion selective).
  • Changed! pfam Neur_chan_LBD 41aa 2e-05 in modified transcript
    • Neurotransmitter-gated ion-channel ligand binding domain. This family is the extracellular ligand binding domain of these ion channels. This domain forms a pentameric arrangement in the known structure.

GANAB

  • refseq_GANAB.F1 refseq_GANAB.R1 169 235
  • NCBIGene 36.3 23193
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198335

  • cd GH31_GANC_GANAB_alpha 339aa 0.0 in ref transcript
    • This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
  • pfam Glyco_hydro_31 446aa 1e-171 in ref transcript
    • Glycosyl hydrolases family 31. Glycosyl hydrolases are key enzymes of carbohydrate metabolism. Family 31 comprises of enzymes that are, or similar to, alpha- galactosidases.
  • COG COG1501 634aa 1e-147 in ref transcript
    • Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism].

GARNL1

  • refseq_GARNL1.F1 refseq_GARNL1.R1 121 152
  • NCBIGene 36.3 253959
  • Single exon skipping, size difference: 31
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_194301

  • pfam Rap_GAP 180aa 2e-19 in ref transcript
    • Rap/ran-GAP.

GCDH

  • refseq_GCDH.F1 refseq_GCDH.R1 160 392
  • NCBIGene 36.3 2639
  • Alternative 3-prime, size difference: 232
  • Exclusion of the stop codon
  • Reference transcript: NM_000159

  • Changed! cd GCD 387aa 0.0 in ref transcript
    • Glutaryl-CoA dehydrogenase (GCD). GCD is an acyl-CoA dehydrogenase, which catalyzes the oxidative decarboxylation of glutaryl-CoA to crotonyl-CoA and carbon dioxide in the catabolism of lysine, hydroxylysine, and tryptophan. It uses electron transfer flavoprotein (ETF) as an electron acceptor. GCD is a homotetramer. GCD deficiency leads to a severe neurological disorder in humans.
  • pfam Acyl-CoA_dh_N 112aa 4e-32 in ref transcript
    • Acyl-CoA dehydrogenase, N-terminal domain. The N-terminal domain of Acyl-CoA dehydrogenase is an all-alpha domain.
  • Changed! TIGR cyc_hxne_CoA_dh 367aa 1e-25 in ref transcript
    • Cyclohex-1-ene-1carboxyl-CoA is an intermediate in the anaerobic degradation of benzoyl-CoA derived from varioius aromatic compounds, in Rhodopseudomonas palustris but not Thauera aromatica. The aliphatic compound cyclohexanecarboxylate, can be converted to the same intermediate in two steps. The first step is its ligation to coenzyme A. The second is the action of this enzyme, cyclohexanecarboxyl-CoA dehydrogenase.
  • Changed! COG CaiA 375aa 2e-70 in ref transcript
    • Acyl-CoA dehydrogenases [Lipid metabolism].
  • Changed! cd GCD 367aa 1e-180 in modified transcript
  • Changed! TIGR cyc_hxne_CoA_dh 300aa 1e-25 in modified transcript
  • Changed! COG CaiA 355aa 1e-65 in modified transcript

GCET2

  • refseq_GCET2.F2 refseq_GCET2.R2 318 485
  • NCBIGene 36.3 257144
  • Alternative 3-prime, size difference: 167
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_152785

GCH1

  • refseq_GCH1.F1 refseq_GCH1.R1 101 375
  • NCBIGene 36.3 2643
  • Alternative 3-prime, size difference: 274
  • Exclusion of the stop codon
  • Reference transcript: NM_000161

  • Changed! cd GTP_cyclohydro1 168aa 7e-83 in ref transcript
    • GTP cyclohydrolase I (GTP-CH-I) catalyzes the conversion of GTP into dihydroneopterin triphosphate. The enzyme product is the precursor of tetrahydrofolate in eubacteria, fungi, and plants and of the folate analogs in methanogenic bacteria. In vertebrates and insects it is the biosynthtic precursor of tetrahydrobiopterin (BH4) which is involved in the formation of catacholamines, nitric oxide, and the stimulation of T lymphocytes. The biosynthetic reaction of BH4 is controlled by a regulatory protein GFRP which mediates feedback inhibition of GTP-CH-I by BH4. This inhibition is reversed by phenylalanine. The decameric GTP-CH-I forms a complex with two pentameric GFRP in the presence of phenylalanine or a combination of GTP and BH4, respectively.
  • Changed! TIGR folE 168aa 1e-73 in ref transcript
    • GTP cyclohydrolase I (EC 3.5.4.16) catalyzes the biosynthesis of formic acid and dihydroneopterin triphosphate from GTP. This reaction is the first step in the biosynthesis of tetrahydrofolate in prokaryotes, of tetrahydrobiopterin in vertebrates, and of pteridine-containing pigments in insects.
  • Changed! PRK folE 168aa 4e-81 in ref transcript
    • GTP cyclohydrolase I; Provisional.
  • Changed! cd GTP_cyclohydro1 127aa 9e-62 in modified transcript
  • Changed! TIGR folE 127aa 2e-53 in modified transcript
  • Changed! PRK folE 127aa 3e-59 in modified transcript
  • Changed! PRK PRK01297 54aa 0.010 in modified transcript
    • ATP-dependent RNA helicase RhlB; Provisional.

GEM

  • refseq_GEM.F2 refseq_GEM.R2 110 225
  • NCBIGene 36.3 2669
  • Alternative 5-prime, size difference: 115
  • Exclusion in 5'UTR
  • Reference transcript: NM_005261

  • cd RGK 221aa 1e-99 in ref transcript
    • RGK subfamily. The RGK (Rem, Rem2, Rad, Gem/Kir) subfamily of Ras GTPases are expressed in a tissue-specific manner and are dynamically regulated by transcriptional and posttranscriptional mechanisms in response to environmental cues. RGK proteins bind to the beta subunit of L-type calcium channels, causing functional down-regulation of these voltage-dependent calcium channels, and either termination of calcium-dependent secretion or modulation of electrical conduction and contractile function. Inhibition of L-type calcium channels by Rem2 may provide a mechanism for modulating calcium-triggered exocytosis in hormone-secreting cells, and has been proposed to influence the secretion of insulin in pancreatic beta cells. RGK proteins also interact with and inhibit the Rho/Rho kinase pathway to modulate remodeling of the cytoskeleton. Two characteristics of RGK proteins cited in the literature are N-terminal and C-terminal extensions beyond the GTPase domain typical of Ras superfamily members. The N-terminal extension is not conserved among family members; the C-terminal extension is reported to be conserved among the family and lack the CaaX prenylation motif typical of membrane-associated Ras proteins. However, a putative CaaX motif has been identified in the alignment of the C-terminal residues of this CD.
  • smart RAS 165aa 2e-34 in ref transcript
    • Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades.
  • COG COG1100 191aa 1e-13 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].

GEMIN7

  • refseq_GEMIN7.F2 refseq_GEMIN7.R2 272 395
  • NCBIGene 36.3 79760
  • Single exon skipping, size difference: 123
  • Exclusion in 5'UTR
  • Reference transcript: NM_024707

GFM2

  • refseq_GFM2.F1 refseq_GFM2.R1 168 309
  • NCBIGene 36.3 84340
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032380

  • cd EF-G 281aa 1e-125 in ref transcript
    • Elongation factor G (EF-G) subfamily. Translocation is mediated by EF-G (also called translocase). The structure of EF-G closely resembles that of the complex between EF-Tu and tRNA. This is an example of molecular mimicry; a protein domain evolved so that it mimics the shape of a tRNA molecule. EF-G in the GTP form binds to the ribosome, primarily through the interaction of its EF-Tu-like domain with the 50S subunit. The binding of EF-G to the ribosome in this manner stimulates the GTPase activity of EF-G. On GTP hydrolysis, EF-G undergoes a conformational change that forces its arm deeper into the A site on the 30S subunit. To accommodate this domain, the peptidyl-tRNA in the A site moves to the P site, carrying the mRNA and the deacylated tRNA with it. The ribosome may be prepared for these rearrangements by the initial binding of EF-G as well. The dissociation of EF-G leaves the ribosome ready to accept the next aminoacyl-tRNA into the A site. This group contains both eukaryotic and bacterial members.
  • cd mtEFG2_like_IV 121aa 1e-44 in ref transcript
    • mtEF-G2 domain IV. This subfamily is a part the of mitochondrial transcriptional elongation factor, mtEF-G2. Mitochondrial translation is crucial for maintaining mitochondrial function and mutations in this system lead to a breakdown in the respiratory chain-oxidative phosphorylation system and to impaired maintenance of mitochondrial DNA. In complex with GTP, EF-G promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site of the small subunit of ribosome and the mRNA is shifted one codon relative to the ribosome.
  • Changed! cd mtEFG2_II_like 82aa 3e-35 in ref transcript
    • mtEFG2_C: C-terminus of mitochondrial Elongation factor G2 (mtEFG2)-like proteins found in eukaryotes. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more complex than that in prokaryotes, with both cytoplasmic and mitochondrial elongation factors and multiple isoforms being expressed in certain species. Eukaryotic EF-2 operates in the cytosolic protein synthesis machinery of eukaryotes, EF-Gs in protein synthesis in bacteria. Eukaryotic mtEFG1 proteins show significant homology to bacterial EF-Gs. No clear phenotype has been found for mutants in the yeast homologue of mtEFG2, MEF2. There are two forms of mtEFG present in mammals (designated mtEFG1s and mtEFG2s) mtEFG1s are not present in this group.
  • cd EFG_mtEFG_C 77aa 1e-21 in ref transcript
    • EFG_mtEFG_C: domains similar to the C-terminal domain of the bacterial translational elongation factor (EF) EF-G. Included in this group is the C-terminus of mitochondrial Elongation factor G1 (mtEFG1) and G2 (mtEFG2) proteins. Eukaryotic cells harbor 2 protein synthesis systems: one localized in the cytoplasm, the other in the mitochondria. Most factors regulating mitochondrial protein synthesis are encoded by nuclear genes, translated in the cytoplasm, and then transported to the mitochondria. The eukaryotic system of elongation factor (EF) components is more complex than that in prokaryotes, with both cytoplasmic and mitochondrial elongation factors and multiple isoforms being expressed in certain species. During the process of peptide synthesis and tRNA site changes, the ribosome is moved along the mRNA a distance equal to one codon with the addition of each amino acid. In bacteria this translocation step is catalyzed by EF-G_GTP, which is hydrolyzed to provide the required energy. Thus, this action releases the uncharged tRNA from the P site and transfers the newly formed peptidyl-tRNA from the A site to the P site. Eukaryotic mtEFG1 proteins show significant homology to bacterial EF-Gs. Mutants in yeast mtEFG1 have impaired mitochondrial protein synthesis, respiratory defects and a tendency to lose mitochondrial DNA. No clear phenotype has been found for mutants in the yeast homologue of mtEFG2, MEF2.
  • Changed! TIGR EF-G 695aa 1e-157 in ref transcript
    • After peptide bond formation, this elongation factor of bacteria and organelles catalyzes the translocation of the tRNA-mRNA complex, with its attached nascent polypeptide chain, from the A-site to the P-site of the ribosome. Every completed bacterial genome has at least one copy, but some species have additional EF-G-like proteins. The closest homolog to canonical (e.g. E. coli) EF-G in the spirochetes clusters as if it is derived from mitochondrial forms, while a more distant second copy is also present. Synechocystis PCC6803 has a few proteins more closely related to EF-G than to any other characterized protein. Two of these resemble E. coli EF-G more closely than does the best match from the spirochetes; it may be that both function as authentic EF-G.
  • Changed! PRK PRK12739 712aa 0.0 in ref transcript
    • elongation factor G; Reviewed.
  • Changed! cd mtEFG2_II_like 41aa 1e-13 in modified transcript
  • Changed! TIGR EF-G 289aa 4e-89 in modified transcript
  • Changed! TIGR EF-G 353aa 6e-55 in modified transcript
  • Changed! COG FusA 663aa 1e-132 in modified transcript
    • Translation elongation factors (GTPases) [Translation, ribosomal structure and biogenesis].

GFRA4

  • refseq_GFRA4.F2 refseq_GFRA4.R2 288 378
  • NCBIGene 36.3 64096
  • Intron retention, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_145762

  • Changed! pfam GDNF 80aa 0.001 in ref transcript
    • GDNF/GAS1 domain. This cysteine rich domain is found in multiple copies in GNDF and GAS1 proteins. GDNF and neurturin (NTN) receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuronal survival by signaling through similar multicomponent receptors that consist of a common receptor tyrosine kinase and a member of a GPI-linked family of receptors that determines ligand specificity.
  • Changed! pfam GDNF 60aa 7e-04 in modified transcript

GGA1

  • refseq_GGA1.F1 refseq_GGA1.R1 100 361
  • NCBIGene 36.3 26088
  • Multiple exon skipping, size difference: 261
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013365

  • cd VHS_GGA 139aa 3e-64 in ref transcript
    • VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
  • pfam VHS 135aa 1e-38 in ref transcript
    • VHS domain. Domain present in VPS-27, Hrs and STAM.
  • Changed! pfam GAT 97aa 6e-27 in ref transcript
    • GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilises membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins.
  • pfam Alpha_adaptinC2 124aa 7e-24 in ref transcript
    • Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found in alpha, beta and gamma adaptins.
  • Changed! pfam GAT 73aa 2e-16 in modified transcript

GGA3

  • refseq_GGA3.F1 refseq_GGA3.R1 127 226
  • NCBIGene 36.3 23163
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138619

  • Changed! cd VHS_GGA 139aa 9e-69 in ref transcript
    • VHS domain family, GGA subfamily; GGA (Golgi-localized, Gamma-ear-containing, Arf-binding) comprise a subfamily of ubiquitously expressed, monomeric, motif-binding cargo/clathrin adaptor proteins. The VHS domain has a superhelical structure similar to the structure of the ARM (Armadillo) repeats and is present at the N-termini of proteins. GGA proteins have a multidomain structure consisting of an N-terminal VHS domain linked by a short proline-rich linker to a GAT (GGA and TOM) domain, which is followed by a long flexible linker to the C-terminal appendage, GAE (gamma-adaptin ear) domain. The VHS domain of GGA proteins binds to the acidic-cluster dileucine (DxxLL) motif found on the cytoplasmic tails of cargo proteins trafficked between the trans-Golgi network and the endosomal system.
  • Changed! pfam VHS 135aa 3e-48 in ref transcript
    • VHS domain. Domain present in VPS-27, Hrs and STAM.
  • pfam GAT 95aa 4e-30 in ref transcript
    • GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilises membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins.
  • pfam Alpha_adaptinC2 124aa 3e-26 in ref transcript
    • Adaptin C-terminal domain. Alpha adaptin is a heterotetramer which regulates clathrin-bud formation. The carboxyl-terminal appendage of the alpha subunit regulates translocation of endocytic accessory proteins to the bud site. This ig-fold domain is found in alpha, beta and gamma adaptins.
  • Changed! cd VHS_GGA 106aa 4e-43 in modified transcript
  • Changed! smart VHS 132aa 5e-29 in modified transcript
    • Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.

GGPS1

  • refseq_GGPS1.F1 refseq_GGPS1.R1 235 328
  • NCBIGene 36.3 9453
  • Single exon skipping, size difference: 93
  • Exclusion of the protein initiation site
  • Reference transcript: NM_004837

  • Changed! cd Trans_IPPS_HT 212aa 4e-52 in ref transcript
    • Trans-Isoprenyl Diphosphate Synthases (Trans_IPPS), head-to-tail (HT) (1'-4) condensation reactions. This CD includes all-trans (E)-isoprenyl diphosphate synthases which synthesis various chain length (C10, C15, C20, C25, C30, C35, C40, C45, and C50) linear isoprenyl diphosphates from precursors, isopentenyl diphosphate (IPP) and dimethylallyl diphosphate (DMAPP). They catalyze the successive 1'-4 condensation of the 5-carbon IPP to allylic substrates geranyl-, farnesyl-, or geranylgeranyl-diphosphate. Isoprenoid chain elongation reactions proceed via electrophilic alkylations in which a new carbon-carbon single bond is generated through interaction between a highly reactive electron-deficient allylic carbocation and an electron-rich carbon-carbon double bond. The catalytic site consists of a large central cavity formed by mostly antiparallel alpha helices with two aspartate-rich regions (DDXX(XX)D) located on opposite walls. These residues mediate binding of prenyl phosphates via bridging Mg2+ ions, inducing proposed conformational changes that close the active site to solvent, protecting and stabilizing reactive carbocation intermediates. Farnesyl diphosphate synthases produce the precursors of steroids, cholesterol, sesquiterpenes, farnsylated proteins, heme, and vitamin K12; and geranylgeranyl diphosphate and longer chain synthases produce the precursors of carotenoids, retinoids, diterpenes, geranylgeranylated chlorophylls, ubiquinone, and archaeal ether linked lipids. Isoprenyl diphosphate synthases are widely distributed among archaea, bacteria, and eukareya.
  • Changed! pfam polyprenyl_synt 251aa 4e-43 in ref transcript
    • Polyprenyl synthetase.
  • Changed! COG IspA 277aa 1e-38 in ref transcript
    • Geranylgeranyl pyrophosphate synthase [Coenzyme metabolism].
  • Changed! cd Trans_IPPS_HT 167aa 2e-43 in modified transcript
  • Changed! pfam polyprenyl_synt 208aa 5e-36 in modified transcript
  • Changed! COG IspA 233aa 8e-34 in modified transcript

GGT1

  • refseq_GGT1.F1 refseq_GGT1.R1 364 472
  • NCBIGene 36.3 2678
  • Alternative 3-prime, size difference: 108
  • Inclusion in 5'UTR
  • Reference transcript: NM_001032364

  • pfam G_glu_transpept 508aa 1e-174 in ref transcript
    • Gamma-glutamyltranspeptidase.
  • COG Ggt 543aa 1e-124 in ref transcript
    • Gamma-glutamyltransferase [Amino acid transport and metabolism].

GH2

  • refseq_GH2.F1 refseq_GH2.R1 207 252
  • NCBIGene 36.3 2689
  • Alternative 3-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022557

  • Changed! pfam Hormone_1 131aa 3e-26 in ref transcript
    • Somatotropin hormone family.
  • Changed! pfam Hormone_1 116aa 8e-23 in modified transcript

GIMAP6

  • refseq_GIMAP6.F1 refseq_GIMAP6.R1 100 324
  • NCBIGene 36.3 474344
  • Alternative 3-prime, size difference: 224
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024711

  • Changed! cd AIG1 196aa 1e-78 in ref transcript
    • AIG1 (avrRpt2-induced gene 1). This represents Arabidoposis protein AIG1 that appears to be involved in plant resistance to bacteria. The Arabidopsis disease resistance gene RPS2 is involved in recognition of bacterial pathogens carrying the avirulence gene avrRpt2. AIG1 exhibits RPS2- and avrRpt1-dependent induction early after infection with Pseudomonas syringae carrying avrRpt2. This subfamily also includes IAN-4 protein, which has GTP-binding activity and shares sequence homology with a novel family of putative GTP-binding proteins: the immuno-associated nucleotide (IAN) family. The evolutionary conservation of the IAN family provides a unique example of a plant pathogen response gene conserved in animals. The IAN/IMAP subfamily has been proposed to regulate apoptosis in vertebrates and angiosperm plants, particularly in relation to cancer, diabetes, and infections. The human IAN genes were renamed GIMAP (GTPase of the immunity associated proteins).
  • Changed! pfam AIG1 209aa 4e-73 in ref transcript
    • AIG1 family. Arabidopsis protein AIG1 appears to be involved in plant resistance to bacteria.
  • Changed! PRK rbgA 87aa 3e-04 in ref transcript
    • ribosomal biogenesis GTPase; Reviewed.

GIPC1

  • refseq_GIPC1.F1 refseq_GIPC1.R1 100 418
  • NCBIGene 36.3 10755
  • Single exon skipping, size difference: 318
  • Exclusion of the protein initiation site
  • Reference transcript: NM_202468

  • cd PDZ_signaling 79aa 8e-08 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 82aa 5e-09 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.

GIT2

  • refseq_GIT2.F1 refseq_GIT2.R1 249 339
  • NCBIGene 36.3 9815
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_057169

  • cd ANK 112aa 1e-11 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart ArfGap 119aa 4e-39 in ref transcript
    • Putative GTP-ase activating proteins for the small GTPase, ARF. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • pfam GIT_SHD 31aa 1e-07 in ref transcript
    • Spa2 homology domain (SHD) of GIT. GIT proteins are signaling integrators with GTPase-activating function which may be involved in the organisation of the cytoskeletal matrix assembled at active zones (CAZ). The function of the CAZ might be to define sites of neurotransmitter release. Mutations in the Spa2 homology domain (SHD) domain of GIT1 described here interfere with the association of GIT1 with Piccolo, beta-PIX, and focal adhesion kinase.
  • pfam GIT_SHD 22aa 7e-04 in ref transcript
  • COG COG5347 115aa 2e-15 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
  • PTZ PTZ00322 102aa 2e-06 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

GIT2

  • refseq_GIT2.F3 refseq_GIT2.R2 101 485
  • NCBIGene 36.3 9815
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 384
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_057169

  • cd ANK 112aa 1e-11 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart ArfGap 119aa 4e-39 in ref transcript
    • Putative GTP-ase activating proteins for the small GTPase, ARF. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • pfam GIT_SHD 31aa 1e-07 in ref transcript
    • Spa2 homology domain (SHD) of GIT. GIT proteins are signaling integrators with GTPase-activating function which may be involved in the organisation of the cytoskeletal matrix assembled at active zones (CAZ). The function of the CAZ might be to define sites of neurotransmitter release. Mutations in the Spa2 homology domain (SHD) domain of GIT1 described here interfere with the association of GIT1 with Piccolo, beta-PIX, and focal adhesion kinase.
  • pfam GIT_SHD 22aa 7e-04 in ref transcript
  • COG COG5347 115aa 2e-15 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
  • PTZ PTZ00322 102aa 2e-06 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

GIT2

  • refseq_GIT2.F4 refseq_GIT2.R3 180 330
  • NCBIGene 36.3 9815
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_057169

  • cd ANK 112aa 1e-11 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart ArfGap 119aa 4e-39 in ref transcript
    • Putative GTP-ase activating proteins for the small GTPase, ARF. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • pfam GIT_SHD 31aa 1e-07 in ref transcript
    • Spa2 homology domain (SHD) of GIT. GIT proteins are signaling integrators with GTPase-activating function which may be involved in the organisation of the cytoskeletal matrix assembled at active zones (CAZ). The function of the CAZ might be to define sites of neurotransmitter release. Mutations in the Spa2 homology domain (SHD) domain of GIT1 described here interfere with the association of GIT1 with Piccolo, beta-PIX, and focal adhesion kinase.
  • pfam GIT_SHD 22aa 7e-04 in ref transcript
  • COG COG5347 115aa 2e-15 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
  • PTZ PTZ00322 102aa 2e-06 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

GIYD2

  • refseq_GIYD1.F1 refseq_GIYD1.R1 99 441
  • NCBIGene 36.3 79008
  • Single exon skipping, size difference: 342
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024044

  • Changed! pfam GIY-YIG 77aa 3e-10 in ref transcript
    • GIY-YIG catalytic domain. This domain called GIY-YIG is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE; it is also found in putative endonucleases encoded by group I introns of fungi and phage. The structure of I-TevI a GIY-YIG endonuclease, reveals a novel alpha/beta-fold with a central three-stranded antiparallel beta-sheet flanked by three helices. The most conserved and putative catalytic residues are located on a shallow, concave surface and include a metal coordination site.
  • Changed! PRK PRK00329 69aa 1e-05 in ref transcript
    • GIY-YIG nuclease superfamily protein; Validated.
  • Changed! pfam GIY-YIG 62aa 2e-07 in modified transcript
  • Changed! PRK PRK00329 67aa 3e-05 in modified transcript

GK

  • refseq_GK.F1 refseq_GK.R1 102 120
  • NCBIGene 36.3 2710
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203391

  • Changed! TIGR glycerol_kin 509aa 0.0 in ref transcript
    • This model describes glycerol kinase, a member of the FGGY family of carbohydrate kinases.
  • Changed! PRK glpK 508aa 0.0 in ref transcript
    • glycerol kinase; Provisional.
  • Changed! TIGR glycerol_kin 503aa 0.0 in modified transcript
  • Changed! PRK glpK 502aa 0.0 in modified transcript

GLT8D1

  • refseq_GLT8D1.F1 refseq_GLT8D1.R1 176 405
  • NCBIGene 36.3 55830
  • Single exon skipping, size difference: 229
  • Exclusion in 5'UTR
  • Reference transcript: NM_001010983

  • cd GT8_like_1 286aa 2e-89 in ref transcript
    • GT8_like_1 represents a subfamily of GT8 with unknown function. A subfamily of glycosyltransferase family 8 with unknown function: Glycosyltransferase family 8 comprises enzymes with a number of known activities; lipopolysaccharide galactosyltransferase lipopolysaccharide glucosyltransferase 1, glycogenin glucosyltransferase and inositol 1-alpha-galactosyltransferase. It is classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed.
  • pfam Glyco_transf_8 274aa 2e-45 in ref transcript
    • Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, and glycogenin glucosyltransferase.
  • COG RfaJ 286aa 3e-15 in ref transcript
    • Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases [Cell envelope biogenesis, outer membrane].

GLUL

  • refseq_GLUL.F1 refseq_GLUL.R1 100 456
  • NCBIGene 36.3 2752
  • Single exon skipping, size difference: 356
  • Exclusion in 5'UTR
  • Reference transcript: NM_002065

  • pfam Gln-synt_C 252aa 1e-69 in ref transcript
    • Glutamine synthetase, catalytic domain.
  • pfam Gln-synt_N 81aa 2e-26 in ref transcript
    • Glutamine synthetase, beta-Grasp domain.
  • COG GlnA 349aa 3e-61 in ref transcript
    • Glutamine synthetase [Amino acid transport and metabolism].

GMEB1

  • refseq_GMEB1.F1 refseq_GMEB1.R1 141 171
  • NCBIGene 36.3 10691
  • Alternative 3-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006582

  • pfam SAND 81aa 2e-30 in ref transcript
    • SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation.

GMPPA

  • refseq_GMPPA.F1 refseq_GMPPA.R1 106 417
  • NCBIGene 36.3 29926
  • Alternative 5-prime and 3-prime, size difference: 311
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_205847

  • cd M1P_guanylylT_A_like_N 258aa 1e-136 in ref transcript
    • N-terminal domain of M1P_guanylyl_A_ like proteins are likely to be a isoform of GDP-mannose pyrophosphorylase. N-terminal domain of the M1P-guanylyltransferase A-isoform like proteins: The proteins of this family are likely to be a isoform of GDP-mannose pyrophosphorylase. Their sequences are highly conserved with mannose-1-phosphate guanyltransferase, but generally about 40-60 bases longer. GDP-mannose pyrophosphorylase (GTP: alpha-d-mannose-1-phosphate guanyltransferase) catalyzes the formation of GDP-d-mannose from GTP and alpha-d-mannose-1-Phosphate. It contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain. GDP-d-mannose is the activated form of mannose for formation of cell wall lipoarabinomannan and various mannose-containing glycolipids and polysaccharides. The function of GDP-mannose pyrophosphorylase is essential for cell wall integrity, morphogenesis and viability. Repression of GDP-mannose pyrophosphorylase in yeast leads to phenotypes including cell lysis, defective cell wall, and failure of polarized growth and cell separation.
  • cd LbH_M1P_guanylylT_C 67aa 1e-19 in ref transcript
    • Mannose-1-phosphate guanylyltransferase, C-terminal Left-handed parallel beta helix (LbH) domain: Mannose-1-phosphate guanylyltransferase is also known as GDP-mannose pyrophosphorylase. It catalyzes the synthesis of GDP-mannose from GTP and mannose-1-phosphate, and is involved in the maintenance of cell wall integrity and glycosylation. Similar to ADP-glucose pyrophosphorylase, it contains an N-terminal catalytic domain that resembles a dinucleotide-binding Rossmann fold and a C-terminal LbH fold domain, presumably with 4 turns, each containing three imperfect tandem repeats of a hexapeptide repeat motif (X-[STAV]-X-[LIV]-[GAED]-X). Proteins containing hexapeptide repeats are often enzymes showing acyltransferase activity.
  • pfam NTP_transferase 189aa 2e-25 in ref transcript
    • Nucleotidyl transferase. This family includes a wide range of enzymes which transfer nucleotides onto phosphosugars.
  • TIGR lipid_A_lpxA 53aa 3e-07 in ref transcript
    • This model describes LpxA, an enzyme for the biosynthesis of lipid A, a component oflipopolysaccharide (LPS) in the outer membrane outer leaflet of most Gram-negative bacteria. Some differences are found between lipid A of different species, but this protein represents the first step (from UDP-N-acetyl-D-glucosamine) and appears to be conserved in function. Proteins from this family contain many copies of the bacterial transferase hexapeptide repeat (pfam00132).
  • COG GCD1 349aa 1e-52 in ref transcript
    • Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) [Cell envelope biogenesis, outer membrane / Translation, ribosomal structure and biogenesis].

GMPR2

  • refseq_GMPR2.F1 refseq_GMPR2.R1 193 252
  • NCBIGene 36.3 51292
  • Alternative 5-prime, size difference: 59
  • Exclusion in 5'UTR
  • Reference transcript: NM_001002000

  • cd IMPDH 327aa 5e-98 in ref transcript
    • IMPDH: The catalytic domain of the inosine monophosphate dehydrogenase. IMPDH catalyzes the NAD-dependent oxidation of inosine 5'-monophosphate (IMP) to xanthosine 5' monophosphate (XMP). It is a rate-limiting step in the de novo synthesis of the guanine nucleotides. There is often a CBS domain inserted in the middle of this domain, which is proposed to play a regulatory role. IMPDH is a key enzyme in the regulation of cell proliferation and differentiation. It has been identified as an attractive target for developing chemotherapeutic agents.
  • TIGR GMP_reduct_1 341aa 1e-179 in ref transcript
    • A deep split separates two families of GMP reductase. This family includes both eukaryotic and some proteobacterial sequences, while the other family contains other bacterial sequences.
  • PRK PRK05096 342aa 0.0 in ref transcript
    • guanosine 5'-monophosphate oxidoreductase; Provisional.

GNAS

  • refseq_GNAS.F1 refseq_GNAS.R1 238 280
  • NCBIGene 36.3 2778
  • Mutually exclusive exon skipping, size difference: 42
  • Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000516

  • Changed! cd G-alpha 350aa 1e-138 in ref transcript
    • G protein alpha subunit. The alpha subunit of G proteins contains the guanine nucleotide binding site. The heterotrimeric GNP-binding proteins are signal transducers that communicate signals from many hormones, neurotransmitters, chemokines, and autocrine and paracrine factors. Extracellular signals are received by receptors, which activate the G proteins, which in turn route the signals to several distinct intracellular signaling pathways. The alpha subunit of G proteins is a weak GTPase. In the resting state, heterotrimeric G proteins are associated at the cytosolic face of the plasma membrane and the alpha subunit binds to GDP. Upon activation by a receptor GDP is replaced with GTP, and the G-alpha/GTP complex dissociates from the beta and gamma subunits. This results in activation of downstream signaling pathways, such as cAMP synthesis by adenylyl cyclase, which is terminated when GTP is hydrolized and the heterotrimers reconstitute.
  • Changed! pfam G-alpha 372aa 1e-143 in ref transcript
    • G-protein alpha subunit. G proteins couple receptors of extracellular signals to intracellular signaling pathways. The G protein alpha subunit binds guanyl nucleotide and is a weak GTPase.
  • PTZ PTZ00133 80aa 1e-04 in ref transcript
    • ADP-ribosylation factor; Provisional.
  • Changed! cd G-alpha 336aa 1e-139 in modified transcript
  • Changed! pfam G-alpha 358aa 1e-143 in modified transcript

GNLY

  • refseq_GNLY.F1 refseq_GNLY.R1 112 356
  • NCBIGene 36.3 10578
  • Single exon skipping, size difference: 244
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006433

  • Changed! smart SapB 71aa 4e-08 in ref transcript
    • Saposin (B) Domains. Present in multiple copies in prosaposin and in pulmonary surfactant-associated protein B. In plant aspartic proteinases, a saposin domain is circularly permuted. This causes the prediction algorithm to predict two such domains, where only one is truly present.

GNRHR

  • refseq_GNRHR.F1 refseq_GNRHR.R1 229 357
  • NCBIGene 36.3 2798
  • Alternative 3-prime, size difference: 128
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000406

  • Changed! pfam 7tm_1 218aa 2e-22 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • Changed! pfam 7tm_1 70aa 3e-09 in modified transcript

GOPC

  • refseq_GOPC.F1 refseq_GOPC.R1 136 160
  • NCBIGene 36.3 57120
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020399

  • cd PDZ_signaling 83aa 2e-17 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 84aa 8e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG COG3975 207aa 0.001 in ref transcript
    • Predicted protease with the C-terminal PDZ domain [General function prediction only].
  • Changed! PRK PRK09039 178aa 0.006 in ref transcript
    • hypothetical protein; Validated.
  • Changed! PRK PRK09039 170aa 2e-04 in modified transcript

GPATCH4

  • refseq_GPATC4.F2 refseq_GPATC4.R2 118 192
  • NCBIGene 36.3 54865
  • Alternative 3-prime, size difference: 74
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015590

  • Changed! pfam G-patch 44aa 7e-06 in ref transcript
    • G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.

GPHN

  • refseq_GPHN.F2 refseq_GPHN.R2 296 395
  • NCBIGene 36.3 10243
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020806

  • cd MoeA 407aa 1e-118 in ref transcript
    • MoeA family. Members of this family are involved in biosynthesis of the molybdenum cofactor (MoCF), an essential cofactor of a diverse group of redox enzymes. MoCF biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. MoCF contains a tricyclic pyranopterin, termed molybdopterin (MPT). MoeA, together with MoaB, is responsible for the metal incorporation into MPT, the third step in MoCF biosynthesis. The plant homolog Cnx1 is a MoeA-MogA fusion protein. The mammalian homolog gephyrin is a MogA-MoeA fusion protein, that plays a critical role in postsynaptic anchoring of inhibitory glycine receptors and major GABAa receptor subtypes.
  • cd MogA_MoaB 155aa 5e-51 in ref transcript
    • MogA_MoaB family. Members of this family are involved in biosynthesis of the molybdenum cofactor (MoCF) an essential cofactor of a diverse group of redox enzymes. MoCF biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea, and eukaryotes. MoCF contains a tricyclic pyranopterin, termed molybdopterin (MPT). MogA, together with MoeA, is responsible for the metal incorporation into MPT, the third step in MoCF biosynthesis. The plant homolog Cnx1 is a MoeA-MogA fusion protein. The mammalian homolog gephyrin is a MogA-MoeA fusion protein, that plays a critical role in postsynaptic anchoring of inhibitory glycine receptors and major GABAa receptor subtypes. In contrast, MoaB shows high similarity to MogA, but little is known about its physiological role. All well studied members of this family form highly stable trimers.
  • pfam MoeA_N 167aa 2e-51 in ref transcript
    • MoeA N-terminal region (domain I and II). This family contains two structural domains. One of these contains the conserved DGXA motif. This region is found in proteins involved in biosynthesis of molybdopterin cofactor however the exact molecular function of this region is uncertain.
  • TIGR molyb_syn 146aa 1e-33 in ref transcript
    • The Drosophila protein cinnamon, the Arabidopsis protein cnx1, and rat protein gephyrin each have one domain like MoeA and one like MoaB and Mog. These domains are, however, distantly related to each other, as captured by this HMM. Gephyrin is unusual in that it seems to be a tubulin-binding neuroprotein involved in the clustering of both blycine receptors and GABA receptors, rather than a protein of molybdenum cofactor biosynthesis.
  • TIGR molyb_syn 143aa 1e-30 in ref transcript
  • pfam MoeA_C 77aa 1e-13 in ref transcript
    • MoeA C-terminal region (domain IV). This domain is found in proteins involved in biosynthesis of molybdopterin cofactor however the exact molecular function of this domain is uncertain. The structure of this domain is known and forms an incomplete beta barrel.
  • COG MoeA 410aa 1e-92 in ref transcript
    • Molybdopterin biosynthesis enzyme [Coenzyme metabolism].
  • PRK moaC 158aa 2e-37 in ref transcript
    • bifunctional molybdenum cofactor biosynthesis protein C/molybdopterin-binding protein; Provisional.

GPM6A

  • refseq_GPM6A.F2 refseq_GPM6A.R2 333 392
  • NCBIGene 36.3 2823
  • Single exon skipping, size difference: 59
  • Exclusion of the protein initiation site
  • Reference transcript: NM_005277

  • Changed! pfam Myelin_PLP 242aa 3e-89 in ref transcript
    • Myelin proteolipid protein (PLP or lipophilin).
  • Changed! pfam Myelin_PLP 242aa 2e-87 in modified transcript

GPM6B

  • refseq_GPM6B.F1 refseq_GPM6B.R1 220 340
  • NCBIGene 36.3 2824
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001995

  • Changed! pfam Myelin_PLP 245aa 1e-127 in ref transcript
    • Myelin proteolipid protein (PLP or lipophilin).
  • Changed! pfam Myelin_PLP 245aa 1e-123 in modified transcript

GPNMB

  • refseq_GPNMB.F1 refseq_GPNMB.R1 181 217
  • NCBIGene 36.3 10457
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005340

  • cd PKD 64aa 2e-05 in ref transcript
    • polycystic kidney disease I (PKD) domain; similar to other cell-surface modules, with an IG-like fold; domain probably functions as a ligand binding site in protein-protein or protein-carbohydrate interactions; a single instance of the repeat is presented here. The domain is also found in microbial collagenases and chitinases.
  • smart PKD 58aa 6e-06 in ref transcript
    • Repeats in polycystic kidney disease 1 (PKD1) and other proteins. Polycystic kidney disease 1 protein contains 14 repeats, present elsewhere such as in microbial collagenases.
  • TIGR PCC 114aa 0.004 in ref transcript
    • Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.

GPR108

  • refseq_GPR108.F1 refseq_GPR108.R1 257 399
  • NCBIGene 36.2 56927
  • Alternative 3-prime, size difference: 142
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_933929

  • pfam Lung_7-TM_R 289aa 5e-85 in ref transcript
    • Lung seven transmembrane receptor. This family represents a conserved region with eukaryotic lung seven transmembrane receptors and related proteins.

GPR126

  • refseq_GPR126.F2 refseq_GPR126.R2 148 232
  • NCBIGene 36.3 57211
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198569

  • cd CUB 104aa 8e-25 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd PTX 185aa 1e-07 in ref transcript
    • Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
  • pfam 7tm_2 259aa 9e-32 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • smart CUB 97aa 9e-26 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • pfam GPS 49aa 3e-10 in ref transcript
    • Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • smart PTX 185aa 3e-08 in ref transcript
    • Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.

GPR126

  • refseq_GPR126.F3 refseq_GPR126.R3 169 215
  • NCBIGene 36.3 57211
  • Single exon skipping, size difference: 46
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_198569

  • cd CUB 104aa 8e-25 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd PTX 185aa 1e-07 in ref transcript
    • Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
  • pfam 7tm_2 259aa 9e-32 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • smart CUB 97aa 9e-26 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • pfam GPS 49aa 3e-10 in ref transcript
    • Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • smart PTX 185aa 3e-08 in ref transcript
    • Pentraxin / C-reactive protein / pentaxin family. This family form a doscoid pentameric structure. Human serum amyloid P demonstrates calcium-mediated ligand-binding.

GPR155

  • refseq_GPR155.F2 refseq_GPR155.R2 223 372
  • NCBIGene 36.3 151556
  • Single exon skipping, size difference: 149
  • Exclusion in 5'UTR
  • Reference transcript: NM_001033045

  • cd DEP_GPR155 83aa 4e-41 in ref transcript
    • DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in GPR155-like proteins. GRP155-like proteins, also known as PGR22, contain an N-terminal permease domain, a central transmembrane region and a C-terminal DEP domain. They are orphan receptors of the class B G protein-coupled receptors. Their function is unknown.
  • TIGR 2a69 329aa 2e-17 in ref transcript
  • smart DEP 68aa 6e-10 in ref transcript
    • Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signalling proteins that contain PH, rasGEF, rhoGEF, rhoGAP, RGS, PDZ domains. DEP domain in Drosophila dishevelled is essential to rescue planar polarity defects and induce JNK signalling (Cell 94, 109-118).
  • COG COG0679 329aa 4e-19 in ref transcript
    • Predicted permeases [General function prediction only].

GPER

  • refseq_GPR30.F2 refseq_GPR30.R2 276 400
  • NCBIGene 36.3 2852
  • Single exon skipping, size difference: 124
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039966

  • pfam 7tm_1 243aa 1e-34 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

GPR56

  • refseq_GPR56.F1 refseq_GPR56.R1 275 397
  • NCBIGene 36.3 9289
  • Single exon skipping, size difference: 122
  • Exclusion in 5'UTR
  • Reference transcript: NM_201524

  • pfam 7tm_2 250aa 3e-25 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • pfam GPS 47aa 3e-09 in ref transcript
    • Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.

GPR56

  • refseq_GPR56.F3 refseq_GPR56.R3 100 118
  • NCBIGene 36.3 9289
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005682

  • Changed! pfam 7tm_2 256aa 1e-24 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • pfam GPS 47aa 3e-09 in ref transcript
    • Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • Changed! pfam 7tm_2 250aa 3e-25 in modified transcript

GPRASP2

  • refseq_GPRASP2.F1 refseq_GPRASP2.R1 128 216
  • NCBIGene 36.3 114928
  • Single exon skipping, size difference: 88
  • Exclusion in 5'UTR
  • Reference transcript: NM_001004051

  • pfam DUF634 245aa 1e-53 in ref transcript
    • Protein of unknown function (DUF634). Mammalian protein of unknown function.
  • PRK PRK12323 129aa 0.008 in ref transcript
    • DNA polymerase III subunits gamma and tau; Provisional.

GPX4

  • refseq_GPX4.F2 refseq_GPX4.R2 102 124
  • NCBIGene 36.3 2879
  • Alternative 3-prime, size difference: 22
  • Inclusion in 3'UTR
  • Reference transcript: NM_001039848

  • cd GSH_Peroxidase 33aa 2e-08 in ref transcript
    • Glutathione (GSH) peroxidase family; tetrameric selenoenzymes that catalyze the reduction of a variety of hydroperoxides including lipid peroxidases, using GSH as a specific electron donor substrate. GSH peroxidase contains one selenocysteine residue per subunit, which is involved in catalysis. Different isoenzymes are known in mammals,which are involved in protection against reactive oxygen species, redox regulation of many metabolic processes, peroxinitrite scavenging, and modulation of inflammatory processes.
  • pfam GSHPx 32aa 1e-10 in ref transcript
    • Glutathione peroxidase.
  • COG BtuE 33aa 2e-05 in ref transcript
    • Glutathione peroxidase [Posttranslational modification, protein turnover, chaperones].

GPX5

  • refseq_GPX5.F2 refseq_GPX5.R2 128 246
  • NCBIGene 36.3 2880
  • Single exon skipping, size difference: 118
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001509

  • Changed! cd GSH_Peroxidase 173aa 5e-45 in ref transcript
    • Glutathione (GSH) peroxidase family; tetrameric selenoenzymes that catalyze the reduction of a variety of hydroperoxides including lipid peroxidases, using GSH as a specific electron donor substrate. GSH peroxidase contains one selenocysteine residue per subunit, which is involved in catalysis. Different isoenzymes are known in mammals,which are involved in protection against reactive oxygen species, redox regulation of many metabolic processes, peroxinitrite scavenging, and modulation of inflammatory processes.
  • Changed! pfam GSHPx 114aa 4e-42 in ref transcript
    • Glutathione peroxidase.
  • Changed! COG BtuE 179aa 2e-33 in ref transcript
    • Glutathione peroxidase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd GSH_Peroxidase 52aa 9e-09 in modified transcript
  • Changed! pfam GSHPx 51aa 4e-11 in modified transcript
  • Changed! COG BtuE 52aa 2e-07 in modified transcript

GRB10

  • refseq_GRB10.F1 refseq_GRB10.R1 142 280
  • NCBIGene 36.3 2887
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005311

  • cd GRB7_RA 83aa 2e-32 in ref transcript
    • Grb7_RA The RA (RAS-associated like) domain of Grb7. Grb7 is an adaptor molecule that mediates signal transduction from multiple cell surface receptors to various downstream signaling pathways. Grb7 and its related family members Grb10 and Grb14 share a conserved domain architecture that includes an amino-terminal proline-rich region, a central segment termed the GM region (for Grb and Mig) which includes the RA, PIR, and PH domains, and a carboxyl-terminal SH2 domain. Grb7/10/14 family proteins are phosphorylated on serine/threonine as well as tyrosine residues and are mainly localized to the cytoplasm.
  • Changed! cd PH_Apbb1ip 113aa 8e-20 in ref transcript
    • Apbb1ip (Amyloid beta (A4) Precursor protein-Binding, family B, member 1 Interacting Protein) pleckstrin homology (PH) domain. Apbb1ip consists of a Ras-associated domain and a PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • cd SH2 97aa 9e-18 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • smart SH2 90aa 2e-21 in ref transcript
    • Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
  • pfam BPS 48aa 5e-19 in ref transcript
    • BPS (Between PH and SH2). The BPS (Between PH and SH2) domain, comprised of 2 beta strands and a C-terminal helix, is an approximately 45 residue region found in the adaptor proteins Grb7/10/14 that mediates inhibition of the tyrosine kinase domain of the insulin receptor by binding of the N-terminal portion of the BPS domain to the substrate peptide groove of the kinase, acting as a pseudosubstrate inhibitor.
  • smart RA 84aa 1e-13 in ref transcript
    • Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
  • Changed! smart PH 109aa 6e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • Changed! cd PH_Apbb1ip 74aa 8e-08 in modified transcript

GRB2

  • refseq_GRB2.F2 refseq_GRB2.R2 240 363
  • NCBIGene 36.3 2885
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002086

  • Changed! cd SH2 91aa 2e-21 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • cd SH3 53aa 1e-13 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 52aa 3e-11 in ref transcript
  • Changed! pfam SH2 76aa 9e-27 in ref transcript
    • SH2 domain.
  • smart SH3 57aa 4e-16 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • pfam SH3_1 56aa 1e-13 in ref transcript
    • SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
  • Changed! cd SH2 49aa 6e-04 in modified transcript
  • Changed! pfam SH2 35aa 1e-05 in modified transcript

GRHL1

  • refseq_GRHL1.F1 refseq_GRHL1.R1 211 288
  • NCBIGene 36.3 29841
  • Single exon skipping, size difference: 77
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014552

  • Changed! pfam CP2 229aa 1e-108 in ref transcript
    • CP2 transcription factor. This family represents a conserved region in the CP2 transcription factor family.

GRHL3

  • refseq_GRHL3.F1 refseq_GRHL3.R1 107 294
  • NCBIGene 36.3 57822
  • Single exon skipping, size difference: 187
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_021180

  • Changed! pfam CP2 227aa 2e-96 in ref transcript
    • CP2 transcription factor. This family represents a conserved region in the CP2 transcription factor family.

GRIK1

  • refseq_GRIK1.F1 refseq_GRIK1.R1 140 185
  • NCBIGene 36.3 2897
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000830

  • Changed! cd PBP1_iGluR_Kainate_GluR5_7 396aa 0.0 in ref transcript
    • N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR5-7 subunits of Kainate receptor. While this N-terminal domain belongs to the periplasmic-binding fold type I superfamily, the glutamate-binding domain of the iGluR is structurally homologous to the periplasmic-binding fold type II. The LIVBP-like domain of iGluRs is thought to play a role in the initial assembly of iGluR subunits, but it is not well understood how this domain is arranged and functions in intact iGluR. There are five types of kainate receptors, GluR5, GluR6, GluR7, KA1, and KA2, which are structurally similar to AMPA and NMDA subunits of ionotropic glutamate receptors. KA1 and KA2 subunits can only form functional receptors with one of the GluR5-7 subunits. Moreover, GluR5-7 can also form functional homomeric receptor channels activated by kainate and glutamate when expressed in heterologous systems. Kainate receptors are involved in excitatory neurotransmission by activating postsynaptic receptors and in inhibitory neurotransmission by modulating release of the inhibitory neurotransmitter GABA through a presynaptic mechanism. Kainate receptors are closely related to AMAP receptors. In contrast of AMPA receptors, kainate receptors play only a minor role in signaling at synapses and their function is not well defined.
  • cd PBPb 117aa 1e-10 in ref transcript
    • Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
  • pfam Lig_chan 282aa 1e-105 in ref transcript
    • Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
  • pfam ANF_receptor 341aa 8e-51 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • pfam Lig_chan-Glu_bd 65aa 9e-27 in ref transcript
    • Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
  • COG HisJ 91aa 2e-05 in ref transcript
    • ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
  • Changed! COG LivK 300aa 3e-04 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
  • Changed! cd PBP1_iGluR_Kainate_GluR5_7 381aa 0.0 in modified transcript
  • Changed! COG LivK 307aa 1e-04 in modified transcript

GRIK2

  • refseq_GRIK2.F1 refseq_GRIK2.R1 315 402
  • NCBIGene 36.3 2898
  • Single exon skipping, size difference: 87
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_021956

  • cd PBP1_iGluR_Kainate_GluR5_7 381aa 0.0 in ref transcript
    • N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR5-7 subunits of Kainate receptor. While this N-terminal domain belongs to the periplasmic-binding fold type I superfamily, the glutamate-binding domain of the iGluR is structurally homologous to the periplasmic-binding fold type II. The LIVBP-like domain of iGluRs is thought to play a role in the initial assembly of iGluR subunits, but it is not well understood how this domain is arranged and functions in intact iGluR. There are five types of kainate receptors, GluR5, GluR6, GluR7, KA1, and KA2, which are structurally similar to AMPA and NMDA subunits of ionotropic glutamate receptors. KA1 and KA2 subunits can only form functional receptors with one of the GluR5-7 subunits. Moreover, GluR5-7 can also form functional homomeric receptor channels activated by kainate and glutamate when expressed in heterologous systems. Kainate receptors are involved in excitatory neurotransmission by activating postsynaptic receptors and in inhibitory neurotransmission by modulating release of the inhibitory neurotransmitter GABA through a presynaptic mechanism. Kainate receptors are closely related to AMAP receptors. In contrast of AMPA receptors, kainate receptors play only a minor role in signaling at synapses and their function is not well defined.
  • cd PBPb 118aa 8e-11 in ref transcript
    • Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
  • pfam Lig_chan 282aa 1e-106 in ref transcript
    • Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
  • pfam ANF_receptor 344aa 2e-55 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • pfam Lig_chan-Glu_bd 66aa 1e-26 in ref transcript
    • Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
  • COG LivK 235aa 9e-08 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
  • COG HisJ 121aa 4e-05 in ref transcript
    • ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].

GRIN1

  • refseq_GRIN1.F2 refseq_GRIN1.R2 190 301
  • NCBIGene 36.3 2902
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007327

  • cd PBP1_iGluR_NMDA_NR1 376aa 0.0 in ref transcript
    • N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the NR1, an essential channel-forming subunit of the NMDA receptor. The ionotropic N-methyl-d-asparate (NMDA) subtype of glutamate receptor serves critical functions in neuronal development, functioning, and degeneration in the mammalian central nervous system. The functional NMDA receptor is a heterotetramer ccomposed of two NR1 and two NR2 (A, B, C, and D) or of NR3 (A and B) subunits. The receptor controls a cation channel that is highly permeable to monovalent ions and calcium and exhibits voltage-dependent inhibition by magnesium. Dual agonists, glutamate and glycine, are required for efficient activation of the NMDA receptor. When co-expressed with NR1, the NR3 subunits form receptors that are activated by glycine alone and therefore can be classified as excitatory glycine receptors. NR1/NR3 receptors are calcium-impermeable and unaffected by ligands acting at the NR2 glutamate-binding site.
  • cd PBPb 96aa 2e-09 in ref transcript
    • Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
  • cd PBPb 111aa 6e-04 in ref transcript
  • pfam Lig_chan 276aa 7e-79 in ref transcript
    • Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
  • pfam ANF_receptor 316aa 5e-28 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • pfam Lig_chan-Glu_bd 58aa 6e-16 in ref transcript
    • Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
  • pfam CaM_bdg_C0 29aa 9e-08 in ref transcript
    • Calmodulin-binding domain C0 of NMDA receptor NR1 subunit. This is a very short highly conserved domain that is C-terminal to the cytosolic transmembrane region IV of the NMDA-receptor 1. It has been shown to bind Calmodulin-Calcium with high affinity. The ionotropic N-methyl-D-aspartate receptor (NMDAR) is a major source of calcium flux into neurons in the brain and plays a critical role in learning, memory, neural development, and synaptic plasticity. Calmodulin (CaM) regulates NMDARs by binding tightly to the C0 and C1 regions of their NR1 subunit. The conserved tryptophan is considered to be the anchor residue.
  • COG LivK 320aa 1e-10 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
  • COG HisJ 89aa 1e-05 in ref transcript
    • ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
  • COG HisJ 82aa 0.002 in ref transcript

GRINA

  • refseq_GRINA.F2 refseq_GRINA.R2 164 276
  • NCBIGene 36.3 2907
  • Alternative 5-prime, size difference: 112
  • Exclusion in 5'UTR
  • Reference transcript: NM_000837

  • cd BI-1-like 162aa 6e-40 in ref transcript
    • BAX inhibitor (BI)-1 like protein family. Mammalian members of this family of small transmembrane proteins have been shown to have an antiapoptotic effect either by stimulating the antiapoptotic function of Bcl-2, a well characterized oncogene, or inhibiting the proapoptotic effect of Bax, another member of the Bcl-2 family. Their broad tissue distribution and high degree of conservation suggests an important regulatory role. In plants, BI-1 like proteins play a role in pathogen resistance. A prokaryotic member, E.coli YccA, has been shown to interact with ATP-dependent protease FtsH, which degrades abnormal membrane proteins as part of a quality control mechanism to keep the integrity of biological membranes.
  • pfam UPF0005 167aa 8e-28 in ref transcript
    • Uncharacterised protein family UPF0005. The Pfam entry finds members not in the Prosite definition.
  • COG COG0670 114aa 2e-10 in ref transcript
    • Integral membrane protein, interacts with FtsH [General function prediction only].

GRINL1A

  • refseq_GRINL1A.F1 refseq_GRINL1A.R1 198 384
  • NCBIGene 36.3 81488
  • Alternative 5-prime, size difference: 186
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015532

GRIP1

  • refseq_GRIP1.F1 refseq_GRIP1.R1 180 225
  • NCBIGene 36.2 23426
  • Single exon skipping, size difference: 45
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: XM_001133924

  • cd PDZ_signaling 84aa 2e-14 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 82aa 1e-13 in ref transcript
  • cd PDZ_signaling 85aa 8e-12 in ref transcript
  • cd PDZ_signaling 81aa 2e-11 in ref transcript
  • cd PDZ_signaling 71aa 1e-10 in ref transcript
  • cd PDZ_signaling 84aa 3e-10 in ref transcript
  • cd PDZ_signaling 81aa 2e-09 in ref transcript
  • smart PDZ 89aa 2e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 82aa 6e-15 in ref transcript
  • smart PDZ 86aa 2e-11 in ref transcript
  • smart PDZ 87aa 3e-11 in ref transcript
  • smart PDZ 77aa 5e-10 in ref transcript
  • smart PDZ 85aa 5e-10 in ref transcript
  • smart PDZ 71aa 8e-09 in ref transcript
  • COG Prc 78aa 7e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 127aa 1e-04 in ref transcript

GRIP1

  • refseq_GRIP1.F3 refseq_GRIP1.R3 179 332
  • NCBIGene 36.2 23426
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 153
  • Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001133924

  • cd PDZ_signaling 84aa 2e-14 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 82aa 1e-13 in ref transcript
  • cd PDZ_signaling 85aa 8e-12 in ref transcript
  • cd PDZ_signaling 81aa 2e-11 in ref transcript
  • cd PDZ_signaling 71aa 1e-10 in ref transcript
  • cd PDZ_signaling 84aa 3e-10 in ref transcript
  • cd PDZ_signaling 81aa 2e-09 in ref transcript
  • smart PDZ 89aa 2e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 82aa 6e-15 in ref transcript
  • smart PDZ 86aa 2e-11 in ref transcript
  • smart PDZ 87aa 3e-11 in ref transcript
  • smart PDZ 77aa 5e-10 in ref transcript
  • smart PDZ 85aa 5e-10 in ref transcript
  • smart PDZ 71aa 8e-09 in ref transcript
  • COG Prc 78aa 7e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 127aa 1e-04 in ref transcript

GRIPAP1

  • refseq_GRIPAP1.F1 refseq_GRIPAP1.R1 125 284
  • NCBIGene 36.3 56850
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 159
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020137

GRIPAP1

  • refseq_GRIPAP1.F3 refseq_GRIPAP1.R3 102 180
  • NCBIGene 36.3 56850
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020137

GRK4

  • refseq_GRK4.F1 refseq_GRK4.R1 234 372
  • NCBIGene 36.3 2868
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182982

  • cd STKc_GRK4 285aa 1e-144 in ref transcript
    • STKc_GRK4: Serine/Threonine Kinases (STKs), G protein-coupled Receptor Kinase (GRK) subfamily, GRK4 isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The GRK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. GRKs phosphorylate and regulate G protein-coupled receptors (GPCRs), the largest superfamily of cell surface receptors which regulate some part of nearly all physiological functions. Phosphorylated GPCRs bind to arrestins, which prevents further G protein signaling despite the presence of activating ligand. There are seven types of GRKs, named GRK1 to GRK7. GRK4 has a limited tissue distribution. It is mainly found in the testis, but is also present in the cerebellum and kidney. It is expressed as multiple splice variants with different domain architectures. It is post-translationally palmitoylated and localized in the membrane. GRK4 polymorphisms are associated with hypertension and salt sensitivity, as they cause hyperphosphorylation, desensitization, and internalization of the dopamine 1 (D1) receptor while increasing the expression of the angiotensin II type 1 receptor. GRK4 plays a crucial role in the D1 receptor regulation of sodium excretion and blood pressure.
  • smart S_TKc 253aa 6e-60 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart RGS 120aa 2e-20 in ref transcript
    • Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
  • PTZ PTZ00263 273aa 4e-40 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

GRK4

  • refseq_GRK4.F3 refseq_GRK4.R3 276 372
  • NCBIGene 36.3 2868
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182982

  • cd STKc_GRK4 285aa 1e-144 in ref transcript
    • STKc_GRK4: Serine/Threonine Kinases (STKs), G protein-coupled Receptor Kinase (GRK) subfamily, GRK4 isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The GRK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. GRKs phosphorylate and regulate G protein-coupled receptors (GPCRs), the largest superfamily of cell surface receptors which regulate some part of nearly all physiological functions. Phosphorylated GPCRs bind to arrestins, which prevents further G protein signaling despite the presence of activating ligand. There are seven types of GRKs, named GRK1 to GRK7. GRK4 has a limited tissue distribution. It is mainly found in the testis, but is also present in the cerebellum and kidney. It is expressed as multiple splice variants with different domain architectures. It is post-translationally palmitoylated and localized in the membrane. GRK4 polymorphisms are associated with hypertension and salt sensitivity, as they cause hyperphosphorylation, desensitization, and internalization of the dopamine 1 (D1) receptor while increasing the expression of the angiotensin II type 1 receptor. GRK4 plays a crucial role in the D1 receptor regulation of sodium excretion and blood pressure.
  • smart S_TKc 253aa 6e-60 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart RGS 120aa 2e-20 in ref transcript
    • Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
  • PTZ PTZ00263 273aa 4e-40 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

GRM7

  • refseq_GRM7.F2 refseq_GRM7.R2 256 348
  • NCBIGene 36.3 2917
  • Single exon skipping, size difference: 92
  • Exclusion of the stop codon
  • Reference transcript: NM_181874

  • cd PBP1_mGluR_groupIII 465aa 0.0 in ref transcript
    • Ligand-binding domain of the group III metabotropic glutamate receptor, a family which contains mGlu4R, mGluR6R, mGluR7, and mGluR8; all of which inhibit adenylyl cyclase. The metabotropic glutamate receptor is a member of the family C of G-protein-coupled receptors that transduce extracellular signals into G-protein activation and ultimately into intracellular responses. The mGluRs are classified into three groups which comprise eight subtypes.
  • pfam 7tm_3 262aa 3e-96 in ref transcript
    • 7 transmembrane sweet-taste receptor of 3 GCPR. This is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G pfam07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor pfam01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness.
  • pfam ANF_receptor 408aa 2e-67 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • pfam NCD3G 52aa 1e-17 in ref transcript
    • Nine Cysteines Domain of family 3 GPCR. This conserved sequence contains several highly-conserved Cys residues that are predicted to form disulphide bridges. It is predicted to lie outside the cell membrane, tethered to the pfam00003 in several receptor proteins.
  • COG LivK 310aa 9e-12 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].

GRP

  • refseq_GRP.F1 refseq_GRP.R1 99 120
  • NCBIGene 36.3 2922
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002091

GSG1

  • refseq_GSG1.F2 refseq_GSG1.R2 184 308
  • NCBIGene 36.3 83445
  • Alternative 5-prime, size difference: 124
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_153823

  • Changed! pfam GSG-1 117aa 6e-47 in ref transcript
    • GSG1-like protein. This family contains sequences bearing similarity to a region of GSG1, a protein specifically expressed in testicular germ cells. It is possible that overexpression of the human homolog may be involved in tumourigenesis of human testicular germ cell tumours. The region in question has four highly-conserved cysteine residues.
  • Changed! pfam GSG-1 118aa 3e-46 in modified transcript

GSTZ1

  • refseq_GSTZ1.F2 refseq_GSTZ1.R2 148 274
  • NCBIGene 36.3 2954
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145870

  • Changed! cd GST_C_Zeta 115aa 2e-42 in ref transcript
    • GST_C family, Class Zeta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Class Zeta GSTs, also known as maleylacetoacetate (MAA) isomerases, catalyze the isomerization of MAA to fumarylacetoacetate, the penultimate step in tyrosine/phenylalanine catabolism, using GSH as a cofactor. They show little GSH-conjugating activity towards traditional GST substrates, but display modest GSH peroxidase activity. They are also implicated in the detoxification of the carcinogen dichloroacetic acid by catalyzing its dechlorination to glyoxylic acid.
  • Changed! cd GST_N_Zeta 75aa 1e-31 in ref transcript
    • GST_N family, Class Zeta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Class Zeta GSTs, also known as maleylacetoacetate (MAA) isomerases, catalyze the isomerization of MAA to fumarylacetoacetate, the penultimate step in tyrosine/phenylalanine catabolism, using GSH as a cofactor. They show little GSH-conjugating activity towards traditional GST substrates but display modest GSH peroxidase activity. They are also implicated in the detoxification of the carcinogen dichloroacetic acid by catalyzing its dechlorination to glyoxylic acid.
  • Changed! TIGR maiA 205aa 4e-96 in ref transcript
    • Maleylacetoacetate isomerase is an enzyme of tyrosine and phenylalanine catabolism. It requires glutathione and belongs by homology to the zeta family of glutathione S-transferases. The enzyme (EC 5.2.1.2) is described as active also on maleylpyruvate, and the example from a Ralstonia sp. catabolic plasmid is described as a maleylpyruvate isomerase involved in gentisate catabolism.
  • Changed! COG Gst 195aa 8e-32 in ref transcript
    • Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd GST_C_Zeta 93aa 2e-33 in modified transcript
  • Changed! cd GST_N_Zeta 67aa 5e-27 in modified transcript
  • Changed! TIGR maiA 163aa 2e-69 in modified transcript
  • Changed! COG Gst 153aa 2e-17 in modified transcript

GTDC1

  • refseq_GTDC1.F1 refseq_GTDC1.R1 152 407
  • NCBIGene 36.3 79712
  • Multiple exon skipping, size difference: 255
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001006636

  • Changed! cd GT1_cap1E_like 90aa 1e-04 in ref transcript
    • This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides.
  • Changed! pfam Glycos_transf_1 110aa 0.001 in ref transcript
    • Glycosyl transferases group 1. Mutations in this domain of subunit A of phosphatidylinositol N-acetylglucosaminyltransferase lead to disease (Paroxysmal Nocturnal haemoglobinuria). Members of this family transfer activated sugars to a variety of substrates, including glycogen, Fructose-6-phosphate and lipopolysaccharides. Members of this family transfer UDP, ADP, GDP or CMP linked sugars. The eukaryotic glycogen synthases may be distant members of this family.
  • Changed! COG RfaG 175aa 1e-07 in ref transcript
    • Glycosyltransferase [Cell envelope biogenesis, outer membrane].

GTF2I

  • refseq_GTF2I.F2 refseq_GTF2I.R2 111 174
  • NCBIGene 36.3 2969
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032999

  • pfam GTF2I 73aa 6e-31 in ref transcript
    • GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain.
  • pfam GTF2I 73aa 1e-30 in ref transcript
  • pfam GTF2I 73aa 4e-30 in ref transcript
  • pfam GTF2I 73aa 1e-29 in ref transcript
  • pfam GTF2I 73aa 3e-26 in ref transcript
  • pfam GTF2I 73aa 3e-26 in ref transcript

GTF2IRD1

  • refseq_GTF2IRD1.F2 refseq_GTF2IRD1.R2 138 183
  • NCBIGene 36.3 9569
  • Alternative 3-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016328

  • pfam GTF2I 73aa 2e-29 in ref transcript
    • GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain.
  • pfam GTF2I 73aa 3e-29 in ref transcript
  • pfam GTF2I 73aa 3e-29 in ref transcript
  • pfam GTF2I 73aa 5e-28 in ref transcript
  • pfam GTF2I 73aa 2e-25 in ref transcript

GTF3C2

  • refseq_GTF3C2.F2 refseq_GTF3C2.R2 195 271
  • NCBIGene 36.3 2976
  • Single exon skipping, size difference: 76
  • Exclusion in 5'UTR
  • Reference transcript: NM_001521

  • cd WD40 90aa 1e-05 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • smart WD40 38aa 0.002 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.

GTPBP8

  • refseq_GTPBP8.F1 refseq_GTPBP8.R1 117 248
  • NCBIGene 36.2 29083
  • Single exon skipping, size difference: 131
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014170

  • Changed! cd YihA_EngB 169aa 1e-50 in ref transcript
    • The YihA (EngB) subfamily. This subfamily of GTPases is typified by the E. coli YihA, an essential protein involved in cell division control. YihA and its orthologs are small proteins that typically contain less than 200 amino acid residues and consists of the GTPase domain only (some of the eukaryotic homologs contain an N-terminal extension of about 120 residues that might be involved in organellar targeting). Homologs of yihA are found in most Gram-positive and Gram-negative pathogenic bacteria, with the exception of Mycobacterium tuberculosis. The broad-spectrum nature of YihA and its essentiality for cell viability in bacteria make it an attractive antibacterial target.
  • Changed! TIGR GTPase_YsxC 176aa 2e-47 in ref transcript
    • Members of this protein family are a GTPase associated with ribosome biogenesis, typified by YsxC from Bacillus subutilis. The family is widely but not universally distributed among bacteria. Members commonly are called EngB based on homology to EngA, one of several other GTPases of ribosome biogenesis. Cutoffs as set find essentially all bacterial members, but also identify large numbers of eukaryotic (probably organellar) sequences. This protein is found in about 80 percent of bacterial genomes.
  • Changed! PRK engB 188aa 6e-50 in ref transcript
    • GTPase EngB; Reviewed.
  • Changed! cd YihA_EngB 34aa 1e-09 in modified transcript
  • Changed! TIGR GTPase_YsxC 51aa 7e-13 in modified transcript
  • Changed! PRK engB 52aa 6e-14 in modified transcript

GTPBP8

  • refseq_GTPBP8.F3 refseq_GTPBP8.R3 129 228
  • NCBIGene 36.3 29083
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014170

  • Changed! cd YihA_EngB 169aa 1e-50 in ref transcript
    • The YihA (EngB) subfamily. This subfamily of GTPases is typified by the E. coli YihA, an essential protein involved in cell division control. YihA and its orthologs are small proteins that typically contain less than 200 amino acid residues and consists of the GTPase domain only (some of the eukaryotic homologs contain an N-terminal extension of about 120 residues that might be involved in organellar targeting). Homologs of yihA are found in most Gram-positive and Gram-negative pathogenic bacteria, with the exception of Mycobacterium tuberculosis. The broad-spectrum nature of YihA and its essentiality for cell viability in bacteria make it an attractive antibacterial target.
  • Changed! TIGR GTPase_YsxC 176aa 2e-47 in ref transcript
    • Members of this protein family are a GTPase associated with ribosome biogenesis, typified by YsxC from Bacillus subutilis. The family is widely but not universally distributed among bacteria. Members commonly are called EngB based on homology to EngA, one of several other GTPases of ribosome biogenesis. Cutoffs as set find essentially all bacterial members, but also identify large numbers of eukaryotic (probably organellar) sequences. This protein is found in about 80 percent of bacterial genomes.
  • Changed! PRK engB 188aa 6e-50 in ref transcript
    • GTPase EngB; Reviewed.
  • Changed! cd YihA_EngB 135aa 4e-35 in modified transcript
  • Changed! TIGR GTPase_YsxC 125aa 2e-28 in modified transcript
  • Changed! PRK engB 155aa 1e-30 in modified transcript

GUSBL2

  • refseq_GUSBL2.F1 refseq_GUSBL2.R1 163 316
  • NCBIGene 36.2 375513
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206910

GYPC

  • refseq_GYPC.F1 refseq_GYPC.R1 125 182
  • NCBIGene 36.3 2995
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002101

  • smart 4.1m 19aa 6e-04 in ref transcript
    • putative band 4.1 homologues' binding motif.

Gcom1

  • refseq_Gcom1.F1 refseq_Gcom1.R1 113 206
  • NCBIGene 36.2 145781
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001018090

  • Changed! TIGR SMC_prok_B 250aa 2e-08 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! COG Smc 266aa 4e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 235aa 2e-05 in modified transcript
  • Changed! COG Smc 206aa 4e-04 in modified transcript

Gcom1

  • refseq_Gcom1.F3 refseq_Gcom1.R3 138 222
  • NCBIGene 36.3 145781
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001018090

  • Changed! TIGR SMC_prok_B 250aa 2e-08 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! COG Smc 266aa 4e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 294aa 2e-09 in modified transcript
  • Changed! COG Smc 285aa 3e-06 in modified transcript

H2AFV

  • refseq_H2AFV.F1 refseq_H2AFV.R1 226 304
  • NCBIGene 36.3 94239
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012412

  • Changed! cd H2A 105aa 1e-37 in ref transcript
    • Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
  • Changed! smart H2A 106aa 2e-43 in ref transcript
    • Histone 2A.
  • Changed! PTZ PTZ00017 110aa 1e-46 in ref transcript
    • histone H2A; Provisional.
  • Changed! cd H2A 95aa 1e-33 in modified transcript
  • Changed! smart H2A 97aa 9e-39 in modified transcript
  • Changed! PTZ PTZ00017 101aa 1e-42 in modified transcript

H2AFV

  • refseq_H2AFV.F3 refseq_H2AFV.R3 121 251
  • NCBIGene 36.3 94239
  • Single exon skipping, size difference: 130
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_012412

  • Changed! cd H2A 105aa 1e-37 in ref transcript
    • Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
  • Changed! smart H2A 106aa 2e-43 in ref transcript
    • Histone 2A.
  • Changed! PTZ PTZ00017 110aa 1e-46 in ref transcript
    • histone H2A; Provisional.
  • Changed! cd H2A 49aa 8e-14 in modified transcript
  • Changed! smart H2A 48aa 1e-17 in modified transcript
  • Changed! PTZ PTZ00017 48aa 8e-17 in modified transcript

H2AFV

  • refseq_H2AFV.F5 refseq_H2AFV.R5 134 248
  • NCBIGene 36.3 94239
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012412

  • Changed! cd H2A 105aa 1e-37 in ref transcript
    • Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
  • Changed! smart H2A 106aa 2e-43 in ref transcript
    • Histone 2A.
  • Changed! PTZ PTZ00017 110aa 1e-46 in ref transcript
    • histone H2A; Provisional.
  • Changed! cd H2A 58aa 5e-18 in modified transcript
  • Changed! smart H2A 60aa 2e-19 in modified transcript
  • Changed! PTZ PTZ00017 64aa 2e-24 in modified transcript

HSD17B10

  • refseq_HADH2.F1 refseq_HADH2.R1 135 162
  • NCBIGene 36.3 3028
  • Alternative 5-prime, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004493

  • Changed! TIGR 3oxo_ACP_reduc 210aa 2e-32 in ref transcript
    • This model represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis.
  • Changed! PRK fabG 254aa 2e-43 in ref transcript
    • 3-ketoacyl-(acyl-carrier-protein) reductase; Validated.
  • Changed! TIGR 3oxo_ACP_reduc 201aa 2e-26 in modified transcript
  • Changed! PRK fabG 245aa 4e-37 in modified transcript

HAGH

  • refseq_HAGH.F2 refseq_HAGH.R2 292 388
  • NCBIGene 36.3 3029
  • Single exon skipping, size difference: 96
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_005326

  • Changed! TIGR GSH_gloB 250aa 5e-70 in ref transcript
    • Members of this protein family are hydroxyacylglutathione hydrolase, a detoxification enzyme known as glyoxalase II. It follows lactoylglutathione lyase, or glyoxalase I, and acts to remove the toxic metabolite methylglyoxal and related compounds. This protein belongs to the broader metallo-beta-lactamase family (pfam00753).
  • Changed! PRK PRK10241 255aa 9e-33 in ref transcript
    • hydroxyacylglutathione hydrolase; Provisional.

HAO2

  • refseq_HAO2.F1 refseq_HAO2.R1 162 382
  • NCBIGene 36.3 51179
  • Single exon skipping, size difference: 220
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001005783

  • cd alpha_hydroxyacid_oxid_FMN 336aa 1e-120 in ref transcript
    • Family of homologous FMN-dependent alpha-hydroxyacid oxidizing enzymes. This family occurs in both prokaryotes and eukaryotes. Members of this family include flavocytochrome b2 (FCB2), glycolate oxidase (GOX), lactate monooxygenase (LMO), mandelate dehydrogenase (MDH), and long chain hydroxyacid oxidase (LCHAO). In green plants, glycolate oxidase is one of the key enzymes in photorespiration where it oxidizes glycolate to glyoxylate. LMO catalyzes the oxidation of L-lactate to acetate and carbon dioxide. MDH oxidizes (S)-mandelate to phenylglyoxalate. It is an enzyme in the mandelate pathway that occurs in several strains of Pseudomonas which converts (R)-mandelate to benzoate.
  • pfam FMN_dh 336aa 1e-129 in ref transcript
    • FMN-dependent dehydrogenase.
  • COG LldD 344aa 1e-81 in ref transcript
    • L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases [Energy production and conversion].

HAP1

  • refseq_HAP1.F1 refseq_HAP1.R1 187 343
  • NCBIGene 36.3 9001
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003949

  • Changed! pfam HAP1_N 354aa 1e-71 in ref transcript
    • HAP1 N-terminal conserved region. This family represents an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntington's disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34.
  • Changed! COG Smc 148aa 4e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! pfam HAP1_N 320aa 2e-65 in modified transcript
  • Changed! COG Smc 133aa 4e-05 in modified transcript

HAT1

  • refseq_HAT1.F2 refseq_HAT1.R2 315 391
  • NCBIGene 36.3 8520
  • Single exon skipping, size difference: 76
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003642

  • Changed! pfam Hat1_N 161aa 4e-50 in ref transcript
    • Histone acetyl transferase HAT1 N-terminus. This domain is the N-terminal half of the structure of histone acetyl transferase HAT1. It is often found in association with the C-terminal part of the GNAT Acetyltransf_1 (pfam00583) domain. It seems to be motifs C and D of the structure. Histone acetyltransferases (HATs) catalyse the transfer of an acetyl group from acetyl-CoA to the lysine E-amino groups on the N-terminal tails of histones. HATs are involved in transcription since histones tend to be hyper-acetylated in actively transcribed regions of chromatin, whereas in transcriptionally silent regions histones are hypo-acetylated.

HAX1

  • refseq_HAX1.F1 refseq_HAX1.R1 232 376
  • NCBIGene 36.3 10456
  • Alternative 3-prime, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006118

HCFC1R1

  • refseq_HCFC1R1.F1 refseq_HCFC1R1.R1 139 196
  • NCBIGene 36.3 54985
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017885

HDAC7A

  • refseq_HDAC7A.F2 refseq_HDAC7A.R2 117 237
  • NCBIGene 36.3 51564
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098415

  • pfam Hist_deacetyl 330aa 3e-82 in ref transcript
    • Histone deacetylase domain. Histones can be reversibly acetylated on several lysine residues. Regulation of transcription is caused in part by this mechanism. Histone deacetylases catalyse the removal of the acetyl group. Histone deacetylases are related to other proteins.
  • COG AcuC 309aa 3e-57 in ref transcript
    • Deacetylases, including yeast histone deacetylase and acetoin utilization protein [Chromatin structure and dynamics / Secondary metabolites biosynthesis, transport, and catabolism].

N6AMT1

  • refseq_HEMK2.F1 refseq_HEMK2.R1 121 205
  • NCBIGene 36.3 29104
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013240

  • Changed! cd AdoMet_MTases 126aa 3e-04 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! TIGR hemK_rel_arch 189aa 1e-23 in ref transcript
    • The gene hemK from E. coli was found to contribute to heme biosynthesis and originally suggested to be protoporphyrinogen oxidase. Functional analysis of the nearest homolog in Saccharomyces cerevisiae, YNL063w, finds it is not protoporphyrinogen oxidase and sequence analysis suggests that HemK homologs have S-adenosyl-methionine-dependent methyltransferase activity. Homologs are found, usually in a single copy, in nearly all completed genomes, but varying somewhat in apparent domain architecture. This model represents an archaeal and eukaryotic protein family that lacks an N-terminal domain found in HemK and its eubacterial homologs. It is found in a single copy in the first six completed archaeal and eukaryotic genomes.
  • Changed! COG HemK 171aa 1e-23 in ref transcript
    • Methylase of polypeptide chain release factors [Translation, ribosomal structure and biogenesis].
  • Changed! TIGR hemK_rel_arch 161aa 5e-15 in modified transcript
  • Changed! COG HemK 143aa 4e-14 in modified transcript

HERC4

  • refseq_HERC4.F2 refseq_HERC4.R2 182 342
  • NCBIGene 36.2 26091
  • Single exon skipping, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_022079

  • Changed! cd HECTc 347aa 1e-108 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • Changed! pfam HECT 298aa 5e-93 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • Changed! pfam RCC1 50aa 2e-07 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • Changed! pfam RCC1 48aa 1e-05 in ref transcript
  • Changed! pfam RCC1 48aa 0.004 in ref transcript
  • Changed! pfam RCC1 50aa 0.006 in ref transcript
  • Changed! COG HUL4 398aa 6e-78 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG ATS1 251aa 3e-35 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
  • Changed! COG ATS1 69aa 1e-09 in modified transcript

HERC4

  • refseq_HERC4.F3 refseq_HERC4.R3 123 147
  • NCBIGene 36.3 26091
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022079

  • cd HECTc 347aa 1e-108 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • pfam HECT 298aa 5e-93 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • pfam RCC1 50aa 2e-07 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • pfam RCC1 48aa 1e-05 in ref transcript
  • pfam RCC1 48aa 0.004 in ref transcript
  • pfam RCC1 50aa 0.006 in ref transcript
  • COG HUL4 398aa 6e-78 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • COG ATS1 251aa 3e-35 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].

HERC4

  • refseq_HERC4.F4 refseq_HERC4.R4 204 296
  • NCBIGene 36.2 26091
  • Single exon skipping, size difference: 92
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_022079

  • Changed! cd HECTc 347aa 1e-108 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • Changed! pfam HECT 298aa 5e-93 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • pfam RCC1 50aa 2e-07 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • pfam RCC1 48aa 1e-05 in ref transcript
  • Changed! pfam RCC1 48aa 0.004 in ref transcript
  • Changed! pfam RCC1 50aa 0.006 in ref transcript
  • Changed! COG HUL4 398aa 6e-78 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG ATS1 251aa 3e-35 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
  • Changed! COG ATS1 204aa 2e-30 in modified transcript

HERC6

  • refseq_HERC6.F1 refseq_HERC6.R1 215 297
  • NCBIGene 36.2 55008
  • Single exon skipping, size difference: 82
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_017912

  • Changed! cd HECTc 343aa 1e-101 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • Changed! pfam HECT 288aa 2e-65 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • pfam RCC1 51aa 4e-05 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • pfam RCC1 48aa 6e-04 in ref transcript
  • Changed! COG HUL4 608aa 1e-58 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • COG ATS1 241aa 1e-23 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].

HERC6

  • refseq_HERC6.F2 refseq_HERC6.R1 282 390
  • NCBIGene 36.2 55008
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017912

  • cd HECTc 343aa 1e-101 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • pfam HECT 288aa 2e-65 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • pfam RCC1 51aa 4e-05 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • pfam RCC1 48aa 6e-04 in ref transcript
  • Changed! COG HUL4 608aa 1e-58 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • COG ATS1 241aa 1e-23 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
  • Changed! COG HUL4 572aa 1e-62 in modified transcript

HERC6

  • refseq_HERC6.F4 refseq_HERC6.R3 209 364
  • NCBIGene 36.2 55008
  • Single exon skipping, size difference: 155
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017912

  • Changed! cd HECTc 343aa 1e-101 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • Changed! pfam HECT 288aa 2e-65 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • pfam RCC1 51aa 4e-05 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • pfam RCC1 48aa 6e-04 in ref transcript
  • Changed! COG HUL4 608aa 1e-58 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • COG ATS1 241aa 1e-23 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].

HERC6

  • refseq_HERC6.F5 refseq_HERC6.R4 224 346
  • NCBIGene 36.2 55008
  • Single exon skipping, size difference: 122
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017912

  • Changed! cd HECTc 343aa 1e-101 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • Changed! pfam HECT 288aa 2e-65 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • pfam RCC1 51aa 4e-05 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • pfam RCC1 48aa 6e-04 in ref transcript
  • Changed! COG HUL4 608aa 1e-58 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • COG ATS1 241aa 1e-23 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].

HFE

  • refseq_HFE.F1 refseq_HFE.R1 284 353
  • NCBIGene 36.3 3077
  • Alternative 3-prime, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000410

  • cd IGc 76aa 2e-12 in ref transcript
    • Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! pfam MHC_I 176aa 3e-31 in ref transcript
    • Class I Histocompatibility antigen, domains alpha 1 and 2.
  • pfam C1-set 72aa 2e-15 in ref transcript
    • Immunoglobulin C1-set domain.
  • Changed! pfam MHC_I 157aa 6e-27 in modified transcript

HFE

  • refseq_HFE.F3 refseq_HFE.R3 202 244
  • NCBIGene 36.3 3077
  • Alternative 3-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000410

  • Changed! cd IGc 76aa 2e-12 in ref transcript
    • Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam MHC_I 176aa 3e-31 in ref transcript
    • Class I Histocompatibility antigen, domains alpha 1 and 2.
  • Changed! pfam C1-set 72aa 2e-15 in ref transcript
    • Immunoglobulin C1-set domain.
  • Changed! cd IGc 92aa 3e-13 in modified transcript
  • Changed! pfam C1-set 73aa 6e-16 in modified transcript

HFE

  • refseq_HFE.F5 refseq_HFE.R3 130 406
  • NCBIGene 36.3 3077
  • Single exon skipping, size difference: 276
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000410

  • cd IGc 76aa 2e-12 in ref transcript
    • Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! pfam MHC_I 176aa 3e-31 in ref transcript
    • Class I Histocompatibility antigen, domains alpha 1 and 2.
  • pfam C1-set 72aa 2e-15 in ref transcript
    • Immunoglobulin C1-set domain.
  • Changed! pfam MHC_I 87aa 4e-08 in modified transcript

HFE2

  • refseq_HFE2.F2 refseq_HFE2.R2 103 289
  • NCBIGene 36.3 148738
  • Single exon skipping, size difference: 186
  • Exclusion of the protein initiation site
  • Reference transcript: NM_213653

  • pfam RGM_C 197aa 4e-81 in ref transcript
    • Repulsive guidance molecule (RGM) C-terminus. This family consists of several mammalian and one bird sequence from Gallus gallus (Chicken). This family represents the C-terminal region of several sequences but in others it represents the full protein. All of the mammalian proteins are hypothetical and have no known function but the member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum.
  • Changed! pfam RGM_N 195aa 3e-57 in ref transcript
    • Repulsive guidance molecule (RGM) N-terminus. This family consists of the N-terminal region of several mammalian and one bird sequence from Gallus gallus (Chicken). All of the mammalian proteins are hypothetical and have no known function but the member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum.
  • Changed! pfam RGM_N 114aa 7e-40 in modified transcript

HGD

  • refseq_HGD.F2 refseq_HGD.R2 143 366
  • NCBIGene 36.2 3081
  • Alternative 5-prime and 3-prime, size difference: 223
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001125869

  • Changed! TIGR hmgA 432aa 0.0 in ref transcript
    • Missing in human disease alkaptonuria.
  • Changed! PRK PRK05341 439aa 0.0 in ref transcript
    • homogentisate 1,2-dioxygenase; Provisional.
  • Changed! TIGR hmgA 301aa 1e-170 in modified transcript
  • Changed! PRK PRK05341 303aa 1e-132 in modified transcript

HHLA3

  • refseq_HHLA3.F2 refseq_HHLA3.R2 262 314
  • NCBIGene 36.3 11147
  • Single exon skipping, size difference: 52
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001031693

HIBCH

  • refseq_HIBCH.F1 refseq_HIBCH.R1 191 225
  • NCBIGene 36.3 26275
  • Single exon skipping, size difference: 34
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014362

  • cd crotonase-like 185aa 2e-39 in ref transcript
    • Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole.
  • pfam ECH 170aa 4e-15 in ref transcript
    • Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase.
  • Changed! PRK PRK05617 355aa 1e-116 in ref transcript
    • enoyl-CoA hydratase; Provisional.
  • Changed! PRK PRK05617 306aa 9e-93 in modified transcript

NFU1

  • refseq_HIRIP5.F2 refseq_HIRIP5.R2 307 443
  • NCBIGene 36.3 27247
  • Single exon skipping, size difference: 136
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015700

  • Changed! pfam Nfu_N 88aa 3e-38 in ref transcript
    • Scaffold protein Nfu/NifU N terminal. This domain is found at the N terminus of NifU and NifU related proteins, and in the human Nfu protein. Both of these proteins are thought to be involved in the the assembly of iron-sulphur clusters.
  • Changed! pfam NifU 69aa 7e-20 in ref transcript
    • NifU-like domain. This is an alignment of the carboxy-terminal domain. This is the only common region between the NifU protein from nitrogen-fixing bacteria and rhodobacterial species. The biochemical function of NifU is unknown.
  • Changed! COG COG0694 85aa 4e-22 in ref transcript
    • Thioredoxin-like proteins and domains [Posttranslational modification, protein turnover, chaperones].

NFU1

  • refseq_HIRIP5.F3 refseq_HIRIP5.R3 184 274
  • NCBIGene 36.3 27247
  • Alternative 5-prime, size difference: 90
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001002755

  • Changed! pfam Nfu_N 88aa 2e-38 in ref transcript
    • Scaffold protein Nfu/NifU N terminal. This domain is found at the N terminus of NifU and NifU related proteins, and in the human Nfu protein. Both of these proteins are thought to be involved in the the assembly of iron-sulphur clusters.
  • Changed! pfam NifU 69aa 7e-20 in ref transcript
    • NifU-like domain. This is an alignment of the carboxy-terminal domain. This is the only common region between the NifU protein from nitrogen-fixing bacteria and rhodobacterial species. The biochemical function of NifU is unknown.
  • Changed! COG COG0694 85aa 4e-22 in ref transcript
    • Thioredoxin-like proteins and domains [Posttranslational modification, protein turnover, chaperones].

HISPPD2A

  • refseq_HISPPD2A.F1 refseq_HISPPD2A.R1 98 500
  • NCBIGene 36.3 9677
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 402
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014659

  • cd HP_HAP_like 80aa 4e-16 in ref transcript
    • Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
  • cd HP_HAP_like 139aa 8e-09 in ref transcript
  • pfam Acid_phosphat_A 407aa 5e-37 in ref transcript
    • Histidine acid phosphatase.
  • pfam Acid_phosphat_A 47aa 3e-07 in ref transcript

HISPPD2A

  • refseq_HISPPD2A.F4 refseq_HISPPD2A.R4 113 392
  • NCBIGene 36.3 9677
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 279
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014659

  • cd HP_HAP_like 80aa 4e-16 in ref transcript
    • Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
  • Changed! cd HP_HAP_like 139aa 8e-09 in ref transcript
  • Changed! pfam Acid_phosphat_A 407aa 5e-37 in ref transcript
    • Histidine acid phosphatase.
  • pfam Acid_phosphat_A 47aa 3e-07 in ref transcript
  • Changed! cd HP_HAP_like 108aa 8e-05 in modified transcript
  • Changed! pfam Acid_phosphat_A 372aa 2e-32 in modified transcript

HISPPD2A

  • refseq_HISPPD2A.F5 refseq_HISPPD2A.R5 130 193
  • NCBIGene 36.3 9677
  • Single exon skipping, size difference: 63
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_014659

  • cd HP_HAP_like 80aa 4e-16 in ref transcript
    • Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
  • cd HP_HAP_like 139aa 8e-09 in ref transcript
  • pfam Acid_phosphat_A 407aa 5e-37 in ref transcript
    • Histidine acid phosphatase.
  • pfam Acid_phosphat_A 47aa 3e-07 in ref transcript

HISPPD2A

  • refseq_HISPPD2A.F7 refseq_HISPPD2A.R7 111 135
  • NCBIGene 36.3 9677
  • Alternative 5-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014659

  • cd HP_HAP_like 80aa 4e-16 in ref transcript
    • Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
  • Changed! cd HP_HAP_like 139aa 8e-09 in ref transcript
  • Changed! pfam Acid_phosphat_A 407aa 5e-37 in ref transcript
    • Histidine acid phosphatase.
  • pfam Acid_phosphat_A 47aa 3e-07 in ref transcript
  • Changed! cd HP_HAP_like 102aa 5e-07 in modified transcript
  • Changed! pfam Acid_phosphat_A 399aa 6e-38 in modified transcript

HK1

  • refseq_HK1.F2 refseq_HK1.R2 102 156
  • NCBIGene 36.3 3098
  • Alternative 5-prime, size difference: 54
  • Exclusion in 5'UTR
  • Reference transcript: NM_033498

  • pfam Hexokinase_2 237aa 1e-113 in ref transcript
    • Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam00349. Some members of the family have two copies of each of these domains.
  • pfam Hexokinase_2 238aa 1e-108 in ref transcript
  • pfam Hexokinase_1 206aa 6e-94 in ref transcript
    • Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam03727. Some members of the family have two copies of each of these domains.
  • pfam Hexokinase_1 201aa 8e-90 in ref transcript
  • COG COG5026 443aa 5e-84 in ref transcript
    • Hexokinase [Carbohydrate transport and metabolism].
  • COG COG5026 442aa 1e-74 in ref transcript

HK1

  • refseq_HK1.F3 refseq_HK1.R3 113 206
  • NCBIGene 36.3 3098
  • Single exon skipping, size difference: 93
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_033497

  • Changed! pfam Hexokinase_2 237aa 1e-113 in ref transcript
    • Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam00349. Some members of the family have two copies of each of these domains.
  • Changed! pfam Hexokinase_2 238aa 1e-108 in ref transcript
  • Changed! pfam Hexokinase_1 206aa 6e-94 in ref transcript
    • Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam03727. Some members of the family have two copies of each of these domains.
  • Changed! pfam Hexokinase_1 201aa 8e-90 in ref transcript
  • Changed! COG COG5026 443aa 5e-84 in ref transcript
    • Hexokinase [Carbohydrate transport and metabolism].
  • Changed! COG COG5026 442aa 1e-74 in ref transcript

HM13

  • refseq_HM13.F2 refseq_HM13.R2 162 218
  • NCBIGene 36.3 81502
  • Alternative 5-prime, size difference: 56
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_178581

  • pfam Peptidase_A22B 286aa 1e-89 in ref transcript
    • Signal peptide peptidase. The members of this family are membrane proteins. In some proteins this region is found associated with pfam02225. This family corresponds with Merops subfamily A22B, the type example of which is signal peptide peptidase. There is a sequence-similarity relationship with pfam01080.
  • COG COG3389 176aa 3e-05 in ref transcript
    • Uncharacterized protein conserved in archaea [Function unknown].

HMG2L1

  • refseq_HMG2L1.F1 refseq_HMG2L1.R1 248 363
  • NCBIGene 36.3 10042
  • Single exon skipping, size difference: 115
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001003681

  • Changed! cd HMG-box 60aa 5e-11 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • Changed! pfam HMG_box 61aa 5e-11 in ref transcript
    • HMG (high mobility group) box.
  • Changed! COG NHP6B 72aa 9e-04 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

HMG2L1

  • refseq_HMG2L1.F3 refseq_HMG2L1.R3 100 199
  • NCBIGene 36.3 10042
  • Single exon skipping, size difference: 99
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001003681

  • cd HMG-box 60aa 5e-11 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • Changed! pfam HMG_box 61aa 5e-11 in ref transcript
    • HMG (high mobility group) box.
  • COG NHP6B 72aa 9e-04 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

HMGN3

  • refseq_HMGN3.F1 refseq_HMGN3.R1 111 152
  • NCBIGene 36.3 9324
  • Alternative 5-prime, size difference: 41
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004242

HN1

  • refseq_HN1.F2 refseq_HN1.R2 139 234
  • NCBIGene 36.3 51155
  • Single exon skipping, size difference: 95
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001002032

HNRNPA2B1

  • refseq_HNRPA2B1.F2 refseq_HNRPA2B1.R2 180 216
  • NCBIGene 36.3 3181
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031243

  • Changed! cd RRM 73aa 3e-13 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 74aa 8e-11 in ref transcript
  • Changed! TIGR SF-CC1 140aa 9e-15 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • Changed! COG COG0724 155aa 3e-10 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! cd RRM 74aa 3e-13 in modified transcript
  • Changed! TIGR SF-CC1 150aa 2e-15 in modified transcript
  • Changed! COG COG0724 164aa 1e-09 in modified transcript

HNRPAB

  • refseq_HNRPAB.F2 refseq_HNRPAB.R2 129 270
  • NCBIGene 36.3 3182
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031266

  • cd RRM 70aa 6e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 48aa 4e-10 in ref transcript
  • TIGR SF-CC1 137aa 2e-18 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • pfam CBFNT 71aa 6e-06 in ref transcript
    • CBFNT (NUC161) domain. This N terminal domain is found in proteins of CARG-binding factor A-like proteins.
  • COG COG0724 142aa 6e-08 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

HNRPD

  • refseq_HNRPD.F2 refseq_HNRPD.R2 160 217
  • NCBIGene 36.3 3184
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031370

  • cd RRM 79aa 2e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 74aa 9e-14 in ref transcript
  • Changed! TIGR PABP-1234 143aa 2e-19 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • Changed! COG COG0724 179aa 2e-07 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! TIGR PABP-1234 155aa 2e-20 in modified transcript
  • Changed! COG COG0724 162aa 5e-07 in modified transcript

HNRPD

  • refseq_HNRPD.F3 refseq_HNRPD.R3 189 336
  • NCBIGene 36.3 3184
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031370

  • cd RRM 79aa 2e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 74aa 9e-14 in ref transcript
  • TIGR PABP-1234 143aa 2e-19 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • COG COG0724 179aa 2e-07 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

HNRPDL

  • refseq_HNRPDL.F1 refseq_HNRPDL.R1 133 238
  • NCBIGene 36.3 9987
  • Single exon skipping, size difference: 105
  • Exclusion in 3'UTR
  • Reference transcript: NR_003249

  • cd RRM 74aa 1e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 70aa 1e-14 in ref transcript
  • TIGR SF-CC1 149aa 1e-18 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • COG COG0724 170aa 3e-08 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

HNRPH3

  • refseq_HNRPH3.F1 refseq_HNRPH3.R1 120 165
  • NCBIGene 36.3 3189
  • Alternative 5-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012207

  • cd RRM 74aa 5e-08 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 71aa 1e-07 in ref transcript
  • smart RRM_2 69aa 4e-08 in ref transcript
    • RNA recognition motif.
  • smart RRM_2 72aa 5e-08 in ref transcript

HNRPK

  • refseq_HNRPK.F1 refseq_HNRPK.R1 283 343
  • NCBIGene 36.3 3190
  • Alternative 3-prime, size difference: 60
  • Exclusion of the stop codon
  • Reference transcript: NM_002140

  • cd PCBP_like_KH 62aa 1e-11 in ref transcript
    • K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
  • cd PCBP_like_KH 65aa 1e-07 in ref transcript
  • pfam ROKNT 43aa 3e-12 in ref transcript
    • ROKNT (NUC014) domain. This presumed domain is found at the N-terminus of RNP K-like proteins that also contains KH domains pfam00013.
  • pfam KH_1 61aa 7e-10 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • PRK PRK11824 52aa 0.001 in ref transcript
    • polynucleotide phosphorylase/polyadenylase; Provisional.

HNRNPU

  • refseq_HNRPU.F2 refseq_HNRPU.R2 134 191
  • NCBIGene 36.3 3192
  • Alternative 5-prime, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031844

  • pfam SPRY 117aa 6e-23 in ref transcript
    • SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • TIGR PNK-3'Pase 111aa 0.003 in ref transcript
    • Note that the EC number for the kinase function is: 2.7.1.78.
  • COG COG4639 151aa 0.005 in ref transcript
    • Predicted kinase [General function prediction only].

HNRPUL1

  • refseq_HNRPUL1.F2 refseq_HNRPUL1.R2 231 387
  • NCBIGene 36.2 11100
  • Alternative 3-prime, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007040

  • pfam SPRY 133aa 2e-25 in ref transcript
    • SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • TIGR selen_PSTK 146aa 0.005 in ref transcript
    • Members of this protein are L-seryl-tRNA(Sec) kinase. This enzyme is part of a two-step pathway in Eukaryota and Archaea for performing selenocysteine biosynthesis by changing serine misacylated on selenocysteine-tRNA to selenocysteine. This enzyme performs the first step, phosphorylation of the OH group of the serine side chain. This family represents archaeal proteins with this activity.
  • COG COG4639 131aa 0.009 in ref transcript
    • Predicted kinase [General function prediction only].

HOMER2

  • refseq_HOMER2.F1 refseq_HOMER2.R1 131 164
  • NCBIGene 36.3 9455
  • Alternative 3-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199330

  • cd Homer 111aa 6e-54 in ref transcript
    • Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
  • pfam WH1 104aa 6e-32 in ref transcript
    • WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homologue. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif.
  • TIGR SMC_prok_A 190aa 8e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! COG Smc 188aa 0.008 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! COG SbcC 237aa 0.001 in modified transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].

HOMER2

  • refseq_HOMER2.F4 refseq_HOMER2.R4 281 446
  • NCBIGene 36.3 9455
  • Alternative 5-prime and 3-prime, size difference: 165
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_199330

  • cd Homer 111aa 6e-54 in ref transcript
    • Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
  • pfam WH1 104aa 6e-32 in ref transcript
    • WH1 domain. WASp Homology domain 1 (WH1) domain. WASP is the protein that is defective in Wiskott-Aldrich syndrome (WAS). The majority of point mutations occur within the amino- terminal WH1 domain. The metabotropic glutamate receptors mGluR1alpha and mGluR5 bind a protein called homer, which is a WH1 domain homologue. A subset of WH1 domains has been termed a "EVH1" domain and appear to bind a polyproline motif.
  • Changed! TIGR SMC_prok_A 190aa 8e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! COG Smc 188aa 0.008 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

HOPX

  • refseq_HOP.F1 refseq_HOP.R1 237 362
  • NCBIGene 36.3 84525
  • Single exon skipping, size difference: 125
  • Exclusion of the protein initiation site
  • Reference transcript: NM_032495

  • cd homeodomain 53aa 1e-06 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam Homeobox 52aa 4e-05 in ref transcript
    • Homeobox domain.

HPCAL1

  • refseq_HPCAL1.F1 refseq_HPCAL1.R1 178 351
  • NCBIGene 36.3 3241
  • Single exon skipping, size difference: 173
  • Exclusion in 5'UTR
  • Reference transcript: NM_134421

  • cd EFh 62aa 3e-11 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 75aa 2e-10 in ref transcript
  • cd EFh 61aa 0.008 in ref transcript
  • smart EFh 29aa 8e-04 in ref transcript
    • EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
  • COG FRQ1 167aa 8e-27 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

HPS1

  • refseq_HPS1.F1 refseq_HPS1.R1 106 149
  • NCBIGene 36.2 3257
  • Alternative 5-prime, size difference: 43
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_000195

HPS1

  • refseq_HPS1.F2 refseq_HPS1.R2 149 292
  • NCBIGene 36.2 3257
  • Single exon skipping, size difference: 143
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000195

HPS1

  • refseq_HPS1.F4 refseq_HPS1.R4 140 239
  • NCBIGene 36.2 3257
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000195

HPS4

  • refseq_HPS4.F2 refseq_HPS4.R2 144 198
  • NCBIGene 36.2 89781
  • Single exon skipping, size difference: 54
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_022081

HPS5

  • refseq_HPS5.F2 refseq_HPS5.R2 222 379
  • NCBIGene 36.3 11234
  • Single exon skipping, size difference: 157
  • Exclusion of the protein initiation site
  • Reference transcript: NM_181507

HR

  • refseq_HR.F1 refseq_HR.R1 243 408
  • NCBIGene 36.3 55806
  • Single exon skipping, size difference: 165
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005144

  • Changed! pfam JmjC 41aa 4e-06 in ref transcript
    • JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.

HSD11B1L

  • refseq_HSD11B1L.F1 refseq_HSD11B1L.R1 189 252
  • NCBIGene 36.3 374875
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 63
  • Inclusion in the protein (no stop codon or frameshift), Exclusion of the protein initiation site
  • Reference transcript: NM_198533

  • Changed! pfam adh_short 165aa 4e-13 in ref transcript
    • short chain dehydrogenase. This family contains a wide variety of dehydrogenases.
  • Changed! PRK PRK06181 234aa 7e-28 in ref transcript
    • short chain dehydrogenase; Provisional.
  • Changed! pfam adh_short 109aa 9e-04 in modified transcript
  • Changed! PRK PRK06181 177aa 6e-14 in modified transcript

HSD11B1L

  • refseq_HSD11B1L.F3 refseq_HSD11B1L.R3 134 246
  • NCBIGene 36.3 374875
  • Single exon skipping, size difference: 112
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_198533

  • Changed! pfam adh_short 165aa 4e-13 in ref transcript
    • short chain dehydrogenase. This family contains a wide variety of dehydrogenases.
  • Changed! PRK PRK06181 234aa 7e-28 in ref transcript
    • short chain dehydrogenase; Provisional.
  • Changed! PRK PRK05866 44aa 5e-07 in modified transcript
    • short chain dehydrogenase; Provisional.

HSD11B1L

  • refseq_HSD11B1L.F6 refseq_HSD11B1L.R6 223 386
  • NCBIGene 36.3 374875
  • Alternative 3-prime, size difference: 163
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_198533

  • pfam adh_short 165aa 4e-13 in ref transcript
    • short chain dehydrogenase. This family contains a wide variety of dehydrogenases.
  • Changed! PRK PRK06181 234aa 7e-28 in ref transcript
    • short chain dehydrogenase; Provisional.
  • Changed! PRK PRK06181 216aa 9e-26 in modified transcript

HTATIP

  • refseq_HTATIP.F1 refseq_HTATIP.R1 105 261
  • NCBIGene 36.3 10524
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182710

  • cd CHROMO 53aa 0.003 in ref transcript
    • Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
  • pfam MOZ_SAS 193aa 1e-95 in ref transcript
    • MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases.
  • smart CHROMO 52aa 2e-04 in ref transcript
    • Chromatin organization modifier domain.
  • Changed! COG SAS2 275aa 6e-96 in ref transcript
    • Histone acetyltransferase (MYST family) [Chromatin structure and dynamics].
  • Changed! COG SAS2 72aa 2e-07 in ref transcript
  • Changed! COG SAS2 443aa 1e-102 in modified transcript

HTR4

  • refseq_HTR4.F1 refseq_HTR4.R1 235 355
  • NCBIGene 36.3 3360
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040173

  • pfam 7tm_1 271aa 5e-51 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

HTR4

  • refseq_HTR4.F3 refseq_HTR4.R3 223 265
  • NCBIGene 36.3 3360
  • Single exon skipping, size difference: 42
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001040173

  • Changed! pfam 7tm_1 271aa 5e-51 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • Changed! pfam 7tm_1 285aa 3e-50 in modified transcript

HTR4

  • refseq_HTR4.F5 refseq_HTR4.R6 247 323
  • NCBIGene 36.3 3360
  • Single exon skipping, size difference: 76
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001040169

  • pfam 7tm_1 271aa 6e-50 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

HTRA2

  • refseq_HTRA2.F1 refseq_HTRA2.R1 209 305
  • NCBIGene 36.3 27429
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013247

  • Changed! cd PDZ_serine_protease 93aa 1e-12 in ref transcript
    • PDZ domain of tryspin-like serine proteases, such as DegP/HtrA, which are oligomeric proteins involved in heat-shock response, chaperone function, and apoptosis. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins.
  • Changed! TIGR degP_htrA_DO 304aa 6e-63 in ref transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
  • Changed! COG DegQ 312aa 4e-46 in ref transcript
    • Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd PDZ_metalloprotease 51aa 1e-05 in modified transcript
    • PDZ domain of bacterial and plant zinc metalloprotases, presumably membrane-associated or integral membrane proteases, which may be involved in signalling and regulatory mechanisms. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins.
  • Changed! TIGR degP_htrA_DO 272aa 5e-53 in modified transcript
  • Changed! COG DegQ 280aa 2e-40 in modified transcript

HTRA2

  • refseq_HTRA2.F3 refseq_HTRA2.R3 110 305
  • NCBIGene 36.3 27429
  • Single exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013247

  • cd PDZ_serine_protease 93aa 1e-12 in ref transcript
    • PDZ domain of tryspin-like serine proteases, such as DegP/HtrA, which are oligomeric proteins involved in heat-shock response, chaperone function, and apoptosis. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins.
  • Changed! TIGR degP_htrA_DO 304aa 6e-63 in ref transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
  • Changed! COG DegQ 312aa 4e-46 in ref transcript
    • Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones].
  • Changed! TIGR degP_htrA_DO 239aa 5e-30 in modified transcript
  • Changed! COG DegQ 157aa 9e-18 in modified transcript
  • Changed! PRK PRK10898 81aa 1e-04 in modified transcript
    • serine endoprotease; Provisional.

HYAL1

  • refseq_HYAL1.F1 refseq_HYAL1.R1 101 191
  • NCBIGene 36.3 3373
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153281

  • Changed! pfam Glyco_hydro_56 335aa 1e-146 in ref transcript
    • Hyaluronidase.
  • Changed! pfam Glyco_hydro_56 305aa 1e-127 in modified transcript

ICAM4

  • refseq_ICAM4.F2 refseq_ICAM4.R2 237 314
  • NCBIGene 36.3 3386
  • Alternative 3-prime, size difference: 77
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001039132

  • Changed! pfam ICAM_N 87aa 2e-11 in ref transcript
    • Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signalling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, during the viral attachment process. This family is a family that is part of the Ig superfamily and is therefore related to the family ig (pfam00047).
  • Changed! pfam ICAM_N 89aa 3e-12 in modified transcript

ICK

  • refseq_ICK.F2 refseq_ICK.R2 239 350
  • NCBIGene 36.3 22858
  • Single exon skipping, size difference: 111
  • Exclusion in 5'UTR
  • Reference transcript: NM_016513

  • cd S_TKc 282aa 3e-74 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 281aa 1e-79 in ref transcript
    • Protein kinase domain.
  • PTZ PTZ00024 275aa 2e-43 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

IDH3B

  • refseq_IDH3B.F2 refseq_IDH3B.R2 101 416
  • NCBIGene 36.3 3420
  • Alternative 3-prime, size difference: 315
  • Exclusion of the stop codon
  • Reference transcript: NM_006899

  • Changed! TIGR mito_nad_idh 334aa 1e-133 in ref transcript
    • The NADP-dependent IDH of Thermus aquaticus thermophilus strain HB8 resembles these NAD-dependent IDH, except for the residues involved in cofactor specificity, much more closely than it resembles other prokaryotic NADP-dependent IDH, including that of Thermus aquaticus strain YT1.
  • Changed! COG LeuB 336aa 1e-81 in ref transcript
    • Isocitrate/isopropylmalate dehydrogenase [Amino acid transport and metabolism].
  • Changed! TIGR mito_nad_idh 334aa 1e-133 in modified transcript
  • Changed! COG LeuB 334aa 1e-82 in modified transcript

IFIT1

  • refseq_IFIT1.F2 refseq_IFIT1.R2 201 310
  • NCBIGene 36.2 3434
  • Single exon skipping, size difference: 109
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001001887

  • Changed! cd TPR 85aa 8e-05 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • Changed! cd TPR 91aa 0.001 in ref transcript
  • Changed! TIGR PEP_TPR_lipo 412aa 1e-08 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • Changed! COG PilF 144aa 0.002 in ref transcript
    • Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion].

IFNAR2

  • refseq_IFNAR2.F1 refseq_IFNAR2.R1 186 232
  • NCBIGene 36.3 3455
  • Alternative 3-prime, size difference: 46
  • Inclusion in 5'UTR
  • Reference transcript: NM_207585

  • pfam Interfer-bind 101aa 1e-48 in ref transcript
    • Interferon-alpha/beta receptor, fibronectin type III. Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha.

IFT122

  • refseq_IFT122.F2 refseq_IFT122.R2 218 395
  • NCBIGene 36.3 55764
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_052985

  • Changed! cd WD40 230aa 2e-14 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! cd WD40 259aa 6e-08 in ref transcript
  • smart WD40 39aa 2e-06 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • TIGR PEP_TPR_lipo 209aa 0.002 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • Changed! COG COG2319 229aa 6e-08 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! COG COG2319 269aa 3e-04 in ref transcript
  • Changed! cd WD40 293aa 3e-16 in modified transcript
  • Changed! COG COG2319 329aa 4e-13 in modified transcript

IFT122

  • refseq_IFT122.F3 refseq_IFT122.R3 119 272
  • NCBIGene 36.3 55764
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_052985

  • Changed! cd WD40 230aa 2e-14 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! cd WD40 259aa 6e-08 in ref transcript
  • smart WD40 39aa 2e-06 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • TIGR PEP_TPR_lipo 209aa 0.002 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • Changed! COG COG2319 229aa 6e-08 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! COG COG2319 269aa 3e-04 in ref transcript
  • Changed! cd WD40 176aa 1e-16 in modified transcript
  • Changed! cd WD40 302aa 9e-14 in modified transcript
  • Changed! COG COG2319 345aa 1e-11 in modified transcript

IFT88

  • refseq_IFT88.F1 refseq_IFT88.R1 115 344
  • NCBIGene 36.3 8100
  • Multiple exon skipping, size difference: 229
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_175605

  • cd TPR 99aa 6e-12 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • cd TPR 100aa 9e-08 in ref transcript
  • cd TPR 95aa 4e-07 in ref transcript
  • cd TPR 98aa 8e-06 in ref transcript
  • cd TPR 93aa 3e-05 in ref transcript
  • TIGR PEP_TPR_lipo 270aa 1e-14 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • TIGR PEP_TPR_lipo 135aa 0.005 in ref transcript
  • Changed! COG NrfG 259aa 3e-05 in ref transcript
    • FOG: TPR repeat [General function prediction only].
  • Changed! COG PilF 213aa 4e-05 in modified transcript
    • Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion].
  • Changed! COG COG4235 125aa 8e-04 in modified transcript
    • Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones].

IGF2BP2

  • refseq_IGF2BP2.F1 refseq_IGF2BP2.R1 224 353
  • NCBIGene 36.3 10644
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006548

  • cd PCBP_like_KH 64aa 1e-09 in ref transcript
    • K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
  • cd RRM 70aa 8e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd KH-I 64aa 2e-08 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • cd KH-I 66aa 3e-08 in ref transcript
  • cd RRM 72aa 6e-07 in ref transcript
  • cd KH-I 63aa 5e-06 in ref transcript
  • smart KH 74aa 2e-08 in ref transcript
    • K homology RNA-binding domain.
  • smart KH 68aa 3e-08 in ref transcript
  • pfam KH_1 64aa 1e-07 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • pfam KH_1 63aa 4e-07 in ref transcript
  • smart RRM 66aa 5e-07 in ref transcript
    • RNA recognition motif.
  • TIGR PABP-1234 107aa 2e-06 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • TIGR arCOG04150 132aa 2e-04 in ref transcript
    • This family of proteins is universal among the 41 archaeal genomes analyzed in and is not observed outside of the archaea. The proteins contain a single KH domain (pfam00013) which is likely to confer the ability to bind RNA.
  • PRK PRK13763 140aa 8e-05 in ref transcript
    • putative RNA-processing protein; Provisional.
  • COG COG0724 76aa 1e-04 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • PRK PRK12705 123aa 9e-04 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! cd PCBP_like_KH 56aa 8e-06 in modified transcript

IGLL1

  • refseq_IGLL1.F2 refseq_IGLL1.R2 225 341
  • NCBIGene 36.3 3543
  • Single exon skipping, size difference: 116
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020070

  • Changed! cd IGc 95aa 5e-22 in ref transcript
    • Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! smart IGc1 75aa 2e-21 in ref transcript
    • Immunoglobulin C-Type.

IGSF3

  • refseq_IGSF3.F2 refseq_IGSF3.R2 334 394
  • NCBIGene 36.3 3321
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001542

  • cd IGv 94aa 8e-04 in ref transcript
    • Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGv 86aa 8e-04 in ref transcript
  • pfam V-set 105aa 8e-08 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • pfam V-set 102aa 4e-05 in ref transcript
  • pfam V-set 102aa 8e-05 in ref transcript
  • pfam V-set 111aa 8e-05 in ref transcript

IHPK2

  • refseq_IHPK2.F1 refseq_IHPK2.R1 268 399
  • NCBIGene 36.2 51447
  • Single exon skipping, size difference: 131
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001005910

IKIP

  • refseq_IKIP.F1 refseq_IKIP.R1 253 371
  • NCBIGene 36.3 121457
  • Single exon skipping, size difference: 118
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_201612

  • Changed! TIGR SMC_prok_B 198aa 3e-06 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! COG Smc 267aa 2e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

IL17RC

  • refseq_IL17RC.F2 refseq_IL17RC.R2 138 183
  • NCBIGene 36.3 84818
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153461

  • pfam SEFIR 145aa 1e-33 in ref transcript
    • SEFIR domain. This family comprises IL17 receptors (IL17Rs) and SEF proteins. The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (pfam01582), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways.

IL17RC

  • refseq_IL17RC.F3 refseq_IL17RC.R3 170 383
  • NCBIGene 36.3 84818
  • Alternative 5-prime, size difference: 213
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153461

  • pfam SEFIR 145aa 1e-33 in ref transcript
    • SEFIR domain. This family comprises IL17 receptors (IL17Rs) and SEF proteins. The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (pfam01582), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways.

IL17RE

  • refseq_IL17RE.F2 refseq_IL17RE.R2 211 284
  • NCBIGene 36.2 132014
  • Alternative 3-prime, size difference: 73
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_144640

IL18BP

  • refseq_IL18BP.F1 refseq_IL18BP.R1 144 256
  • NCBIGene 36.3 10068
  • Single exon skipping, size difference: 112
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039659

  • pfam Pox_vIL-18BP 100aa 2e-04 in ref transcript
    • Orthopoxvirus interleukin 18 binding protein. Interleukin-18 (IL-18) is a proinflammatory cytokine that plays a key role in the activation of natural killer and T helper 1 cell responses principally by inducing interferon-gamma (IFN-gamma). Several poxvirus genes encode proteins with sequence similarity to IL-18BPs. It has been shown that vaccinia, ectromelia and cowpox viruses secrete from infected cells a soluble IL-18BP (vIL-18BP) that may modulate the host antiviral response. The expression of vIL-18BPs by distinct poxvirus genera that cause local or general viral dissemination, or persistent or acute infections in the host, emphasises the importance of IL-18 in response to viral infections.

IL1F7

  • refseq_IL1F7.F1 refseq_IL1F7.R1 116 236
  • NCBIGene 36.3 27178
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014439

  • Changed! cd IL1 146aa 3e-11 in ref transcript
    • Interleukin-1 homologes; Cytokines with various biological functions. Interleukin 1 alpha and beta are also known as hematopoietin and catabolin. This family also contains interleukin-1 receptor antagonists (inhibitors).
  • Changed! smart IL1 146aa 8e-12 in ref transcript
    • Interleukin-1 homologues. Cytokines with various biological functions. Interluekin 1 alpha and beta are also known as hematopoietin and catabolin.
  • Changed! cd IL1 97aa 3e-11 in modified transcript
  • Changed! smart IL1 97aa 4e-11 in modified transcript

IL1F7

  • refseq_IL1F7.F3 refseq_IL1F7.R3 195 258
  • NCBIGene 36.3 27178
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014439

  • cd IL1 146aa 3e-11 in ref transcript
    • Interleukin-1 homologes; Cytokines with various biological functions. Interleukin 1 alpha and beta are also known as hematopoietin and catabolin. This family also contains interleukin-1 receptor antagonists (inhibitors).
  • smart IL1 146aa 8e-12 in ref transcript
    • Interleukin-1 homologues. Cytokines with various biological functions. Interluekin 1 alpha and beta are also known as hematopoietin and catabolin.

IL22RA2

  • refseq_IL22RA2.F1 refseq_IL22RA2.R1 154 324
  • NCBIGene 36.3 116379
  • Single exon skipping, size difference: 170
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_052962

  • Changed! pfam IL10Ra-bind 236aa 6e-15 in ref transcript
    • Interleukin-10 receptor alpha chain. IL10Ra-bind is the cell-membrane bound receptor for interleukin-10, a pleiotropic cytokine produced by subsets of activated T cells, B cells, and macrophages, with a broad spectrum of immunoregulation and anti-inflammatory effects. The activity of IL-10 is mediated through its binding with IL10Ra-bind.
  • Changed! pfam IL10Ra-bind 128aa 4e-12 in modified transcript

IL22RA2

  • refseq_IL22RA2.F2 refseq_IL22RA2.R2 256 352
  • NCBIGene 36.3 116379
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_052962

  • Changed! pfam IL10Ra-bind 236aa 6e-15 in ref transcript
    • Interleukin-10 receptor alpha chain. IL10Ra-bind is the cell-membrane bound receptor for interleukin-10, a pleiotropic cytokine produced by subsets of activated T cells, B cells, and macrophages, with a broad spectrum of immunoregulation and anti-inflammatory effects. The activity of IL-10 is mediated through its binding with IL10Ra-bind.
  • Changed! pfam IL10Ra-bind 204aa 8e-20 in modified transcript

IL24

  • refseq_IL24.F1 refseq_IL24.R1 109 451
  • NCBIGene 36.3 11009
  • Multiple exon skipping, size difference: 342
  • Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006850

IL28RA

  • refseq_IL28RA.F1 refseq_IL28RA.R1 255 386
  • NCBIGene 36.3 163702
  • Single exon skipping, size difference: 131
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_170743

IL28RA

  • refseq_IL28RA.F4 refseq_IL28RA.R3 138 225
  • NCBIGene 36.3 163702
  • Alternative 3-prime, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170743

IL32

  • refseq_IL32.F1 refseq_IL32.R1 110 137
  • NCBIGene 36.3 9235
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004221

IL32

  • refseq_IL32.F3 refseq_IL32.R3 100 160
  • NCBIGene 36.3 9235
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004221

IL4

  • refseq_IL4.F2 refseq_IL4.R2 232 280
  • NCBIGene 36.3 3565
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000589

  • Changed! pfam IL4 148aa 4e-53 in ref transcript
    • Interleukin 4.
  • Changed! pfam IL4 132aa 2e-43 in modified transcript

IL5RA

  • refseq_IL5RA.F2 refseq_IL5RA.R2 140 209
  • NCBIGene 36.3 3568
  • Single exon skipping, size difference: 69
  • Exclusion in 5'UTR
  • Reference transcript: NM_000564

  • pfam Haemat_rec_S_F2 30aa 3e-08 in ref transcript
    • Haematopoeitin receptor binding domain short. This domain is a common signature for a number of receptors for lymphokines, haematopoietic growth factors and growth hormone-related molecules.

IL6R

  • refseq_IL6R.F2 refseq_IL6R.R2 277 371
  • NCBIGene 36.3 3570
  • Single exon skipping, size difference: 94
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000565

  • cd FN3 93aa 3e-05 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • pfam IL6Ra-bind 102aa 1e-45 in ref transcript
    • Interleukin-6 receptor alpha chain, binding. Members of this family adopt a structure consisting of an immunoglobulin-like beta-sandwich, with seven strands in two beta-sheets, in a Greek-key topology. They are required for binding to the cytokine Interleukin-6.
  • smart IGc2 59aa 4e-10 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart FN3 86aa 1e-05 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.

IL6ST

  • refseq_IL6ST.F1 refseq_IL6ST.R1 313 396
  • NCBIGene 36.3 3572
  • Single exon skipping, size difference: 83
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002184

  • cd FN3 100aa 8e-12 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • Changed! cd FN3 92aa 8e-06 in ref transcript
  • pfam Lep_receptor_Ig 91aa 1e-30 in ref transcript
    • Ig-like C2-type domain. This domain is a ligand-binding immunoglobulin-like domain. The two cysteine residues form a disulphide bridge.
  • Changed! smart FN3 81aa 3e-09 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam fn3 90aa 1e-08 in ref transcript
    • Fibronectin type III domain.

IMPDH1

  • refseq_IMPDH1.F1 refseq_IMPDH1.R1 159 267
  • NCBIGene 36.3 3614
  • Multiple exon skipping, size difference: 108
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000883

  • cd IMPDH 242aa 1e-95 in ref transcript
    • IMPDH: The catalytic domain of the inosine monophosphate dehydrogenase. IMPDH catalyzes the NAD-dependent oxidation of inosine 5'-monophosphate (IMP) to xanthosine 5' monophosphate (XMP). It is a rate-limiting step in the de novo synthesis of the guanine nucleotides. There is often a CBS domain inserted in the middle of this domain, which is proposed to play a regulatory role. IMPDH is a key enzyme in the regulation of cell proliferation and differentiation. It has been identified as an attractive target for developing chemotherapeutic agents.
  • cd CBS_pair_IMPDH_2 116aa 4e-45 in ref transcript
    • This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the inosine 5' monophosphate dehydrogenase (IMPDH) protein. IMPDH is an essential enzyme that catalyzes the first step unique to GTP synthesis, playing a key role in the regulation of cell proliferation and differentiation. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in IMPDH have been associated with retinitis pigmentosa.
  • cd IMPDH 81aa 3e-30 in ref transcript
  • pfam IMPDH 477aa 0.0 in ref transcript
    • IMP dehydrogenase / GMP reductase domain. This family is involved in biosynthesis of guanosine nucleotide. Members of this family contain a TIM barrel structure. In the inosine monophosphate dehydrogenases 2 CBS domains pfam00571 are inserted in the TIM barrel. This family is a member of the common phosphate binding site TIM barrel family.
  • PTZ PTZ00314 501aa 0.0 in ref transcript
    • inosine-5'-monophosphate dehydrogenase; Provisional.

INSIG1

  • refseq_INSIG1.F1 refseq_INSIG1.R1 104 396
  • NCBIGene 36.3 3638
  • Multiple exon skipping, size difference: 292
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_198336

  • Changed! pfam INSIG 184aa 3e-78 in ref transcript
    • Insulin-induced protein (INSIG). This family contains a number of eukaryotic Insulin-induced proteins (INSIG-1 and INSIG-2) approximately 200 residues long. INSIG-1 and INSIG-2 are found in the endoplasmic reticulum and bind the sterol-sensing domain of SREBP cleavage-activating protein (SCAP), preventing it from escorting SREBPs to the Golgi. Their combined action permits feedback regulation of cholesterol synthesis over a wide range of sterol concentrations.
  • Changed! pfam INSIG 56aa 5e-14 in modified transcript

INSIG1

  • refseq_INSIG1.F4 refseq_INSIG1.R4 202 273
  • NCBIGene 36.3 3638
  • Single exon skipping, size difference: 71
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_198336

  • pfam INSIG 184aa 3e-78 in ref transcript
    • Insulin-induced protein (INSIG). This family contains a number of eukaryotic Insulin-induced proteins (INSIG-1 and INSIG-2) approximately 200 residues long. INSIG-1 and INSIG-2 are found in the endoplasmic reticulum and bind the sterol-sensing domain of SREBP cleavage-activating protein (SCAP), preventing it from escorting SREBPs to the Golgi. Their combined action permits feedback regulation of cholesterol synthesis over a wide range of sterol concentrations.

IQCH

  • refseq_IQCH.F1 refseq_IQCH.R1 177 239
  • NCBIGene 36.3 64799
  • Single exon skipping, size difference: 62
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031715

IQCH

  • refseq_IQCH.F3 refseq_IQCH.R3 223 329
  • NCBIGene 36.3 64799
  • Single exon skipping, size difference: 106
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001031715

IQCH

  • refseq_IQCH.F5 refseq_IQCH.R5 104 438
  • NCBIGene 36.3 64799
  • Multiple exon skipping, size difference: 334
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031715

IQWD1

  • refseq_IQWD1.F2 refseq_IQWD1.R2 281 341
  • NCBIGene 36.3 55827
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018442

  • cd WD40 244aa 4e-19 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 78aa 7e-05 in ref transcript
  • smart WD40 36aa 0.002 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • COG COG2319 274aa 4e-09 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

IRF7

  • refseq_IRF7.F1 refseq_IRF7.R1 114 201
  • NCBIGene 36.3 3665
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004031

  • cd IRF 114aa 2e-25 in ref transcript
    • Interferon Regulatory Factor (IRF); also known as tryptophan pentad repeat. The family of IRF transcription factors is important in the regulation of interferons in response to infection by virus and in the regulation of interferon-inducible genes. The IRF family is characterized by a unique 'tryptophan cluster' DNA-binding region. Viral IRFs bind to cellular IRFs; block type I and II interferons and host IRF-mediated transcriptional activation.
  • pfam IRF-3 181aa 5e-50 in ref transcript
    • Interferon-regulatory factor 3. This is the interferon-regulatory factor 3 chain of the hetero-dimeric structure which also contains the shorter chain CREB-binding protein. These two subunits make up the DRAF1 (double-stranded RNA-activated factor 1). Viral dsRNA produced during viral transcription or replication leads to the activation of DRAF1. The DNA-binding specificity of DRAF1 correlates with transcriptional induction of ISG (interferon-alpha,beta-stimulated gene). IRF-3 preexists in the cytoplasm of uninfected cells and translocates to the nucleus following viral infection. Translocation of IRF-3 is accompanied by an increase in serine and threonine phosphorylation, and association with the CREB coactivator occurs only after infection.
  • smart IRF 114aa 2e-29 in ref transcript
    • interferon regulatory factor. interferon regulatory factor, also known as trytophan pentad repeat.

ITGA3

  • refseq_ITGA3.F1 refseq_ITGA3.R1 174 316
  • NCBIGene 36.3 3675
  • Single exon skipping, size difference: 142
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_005501

  • pfam Integrin_alpha2 455aa 7e-78 in ref transcript
    • Integrin alpha. This domain is found in integrin alpha and integrin alpha precursors to the C terminus of a number of pfam01839 repeats and to the N-terminus of the pfam00357 cytoplasmic region.
  • smart Int_alpha 53aa 4e-11 in ref transcript
    • Integrin alpha (beta-propellor repeats). Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Alpha integrins are proposed to contain a domain containing a 7-fold repeat that adopts a beta-propellor fold. Some of these domains contain an inserted von Willebrand factor type-A domain. Some repeats contain putative calcium-binding sites. The 7-fold repeat domain is homologous to a similar domain in phosphatidylinositol-glycan-specific phospholipase D.
  • smart Int_alpha 49aa 1e-08 in ref transcript
  • smart Int_alpha 61aa 1e-05 in ref transcript

ITGB1BP1

  • refseq_ITGB1BP1.F2 refseq_ITGB1BP1.R2 251 401
  • NCBIGene 36.3 9270
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004763

  • Changed! cd PTB 130aa 0.003 in ref transcript
    • Phosphotyrosine-binding (PTB) domain; PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. The PTB domain of SHC binds to a NPXpY sequence. More recent studies have found that some types of PTB domains such as the neuronal protein X11 and in the cell-fate determinant protein Numb can bind to peptides which are not tyrosine phosphorylated; whereas, other PTB domains can bind motifs lacking tyrosine residues altogether.
  • Changed! pfam ICAP-1_inte_bdg 171aa 2e-81 in ref transcript
    • Beta-1 integrin binding protein. ICAP-1 is a serine/threonine-rich protein that binds to the cytoplasmic domains of beta-1 integrins in a highly specific manner, binding to a NPXY sequence motif on the beta-1 integrin. The cytoplasmic domains of integrins are essential for cell adhesion, and the fact that phosphorylation of ICAP-1 by interaction with the cell-matrix implies an important role of ICAP-1 during integrin-dependent cell adhesion. Overexpression of ICAP-1 strongly reduces the integrin-mediated cell spreading on extracellular matrix and inhibits both Cdc42 and Rac1. In addition, ICAP-1 induces release of Cdc42 from cellular membranes and prevents the dissociation of GDP from this GTPase. An additional function of ICAP-1 is to promote differentiation of osteoprogenitors by supporting their condensation through modulating the integrin high affinity state,.
  • Changed! pfam ICAP-1_inte_bdg 121aa 1e-51 in modified transcript

ITGB1BP3

  • refseq_ITGB1BP3.F1 refseq_ITGB1BP3.R1 222 354
  • NCBIGene 36.2 27231
  • Alternative 5-prime and 3-prime, size difference: 132
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_170678

  • Changed! cd NRK1 161aa 9e-69 in ref transcript
    • Nicotinamide riboside kinase (NRK) is an enzyme involved in the metabolism of nicotinamide adenine dinucleotide (NAD+). This enzyme catalyzes the phosphorylation of nicotinamide riboside (NR) to form nicotinamide mononucleotide (NMN). It defines the NR salvage pathway of NAD+ biosynthesis in addition to the pathways through nicotinic acid mononucleotide (NaMN). This enzyme can also phosphorylate the anticancer drug tiazofurin, which is an analog of nicotinamide riboside.
  • Changed! COG Udk 153aa 3e-15 in ref transcript
    • Uridine kinase [Nucleotide transport and metabolism].
  • Changed! cd NRK1 177aa 7e-48 in modified transcript
  • Changed! TIGR udk 45aa 0.009 in modified transcript
    • Model contains a number of longer eukaryotic proteins and starts bringing in phosphoribulokinase hits at scores of 160 and below.
  • Changed! COG Udk 141aa 6e-10 in modified transcript

ITGB4

  • refseq_ITGB4.F1 refseq_ITGB4.R1 203 362
  • NCBIGene 36.3 3691
  • Single exon skipping, size difference: 159
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_000213

  • cd FN3 91aa 2e-13 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 94aa 1e-11 in ref transcript
  • cd FN3 94aa 6e-10 in ref transcript
  • cd FN3 88aa 6e-09 in ref transcript
  • pfam Integrin_beta 417aa 1e-180 in ref transcript
    • Integrin, beta chain. Integrins have been found in animals and their homologues have also been found in cyanobacteria, probably due to horizontal gene transfer. The sequences repeats have been trimmed due to an overlap with EGF.
  • pfam Integrin_B_tail 86aa 7e-21 in ref transcript
    • Integrin beta tail domain. This is the beta tail domain of the Integrin protein. Integrins are receptors which are involved in cell-cell and cell-extracellular matrix interactions.
  • pfam fn3 84aa 5e-17 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 87aa 8e-16 in ref transcript
  • smart Calx_beta 64aa 2e-15 in ref transcript
    • Domains in Na-Ca exchangers and integrin-beta4. Domain in Na-Ca exchangers and integrin subunit beta4 (and some cyanobacterial proteins).
  • pfam fn3 82aa 5e-12 in ref transcript
  • pfam fn3 90aa 1e-10 in ref transcript

ITGB4

  • refseq_ITGB4.F3 refseq_ITGB4.R3 131 341
  • NCBIGene 36.3 3691
  • Single exon skipping, size difference: 210
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000213

  • cd FN3 91aa 2e-13 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 94aa 1e-11 in ref transcript
  • cd FN3 94aa 6e-10 in ref transcript
  • cd FN3 88aa 6e-09 in ref transcript
  • pfam Integrin_beta 417aa 1e-180 in ref transcript
    • Integrin, beta chain. Integrins have been found in animals and their homologues have also been found in cyanobacteria, probably due to horizontal gene transfer. The sequences repeats have been trimmed due to an overlap with EGF.
  • pfam Integrin_B_tail 86aa 7e-21 in ref transcript
    • Integrin beta tail domain. This is the beta tail domain of the Integrin protein. Integrins are receptors which are involved in cell-cell and cell-extracellular matrix interactions.
  • pfam fn3 84aa 5e-17 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 87aa 8e-16 in ref transcript
  • smart Calx_beta 64aa 2e-15 in ref transcript
    • Domains in Na-Ca exchangers and integrin-beta4. Domain in Na-Ca exchangers and integrin subunit beta4 (and some cyanobacterial proteins).
  • pfam fn3 82aa 5e-12 in ref transcript
  • pfam fn3 90aa 1e-10 in ref transcript

EIF6

  • refseq_ITGB4BP.F1 refseq_ITGB4BP.R1 215 391
  • NCBIGene 36.3 3692
  • Single exon skipping, size difference: 176
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002212

  • Changed! cd IF6 222aa 1e-90 in ref transcript
    • Ribosome anti-association factor IF6 binds the large ribosomal subunit and prevents the two subunits from associating during translation initiation. IF6 comprises a family of translation factors that includes both eukaryotic (eIF6) and archeal (aIF6) members. All members of this family have a conserved pentameric fold referred to as a beta/alpha propeller. The eukaryotic IF6 members have a moderately conserved C-terminal extension which is not required for ribosomal binding, and may have an alternative function.
  • Changed! smart eIF6 201aa 9e-90 in ref transcript
    • translation initiation factor 6.
  • Changed! PTZ PTZ00136 244aa 1e-118 in ref transcript
    • eukaryotic translation initiation factor 6; Provisional.
  • Changed! cd IF6 67aa 1e-20 in modified transcript
  • Changed! smart eIF6 65aa 2e-22 in modified transcript
  • Changed! PTZ PTZ00136 68aa 6e-27 in modified transcript

ITM2C

  • refseq_ITM2C.F1 refseq_ITM2C.R1 248 359
  • NCBIGene 36.3 81618
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_030926

  • Changed! pfam BRICHOS 93aa 6e-28 in ref transcript
    • BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer. Its exact function is unknown; roles that have been proposed for it include (a) in targeting of the protein to the secretory pathway, (b) intramolecular chaperone-like function, and (c) assisting the specialised intracellular protease processing system.
  • Changed! pfam BRICHOS 56aa 5e-08 in modified transcript

ITM2C

  • refseq_ITM2C.F3 refseq_ITM2C.R3 149 290
  • NCBIGene 36.3 81618
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_030926

  • pfam BRICHOS 93aa 6e-28 in ref transcript
    • BRICHOS domain. The BRICHOS domain is about 100 amino acids long. It is found in a variety of proteins implicated in dementia, respiratory distress and cancer. Its exact function is unknown; roles that have been proposed for it include (a) in targeting of the protein to the secretory pathway, (b) intramolecular chaperone-like function, and (c) assisting the specialised intracellular protease processing system.

ITPA

  • refseq_ITPA.F1 refseq_ITPA.R1 146 197
  • NCBIGene 36.3 3704
  • Alternative 5-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033453

  • Changed! cd HAM1 179aa 1e-56 in ref transcript
    • NTPase/HAM1. This family consists of the HAM1 protein and pyrophosphate-releasing xanthosine/ inosine triphosphatase. HAM1 protects the cell against mutagenesis by the base analog 6-N-hydroxylaminopurine (HAP) in E. Coli and S. cerevisiae. A Ham1-related protein from Methanococcus jannaschii is a novel NTPase that has been shown to hydrolyze nonstandard nucleotides such as XTP to XMP and ITP to IMP, but not the standard nucleotides, in the presence of Mg or Mn ions. The enzyme exists as a homodimer. The HAM1 protein may be acting as an NTPase by hydrolyzing the HAP triphosphate.
  • Changed! pfam Ham1p_like 179aa 2e-39 in ref transcript
    • Ham1 family. This family consists of the HAM1 protein and hypothetical archaeal bacterial and Caenorhabditis elegans proteins. HAM1 controls 6-N-hydroxylaminopurine (HAP) sensitivity and mutagenesis in Saccharomyces cerevisiae HAM1. The HAM1 protein protects the cell from HAP, either on the level of deoxynucleoside triphosphate or the DNA level by a yet unidentified set of reactions.
  • Changed! COG COG0127 187aa 1e-42 in ref transcript
    • Xanthosine triphosphate pyrophosphatase [Nucleotide transport and metabolism].
  • Changed! cd HAM1 166aa 2e-50 in modified transcript
  • Changed! pfam Ham1p_like 165aa 8e-35 in modified transcript
  • Changed! COG COG0127 170aa 3e-37 in modified transcript

IVNS1ABP

  • refseq_IVNS1ABP.F2 refseq_IVNS1ABP.R2 110 251
  • NCBIGene 36.2 10625
  • Single exon skipping, size difference: 141
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006469

  • pfam BTB 105aa 2e-22 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • Changed! smart BACK 87aa 2e-13 in ref transcript
    • BTB And C-terminal Kelch. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues.
  • Changed! smart Kelch 46aa 3e-11 in ref transcript
    • Kelch domain.
  • Changed! smart Kelch 47aa 5e-11 in ref transcript
  • Changed! pfam Kelch_1 41aa 4e-10 in ref transcript
    • Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that the Drosophila ring canal kelch protein is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
  • Changed! smart Kelch 47aa 1e-09 in ref transcript
  • Changed! pfam Kelch_1 47aa 8e-08 in ref transcript
  • Changed! smart Kelch 48aa 2e-07 in ref transcript
  • Changed! TIGR muta_rot_YjhT 179aa 9e-06 in ref transcript
    • Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
  • Changed! COG COG3055 81aa 6e-05 in ref transcript
    • Uncharacterized protein conserved in bacteria [Function unknown].
  • Changed! COG COG3055 171aa 0.001 in ref transcript
  • Changed! smart BACK 87aa 1e-11 in modified transcript

JAG2

  • refseq_JAG2.F2 refseq_JAG2.R2 100 214
  • NCBIGene 36.3 3714
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002226

  • cd EGF_CA 39aa 5e-04 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • cd EGF_CA 37aa 0.002 in ref transcript
  • cd EGF_CA 37aa 0.003 in ref transcript
  • Changed! cd EGF_CA 36aa 0.005 in ref transcript
  • pfam MNNL 81aa 2e-22 in ref transcript
    • N terminus of Notch ligand. This entry represents a region of conserved sequence at the N terminus of several Notch ligand proteins.
  • pfam DSL 63aa 4e-21 in ref transcript
    • Delta serrate ligand.
  • smart VWC_out 72aa 1e-12 in ref transcript
    • von Willebrand factor (vWF) type C domain.
  • smart EGF_CA 39aa 4e-04 in ref transcript
    • Calcium-binding EGF-like domain.
  • smart EGF_CA 37aa 6e-04 in ref transcript
  • Changed! smart EGF_CA 36aa 0.002 in ref transcript
  • smart EGF_CA 37aa 0.004 in ref transcript
  • Changed! cd EGF_CA 36aa 0.002 in modified transcript
  • Changed! smart EGF_CA 36aa 6e-04 in modified transcript

KLHDC9

  • refseq_KARCA1.F1 refseq_KARCA1.R1 100 119
  • NCBIGene 36.3 126823
  • Alternative 3-prime, size difference: 19
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_152366

CCBL2

  • refseq_KAT3.F2 refseq_KAT3.R2 164 264
  • NCBIGene 36.3 56267
  • Single exon skipping, size difference: 100
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001008661

  • cd AAT_like 383aa 6e-78 in ref transcript
    • Aspartate aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). Pyridoxal phosphate combines with an alpha-amino acid to form a compound called a Schiff base or aldimine intermediate, which depending on the reaction, is the substrate in four kinds of reactions (1) transamination (movement of amino groups), (2) racemization (redistribution of enantiomers), (3) decarboxylation (removing COOH groups), and (4) various side-chain reactions depending on the enzyme involved. Pyridoxal phosphate (PLP) dependent enzymes were previously classified into alpha, beta and gamma classes, based on the chemical characteristics (carbon atom involved) of the reaction they catalyzed. The availability of several structures allowed a comprehensive analysis of the evolutionary classification of PLP dependent enzymes, and it was found that the functional classification did not always agree with the evolutionary history of these enzymes. The major groups in this CD corresponds to Aspartate aminotransferase a, b and c, Tyrosine, Alanine, Aromatic-amino-acid, Glutamine phenylpyruvate, 1-Aminocyclopropane-1-carboxylate synthase, Histidinol-phosphate, gene products of malY and cobC, Valine-pyruvate aminotransferase and Rhizopine catabolism regulatory protein.
  • pfam Aminotran_1_2 368aa 3e-44 in ref transcript
    • Aminotransferase class I and II.
  • PRK PRK08912 391aa 2e-97 in ref transcript
    • hypothetical protein; Provisional.

KBTBD3

  • refseq_KBTBD3.F2 refseq_KBTBD3.R2 125 326
  • NCBIGene 36.3 143879
  • Single exon skipping, size difference: 201
  • Exclusion in 5'UTR
  • Reference transcript: NM_198439

  • pfam BACK 102aa 1e-25 in ref transcript
    • BTB And C-terminal Kelch. This domain is found associated with pfam00651 and pfam01344. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues. This family appears to be closely related to the BTB domain (Finn RD, personal observation).
  • pfam BTB 108aa 3e-17 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • pfam Kelch_1 50aa 3e-04 in ref transcript
    • Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that the Drosophila ring canal kelch protein is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
  • pfam Kelch_1 44aa 0.002 in ref transcript

KBTBD4

  • refseq_KBTBD4.F1 refseq_KBTBD4.R1 274 383
  • NCBIGene 36.3 55709
  • Alternative 5-prime and 3-prime, size difference: 109
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_016506

  • pfam BTB 104aa 3e-22 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • smart BACK 83aa 6e-12 in ref transcript
    • BTB And C-terminal Kelch. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues.

KCNAB2

  • refseq_KCNAB2.F2 refseq_KCNAB2.R2 100 142
  • NCBIGene 36.3 8514
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003636

  • Changed! cd Aldo_ket_red 316aa 8e-76 in ref transcript
    • Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
  • Changed! TIGR Kv_beta 317aa 1e-179 in ref transcript
    • Plant beta subunits and their closely related bacterial homologs (in Deinococcus radiudurans, Xylella fastidiosa, etc.) appear more closely related to each other than to animal forms. However, the bacterial species lack convincing counterparts the Kv alpha subunit and the Kv beta homolog may serve as an enzyme. Cutoffs are set for this model such that yeast and plant forms and bacterial close homologs score between trusted and noise cutoffs.
  • Changed! COG Tas 321aa 2e-67 in ref transcript
    • Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) [Energy production and conversion].
  • Changed! cd Aldo_ket_red 316aa 1e-75 in modified transcript
  • Changed! TIGR Kv_beta 317aa 1e-179 in modified transcript
  • Changed! COG Tas 323aa 8e-68 in modified transcript

KCNC2

  • refseq_KCNC2.F2 refseq_KCNC2.R2 125 290
  • NCBIGene 36.3 3747
  • Single exon skipping, size difference: 165
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139136

  • pfam K_tetra 51aa 6e-19 in ref transcript
    • K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
  • pfam Ion_trans 189aa 2e-16 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • smart BTB 143aa 5e-07 in ref transcript
    • Broad-Complex, Tramtrack and Bric a brac. Domain in Broad-Complex, Tramtrack and Bric a brac. Also known as POZ (poxvirus and zinc finger) domain. Known to be a protein-protein interaction motif found at the N-termini of several C2H2-type transcription factors as well as Shaw-type potassium channels. Known structure reveals a tightly intertwined dimer formed via interactions between N-terminal strand and helix structures. However in a subset of BTB/POZ domains, these two secondary structures appear to be missing. Be aware SMART predicts BTB/POZ domains without the beta1- and alpha1-secondary structures.
  • PRK PRK10537 25aa 0.001 in ref transcript
    • voltage-gated potassium channel; Provisional.

KCND3

  • refseq_KCND3.F1 refseq_KCND3.R1 204 261
  • NCBIGene 36.3 3752
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004980

  • pfam K_tetra 90aa 5e-33 in ref transcript
    • K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
  • pfam Ion_trans 162aa 6e-27 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • PRK PRK10537 87aa 6e-09 in ref transcript
    • voltage-gated potassium channel; Provisional.

KCNG3

  • refseq_KCNG3.F1 refseq_KCNG3.R1 134 167
  • NCBIGene 36.3 170850
  • Alternative 5-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_133329

  • pfam Ion_trans 186aa 2e-25 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam K_tetra 101aa 4e-19 in ref transcript
    • K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
  • PRK PRK10537 51aa 5e-06 in ref transcript
    • voltage-gated potassium channel; Provisional.

KCNH1

  • refseq_KCNH1.F1 refseq_KCNH1.R1 290 371
  • NCBIGene 36.3 3756
  • Alternative 5-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172362

  • cd CAP_ED 110aa 5e-17 in ref transcript
    • effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
  • cd PAS 95aa 8e-07 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • smart cNMP 115aa 1e-15 in ref transcript
    • Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
  • Changed! pfam Ion_trans_2 60aa 3e-09 in ref transcript
    • Ion channel. This family includes the two membrane helix type ion channels found in bacteria.
  • Changed! pfam Ion_trans 120aa 4e-07 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • smart PAC 42aa 9e-06 in ref transcript
    • Motif C-terminal to PAS motifs (likely to contribute to PAS structural domain). PAC motif occurs C-terminal to a subset of all known PAS motifs. It is proposed to contribute to the PAS domain fold.
  • TIGR sensory_box 123aa 2e-05 in ref transcript
    • The PAS domain was previously described. This sensory box, or S-box domain occupies the central portion of the PAS domain but is more widely distributed. It is often tandemly repeated. Known prosthetic groups bound in the S-box domain include heme in the oxygen sensor FixL, FAD in the redox potential sensor NifL, and a 4-hydroxycinnamyl chromophore in photoactive yellow protein. Proteins containing the domain often contain other regulatory domains such as response regulator or sensor histidine kinase domains. Other S-box proteins include phytochromes and the aryl hydrocarbon receptor nuclear translocator.
  • PRK PRK13558 100aa 7e-13 in ref transcript
    • bacterio-opsin activator; Provisional.
  • COG Crp 121aa 5e-09 in ref transcript
    • cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
  • PRK PRK10537 49aa 0.001 in ref transcript
    • voltage-gated potassium channel; Provisional.
  • Changed! pfam Ion_trans 185aa 4e-10 in modified transcript

KCNH5

  • refseq_KCNH5.F1 refseq_KCNH5.R1 205 402
  • NCBIGene 36.3 27133
  • Single exon skipping, size difference: 197
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_139318

  • Changed! cd CAP_ED 110aa 2e-16 in ref transcript
    • effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
  • cd PAS 95aa 9e-06 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • Changed! smart cNMP 100aa 4e-13 in ref transcript
    • Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
  • Changed! pfam Ion_trans_2 75aa 7e-10 in ref transcript
    • Ion channel. This family includes the two membrane helix type ion channels found in bacteria.
  • pfam Ion_trans 184aa 7e-07 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam PAS 92aa 1e-05 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • PRK PRK13558 100aa 4e-12 in ref transcript
    • bacterio-opsin activator; Provisional.
  • Changed! COG Crp 215aa 2e-08 in ref transcript
    • cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
  • PRK PRK10537 52aa 1e-04 in ref transcript
    • voltage-gated potassium channel; Provisional.
  • Changed! cd CAP_ED 52aa 2e-06 in modified transcript
  • Changed! pfam Ion_trans_2 77aa 3e-08 in modified transcript
  • Changed! smart cNMP 57aa 3e-07 in modified transcript

KCNH6

  • refseq_KCNH6.F1 refseq_KCNH6.R1 189 297
  • NCBIGene 36.3 81033
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_030779

  • cd CAP_ED 111aa 4e-17 in ref transcript
    • effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
  • cd PAS 92aa 2e-06 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • smart cNMP 100aa 2e-16 in ref transcript
    • Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
  • pfam Ion_trans 177aa 2e-10 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam PAS 92aa 5e-07 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • PRK PRK13559 95aa 5e-14 in ref transcript
    • hypothetical protein; Provisional.
  • COG Crp 121aa 2e-09 in ref transcript
    • cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
  • PRK PRK10537 73aa 1e-04 in ref transcript
    • voltage-gated potassium channel; Provisional.

KCNH7

  • refseq_KCNH7.F2 refseq_KCNH7.R2 119 140
  • NCBIGene 36.3 90134
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033272

  • cd CAP_ED 111aa 1e-17 in ref transcript
    • effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
  • cd PAS 90aa 3e-07 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • smart cNMP 115aa 3e-18 in ref transcript
    • Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
  • pfam Ion_trans 177aa 2e-11 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam PAS 89aa 7e-05 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • PRK PRK13557 80aa 8e-11 in ref transcript
    • histidine kinase; Provisional.
  • COG Crp 144aa 1e-09 in ref transcript
    • cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
  • PRK PRK10537 80aa 8e-05 in ref transcript
    • voltage-gated potassium channel; Provisional.

KCNIP1

  • refseq_KCNIP1.F2 refseq_KCNIP1.R2 172 205
  • NCBIGene 36.3 30820
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001034837

  • cd EFh 72aa 3e-08 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 63aa 6e-06 in ref transcript
  • smart EFh 29aa 0.002 in ref transcript
    • EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
  • COG FRQ1 167aa 5e-19 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

KCNIP2

  • refseq_KCNIP2.F1 refseq_KCNIP2.R1 121 229
  • NCBIGene 36.3 30819
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014591

  • Changed! cd EFh 63aa 6e-08 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! cd EFh 72aa 1e-07 in ref transcript
  • Changed! smart EH 87aa 7e-04 in ref transcript
    • Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
  • Changed! COG FRQ1 167aa 3e-18 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! cd EFh 72aa 1e-06 in modified transcript
  • Changed! COG FRQ1 104aa 2e-08 in modified transcript

KCNJ1

  • refseq_KCNJ1.F1 refseq_KCNJ1.R1 229 399
  • NCBIGene 36.3 3758
  • Single exon skipping, size difference: 170
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_153765

  • Changed! pfam IRK 338aa 1e-153 in ref transcript
    • Inward rectifier potassium channel.

KCNJ1

  • refseq_KCNJ1.F2 refseq_KCNJ1.R2 258 391
  • NCBIGene 36.3 3758
  • Single exon skipping, size difference: 133
  • Exclusion in 5'UTR
  • Reference transcript: NM_153767

  • pfam IRK 338aa 1e-153 in ref transcript
    • Inward rectifier potassium channel.

KCNQ1

  • refseq_KCNQ1.F1 refseq_KCNQ1.R1 122 214
  • NCBIGene 36.2 3784
  • Single exon skipping, size difference: 92
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_000218

  • Changed! pfam KCNQ_channel 163aa 5e-62 in ref transcript
    • KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels.
  • Changed! pfam Ion_trans 190aa 4e-18 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.

KCNQ2

  • refseq_KCNQ2.F1 refseq_KCNQ2.R1 153 183
  • NCBIGene 36.3 3785
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172107

  • pfam KCNQ_channel 201aa 2e-70 in ref transcript
    • KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels.
  • pfam Ion_trans 185aa 8e-25 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • PRK PRK10537 40aa 0.004 in ref transcript
    • voltage-gated potassium channel; Provisional.

KCNQ4

  • refseq_KCNQ4.F1 refseq_KCNQ4.R1 105 267
  • NCBIGene 36.3 9132
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004700

  • pfam KCNQ_channel 197aa 5e-72 in ref transcript
    • KCNQ voltage-gated potassium channel. This family matches to the C-terminal tail of KCNQ type potassium channels.
  • pfam Ion_trans 185aa 1e-18 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • PRK PRK10537 69aa 0.002 in ref transcript
    • voltage-gated potassium channel; Provisional.

KCNRG

  • refseq_KCNRG.F2 refseq_KCNRG.R2 267 365
  • NCBIGene 36.3 283518
  • Single exon skipping, size difference: 98
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_173605

  • pfam K_tetra 89aa 4e-12 in ref transcript
    • K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.

KIAA0101

  • refseq_KIAA0101.F1 refseq_KIAA0101.R1 129 292
  • NCBIGene 36.3 9768
  • Single exon skipping, size difference: 163
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014736

KIAA0841

  • refseq_KIAA0841.F2 refseq_KIAA0841.R2 137 198
  • NCBIGene 36.2 23354
  • Alternative 3-prime, size difference: 61
  • Exclusion of the protein initiation site
  • Reference transcript: XM_932194

KIAA0859

  • refseq_KIAA0859.F2 refseq_KIAA0859.R2 101 426
  • NCBIGene 36.3 51603
  • Alternative 5-prime, size difference: 325
  • Exclusion of the protein initiation site
  • Reference transcript: NM_015935

  • Changed! cd AdoMet_MTases 109aa 8e-08 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! pfam Methyltransf_11 106aa 6e-09 in ref transcript
    • Methyltransferase domain. Members of this family are SAM dependent methyltransferases.
  • pfam Spermine_synth 94aa 3e-06 in ref transcript
    • Spermine/spermidine synthase. Spermine and spermidine are polyamines. This family includes spermidine synthase that catalyses the fifth (last) step in the biosynthesis of spermidine from arginine, and spermine synthase.
  • COG SpeE 169aa 1e-11 in ref transcript
    • Spermidine synthase [Amino acid transport and metabolism].
  • Changed! PRK PRK08317 221aa 5e-09 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! PRK PRK08317 64aa 7e-04 in modified transcript

KIAA1462

  • refseq_KIAA1462.F2 refseq_KIAA1462.R2 147 187
  • NCBIGene 36.2 57608
  • Single exon skipping, size difference: 40
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_929312

FNIP1

  • refseq_KIAA1961.F1 refseq_KIAA1961.R1 149 233
  • NCBIGene 36.3 96459
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_133372

KIF23

  • refseq_KIF23.F1 refseq_KIF23.R1 104 416
  • NCBIGene 36.3 9493
  • Single exon skipping, size difference: 312
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138555

  • cd KISc_KIF23_like 219aa 1e-102 in ref transcript
    • Kinesin motor domain, KIF23-like subgroup. Members of this group may play a role in mitosis. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second. To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through the coiled-coil domain close to a second tubulin dimer, about 80 Angstroms along the microtubule. Meanwhile, ATP hydrolysis takes place, and when the second head domain binds to the microtubule, the first domain again replaces ADP with ATP, triggering a conformational change that pulls the first domain forward.
  • cd KISc_KIF23_like 123aa 9e-50 in ref transcript
  • pfam Kinesin 228aa 1e-74 in ref transcript
    • Kinesin motor domain.
  • smart KISc 126aa 3e-30 in ref transcript
    • Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
  • pfam SMC_N 168aa 4e-04 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG KIP1 236aa 2e-36 in ref transcript
    • Kinesin-like protein [Cytoskeleton].
  • COG KIP1 75aa 1e-11 in ref transcript
  • COG Smc 184aa 2e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

KIF25

  • refseq_KIF25.F1 refseq_KIF25.R1 145 301
  • NCBIGene 36.3 3834
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_030615

  • Changed! cd KISc_C_terminal 336aa 5e-63 in ref transcript
    • Kinesin motor domain, KIFC2/KIFC3/ncd-like carboxy-terminal kinesins. Ncd is a spindle motor protein necessary for chromosome segregation in meiosis. KIFC2/KIFC3-like kinesins have been implicated in motility of the Golgi apparatus as well as dentritic and axonal transport in neurons. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In this subgroup the motor domain is found at the C-terminus (C-type). C-type kinesins are (-) end-directed motors, i.e. they transport cargo towards the (-) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second. To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through the coiled-coil domain close to a second tubulin dimer, about 80 Angstroms along the microtubule. Meanwhile, ATP hydrolysis takes place, and when the second head domain binds to the microtubule, the first domain again replaces ADP with ATP, triggering a conformational change that pulls the first domain forward.
  • Changed! pfam Kinesin 333aa 1e-65 in ref transcript
    • Kinesin motor domain.
  • Changed! COG KIP1 333aa 6e-38 in ref transcript
    • Kinesin-like protein [Cytoskeleton].
  • Changed! cd KISc_C_terminal 284aa 7e-43 in modified transcript
  • Changed! smart KISc 287aa 5e-46 in modified transcript
    • Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
  • Changed! COG KIP1 281aa 9e-28 in modified transcript

KIF9

  • refseq_KIF9.F1 refseq_KIF9.R1 201 396
  • NCBIGene 36.3 64147
  • Single exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182902

  • cd KISc_KIF9_like 333aa 1e-144 in ref transcript
    • Kinesin motor domain, KIF9-like subgroup; might play a role in cell shape remodeling. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. Kinesin motor domains hydrolyze ATP at a rate of about 80 per second, and move along the microtubule at a speed of about 6400 Angstroms per second. To achieve that, kinesin head groups work in pairs. Upon replacing ADP with ATP, a kinesin motor domain increases its affinity for microtubule binding and locks in place. Also, the neck linker binds to the motor domain, which repositions the other head domain through the coiled-coil domain close to a second tubulin dimer, about 80 Angstroms along the microtubule. Meanwhile, ATP hydrolysis takes place, and when the second head domain binds to the microtubule, the first domain again replaces ADP with ATP, triggering a conformational change that pulls the first domain forward.
  • pfam Kinesin 329aa 1e-107 in ref transcript
    • Kinesin motor domain.
  • COG KIP1 300aa 2e-54 in ref transcript
    • Kinesin-like protein [Cytoskeleton].

KIRREL2

  • refseq_KIRREL2.F1 refseq_KIRREL2.R1 202 352
  • NCBIGene 36.3 84063
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199180

  • cd IGcam 82aa 8e-10 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 79aa 2e-05 in ref transcript
  • Changed! cd IGcam 95aa 0.001 in ref transcript
  • cd IGcam 73aa 0.002 in ref transcript
  • pfam I-set 80aa 9e-12 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam C2-set_2 90aa 3e-07 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
  • Changed! pfam I-set 83aa 7e-07 in ref transcript
  • pfam I-set 74aa 3e-04 in ref transcript
  • Changed! smart IG_like 44aa 0.005 in modified transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.

KL

  • refseq_KL.F2 refseq_KL.R2 209 255
  • NCBIGene 36.2 9365
  • Alternative 5-prime and 3-prime, size difference: 46
  • Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004795

  • pfam Glyco_hydro_1 449aa 1e-115 in ref transcript
    • Glycosyl hydrolase family 1.
  • Changed! pfam Glyco_hydro_1 372aa 6e-78 in ref transcript
  • COG BglB 446aa 4e-79 in ref transcript
    • Beta-glucosidase/6-phospho-beta-glucosidase/beta- galactosidase [Carbohydrate transport and metabolism].
  • Changed! COG BglB 371aa 5e-49 in ref transcript

KLHL5

  • refseq_KLHL5.F1 refseq_KLHL5.R1 144 327
  • NCBIGene 36.3 51088
  • Single exon skipping, size difference: 183
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015990

  • pfam BACK 100aa 4e-35 in ref transcript
    • BTB And C-terminal Kelch. This domain is found associated with pfam00651 and pfam01344. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues. This family appears to be closely related to the BTB domain (Finn RD, personal observation).
  • Changed! pfam BTB 108aa 2e-26 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • pfam Kelch_1 46aa 1e-12 in ref transcript
    • Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that the Drosophila ring canal kelch protein is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
  • smart Kelch 47aa 3e-12 in ref transcript
    • Kelch domain.
  • pfam Kelch_1 44aa 2e-10 in ref transcript
  • smart Kelch 45aa 9e-10 in ref transcript
  • smart Kelch 46aa 1e-08 in ref transcript
  • pfam Kelch_1 52aa 2e-07 in ref transcript
  • TIGR mutarot_permut 200aa 6e-06 in ref transcript
    • Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
  • TIGR muta_rot_YjhT 119aa 5e-05 in ref transcript
    • Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
  • Changed! pfam BTB 74aa 8e-13 in modified transcript

KLHL7

  • refseq_KLHL7.F1 refseq_KLHL7.R1 240 371
  • NCBIGene 36.3 55975
  • Single exon skipping, size difference: 131
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001031710

  • Changed! pfam BACK 103aa 2e-32 in ref transcript
    • BTB And C-terminal Kelch. This domain is found associated with pfam00651 and pfam01344. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues. This family appears to be closely related to the BTB domain (Finn RD, personal observation).
  • Changed! pfam BTB 107aa 5e-29 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • Changed! pfam Kelch_1 46aa 6e-09 in ref transcript
    • Kelch motif. The kelch motif was initially discovered in Kelch. In this protein there are six copies of the motif. It has been shown that the Drosophila ring canal kelch protein is related to Galactose Oxidase for which a structure has been solved. The kelch motif forms a beta sheet. Several of these sheets associate to form a beta propeller structure as found in pfam00064, pfam00400 and pfam00415.
  • Changed! pfam Kelch_1 48aa 7e-08 in ref transcript
  • Changed! smart Kelch 48aa 2e-05 in ref transcript
    • Kelch domain.
  • Changed! TIGR muta_rot_YjhT 65aa 0.002 in ref transcript
    • Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.

KLK12

  • refseq_KLK12.F1 refseq_KLK12.R1 131 391
  • NCBIGene 36.3 43849
  • Single exon skipping, size difference: 260
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_019598

  • Changed! cd Tryp_SPc 215aa 8e-50 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! smart Tryp_SPc 216aa 9e-56 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! COG COG5640 236aa 2e-15 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Tryp_SPc 50aa 9e-11 in modified transcript
  • Changed! smart Tryp_SPc 43aa 2e-12 in modified transcript
  • Changed! COG COG5640 63aa 9e-06 in modified transcript

KLK3

  • refseq_KLK3.F2 refseq_KLK3.R2 247 376
  • NCBIGene 36.3 354
  • Alternative 3-prime, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001648

  • Changed! cd Tryp_SPc 232aa 3e-64 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! smart Tryp_SPc 230aa 5e-71 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! COG COG5640 250aa 1e-21 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Tryp_SPc 189aa 1e-52 in modified transcript
  • Changed! smart Tryp_SPc 187aa 2e-60 in modified transcript
  • Changed! COG COG5640 207aa 8e-19 in modified transcript

KLRC1

  • refseq_KLRC1.F1 refseq_KLRC1.R1 215 269
  • NCBIGene 36.3 3821
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_213658

  • cd CLECT_NK_receptors_like 113aa 4e-26 in ref transcript
    • CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
  • smart CLECT 112aa 4e-17 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

KLRD1

  • refseq_KLRD1.F1 refseq_KLRD1.R1 135 228
  • NCBIGene 36.3 3824
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002262

  • cd CLECT_NK_receptors_like 116aa 3e-26 in ref transcript
    • CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
  • smart CLECT 115aa 9e-21 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

KREMEN1

  • refseq_KREMEN1.F2 refseq_KREMEN1.R2 186 237
  • NCBIGene 36.3 83999
  • Alternative 5-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039570

  • cd KR 84aa 9e-18 in ref transcript
    • Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.
  • cd CUB 107aa 1e-15 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • pfam WSC 81aa 3e-19 in ref transcript
    • WSC domain. This domain may be involved in carbohydrate binding.
  • smart KR 84aa 3e-18 in ref transcript
    • Kringle domain. Named after a Danish pastry. Found in several serine proteases and in ROR-like receptors. Can occur in up to 38 copies (in apolipoprotein(a)). Plasminogen-like kringles possess affinity for free lysine and lysine- containing peptides.
  • pfam CUB 105aa 1e-14 in ref transcript
    • CUB domain.

KREMEN2

  • refseq_KREMEN2.F1 refseq_KREMEN2.R1 266 345
  • NCBIGene 36.3 79412
  • Single exon skipping, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172229

  • cd CUB 106aa 2e-15 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd KR 86aa 2e-15 in ref transcript
    • Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.
  • pfam WSC 82aa 7e-17 in ref transcript
    • WSC domain. This domain may be involved in carbohydrate binding.
  • smart KR 86aa 3e-16 in ref transcript
    • Kringle domain. Named after a Danish pastry. Found in several serine proteases and in ROR-like receptors. Can occur in up to 38 copies (in apolipoprotein(a)). Plasminogen-like kringles possess affinity for free lysine and lysine- containing peptides.
  • pfam CUB 105aa 9e-14 in ref transcript
    • CUB domain.

KRIT1

  • refseq_KRIT1.F2 refseq_KRIT1.R2 153 301
  • NCBIGene 36.3 889
  • Single exon skipping, size difference: 148
  • Exclusion in 5'UTR
  • Reference transcript: NM_194456

  • cd ANK 121aa 1e-14 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart B41 220aa 1e-32 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • COG Arp 127aa 3e-06 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

KRIT1

  • refseq_KRIT1.F4 refseq_KRIT1.R4 231 306
  • NCBIGene 36.3 889
  • Alternative 5-prime and 3-prime, size difference: 75
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_194456

  • cd ANK 121aa 1e-14 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart B41 220aa 1e-32 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • COG Arp 127aa 3e-06 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

KRIT1

  • refseq_KRIT1.F5 refseq_KRIT1.R5 103 466
  • NCBIGene 36.3 889
  • Alternative 5-prime, size difference: 363
  • Exclusion in 5'UTR
  • Reference transcript: NM_004912

  • cd ANK 121aa 1e-14 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart B41 220aa 1e-32 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • COG Arp 127aa 3e-06 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

KRIT1

  • refseq_KRIT1.F8 refseq_KRIT1.R8 135 279
  • NCBIGene 36.3 889
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_194456

  • Changed! cd ANK 121aa 1e-14 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart B41 220aa 1e-32 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • Changed! COG Arp 127aa 3e-06 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! cd ANK 85aa 1e-06 in modified transcript
  • Changed! PTZ PTZ00322 74aa 2e-04 in modified transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

KRT13

  • refseq_KRT13.F2 refseq_KRT13.R2 136 162
  • NCBIGene 36.3 3860
  • Single exon skipping, size difference: 26
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_153490

  • pfam Filament 313aa 1e-109 in ref transcript
    • Intermediate filament protein.

L3MBTL

  • refseq_L3MBTL.F1 refseq_L3MBTL.R1 241 400
  • NCBIGene 36.2 26013
  • Intron retention, size difference: 118
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015478

  • smart MBT 97aa 8e-37 in ref transcript
    • Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
  • smart MBT 93aa 2e-36 in ref transcript
  • smart MBT 98aa 1e-35 in ref transcript
  • pfam zf-C2HC 29aa 4e-10 in ref transcript
    • Zinc finger, C2HC type. This is a DNA binding zinc finger domain.

L3MBTL2

  • refseq_L3MBTL2.F2 refseq_L3MBTL2.R2 176 281
  • NCBIGene 36.2 83746
  • Single exon skipping, size difference: 105
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_031488

  • smart MBT 94aa 6e-38 in ref transcript
    • Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
  • smart MBT 97aa 3e-28 in ref transcript
  • smart MBT 101aa 6e-19 in ref transcript
  • smart MBT 93aa 2e-17 in ref transcript

L3MBTL3

  • refseq_L3MBTL3.F2 refseq_L3MBTL3.R2 204 279
  • NCBIGene 36.3 84456
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032438

  • cd SAM 64aa 4e-08 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • smart MBT 97aa 1e-33 in ref transcript
    • Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
  • smart MBT 93aa 2e-33 in ref transcript
  • smart MBT 95aa 7e-30 in ref transcript
  • pfam SAM_1 63aa 5e-11 in ref transcript
    • SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.

LAIR1

  • refseq_LAIR1.F1 refseq_LAIR1.R1 265 316
  • NCBIGene 36.3 3903
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002287

LAIR2

  • refseq_LAIR2.F2 refseq_LAIR2.R2 280 331
  • NCBIGene 36.3 3904
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002288

LARGE

  • refseq_LARGE.F1 refseq_LARGE.R1 174 237
  • NCBIGene 36.3 9215
  • Single exon skipping, size difference: 63
  • Exclusion in 5'UTR
  • Reference transcript: NM_004737

  • cd GT8_LARGE_C 280aa 1e-165 in ref transcript
    • LARGE catalytic domain has closest homology to GT8 glycosyltransferase involved in lipooligosaccharide synthesis. The catalytic domain of LARGE is a putative glycosyltransferase. Mutations of LARGE in mouse and human cause dystroglycanopathies, a disease associated with hypoglycosylation of the membrane protein alpha-dystroglycan (alpha-DG) and consequent loss of extracellular ligand binding. LARGE needs to both physically interact with alpha-dystroglycan and function as a glycosyltransferase in order to stimulate alpha-dystroglycan hyperglycosylation. LARGE localizes to the Golgi apparatus and contains three conserved DxD motifs. While two of the motifs are indispensible for glycosylation function, one is important for localization of th eenzyme. LARGE was originally named because it covers approximately large trunck of genomic DNA, more than 600bp long. The predicted protein structure contains an N-terminal cytoplasmic domain, a transmembrane region, a coiled-coil motif, and two putative catalytic domains. This catalytic domain has closest homology to GT8 glycosyltransferase involved in lipooligosaccharide synthesis.
  • pfam Glyco_transf_8 245aa 3e-17 in ref transcript
    • Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, and glycogenin glucosyltransferase.
  • COG RfaJ 251aa 2e-10 in ref transcript
    • Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases [Cell envelope biogenesis, outer membrane].

LARP4

  • refseq_LARP4.F2 refseq_LARP4.R2 161 374
  • NCBIGene 36.3 113251
  • Single exon skipping, size difference: 213
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199188

  • cd RRM 67aa 0.004 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart LA 79aa 5e-29 in ref transcript
    • Domain in the RNA-binding Lupus La protein; unknown function.

LASS2

  • refseq_LASS2.F2 refseq_LASS2.R2 224 264
  • NCBIGene 36.3 29956
  • Alternative 3-prime, size difference: 40
  • Inclusion in 5'UTR
  • Reference transcript: NM_022075

  • cd homeodomain 58aa 0.001 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • smart TLC 202aa 1e-42 in ref transcript
    • TRAM, LAG1 and CLN8 homology domains. Protein domain with at least 5 transmembrane alpha-helices. Lag1p and Lac1p are essential for acyl-CoA-dependent ceramide synthesis, TRAM is a subunit of the translocon and the CLN8 gene is mutated in Northern epilepsy syndrome. The family may possess multiple functions such as lipid trafficking, metabolism, or sensing. Trh homologues possess additional homeobox domains.
  • COG LAG1 220aa 3e-22 in ref transcript
    • Protein transporter of the TRAM (translocating chain-associating membrane) superfamily, longevity assurance factor [Intracellular trafficking and secretion].

LAT2

  • refseq_LAT2.F1 refseq_LAT2.R1 117 187
  • NCBIGene 36.3 7462
  • Alternative 5-prime, size difference: 70
  • Exclusion in 5'UTR
  • Reference transcript: NM_032464

LCMT1

  • refseq_LCMT1.F1 refseq_LCMT1.R1 116 281
  • NCBIGene 36.3 51451
  • Multiple exon skipping, size difference: 165
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016309

  • Changed! pfam LCM 299aa 1e-127 in ref transcript
    • Leucine carboxyl methyltransferase. Family of leucine carboxyl methyltransferases EC:2.1.1.-. This family may need divides a the full alignment contains a significantly shorter mouse sequence.
  • Changed! COG COG3315 195aa 6e-14 in ref transcript
    • O-Methyltransferase involved in polyketide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam LCM 244aa 1e-101 in modified transcript
  • Changed! COG COG3315 140aa 7e-09 in modified transcript

LDHC

  • refseq_LDHC.F1 refseq_LDHC.R1 121 175
  • NCBIGene 36.3 3948
  • Alternative 5-prime, size difference: 54
  • Exclusion in 5'UTR
  • Reference transcript: NM_002301

  • cd LDH_1 312aa 1e-145 in ref transcript
    • A subgroup of L-lactate dehydrogenases. L-lactate dehydrogenases (LDH) are tetrameric enzymes catalyzing the last step of glycolysis in which pyruvate is converted to L-lactate. This subgroup is composed of eukaryotic LDHs. Vertebrate LDHs are non-allosteric. This is in contrast to some bacterial LDHs that are activated by an allosteric effector such as fructose-1,6-bisphosphate. LDHs are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate dehydrogenases, formate/glycerate dehydrogenases, siroheme synthases, 6-phosphogluconate dehydrogenases, aminoacid dehydrogenases, repressor rex, and NAD-binding potassium channel domains, among others.
  • TIGR L-LDH-NAD 301aa 1e-110 in ref transcript
    • This model represents the NAD-dependent L-lactate dehydrogenases from bacteria and eukaryotes. This enzyme function as as the final step in anaerobic glycolysis. Although lactate dehydrogenases have in some cases been mistaken for malate dehydrogenases due to the similarity of these two substrates and the apparent ease with which evolution can toggle these activities, critical residues have been identified which can discriminate between the two activities. At the time of the creation of this model no hits above the trusted cutoff contained critical residues typical of malate dehydrogenases.
  • PRK ldh 309aa 8e-87 in ref transcript
    • L-lactate dehydrogenase; Reviewed.

LDHD

  • refseq_LDHD.F1 refseq_LDHD.R1 225 294
  • NCBIGene 36.3 197257
  • Alternative 3-prime, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153486

  • Changed! TIGR glcD 437aa 2e-84 in ref transcript
    • This protein, the glycolate oxidase GlcD subunit, is similar in sequence to that of several D-lactate dehydrogenases, including that of E. coli. The glycolate oxidase has been found to have some D-lactate dehydrogenase activity.
  • Changed! COG GlcD 474aa 2e-80 in ref transcript
    • FAD/FMN-containing dehydrogenases [Energy production and conversion].
  • Changed! TIGR glcD 414aa 3e-86 in modified transcript
  • Changed! COG GlcD 451aa 5e-83 in modified transcript

LETMD1

  • refseq_LETMD1.F1 refseq_LETMD1.R1 104 220
  • NCBIGene 36.2 25875
  • Single exon skipping, size difference: 116
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015416

  • Changed! pfam LETM1 255aa 4e-32 in ref transcript
    • LETM1-like protein. Members of this family are inner mitochondrial membrane proteins which play a role in potassium and hydrogen ion exchange. Deletion of LETM1 is thought to be involved in the development of Wolf-Hirschhorn syndrome in humans.

LETMD1

  • refseq_LETMD1.F4 refseq_LETMD1.R4 166 273
  • NCBIGene 36.3 25875
  • Alternative 5-prime, size difference: 107
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015416

  • Changed! pfam LETM1 255aa 4e-32 in ref transcript
    • LETM1-like protein. Members of this family are inner mitochondrial membrane proteins which play a role in potassium and hydrogen ion exchange. Deletion of LETM1 is thought to be involved in the development of Wolf-Hirschhorn syndrome in humans.

LGALS14

  • refseq_LGALS14.F1 refseq_LGALS14.R1 102 407
  • NCBIGene 36.3 56891
  • Single exon skipping, size difference: 305
  • Exclusion of the protein initiation site
  • Reference transcript: NM_203471

  • Changed! cd GLECT 130aa 4e-28 in ref transcript
    • Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
  • Changed! pfam Gal-bind_lectin 129aa 8e-31 in ref transcript
    • Galactoside-binding lectin. This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).
  • Changed! cd GLECT 131aa 1e-28 in modified transcript
  • Changed! pfam Gal-bind_lectin 131aa 3e-31 in modified transcript

LGALS8

  • refseq_LGALS8.F2 refseq_LGALS8.R2 206 332
  • NCBIGene 36.3 3964
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201545

  • cd GLECT 132aa 5e-34 in ref transcript
    • Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
  • cd GLECT 129aa 2e-32 in ref transcript
  • pfam Gal-bind_lectin 132aa 7e-39 in ref transcript
    • Galactoside-binding lectin. This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).
  • pfam Gal-bind_lectin 129aa 4e-37 in ref transcript

LGMN

  • refseq_LGMN.F2 refseq_LGMN.R2 171 289
  • NCBIGene 36.3 5641
  • Single exon skipping, size difference: 118
  • Exclusion in 5'UTR
  • Reference transcript: NM_001008530

  • pfam Peptidase_C13 257aa 1e-125 in ref transcript
    • Peptidase C13 family. Members of this family are asparaginyl peptidases. The blood fluke parasite Schistosoma mansoni has at least five Clan CA cysteine peptidases in its digestive tract including cathepsins B (2 isoforms), C, F and L. All have been recombinantly expressed as active enzymes, albeit in various stages of activation. In addition, a Clan CD peptidase, termed asparaginyl endopeptidase or 'legumain' has been identified. This has formerly been characterised as a 'haemoglobinase', but this term is probably incorrect. Two cDNAs have been described for Schistosoma mansoni legumain; one encodes an active enzyme whereas the active site cysteine residue encoded by the second cDNA is substituted by an asparagine residue. Both forms have been recombinantly expressed.
  • COG GPI8 194aa 2e-26 in ref transcript
    • Glycosylphosphatidylinositol transamidase (GPIT), subunit GPI8 [Posttranslational modification, protein turnover, chaperones].

LGR6

  • refseq_LGR6.F1 refseq_LGR6.R1 100 388
  • NCBIGene 36.3 59352
  • Multiple exon skipping, size difference: 288
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001017403

  • Changed! cd LRR_RI 262aa 6e-08 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! cd LRR_RI 143aa 1e-06 in ref transcript
  • Changed! COG COG4886 215aa 1e-17 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].
  • Changed! COG COG4886 253aa 2e-14 in ref transcript
  • Changed! cd LRR_RI 226aa 9e-09 in modified transcript
  • Changed! COG COG4886 241aa 9e-16 in modified transcript

LHX6

  • refseq_LHX6.F2 refseq_LHX6.R2 287 391
  • NCBIGene 36.3 26468
  • Single exon skipping, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014368

  • cd homeodomain 58aa 6e-14 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam Homeobox 57aa 1e-17 in ref transcript
    • Homeobox domain.
  • pfam LIM 58aa 1e-09 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 55aa 8e-09 in ref transcript
  • COG COG5576 91aa 3e-10 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

LIAS

  • refseq_LIAS.F2 refseq_LIAS.R2 213 325
  • NCBIGene 36.3 11019
  • Single exon skipping, size difference: 112
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006859

  • Changed! cd Radical_SAM 197aa 7e-07 in ref transcript
    • Radical SAM superfamily. Enzymes of this family generate radicals by combining a 4Fe-4S cluster and S-adenosylmethionine (SAM) in close proximity. They are characterized by a conserved CxxxCxxC motif, which coordinates the conserved iron-sulfur cluster. Mechanistically, they share the transfer of a single electron from the iron-sulfur cluster to SAM, which leads to its reductive cleavage to methionine and a 5'-deoxyadenosyl radical, which, in turn, abstracts a hydrogen from the appropriately positioned carbon atom. Depending on the enzyme, SAM is consumed during this process or it is restored and reused. Radical SAM enzymes catalyze steps in metabolism, DNA repair, the biosynthesis of vitamins and coenzymes, and the biosynthesis of many antibiotics. Examples are biotin synthase (BioB), lipoyl synthase (LipA), pyruvate formate-lyase (PFL), coproporphyrinogen oxidase (HemN), lysine 2,3-aminomutase (LAM), anaerobic ribonucleotide reductase (ARR), and MoaA, an enzyme of the biosynthesis of molybdopterin.
  • Changed! TIGR lipA 302aa 1e-104 in ref transcript
    • The family shows strong sequence conservation.
  • Changed! PRK PRK05481 293aa 1e-126 in ref transcript
    • lipoyl synthase; Provisional.
  • Changed! cd Radical_SAM 177aa 6e-07 in modified transcript
  • Changed! TIGR lipA 255aa 7e-86 in modified transcript
  • Changed! PRK PRK05481 248aa 1e-105 in modified transcript

LILRA5

  • refseq_LILRA5.F2 refseq_LILRA5.R2 167 203
  • NCBIGene 36.3 353514
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021250

LLGL2

  • refseq_LLGL2.F1 refseq_LLGL2.R1 101 144
  • NCBIGene 36.3 3993
  • Single exon skipping, size difference: 43
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031803

  • cd WD40 233aa 6e-05 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam LLGL 105aa 6e-39 in ref transcript
    • LLGL2. This domain is found in lethal giant larvae homolog 2 (LLGL2) proteins and syntaxin-binding proteins like tomosyn. It has been identified in eukaryotes and tends to be found together with WD repeats (pfam00400).
  • COG COG2319 227aa 3e-05 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

LMNA

  • refseq_LMNA.F1 refseq_LMNA.R1 218 308
  • NCBIGene 36.3 4000
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170707

  • pfam Filament 349aa 3e-52 in ref transcript
    • Intermediate filament protein.
  • Changed! pfam IF_tail 111aa 1e-47 in ref transcript
    • Intermediate filament tail domain. The Prosite motif does not correspond to this Pfam entry.
  • COG Smc 290aa 4e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! pfam IF_tail 102aa 3e-44 in modified transcript

LMX1A

  • refseq_LMX1A.F1 refseq_LMX1A.R1 109 242
  • NCBIGene 36.3 4009
  • Single exon skipping, size difference: 133
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_177398

  • cd homeodomain 57aa 4e-13 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam Homeobox 57aa 6e-17 in ref transcript
    • Homeobox domain.
  • pfam LIM 56aa 1e-12 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 59aa 2e-10 in ref transcript
  • COG COG5576 77aa 1e-08 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

LOC144983

  • refseq_LOC144983.F1 refseq_LOC144983.R1 248 389
  • NCBIGene 36.3 144983
  • Single exon skipping, size difference: 141
  • Exclusion in 5'UTR
  • Reference transcript: NM_001011724

  • cd RRM 74aa 2e-15 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 73aa 2e-13 in ref transcript
  • smart RRM 70aa 6e-16 in ref transcript
    • RNA recognition motif.
  • TIGR SF-CC1 142aa 5e-15 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • COG COG0724 171aa 1e-10 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

LOC150223

  • refseq_LOC150223.F1 refseq_LOC150223.R1 100 159
  • NCBIGene 36.2 150223
  • Alternative 5-prime, size difference: 59
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001017964

  • Changed! pfam YdjC 283aa 8e-60 in ref transcript
    • YdjC-like protein. Family of YdjC-like proteins. This region is possibly involved in the the cleavage of cellobiose-phosphate.
  • Changed! PRK PRK02134 294aa 3e-34 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! pfam YdjC 105aa 6e-27 in modified transcript
  • Changed! PRK PRK02134 104aa 8e-15 in modified transcript

LOC150383

  • refseq_LOC150383.F1 refseq_LOC150383.R1 140 263
  • NCBIGene 36.2 150383
  • Single exon skipping, size difference: 123
  • Exclusion in 5'UTR
  • Reference transcript: NM_001008917

LOC219854

  • refseq_LOC219854.F2 refseq_LOC219854.R2 103 162
  • NCBIGene 36.2 219854
  • Single exon skipping, size difference: 59
  • Exclusion in 5'UTR
  • Reference transcript: XM_933911

LOC219854

  • refseq_LOC219854.F4 refseq_LOC219854.R4 114 162
  • NCBIGene 36.2 219854
  • Alternative 5-prime, size difference: 48
  • Exclusion in 5'UTR
  • Reference transcript: XM_933911

LOC219854

  • refseq_LOC219854.F6 refseq_LOC219854.R6 214 292
  • NCBIGene 36.2 219854
  • Single exon skipping, size difference: 78
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: XM_933894

LOC255411

  • refseq_LOC255411.F2 refseq_LOC255411.R2 218 303
  • NCBIGene 36.3 255411
  • Single exon skipping, size difference: 85
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_170708

C19orf63

  • refseq_LOC284361.F1 refseq_LOC284361.R1 241 328
  • NCBIGene 36.3 284361
  • Single exon skipping, size difference: 87
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_206538

LOC339287

  • refseq_LOC339287.F1 refseq_LOC339287.R1 109 157
  • NCBIGene 36.2 339287
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_932097

C16orf84

  • refseq_LOC348180.F2 refseq_LOC348180.R2 224 283
  • NCBIGene 36.3 348180
  • Single exon skipping, size difference: 59
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001012759

  • pfam DUF2392 101aa 5e-37 in ref transcript
    • Protein of unknown function (DUF2392). This is a family of proteins conserved from plants to humans. The function is not known. It carries a characteristic GRG sequence motif.
  • COG MesJ 153aa 0.001 in ref transcript
    • Predicted ATPase of the PP-loop superfamily implicated in cell cycle control [Cell division and chromosome partitioning].

LOC348801

  • refseq_LOC348801.F1 refseq_LOC348801.R1 220 259
  • NCBIGene 36.2 348801
  • Alternative 3-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_931908

LOC352909

  • refseq_LOC352909.F1 refseq_LOC352909.R1 112 188
  • NCBIGene 36.2 352909
  • Alternative 5-prime, size difference: 76
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_178837

LOC374569

  • refseq_LOC374569.F1 refseq_LOC374569.R1 101 119
  • NCBIGene 36.2 374569
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_370777

  • cd Asparaginase 340aa 8e-93 in ref transcript
    • Asparaginase (amidohydrolase): Asparaginases are tetrameric enzymes that catalyze the hydrolysis of asparagine to aspartic acid and ammonia. In bacteria, there are two classes of amidohydrolases, one highly specific for asparagine and localised to the periplasm, and a second (asparaginase- glutaminase) present in the cytosol that hydrolyzises both asparagine and glutamine with similar specificities.
  • cd ANK 124aa 2e-17 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart Asparaginase 339aa 1e-82 in ref transcript
    • Asparaginase, which is found in various plant, animal and bacterial cells, catalyses the deamination of asparagine to yield aspartic acid and an ammonium ion, resulting in a depletion of free circulatory asparagine in plasma PUBMED:3026924. The enzyme is effective in the treatment of human malignant lymphomas, which have a diminished capacity to produce asparagine synthetase: in order to survive, such cells absorb asparagine from blood plasma PUBMED:2407723, PUBMED:3379033 - if Asn levels have been depleted by injection of asparaginase, the lymphoma cells die.
  • pfam Ank 32aa 3e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • TIGR trp 76aa 1e-04 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • PRK ansA 353aa 1e-101 in ref transcript
    • cytoplasmic asparaginase I; Provisional.
  • PTZ PTZ00322 81aa 7e-10 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.
  • COG Arp 132aa 2e-08 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

NHLRC3

  • refseq_LOC387921.F1 refseq_LOC387921.R1 129 330
  • NCBIGene 36.3 387921
  • Single exon skipping, size difference: 201
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012754

  • pfam NHL 28aa 1e-05 in ref transcript
    • NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in the peptidyl-glycine alpha-amidating monooxygenase (PAM), proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localised to the repeats. The human tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is me diated by the NHL repeats.
  • Changed! COG COG3391 243aa 1e-05 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! COG COG3391 153aa 9e-07 in modified transcript

LOC388276

  • refseq_LOC388276.F1 refseq_LOC388276.R1 106 234
  • NCBIGene 36.2 388276
  • Multiple exon skipping, size difference: 128
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift
  • Reference transcript: XM_931511

LOC389073

  • refseq_LOC389073.F1 refseq_LOC389073.R1 119 238
  • NCBIGene 36.2 389073
  • Single exon skipping, size difference: 119
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_371592

LOC441155

  • refseq_LOC441155.F1 refseq_LOC441155.R1 108 390
  • NCBIGene 36.2 441155
  • Alternative 3-prime, size difference: 282
  • Inclusion in 5'UTR
  • Reference transcript: XM_930997

LOC51149

  • refseq_LOC51149.F1 refseq_LOC51149.R1 155 235
  • NCBIGene 36.2 51149
  • Single exon skipping, size difference: 80
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_016175

LOC643446

  • refseq_LOC643446.F1 refseq_LOC643446.R1 149 237
  • NCBIGene 36.3 643446
  • Alternative 5-prime, size difference: 88
  • Exclusion in 5'UTR
  • Reference transcript: XM_001130637

  • cd RRM 73aa 3e-15 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM 70aa 8e-14 in ref transcript
    • RNA recognition motif.
  • COG COG0724 76aa 3e-05 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

LOC643446

  • refseq_LOC643446.F3 refseq_LOC643446.R3 149 337
  • NCBIGene 36.3 643446
  • Alternative 3-prime, size difference: 188
  • Exclusion of the protein initiation site
  • Reference transcript: XM_001130681

  • cd RRM 73aa 3e-15 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM 70aa 8e-14 in ref transcript
    • RNA recognition motif.
  • COG COG0724 76aa 3e-05 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

LOC643847

  • refseq_LOC643847.F1 refseq_LOC643847.R1 101 131
  • NCBIGene 36.2 643847
  • Alternative 3-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001129508

  • cd pepsin_A 323aa 1e-173 in ref transcript
    • Pepsin A, aspartic protease produced in gastric mucosa of mammals. Pepsin, a well-known aspartic protease, is produced by the human gastric mucosa in seven different zymogen isoforms, subdivided into two types: pepsinogen A and pepsinogen C. The prosequence of the zymogens are self cleaved under acidic pH. The mature enzymes are called pepsin A and pepsin C, correspondingly. The well researched porcine pepsin is also in this pepsin A family. Pepsins play an integral role in the digestion process of vertebrates. Pepsins are bilobal enzymes, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. One lobe may be evolved from the other through ancient gene-duplication event. More recently evolved enzymes have similar three-dimensional structures, however their amino acid sequences are more divergent except for the conserved catalytic site motif. Pepsins specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
  • pfam Asp 315aa 1e-129 in ref transcript
    • Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
  • Changed! pfam A1_Propeptide 29aa 7e-09 in ref transcript
    • A1 Propeptide. Most eukaryotic endopeptidases (Merops Family A1) are synthesised with signal and propeptides. The animal pepsin-like endopeptidase propeptides form a distinct family of propeptides, which contain a conserved motif approximately 30 residues long. In pepsinogen A, the first 11 residues of the mature pepsin sequence are displaced by residues of the propeptide. The propeptide contains two helices that block the active site cleft, in particular the conserved Asp11 residue, in pepsin, hydrogen bonds to a conserved Arg residues in the propeptide. This hydrogen bond stabilises the propeptide conformation and is probably responsible for triggering the conversion of pepsinogen to pepsin under acidic conditions.
  • Changed! PTZ PTZ00165 367aa 2e-63 in ref transcript
    • aspartyl protease; Provisional.
  • Changed! PTZ PTZ00165 327aa 2e-63 in modified transcript

LOC645687

  • refseq_LOC645687.F1 refseq_LOC645687.R1 150 356
  • NCBIGene 36.2 645687
  • Alternative 3-prime, size difference: 206
  • Exclusion of the stop codon
  • Reference transcript: XM_001133543

LOC653423

  • refseq_LOC653423.F1 refseq_LOC653423.R1 319 397
  • NCBIGene 36.2 653423
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_933397

  • pfam Sperm_Ag_HE2 70aa 2e-28 in ref transcript
    • Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.

LOC653423

  • refseq_LOC653423.F2 refseq_LOC653423.R2 148 224
  • NCBIGene 36.2 653423
  • Single exon skipping, size difference: 76
  • Inclusion in the protein causing a frameshift
  • Reference transcript: XM_929074

  • Changed! pfam Sperm_Ag_HE2 70aa 2e-30 in ref transcript
    • Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.
  • Changed! pfam Defensin_beta 36aa 3e-05 in ref transcript
    • Beta defensin. The beta defensins are antimicrobial peptides implicated in the resistance of epithelial surfaces to microbial colonisation.
  • Changed! pfam Sperm_Ag_HE2 74aa 8e-30 in modified transcript

LOC653423

  • refseq_LOC653423.F4 refseq_LOC653423.R4 263 369
  • NCBIGene 36.2 653423
  • Multiple exon skipping, size difference: 106
  • Inclusion in the protein causing a frameshift, Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_933397

  • pfam Sperm_Ag_HE2 70aa 2e-28 in ref transcript
    • Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.

LOC653489

  • refseq_LOC653489.F1 refseq_LOC653489.R1 102 120
  • NCBIGene 36.2 653489
  • Alternative 5-prime and 3-prime, size difference: 18
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_934115

  • cd RanBD 121aa 2e-35 in ref transcript
    • Ran-binding domain; This domain of approximately 150 residues shares structural similarity to the PH domain, but lacks detectable sequence similarity. Ran is a Ras-like nuclear small GTPase, which regulates receptor-mediated transport between the nucleus and the cytoplasm. RanGTP hydrolysis is stimulated by RanGAP together with the Ran-binding domain containing acessory proteins RanBP1 and RanBP2. These accessory proteins stabilize the active GTP-bound form of Ran . The Ran-binding domain is found in multiple copies in Nuclear pore complex proteins.
  • cd RanBD 117aa 1e-34 in ref transcript
  • cd TPR 97aa 1e-05 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • pfam Ran_BP1 122aa 1e-50 in ref transcript
    • RanBP1 domain.
  • pfam Ran_BP1 120aa 3e-50 in ref transcript
  • pfam GRIP 45aa 3e-09 in ref transcript
    • GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue. At least some of these domains have been shown to bind to GTPase Arl1, see structures.
  • TIGR type_IV_pilW 152aa 0.008 in ref transcript
    • Members of this family are designated PilF in ref (PMID:8973346) and PilW in ref (PMID:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
  • COG YRB1 160aa 2e-31 in ref transcript
    • Ran GTPase-activating protein (Ran-binding protein) [Intracellular trafficking and secretion].
  • COG YRB1 160aa 2e-23 in ref transcript

LOC653689

  • refseq_LOC653689.F1 refseq_LOC653689.R1 124 166
  • NCBIGene 36.2 653689
  • Alternative 5-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_928923

  • Changed! cd GST_C_Theta 126aa 4e-39 in ref transcript
    • GST_C family, Class Theta subfamily; composed of eukaryotic class Theta GSTs and bacterial dichloromethane (DCM) dehalogenase. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Mammalian class Theta GSTs show poor GSH conjugating activity towards the standard substrates, CDNB and ethacrynic acid, differentiating them from other mammalian GSTs. GSTT1-1 shows similar cataytic activity as bacterial DCM dehalogenase, catalyzing the GSH-dependent hydrolytic dehalogenation of dihalomethanes. This is an essential process in methylotrophic bacteria to enable them to use chloromethane and DCM as sole carbon and energy sources. The presence of polymorphisms in human GSTT1-1 and its relationship to the onset of diseases including cancer is subject of many studies. Human GSTT2-2 exhibits a highly specific sulfatase activity, catalyzing the cleavage of sulfate ions from aralkyl sufate esters, but not from the aryl or alkyl sulfate esters.
  • cd GST_N_Theta 76aa 3e-27 in ref transcript
    • GST_N family, Class Theta subfamily; composed of eukaryotic class Theta GSTs and bacterial dichloromethane (DCM) dehalogenase. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Mammalian class Theta GSTs show poor GSH conjugating activity towards the standard substrates, CDNB and ethacrynic acid, differentiating them from other mammalian GSTs. GSTT1-1 shows similar cataytic activity as bacterial DCM dehalogenase, catalyzing the GSH-dependent hydrolytic dehalogenation of dihalomethanes. This is an essential process in methylotrophic bacteria to enable them to use chloromethane and DCM as sole carbon and energy sources. The presence of polymorphisms in human GSTT1-1 and its relationship to the onset of diseases including cancer is subject of many studies. Human GSTT2-2 exhibits a highly specific sulfatase activity, catalyzing the cleavage of sulfate ions from aralkyl sufate esters, but not from aryl or alkyl sulfate esters.
  • Changed! TIGR maiA 143aa 2e-11 in ref transcript
    • Maleylacetoacetate isomerase is an enzyme of tyrosine and phenylalanine catabolism. It requires glutathione and belongs by homology to the zeta family of glutathione S-transferases. The enzyme (EC 5.2.1.2) is described as active also on maleylpyruvate, and the example from a Ralstonia sp. catabolic plasmid is described as a maleylpyruvate isomerase involved in gentisate catabolism.
  • Changed! COG Gst 208aa 3e-19 in ref transcript
    • Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd GST_C_Theta 112aa 4e-30 in modified transcript
  • Changed! TIGR maiA 134aa 7e-10 in modified transcript
  • Changed! COG Gst 194aa 2e-16 in modified transcript

LOC653820

  • refseq_LOC653820.F1 refseq_LOC653820.R1 278 398
  • NCBIGene 36.2 653820
  • Alternative 5-prime, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_930579

LOC727761

  • refseq_LOC727761.F2 refseq_LOC727761.R2 139 211
  • NCBIGene 36.3 727761
  • Alternative 3-prime, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001126211

  • Changed! cd TMPK 185aa 3e-23 in ref transcript
    • Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP).
  • Changed! pfam Thymidylate_kin 181aa 3e-54 in ref transcript
    • Thymidylate kinase.
  • Changed! COG Tmk 188aa 2e-32 in ref transcript
    • Thymidylate kinase [Nucleotide transport and metabolism].
  • Changed! cd TMPK 161aa 4e-17 in modified transcript
  • Changed! pfam Thymidylate_kin 157aa 7e-41 in modified transcript
  • Changed! COG Tmk 164aa 2e-23 in modified transcript

LOC727761

  • refseq_LOC727761.F4 refseq_LOC727761.R3 98 227
  • NCBIGene 36.3 727761
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 129
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001126211

  • Changed! cd TMPK 185aa 3e-23 in ref transcript
    • Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP).
  • Changed! pfam Thymidylate_kin 181aa 3e-54 in ref transcript
    • Thymidylate kinase.
  • Changed! COG Tmk 188aa 2e-32 in ref transcript
    • Thymidylate kinase [Nucleotide transport and metabolism].
  • Changed! cd TMPK 142aa 2e-11 in modified transcript
  • Changed! pfam Thymidylate_kin 138aa 2e-31 in modified transcript
  • Changed! COG Tmk 145aa 4e-16 in modified transcript

LOC727778

  • refseq_LOC727778.F2 refseq_LOC727778.R2 273 334
  • NCBIGene 36.2 727778
  • Alternative 5-prime, size difference: 61
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001125727

  • Changed! pfam DUF1242 36aa 3e-05 in ref transcript
    • Protein of unknown function (DUF1242). This family consists of a number of eukaryotic proteins of around 72 residues in length. The function of this family is unknown.

LOC728005

  • refseq_LOC728005.F1 refseq_LOC728005.R1 150 315
  • NCBIGene 36.2 728005
  • Multiple exon skipping, size difference: 165
  • Inclusion in 5'UTR, Inclusion in 5'UTR
  • Reference transcript: XM_001127140

  • cd PH_centaurin 61aa 2e-11 in ref transcript
    • Centaurin Pleckstrin homology (PH) domain. Centaurin beta and gamma consist of a PH domain, an ArfGAP domain and three ankyrin repeats. Centaurain gamma also has an N-terminal Ras homology domain. Centaurin alpha has a different domain architecture and its PH domain is in a different subfamily. Centaurin can bind to phosphatidlyinositol (3,4,5)P3. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • cd ANK 90aa 3e-11 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd PH_centaurin 32aa 2e-04 in ref transcript
  • pfam ArfGap 117aa 6e-34 in ref transcript
    • Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • smart PH 64aa 8e-08 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • pfam PH 160aa 1e-06 in ref transcript
    • PH domain. PH stands for pleckstrin homology.
  • pfam Ank 32aa 8e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • COG COG5347 117aa 4e-23 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
  • PTZ PTZ00322 97aa 2e-05 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

LOC728229

  • refseq_LOC728229.F1 refseq_LOC728229.R1 127 157
  • NCBIGene 36.2 728229
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 30
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001128423

LOC728239

  • refseq_LOC728239.F2 refseq_LOC728239.R2 215 263
  • NCBIGene 36.2 728239
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001127104

  • Changed! pfam MAGE 186aa 1e-71 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
  • PRK PRK07003 168aa 0.002 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.
  • Changed! pfam MAGE 170aa 8e-75 in modified transcript

LOC728461

  • refseq_LOC728461.F2 refseq_LOC728461.R2 198 249
  • NCBIGene 36.2 728461
  • Alternative 5-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001126956

LOC728635

  • refseq_LOC728635.F2 refseq_LOC728635.R2 239 350
  • NCBIGene 36.2 728635
  • Multiple exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001131318

  • Changed! TIGR 3oxo_ACP_reduc 242aa 1e-45 in ref transcript
    • This model represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis.
  • Changed! PRK fabG 247aa 5e-53 in ref transcript
    • 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional.
  • Changed! TIGR 3oxo_ACP_reduc 205aa 3e-37 in modified transcript
  • Changed! PRK fabG 210aa 2e-44 in modified transcript

LOC728742

  • refseq_LOC728742.F1 refseq_LOC728742.R1 148 246
  • NCBIGene 36.2 728742
  • Single exon skipping, size difference: 98
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_001128742

  • Changed! cd CobW_like 165aa 7e-44 in ref transcript
    • The function of this protein family is unkown. The amino acid sequence of YjiA protein in E. coli contains several conserved motifs that characterizes it as a P-loop GTPase. YijA gene is among the genes significantly induced in response to DNA-damage caused by mitomycin. YijA gene is a homologue of the CobW gene which encodes the cobalamin synthesis protein/P47K.
  • Changed! TIGR CobW 328aa 9e-39 in ref transcript
    • A broader CobW family is delineated by two PFAM models which identify the N- and C-terminal domains (pfam02492 and pfam07683).
  • Changed! COG COG0523 335aa 8e-67 in ref transcript
    • Putative GTPases (G3E family) [General function prediction only].

LOC728835

  • refseq_LOC728835.F1 refseq_LOC728835.R1 165 206
  • NCBIGene 36.2 728835
  • Alternative 3-prime, size difference: 41
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_001133183

  • cd Chemokine_CC 31aa 6e-06 in ref transcript
    • Chemokine_CC: 1 of 4 subgroup designations based on the arrangement of the two N-terminal cysteine residues; includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity; some members (e.g. 2HCC) contain an additional disulfide bond which is thought to compensate for the highly conserved Trp missing in these; chemotatic for monocytes, macrophages, eosinophils, basophils, and T cells, but not neutrophils; exist as monomers and dimers, but are believed to be functional as monomers; found only in vertebrates and a few viruses; a subgroup of CC, identified by an N-terminal DCCL motif (Exodus-1, Exodus-2, and Exodus-3), has been shown to inhibit specific types of human cancer cell growth in a mouse model. See CDs: Chemokine (cd00169) for the general alignment of chemokines, or Chemokine_CXC (cd00273), Chemokine_C (cd00271), and Chemokine_CX3C (cd00274) for the additional chemokine subgroups, and Chemokine_CC_DCCL for the DCCL subgroup of this CD.
  • Changed! pfam IL8 41aa 2e-08 in ref transcript
    • Small cytokines (intecrine/chemokine), interleukin-8 like. Includes a number of secreted growth factors and interferons involved in mitogenic, chemotactic, and inflammatory activity. Structure contains two highly conserved disulfide bonds.
  • Changed! pfam IL8 40aa 1e-07 in modified transcript

LOC728860

  • refseq_LOC728860.F1 refseq_LOC728860.R1 100 128
  • NCBIGene 36.2 728860
  • Alternative 3-prime, size difference: 28
  • Inclusion in the protein causing a frameshift
  • Reference transcript: XM_001133267

  • Changed! cd ARM 113aa 8e-22 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • Changed! cd ARM 121aa 7e-21 in ref transcript
  • Changed! cd ARM 126aa 1e-18 in ref transcript
  • Changed! pfam IBB 95aa 8e-29 in ref transcript
    • Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex; required for classical NLS import of proteins into the nucleus, through the nuclear pore complex and across the nuclear envelope. Also in the alignment is the NLS of importin alpha which overlaps with the IBB domain.
  • Changed! pfam Arm 41aa 4e-07 in ref transcript
    • Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
  • Changed! pfam Arm 40aa 5e-07 in ref transcript
  • Changed! smart ARM 41aa 2e-05 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
  • Changed! pfam Arm 39aa 2e-04 in ref transcript
  • Changed! pfam Arm 39aa 4e-04 in ref transcript
  • Changed! pfam Arm 41aa 9e-04 in ref transcript
  • Changed! smart ARM 40aa 0.001 in ref transcript
  • Changed! COG SRP1 519aa 1e-137 in ref transcript
    • Karyopherin (importin) alpha [Intracellular trafficking and secretion].

LOC728860

  • refseq_LOC728860.F3 refseq_LOC728860.R3 157 187
  • NCBIGene 36.2 728860
  • Alternative 3-prime, size difference: 30
  • Inclusion in 5'UTR
  • Reference transcript: XM_001133267

  • cd ARM 113aa 8e-22 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 121aa 7e-21 in ref transcript
  • cd ARM 126aa 1e-18 in ref transcript
  • pfam IBB 95aa 8e-29 in ref transcript
    • Importin beta binding domain. This family consists of the importin alpha (karyopherin alpha), importin beta (karyopherin beta) binding domain. The domain mediates formation of the importin alpha beta complex; required for classical NLS import of proteins into the nucleus, through the nuclear pore complex and across the nuclear envelope. Also in the alignment is the NLS of importin alpha which overlaps with the IBB domain.
  • pfam Arm 41aa 4e-07 in ref transcript
    • Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
  • pfam Arm 40aa 5e-07 in ref transcript
  • smart ARM 41aa 2e-05 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
  • pfam Arm 39aa 2e-04 in ref transcript
  • pfam Arm 39aa 4e-04 in ref transcript
  • pfam Arm 41aa 9e-04 in ref transcript
  • smart ARM 40aa 0.001 in ref transcript
  • COG SRP1 519aa 1e-137 in ref transcript
    • Karyopherin (importin) alpha [Intracellular trafficking and secretion].

ANAPC11

  • refseq_LOC728919.F1 refseq_ANAPC11.R1 94 102
  • NCBIGene 36.3 51529
  • Alternative 5-prime, size difference: 8
  • Exclusion in 5'UTR
  • Reference transcript: NM_001002248

LOC728919

  • refseq_LOC728919.F1 refseq_LOC728919.R1 101 123
  • NCBIGene 36.2 728919
  • Alternative 5-prime, size difference: 22
  • Exclusion in 5'UTR
  • Reference transcript: XM_001133292

  • COG APC11 88aa 2e-19 in ref transcript
    • Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning].

LOC729059

  • refseq_LOC729059.F2 refseq_LOC729059.R2 301 400
  • NCBIGene 36.2 729059
  • Alternative 3-prime, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001133127

LOC729264

  • refseq_LOC729264.F1 refseq_LOC729264.R1 156 280
  • NCBIGene 36.3 729264
  • Alternative 5-prime, size difference: 124
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001133674

LOC729355

  • refseq_LOC729264.F1 refseq_LOC729355.R1 154 278
  • NCBIGene 36.2 729355
  • Alternative 5-prime, size difference: 124
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001133681

LOC730258

  • refseq_LOC730258.F1 refseq_LOC730258.R1 209 327
  • NCBIGene 36.2 730258
  • Single exon skipping, size difference: 118
  • Exclusion in 5'UTR
  • Reference transcript: XM_001134401

LONRF3

  • refseq_LONRF3.F2 refseq_LONRF3.R2 257 380
  • NCBIGene 36.3 79836
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031855

  • Changed! cd TPR 96aa 3e-10 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • cd RING 40aa 4e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam LON 172aa 4e-19 in ref transcript
    • ATP-dependent protease La (LON) domain.
  • TIGR rad18 86aa 1e-09 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! TIGR 3a0801s09 115aa 1e-07 in ref transcript
  • TIGR rad18 40aa 7e-06 in ref transcript
  • COG COG2802 197aa 1e-15 in ref transcript
    • Uncharacterized protein, similar to the N-terminal domain of Lon protease [General function prediction only].
  • COG RAD18 61aa 4e-06 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].
  • COG RAD18 39aa 2e-04 in ref transcript
  • Changed! cd TPR 63aa 2e-05 in modified transcript
  • Changed! TIGR 3a0801s09 66aa 0.001 in modified transcript

LRDD

  • refseq_LRDD.F1 LRDD.u.r.22 121 154
  • NCBIGene 36.3 55367
  • Alternative 5-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145886

  • pfam Death 79aa 3e-09 in ref transcript
    • Death domain.
  • pfam Peptidase_S68 34aa 2e-08 in ref transcript
    • Peptidase S68. This family of serine peptidases contains PIDD proteins. PIDD forms a complex with RAIDD and procaspase-2 that is known as the 'PIDDosome'. The PIDDosome forms when DNA damage occurs and either activates NF-kappaB, leading to cell survival, or caspase-2, which leads to apoptosis.
  • smart ZU5 63aa 9e-04 in ref transcript
    • Domain present in ZO-1 and Unc5-like netrin receptors. Domain of unknown function.
  • COG COG4886 96aa 3e-07 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

LRDD

  • refseq_LRDD.F4 refseq_LRDD.R4 260 320
  • NCBIGene 36.3 55367
  • Alternative 3-prime, size difference: 60
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_145886

  • Changed! pfam Death 79aa 3e-09 in ref transcript
    • Death domain.
  • Changed! pfam Peptidase_S68 34aa 2e-08 in ref transcript
    • Peptidase S68. This family of serine peptidases contains PIDD proteins. PIDD forms a complex with RAIDD and procaspase-2 that is known as the 'PIDDosome'. The PIDDosome forms when DNA damage occurs and either activates NF-kappaB, leading to cell survival, or caspase-2, which leads to apoptosis.
  • Changed! smart ZU5 63aa 9e-04 in ref transcript
    • Domain present in ZO-1 and Unc5-like netrin receptors. Domain of unknown function.
  • Changed! COG COG4886 96aa 3e-07 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

LRDD

  • refseq_LRDD.F6 refseq_LRDD.R6 145 196
  • NCBIGene 36.3 55367
  • Alternative 5-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145886

  • pfam Death 79aa 3e-09 in ref transcript
    • Death domain.
  • pfam Peptidase_S68 34aa 2e-08 in ref transcript
    • Peptidase S68. This family of serine peptidases contains PIDD proteins. PIDD forms a complex with RAIDD and procaspase-2 that is known as the 'PIDDosome'. The PIDDosome forms when DNA damage occurs and either activates NF-kappaB, leading to cell survival, or caspase-2, which leads to apoptosis.
  • smart ZU5 63aa 9e-04 in ref transcript
    • Domain present in ZO-1 and Unc5-like netrin receptors. Domain of unknown function.
  • COG COG4886 96aa 3e-07 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

LRP8

  • refseq_LRP8.F2 refseq_LRP8.R2 167 392
  • NCBIGene 36.3 7804
  • Single exon skipping, size difference: 225
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004631

  • cd LDLa 32aa 7e-06 in ref transcript
    • Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
  • cd LDLa 35aa 1e-05 in ref transcript
  • cd LDLa 36aa 2e-04 in ref transcript
  • cd LDLa 37aa 3e-04 in ref transcript
  • cd LDLa 27aa 0.002 in ref transcript
  • pfam Ldl_recept_b 41aa 2e-11 in ref transcript
    • Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.
  • smart LY 43aa 1e-10 in ref transcript
    • Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
  • smart LY 43aa 3e-08 in ref transcript
  • smart LDLa 33aa 9e-07 in ref transcript
    • Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
  • pfam Ldl_recept_b 40aa 1e-06 in ref transcript
  • pfam Ldl_recept_a 37aa 4e-06 in ref transcript
    • Low-density lipoprotein receptor domain class A.
  • pfam Ldl_recept_a 38aa 8e-06 in ref transcript
  • pfam Ldl_recept_a 27aa 3e-05 in ref transcript
  • smart EGF_CA 31aa 8e-04 in ref transcript
    • Calcium-binding EGF-like domain.
  • smart LY 36aa 0.002 in ref transcript
  • smart LDLa 34aa 0.003 in ref transcript
  • pfam Ldl_recept_b 44aa 0.005 in ref transcript
  • Changed! COG COG3391 189aa 3e-05 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! COG COG3391 205aa 1e-05 in modified transcript

LRP8

  • refseq_LRP8.F3 refseq_LRP8.R3 100 487
  • NCBIGene 36.3 7804
  • Single exon skipping, size difference: 387
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004631

  • cd LDLa 32aa 7e-06 in ref transcript
    • Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
  • cd LDLa 35aa 1e-05 in ref transcript
  • Changed! cd LDLa 36aa 2e-04 in ref transcript
  • cd LDLa 37aa 3e-04 in ref transcript
  • Changed! cd LDLa 27aa 0.002 in ref transcript
  • pfam Ldl_recept_b 41aa 2e-11 in ref transcript
    • Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.
  • smart LY 43aa 1e-10 in ref transcript
    • Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
  • smart LY 43aa 3e-08 in ref transcript
  • smart LDLa 33aa 9e-07 in ref transcript
    • Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
  • pfam Ldl_recept_b 40aa 1e-06 in ref transcript
  • pfam Ldl_recept_a 37aa 4e-06 in ref transcript
    • Low-density lipoprotein receptor domain class A.
  • Changed! pfam Ldl_recept_a 38aa 8e-06 in ref transcript
  • Changed! pfam Ldl_recept_a 27aa 3e-05 in ref transcript
  • smart EGF_CA 31aa 8e-04 in ref transcript
    • Calcium-binding EGF-like domain.
  • smart LY 36aa 0.002 in ref transcript
  • smart LDLa 34aa 0.003 in ref transcript
  • pfam Ldl_recept_b 44aa 0.005 in ref transcript
  • COG COG3391 189aa 3e-05 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

LRP8

  • refseq_LRP8.F6 refseq_LRP8.R6 136 313
  • NCBIGene 36.3 7804
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004631

  • cd LDLa 32aa 7e-06 in ref transcript
    • Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
  • cd LDLa 35aa 1e-05 in ref transcript
  • cd LDLa 36aa 2e-04 in ref transcript
  • cd LDLa 37aa 3e-04 in ref transcript
  • cd LDLa 27aa 0.002 in ref transcript
  • pfam Ldl_recept_b 41aa 2e-11 in ref transcript
    • Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.
  • smart LY 43aa 1e-10 in ref transcript
    • Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
  • smart LY 43aa 3e-08 in ref transcript
  • smart LDLa 33aa 9e-07 in ref transcript
    • Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.
  • pfam Ldl_recept_b 40aa 1e-06 in ref transcript
  • pfam Ldl_recept_a 37aa 4e-06 in ref transcript
    • Low-density lipoprotein receptor domain class A.
  • pfam Ldl_recept_a 38aa 8e-06 in ref transcript
  • pfam Ldl_recept_a 27aa 3e-05 in ref transcript
  • smart EGF_CA 31aa 8e-04 in ref transcript
    • Calcium-binding EGF-like domain.
  • smart LY 36aa 0.002 in ref transcript
  • smart LDLa 34aa 0.003 in ref transcript
  • pfam Ldl_recept_b 44aa 0.005 in ref transcript
  • COG COG3391 189aa 3e-05 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

LRRC17

  • refseq_LRRC17.F1 refseq_LRRC17.R1 198 251
  • NCBIGene 36.3 10234
  • Single exon skipping, size difference: 53
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001031692

  • Changed! cd LRR_RI 93aa 0.008 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! COG COG4886 163aa 1e-05 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

LRRC20

  • refseq_LRRC20.F1 refseq_LRRC20.R1 127 295
  • NCBIGene 36.3 55222
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207119

  • Changed! cd LRR_RI 84aa 0.003 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! COG COG4886 126aa 4e-04 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

LRRC20

  • refseq_LRRC20.F3 refseq_LRRC20.R3 187 337
  • NCBIGene 36.3 55222
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207119

  • Changed! cd LRR_RI 84aa 0.003 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! COG COG4886 126aa 4e-04 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].
  • Changed! cd LRR_RI 74aa 0.009 in modified transcript
  • Changed! COG COG4886 84aa 0.001 in modified transcript

LRRC23

  • refseq_LRRC23.F1 refseq_LRRC23.R1 100 398
  • NCBIGene 36.3 10233
  • Single exon skipping, size difference: 298
  • Exclusion of the stop codon
  • Reference transcript: NM_201650

  • Changed! COG COG4886 163aa 0.008 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].
  • Changed! COG COG4886 169aa 3e-05 in modified transcript

LRRFIP2

  • refseq_LRRFIP2.F1 refseq_LRRFIP2.R1 254 326
  • NCBIGene 36.3 9209
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006309

  • Changed! pfam DUF2051 354aa 1e-54 in ref transcript
    • Double stranded RNA binding protein (DUF2051). This is a novel protein identified as interacting with the leucine-rich repeat domain of human flightless-I, FliI protein.
  • COG SbcC 333aa 3e-07 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! pfam DUF2051 330aa 1e-55 in modified transcript

LRRN2

  • refseq_LRRN5.F1 refseq_LRRN5.R1 270 398
  • NCBIGene 36.3 10446
  • Single exon skipping, size difference: 128
  • Exclusion in 5'UTR
  • Reference transcript: NM_006338

  • cd IGcam 92aa 6e-14 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd LRR_RI 174aa 6e-05 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • cd LRR_RI 236aa 1e-04 in ref transcript
  • pfam I-set 85aa 2e-13 in ref transcript
    • Immunoglobulin I-set domain.
  • TIGR PCC 79aa 3e-09 in ref transcript
    • Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
  • COG COG4886 241aa 1e-10 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

LSR

  • refseq_LSR.F2 refseq_LSR.R2 205 262
  • NCBIGene 36.3 51599
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_205834

LST1

  • refseq_LST1.F2 refseq_LST1.R2 180 225
  • NCBIGene 36.3 7940
  • Alternative 3-prime, size difference: 45
  • Exclusion of the protein initiation site
  • Reference transcript: NM_205839

  • Changed! pfam LST1 39aa 3e-08 in modified transcript
    • LST-1 protein. B144/LST1 is a gene encoded in the human major histocompatibility complex that produces multiple forms of alternatively spliced mRNA and encodes peptides fewer than 100 amino acids in length. B144/LST1 is strongly expressed in dendritic cells. Transfection of B144/LST1 into a variety of cells induces morphologic changes including the production of long, thin filopodia.

LTB

  • refseq_LTB.F1 refseq_LTB.R1 160 206
  • NCBIGene 36.3 4050
  • Single exon skipping, size difference: 46
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002341

  • Changed! cd TNF 154aa 1e-27 in ref transcript
    • Tumor Necrosis Factor; TNF superfamily members include the cytokines: TNF (TNF-alpha), LT (lymphotoxin-alpha, TNF-beta), CD40 ligand, Apo2L (TRAIL), Fas ligand, and osteoprotegerin (OPG) ligand. These proteins generally have an intracellular N-terminal domain, a short transmembrane segment, an extracellular stalk, and a globular TNF-like extracellular domain of about 150 residues. They initiate apoptosis by binding to related receptors, some of which have intracellular death domains. They generally form homo- or hetero- trimeric complexes.TNF cytokines bind one elongated receptor molecule along each of three clefts formed by neighboring monomers of the trimer with ligand trimerization a requiste for receptor binding.
  • Changed! smart TNF 142aa 3e-29 in ref transcript
    • Tumour necrosis factor family. Family of cytokines that form homotrimeric or heterotrimeric complexes. TNF mediates mature T-cell receptor-induced apoptosis through the p75 TNF receptor.

LTK

  • refseq_LTK.F1 refseq_LTK.R1 129 312
  • NCBIGene 36.3 4058
  • Single exon skipping, size difference: 183
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002344

  • cd PTKc_ALK_LTK 277aa 1e-160 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinases, Anaplastic Lymphoma Kinase and Leukocyte Tyrosine Kinase. Protein Tyrosine Kinase (PTK) family; Anaplastic lymphoma kinase (ALK) and Leukocyte tyrosine (tyr) kinase (LTK); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyr residues in protein substrates. ALK and LTK are orphan receptor tyr kinases (RTKs) whose ligands are not yet well-defined. RTKs contain an extracellular ligand-binding domain, a transmembrane region, and an intracellular tyr kinase domain. They are usually activated through ligand binding, which causes dimerization and autophosphorylation of the intracellular tyr kinase catalytic domain. ALK appears to play an important role in mammalian neural development as well as visceral muscle differentiation in Drosophila. ALK is aberrantly expressed as fusion proteins, due to chromosomal translocations, in about 60% of anaplastic large cell lymphomas (ALCLs). ALK fusion proteins are also found in rare cases of diffuse large B cell lymphomas (DLBCLs). LTK is mainly expressed in B lymphocytes and neuronal tissues. It is important in cell proliferation and survival. Transgenic mice expressing TLK display retarded growth and high mortality rate. In addition, a polymorphism in mouse and human LTK is implicated in the pathogenesis of systemic lupus erythematosus.
  • pfam Pkinase_Tyr 268aa 1e-104 in ref transcript
    • Protein tyrosine kinase.
  • COG SPS1 269aa 1e-20 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

LY6G5C

  • refseq_LY6G5C.F2 refseq_LY6G5C.R2 123 217
  • NCBIGene 36.2 80741
  • Alternative 5-prime, size difference: 94
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_025262

LYCAT

  • refseq_LYCAT.F2 refseq_LYCAT.R2 208 400
  • NCBIGene 36.3 253558
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 192
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_182551

  • smart PlsC 122aa 1e-13 in ref transcript
    • Phosphate acyltransferases. Function in phospholipid biosynthesis and have either glycerolphosphate, 1-acylglycerolphosphate, or 2-acylglycerolphosphoethanolamine acyltransferase activities. Tafazzin, the product of the gene mutated in patients with Barth syndrome, is a member of this family.
  • COG PlsC 151aa 2e-11 in ref transcript
    • 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism].

LYK5

  • refseq_LYK5.F1 refseq_LYK5.R1 103 132
  • NCBIGene 36.3 92335
  • Single exon skipping, size difference: 29
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001003787

  • Changed! cd STKc_OSR1_SPAK 311aa 2e-57 in ref transcript
    • Serine/threonine kinases (STKs), oxidative stress response kinase (OSR1) and Ste20-related proline alanine-rich kinase (SPAK) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The OSR1 and SPAK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. SPAK is also referred to as STK39 or PASK (proline-alanine-rich STE20-related kinase). OSR1 and SPAK regulate the activity of cation-chloride cotransporters through direct interaction and phosphorylation. They are also implicated in cytoskeletal rearrangement, cell differentiation, transformation and proliferation. OSR1 and SPAK contain a conserved C-terminal (CCT) domain, which recognizes a unique motif ([RK]FX[VI]) present in their activating kinases (WNK1/WNK4) and their substrates.
  • Changed! smart S_TKc 296aa 2e-41 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG SPS1 323aa 4e-19 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

LYST

  • refseq_LYST.F1 refseq_LYST.R1 125 285
  • NCBIGene 36.2 1130
  • Alternative 5-prime, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000081

  • Changed! cd Beach 291aa 1e-109 in ref transcript
    • BEACH (Beige and Chediak-Higashi) domains, implicated in membrane trafficking, are present in a family of proteins conserved throughout eukaryotes. This group contains human lysosomal trafficking regulator (LYST), LPS-responsive and beige-like anchor (LRBA) and neurobeachin. Disruption of LYST leads to Chediak-Higashi syndrome, characterized by severe immunodeficiency, albinism, poor blood coagulation and neurologic problems. Neurobeachin is a candidate gene linked to autism. LBRA seems to be upregulated in several cancer types. It has been shown that the BEACH domain itself is important for the function of these proteins.
  • Changed! cd WD40 302aa 2e-22 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! pfam Beach 291aa 1e-114 in ref transcript
    • Beige/BEACH domain.
  • Changed! COG COG2319 253aa 7e-11 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

M-RIP

  • refseq_M-RIP.F1 refseq_M-RIP.R1 114 177
  • NCBIGene 36.3 23164
  • Single exon skipping, size difference: 63
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015134

  • cd PH_outspread 104aa 5e-48 in ref transcript
    • Outspread Pleckstrin homology (PH) domain. Outspread contains two PH domains and a C-terminal coiled-coil region. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
  • cd PH 92aa 1e-09 in ref transcript
    • Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • smart PH 94aa 7e-13 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • pfam PH 102aa 1e-06 in ref transcript
    • PH domain. PH stands for pleckstrin homology.

MACF1

  • refseq_MACF1.F1 refseq_MACF1.R1 102 120
  • NCBIGene 36.3 23499
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033044

  • cd SPEC 210aa 1e-20 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd SPEC 215aa 3e-19 in ref transcript
  • cd SPEC 217aa 5e-17 in ref transcript
  • cd SPEC 218aa 1e-16 in ref transcript
  • cd SPEC 214aa 1e-15 in ref transcript
  • cd SPEC 214aa 1e-12 in ref transcript
  • cd SPEC 213aa 8e-12 in ref transcript
  • cd SPEC 216aa 1e-11 in ref transcript
  • cd SPEC 217aa 4e-11 in ref transcript
  • cd SPEC 229aa 2e-10 in ref transcript
  • cd SPEC 222aa 6e-10 in ref transcript
  • cd SPEC 245aa 2e-09 in ref transcript
  • cd SPEC 218aa 6e-09 in ref transcript
  • cd SPEC 248aa 7e-08 in ref transcript
  • cd SPEC 216aa 9e-08 in ref transcript
  • cd SPEC 213aa 1e-07 in ref transcript
  • cd SPEC 216aa 1e-06 in ref transcript
  • cd EFh 63aa 1e-06 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd SPEC 214aa 4e-06 in ref transcript
  • cd SPEC 238aa 3e-05 in ref transcript
  • cd SPEC 198aa 1e-04 in ref transcript
  • cd SPEC 214aa 0.005 in ref transcript
  • Changed! smart GAS2 79aa 5e-32 in ref transcript
    • Growth-Arrest-Specific Protein 2 Domain. GROWTH-ARREST-SPECIFIC PROTEIN 2 Domain.
  • pfam SMC_N 882aa 4e-13 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • smart SPEC 98aa 8e-11 in ref transcript
    • Spectrin repeats.
  • smart SPEC 101aa 2e-10 in ref transcript
  • smart SPEC 103aa 8e-10 in ref transcript
  • pfam Plectin 42aa 7e-09 in ref transcript
    • Plectin repeat. This family includes repeats from plectin, desmoplakin, envoplakin and bullous pemphigoid antigen.
  • pfam Plectin 45aa 9e-08 in ref transcript
  • smart SPEC 102aa 1e-07 in ref transcript
  • smart SPEC 103aa 2e-07 in ref transcript
  • pfam Plectin 45aa 2e-07 in ref transcript
  • smart SPEC 102aa 3e-07 in ref transcript
  • pfam Plectin 45aa 3e-07 in ref transcript
  • TIGR SMC_prok_B 802aa 5e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • pfam Plectin 43aa 2e-06 in ref transcript
  • pfam Plectin 45aa 3e-06 in ref transcript
  • pfam Plectin 43aa 4e-06 in ref transcript
  • pfam Plectin 45aa 6e-06 in ref transcript
  • smart SPEC 101aa 7e-06 in ref transcript
  • smart SPEC 103aa 8e-06 in ref transcript
  • TIGR SMC_prok_B 408aa 3e-05 in ref transcript
  • smart SPEC 102aa 6e-05 in ref transcript
  • pfam Spectrin 109aa 3e-04 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • TIGR SMC_prok_B 308aa 4e-04 in ref transcript
  • smart SPEC 101aa 0.006 in ref transcript
  • smart SPEC 111aa 0.006 in ref transcript
  • smart SPEC 119aa 0.009 in ref transcript
  • COG Smc 839aa 3e-19 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 828aa 7e-09 in ref transcript
  • COG FRQ1 56aa 0.001 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • PRK mukB 199aa 0.002 in ref transcript
    • cell division protein MukB; Provisional.
  • Changed! COG SbcC 571aa 0.010 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! smart GAS2 73aa 5e-34 in modified transcript

MAD1L1

  • refseq_MAD1L1.F2 refseq_MAD1L1.R2 162 202
  • NCBIGene 36.3 8379
  • Alternative 3-prime, size difference: 40
  • Inclusion in 5'UTR
  • Reference transcript: NM_001013837

  • pfam MAD 705aa 1e-154 in ref transcript
    • Mitotic checkpoint protein. This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in yeast and higher eukaryotes. In S.cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated.
  • COG Smc 213aa 0.006 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

MADCAM1

  • refseq_MADCAM1.F1 refseq_MADCAM1.R1 115 376
  • NCBIGene 36.3 8174
  • Single exon skipping, size difference: 261
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130760

  • Changed! pfam Adhes-Ig_like 112aa 5e-40 in ref transcript
    • Adhesion molecule, immunoglobulin-like. Members of this family are found in a set of mucosal cellular adhesion proteins and adopt an immunoglobulin-like beta-sandwich structure, with seven strands arranged in two beta-sheets in a Greek-key topology. They are essential for recruitment of lymphocytes to specific tissues.
  • Changed! pfam Adhes-Ig_like 110aa 2e-39 in modified transcript

MADD

  • refseq_MADD.F10 refseq_MADD.R10 118 178
  • NCBIGene 36.3 8567
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003682

  • pfam DENN 230aa 3e-62 in ref transcript
    • DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
  • pfam uDENN 91aa 2e-20 in ref transcript
    • uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
  • pfam dDENN 74aa 2e-11 in ref transcript
    • dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.

MADD

  • refseq_MADD.F2 refseq_MADD.R2 181 310
  • NCBIGene 36.3 8567
  • Alternative 5-prime, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003682

  • pfam DENN 230aa 3e-62 in ref transcript
    • DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
  • pfam uDENN 91aa 2e-20 in ref transcript
    • uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
  • pfam dDENN 74aa 2e-11 in ref transcript
    • dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.

MADD

  • refseq_MADD.F3 refseq_MADD.R3 116 186
  • NCBIGene 36.3 8567
  • Single exon skipping, size difference: 70
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003682

  • pfam DENN 230aa 3e-62 in ref transcript
    • DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
  • pfam uDENN 91aa 2e-20 in ref transcript
    • uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
  • pfam dDENN 74aa 2e-11 in ref transcript
    • dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.

MADD

  • refseq_MADD.F5 refseq_MADD.R5 152 215
  • NCBIGene 36.3 8567
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003682

  • pfam DENN 230aa 3e-62 in ref transcript
    • DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
  • pfam uDENN 91aa 2e-20 in ref transcript
    • uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
  • pfam dDENN 74aa 2e-11 in ref transcript
    • dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.

MADD

  • refseq_MADD.F8 refseq_MADD.R8 175 229
  • NCBIGene 36.3 8567
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003682

  • pfam DENN 230aa 3e-62 in ref transcript
    • DENN (AEX-3) domain. DENN (after differentially expressed in neoplastic vs normal cells) is a domain which occurs in several proteins involved in Rab- mediated processes or regulation of MAPK signalling pathways.
  • pfam uDENN 91aa 2e-20 in ref transcript
    • uDENN domain. This region is always found associated with pfam02141. It is predicted to form an all beta domain.
  • pfam dDENN 74aa 2e-11 in ref transcript
    • dDENN domain. This region is always found associated with pfam02141. It is predicted to form a globular domain. This domain is predicted to be completely alpha helical. Although not statistically supported it has been suggested that this domain may be similar to members of the Rho/Rac/Cdc42 GEF family.
  • Changed! pfam HPC2 208aa 0.007 in modified transcript
    • Histone promoter control 2 (HPC2). HPC2 is required for cell-cycle regulation of histone transcription. It regulates transcription of the histone genes during the S-phase of the cell cycle by repressing transcription at other cell cycle stages. HPC2 mutants display synthetic interactions with FACT complex which allows RNA Pol II to elongate through nucleosomes.

MAEA

  • refseq_MAEA.F1 refseq_MAEA.R1 117 240
  • NCBIGene 36.3 10296
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001017405

  • smart CRA 93aa 6e-13 in ref transcript
    • CT11-RanBPM. protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi).
  • Changed! smart CTLH 58aa 1e-10 in ref transcript
    • C-terminal to LisH motif. Alpha-helical motif of unknown function.
  • Changed! smart LisH 34aa 2e-04 in ref transcript
    • Lissencephaly type-1-like homology motif. Alpha-helical motif present in Lis1, treacle, Nopp140, some katanin p60 subunits, muskelin, tonneau, LEUNIG and numerous WD40 repeat-containing proteins. It is suggested that LisH motifs contribute to the regulation of microtubule dynamics, either by mediating dimerisation, or else by binding cytoplasmic dynein heavy chain or microtubules directly.
  • Changed! smart CTLH 51aa 8e-06 in modified transcript

MAG

  • refseq_MAG.F1 refseq_MAG.R1 187 232
  • NCBIGene 36.3 4099
  • Single exon skipping, size difference: 45
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002361

  • cd IGcam 81aa 2e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 80aa 1e-04 in ref transcript
  • smart IG_like 78aa 3e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam I-set 83aa 5e-06 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam C2-set_2 82aa 3e-05 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.

MAGEA10

  • refseq_MAGEA10.F1 refseq_MAGEA10.R1 293 384
  • NCBIGene 36.3 4109
  • Single exon skipping, size difference: 91
  • Exclusion in 5'UTR
  • Reference transcript: NM_001011543

  • pfam MAGE 171aa 6e-79 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.

MAGEA2

  • refseq_MAGEA2.F1 refseq_MAGEA2.R1 110 192
  • NCBIGene 36.3 4101
  • Single exon skipping, size difference: 82
  • Exclusion in 5'UTR
  • Reference transcript: NM_175742

  • pfam MAGE 171aa 1e-64 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.

MAGEA2

  • refseq_MAGEA2.F3 refseq_MAGEA2.R3 148 246
  • NCBIGene 36.3 4101
  • Single exon skipping, size difference: 98
  • Exclusion in 5'UTR
  • Reference transcript: NM_175743

  • pfam MAGE 171aa 1e-64 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.

MAGEB1

  • refseq_MAGEB1.F1 refseq_MAGEB1.R1 114 197
  • NCBIGene 36.3 4112
  • Single exon skipping, size difference: 83
  • Exclusion in 5'UTR
  • Reference transcript: NM_002363

  • pfam MAGE 171aa 4e-78 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.

MAGED1

  • refseq_MAGED1.F2 refseq_MAGED1.R2 111 279
  • NCBIGene 36.3 9500
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005333

  • pfam MAGE 170aa 1e-70 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.

MAGED4B

  • refseq_MAGED4.F1 refseq_MAGED4.R1 131 161
  • NCBIGene 36.3 81557
  • Alternative 3-prime, size difference: 30
  • Inclusion in 5'UTR
  • Reference transcript: NM_177535

  • pfam MAGE 170aa 8e-75 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
  • PRK PRK07003 168aa 0.002 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.

MAGI1

  • refseq_MAGI1.F1 refseq_MAGI1.R1 194 283
  • NCBIGene 36.3 9223
  • Single exon skipping, size difference: 89
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001033057

  • cd PDZ_signaling 95aa 7e-17 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 78aa 8e-16 in ref transcript
  • cd PDZ_signaling 76aa 3e-13 in ref transcript
  • cd GMPK 70aa 4e-12 in ref transcript
    • Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
  • cd PDZ_signaling 68aa 1e-10 in ref transcript
  • cd PDZ_signaling 80aa 6e-10 in ref transcript
  • cd WW 30aa 3e-06 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • cd WW 30aa 0.001 in ref transcript
  • smart GuKc 183aa 3e-31 in ref transcript
    • Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
  • smart PDZ 83aa 1e-18 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 99aa 3e-17 in ref transcript
  • smart PDZ 87aa 3e-14 in ref transcript
  • smart PDZ 87aa 4e-14 in ref transcript
  • smart PDZ 84aa 8e-14 in ref transcript
  • TIGR degP_htrA_DO 235aa 6e-11 in ref transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
  • pfam WW 30aa 3e-07 in ref transcript
    • WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
  • pfam WW 30aa 1e-05 in ref transcript
  • COG Gmk 62aa 5e-14 in ref transcript
    • Guanylate kinase [Nucleotide transport and metabolism].
  • COG Prc 76aa 8e-07 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 77aa 8e-05 in ref transcript
  • COG Prc 70aa 3e-04 in ref transcript

MAGI1

  • refseq_MAGI1.F3 refseq_MAGI1.R3 257 341
  • NCBIGene 36.3 9223
  • Single exon skipping, size difference: 84
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001033057

  • cd PDZ_signaling 95aa 7e-17 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 78aa 8e-16 in ref transcript
  • cd PDZ_signaling 76aa 3e-13 in ref transcript
  • cd GMPK 70aa 4e-12 in ref transcript
    • Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
  • cd PDZ_signaling 68aa 1e-10 in ref transcript
  • cd PDZ_signaling 80aa 6e-10 in ref transcript
  • cd WW 30aa 3e-06 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • cd WW 30aa 0.001 in ref transcript
  • smart GuKc 183aa 3e-31 in ref transcript
    • Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
  • smart PDZ 83aa 1e-18 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 99aa 3e-17 in ref transcript
  • smart PDZ 87aa 3e-14 in ref transcript
  • smart PDZ 87aa 4e-14 in ref transcript
  • smart PDZ 84aa 8e-14 in ref transcript
  • TIGR degP_htrA_DO 235aa 6e-11 in ref transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
  • pfam WW 30aa 3e-07 in ref transcript
    • WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
  • pfam WW 30aa 1e-05 in ref transcript
  • COG Gmk 62aa 5e-14 in ref transcript
    • Guanylate kinase [Nucleotide transport and metabolism].
  • COG Prc 76aa 8e-07 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 77aa 8e-05 in ref transcript
  • COG Prc 70aa 3e-04 in ref transcript

MAGI3

  • refseq_MAGI3.F1 refseq_MAGI3.R1 107 182
  • NCBIGene 36.2 260425
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020965

  • cd PDZ_signaling 79aa 2e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd GMPK 101aa 2e-15 in ref transcript
    • Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
  • cd PDZ_signaling 87aa 3e-15 in ref transcript
  • cd PDZ_signaling 78aa 4e-11 in ref transcript
  • cd PDZ_signaling 65aa 2e-10 in ref transcript
  • cd WW 30aa 3e-06 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • cd PDZ_signaling 68aa 3e-04 in ref transcript
  • cd PDZ_signaling 35aa 0.002 in ref transcript
  • smart GuKc 86aa 2e-20 in ref transcript
    • Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
  • smart PDZ 81aa 4e-18 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 88aa 9e-15 in ref transcript
  • smart PDZ 81aa 3e-13 in ref transcript
  • smart PDZ 84aa 5e-13 in ref transcript
  • smart PDZ 82aa 1e-07 in ref transcript
  • pfam WW 30aa 4e-07 in ref transcript
    • WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
  • smart PDZ 40aa 0.009 in ref transcript
  • PRK gmk 74aa 7e-18 in ref transcript
    • guanylate kinase; Provisional.
  • COG Prc 67aa 5e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 55aa 8e-05 in ref transcript
  • COG Prc 83aa 0.002 in ref transcript
  • Changed! COG HUL4 53aa 0.007 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd WW 31aa 2e-04 in modified transcript
  • Changed! pfam WW 30aa 9e-06 in modified transcript
  • Changed! COG HUL4 76aa 2e-04 in modified transcript

MAL

  • refseq_MAL.F2 refseq_MAL.R2 191 317
  • NCBIGene 36.3 4118
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002371

  • Changed! pfam MARVEL 115aa 5e-14 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
  • Changed! pfam MARVEL 75aa 0.002 in modified transcript

MAL

  • refseq_MAL.F3 refseq_MAL.R3 111 279
  • NCBIGene 36.3 4118
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002371

  • Changed! pfam MARVEL 115aa 5e-14 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
  • Changed! pfam MARVEL 65aa 4e-06 in modified transcript

MALT1

  • refseq_MALT1.F1 refseq_MALT1.R1 101 134
  • NCBIGene 36.3 10892
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006785

  • cd IGcam 67aa 4e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 65aa 5e-05 in ref transcript
  • pfam Peptidase_C14 218aa 3e-21 in ref transcript
    • Caspase domain.
  • smart IG_like 70aa 1e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 76aa 2e-06 in ref transcript
  • COG COG4249 234aa 8e-07 in ref transcript
    • Uncharacterized protein containing caspase domain [General function prediction only].

MANBAL

  • refseq_MANBAL.F1 refseq_MANBAL.R1 232 349
  • NCBIGene 36.3 63905
  • Single exon skipping, size difference: 117
  • Exclusion in 5'UTR
  • Reference transcript: NM_022077

MANEAL

  • refseq_MANEAL.F1 refseq_MANEAL.R1 156 356
  • NCBIGene 36.3 149175
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 200
  • Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031740

MAP2

  • refseq_MAP2.F1 refseq_MAP2.R1 119 290
  • NCBIGene 36.3 4133
  • Single exon skipping, size difference: 171
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_002374

  • pfam MAP2_projctn 1129aa 0.0 in ref transcript
    • MAP2/Tau projection domain. This domain is found in the MAP2/Tau family of proteins which includes MAP2, MAP4, Tau, and their homologs. All isoforms contain a conserved C-terminal domain containing tubulin-binding repeats (pfam00418), and a N-terminal projection domain of varying size. This domain has a net negative charge and exerts a long-range repulsive force. This provides a mechanism that can regulate microtubule spacing which might facilitate efficient organelle transport.
  • pfam Tubulin-binding 31aa 5e-09 in ref transcript
    • Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
  • pfam Tubulin-binding 32aa 1e-06 in ref transcript
  • pfam Tubulin-binding 32aa 6e-06 in ref transcript

MAP2

  • refseq_MAP2.F3 refseq_MAP2.R3 300 393
  • NCBIGene 36.3 4133
  • Single exon skipping, size difference: 93
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_002374

  • pfam MAP2_projctn 1129aa 0.0 in ref transcript
    • MAP2/Tau projection domain. This domain is found in the MAP2/Tau family of proteins which includes MAP2, MAP4, Tau, and their homologs. All isoforms contain a conserved C-terminal domain containing tubulin-binding repeats (pfam00418), and a N-terminal projection domain of varying size. This domain has a net negative charge and exerts a long-range repulsive force. This provides a mechanism that can regulate microtubule spacing which might facilitate efficient organelle transport.
  • pfam Tubulin-binding 31aa 5e-09 in ref transcript
    • Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
  • pfam Tubulin-binding 32aa 1e-06 in ref transcript
  • pfam Tubulin-binding 32aa 6e-06 in ref transcript
  • Changed! pfam Tubulin-binding 31aa 6e-09 in modified transcript

MAP3K3

  • refseq_MAP3K3.F2 refseq_MAP3K3.R2 133 226
  • NCBIGene 36.3 4215
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203351

  • cd STKc_MEKK3 266aa 1e-158 in ref transcript
    • Serine/threonine kinases (STKs), MAP/ERK kinase kinase 3 (MEKK3) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MEKK3 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MEKK3 is a mitogen-activated protein kinase (MAPK) kinase kinase (MAPKKK or MKKK or MAP3K), that phosphorylates and activates the MAPK kinase MEK5 (or MKK5), which in turn phosphorylates and activates extracellular signal-regulated kinase 5 (ERK5). The ERK5 cascade plays roles in promoting cell proliferation, differentiation, neuronal survival, and neuroprotection. MEKK3 plays an essential role in embryonic angiogenesis and early heart development. In addition, MEKK3 is involved in interleukin-1 receptor and Toll-like receptor 4 signaling. It is also a specific regulator of the proinflammatory cytokines IL-6 and GM-CSF in some immune cells. MEKK3 also regulates calcineurin, which plays a critical role in T cell activation, apoptosis, skeletal myocyte differentiation, and cardiac hypertrophy.
  • cd PB1_Mekk2_3 79aa 1e-34 in ref transcript
    • The PB1 domain is present in the two mitogen-activated protein kinase kinases MEKK2 and MEKK3 which are two members of the signaling kinase cascade involved in angiogenesis and early cardiovascular development. The PB1 domain of MEKK2 (and/or MEKK3) interacts with the PB1 domain of another member of the kinase cascade Map2k5. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. Interactions of PB1 domains with other protein domains have been described as noncanonical PB1-interactions. The PB1 domain module is conserved in amoebas, fungi, animals, and plants. The MEKK2 and MEKK3 proteins contain a type II PB1 domain.
  • smart S_TKc 250aa 3e-79 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart PB1 77aa 6e-12 in ref transcript
    • PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
  • COG SPS1 263aa 4e-30 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MAP3K4

  • refseq_MAP3K4.F1 refseq_MAP3K4.R1 139 289
  • NCBIGene 36.3 4216
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005922

  • cd STKc_MEKK4 260aa 1e-134 in ref transcript
    • Serine/threonine kinases (STKs), MAP/ERK kinase kinase 4 (MEKK4) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MEKK4 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MEKK4 is a mitogen-activated protein kinase (MAPK) kinase kinase (MAPKKK or MKKK or MAP3K), that phosphorylates and activates MAPK kinases (MAPKKs or MKKs or MAP2Ks), which in turn phosphorylate and activate MAPKs during signaling cascades that are important in mediating cellular responses to extracellular signals. MEKK4 activates the c-Jun N-terminal kinase (JNK) and p38 MAPK signaling pathways by directly activating their respective MAPKKs, MKK4/MKK7 and MKK3/MKK6. JNK and p38 are collectively known as stress-activated MAPKs, as they are activated in response to a variety of environmental stresses and pro-inflammatory cytokines. MEKK4 also plays roles in the re-polarization of the actin cytoskeleton in response to osmotic stress, in the proper closure of the neural tube, in cardiovascular development, and in immune responses.
  • smart S_TKc 248aa 7e-73 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 267aa 3e-35 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MAP3K7

  • refseq_MAP3K7.F1 refseq_MAP3K7.R1 207 323
  • NCBIGene 36.3 6885
  • Single exon skipping, size difference: 116
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_145331

  • cd S_TKc 240aa 4e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart TyrKc 242aa 3e-61 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • Changed! pfam Mnd1 83aa 8e-04 in ref transcript
    • Mnd1 family. This family of proteins includes MND1 from Saccharomyces cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair.
  • COG SPS1 318aa 1e-26 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MAP3K7

  • refseq_MAP3K7.F3 refseq_MAP3K7.R3 152 233
  • NCBIGene 36.3 6885
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145331

  • cd S_TKc 240aa 4e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart TyrKc 242aa 3e-61 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • pfam Mnd1 83aa 8e-04 in ref transcript
    • Mnd1 family. This family of proteins includes MND1 from Saccharomyces cerevisiae. The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair.
  • COG SPS1 318aa 1e-26 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MAP4K4

  • refseq_MAP4K4.F1 refseq_MAP4K4.R1 146 377
  • NCBIGene 36.3 9448
  • Single exon skipping, size difference: 231
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145686

  • cd STKc_MAP4K4_6 282aa 1e-149 in ref transcript
    • Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
  • smart CNH 299aa 2e-87 in ref transcript
    • Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • smart S_TKc 255aa 2e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 293aa 5e-27 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • COG ROM1 286aa 1e-06 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

MAP4K4

  • refseq_MAP4K4.F3 refseq_MAP4K4.R3 172 265
  • NCBIGene 36.3 9448
  • Alternative 3-prime, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145686

  • cd STKc_MAP4K4_6 282aa 1e-149 in ref transcript
    • Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
  • smart CNH 299aa 2e-87 in ref transcript
    • Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • smart S_TKc 255aa 2e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 293aa 5e-27 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • COG ROM1 286aa 1e-06 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

MAP4K4

  • refseq_MAP4K4.F5 refseq_MAP4K4.R5 136 160
  • NCBIGene 36.3 9448
  • Alternative 5-prime, size difference: 24
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_145686

  • cd STKc_MAP4K4_6 282aa 1e-149 in ref transcript
    • Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
  • Changed! smart CNH 299aa 2e-87 in ref transcript
    • Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • smart S_TKc 255aa 2e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 293aa 5e-27 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! COG ROM1 286aa 1e-06 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].
  • Changed! smart CNH 307aa 4e-86 in modified transcript
  • Changed! COG ROM1 294aa 2e-06 in modified transcript

MAP4K5

  • refseq_MAP4K5.F1 refseq_MAP4K5.R1 256 304
  • NCBIGene 36.3 11183
  • Alternative 5-prime, size difference: 48
  • Exclusion in 5'UTR
  • Reference transcript: NM_198794

  • cd STKc_MAP4K5 267aa 1e-159 in ref transcript
    • Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 5 (MAPKKKK5 or MAP4K5) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K5 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain, similar to MAP4K4/6. MAP4Ks are involved in some MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K5, also called germinal center kinase-related enzyme (GCKR), has been shown to activate the MAPK c-Jun N-terminal kinase (JNK). MAP4K5 also facilitates Wnt signaling in B cells, and may therefore be implicated in the control of cell fate, proliferation, and polarity.
  • smart CNH 316aa 2e-81 in ref transcript
    • Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • smart S_TKc 248aa 3e-66 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 349aa 3e-33 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MAPK10

  • refseq_MAPK10.F1 refseq_MAPK10.R1 138 197
  • NCBIGene 36.3 5602
  • Single exon skipping, size difference: 59
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_138982

  • Changed! cd S_TKc 297aa 1e-66 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 284aa 1e-67 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 292aa 1e-34 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 62aa 1e-10 in modified transcript
  • Changed! smart STYKc 61aa 3e-09 in modified transcript
    • Protein kinase; unclassified specificity. Phosphotransferases. The specificity of this class of kinases can not be predicted. Possible dual-specificity Ser/Thr/Tyr kinase.
  • Changed! COG SPS1 59aa 0.003 in modified transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MAPK7

  • refseq_MAPK7.F1 refseq_MAPK7.R1 97 500
  • NCBIGene 36.3 5598
  • Multiple exon skipping, size difference: 403
  • Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_139033

  • Changed! cd S_TKc 294aa 2e-74 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 283aa 8e-72 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 293aa 9e-45 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 205aa 2e-51 in modified transcript
  • Changed! smart S_TKc 205aa 2e-53 in modified transcript
  • Changed! PTZ PTZ00024 214aa 4e-34 in modified transcript

MAPK7

  • refseq_MAPK7.F4 refseq_MAPK7.R4 157 245
  • NCBIGene 36.3 5598
  • Alternative 5-prime, size difference: 88
  • Exclusion in 5'UTR
  • Reference transcript: NM_002749

  • cd S_TKc 294aa 2e-74 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 283aa 8e-72 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00024 293aa 9e-45 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

MAPK8IP3

  • refseq_MAPK8IP3.F1 refseq_MAPK8IP3.R1 102 120
  • NCBIGene 36.3 23162
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015133

  • pfam Jnk-SapK_ap_N 158aa 1e-69 in ref transcript
    • JNK_SAPK-associated protein-1. This is the N-terminal 200 residues of a set of proteins conserved from yeasts to humans. Most of the proteins in this entry have an RhoGEF pfam00621 domain at their C-terminal end.
  • pfam Trypan_PARP 70aa 2e-04 in ref transcript
    • Procyclic acidic repetitive protein (PARP). This family consists of several Trypanosoma brucei procyclic acidic repetitive protein (PARP) like sequences. The procyclic acidic repetitive protein (parp) genes of Trypanosoma brucei encode a small family of abundant surface proteins whose expression is restricted to the procyclic form of the parasite. They are found at two unlinked loci, parpA and parpB; transcription of both loci is developmentally regulated.
  • Changed! TIGR SMC_prok_B 127aa 3e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG SbcC 155aa 1e-05 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! COG Smc 158aa 5e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 185aa 7e-05 in modified transcript
  • Changed! COG Smc 176aa 5e-05 in modified transcript

MAPKAP1

  • refseq_MAPKAP1.F1 refseq_MAPKAP1.R1 129 270
  • NCBIGene 36.3 79109
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001006617

  • Changed! pfam SIN1 464aa 1e-140 in ref transcript
    • Stress-activated map kinase interacting protein 1 (SIN1). This family consists of several stress-activated map kinase interacting protein 1 (MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1 mitogen-activated protein (MAP) kinase is a member of the eukaryotic stress-activated MAP kinase (SAPK) family. Sin1 interacts with Sty1/Spc1. Cells lacking Sin1 display many, but not all, of the phenotypes of cells lacking the Sty1/Spc1 MAP kinase including sterility, multiple stress sensitivity and a cell-cycle delay. Sin1 is phosphorylated after stress but this is not Sty1/Spc1-dependent.
  • Changed! pfam SIN1 417aa 1e-107 in modified transcript

MAPKAP1

  • refseq_MAPKAP1.F3 refseq_MAPKAP1.R3 226 334
  • NCBIGene 36.3 79109
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001006617

  • Changed! pfam SIN1 464aa 1e-140 in ref transcript
    • Stress-activated map kinase interacting protein 1 (SIN1). This family consists of several stress-activated map kinase interacting protein 1 (MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1 mitogen-activated protein (MAP) kinase is a member of the eukaryotic stress-activated MAP kinase (SAPK) family. Sin1 interacts with Sty1/Spc1. Cells lacking Sin1 display many, but not all, of the phenotypes of cells lacking the Sty1/Spc1 MAP kinase including sterility, multiple stress sensitivity and a cell-cycle delay. Sin1 is phosphorylated after stress but this is not Sty1/Spc1-dependent.
  • Changed! pfam SIN1 428aa 1e-131 in modified transcript

MAPKAP1

  • refseq_MAPKAP1.F5 refseq_MAPKAP1.R5 135 463
  • NCBIGene 36.3 79109
  • Single exon skipping, size difference: 328
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001006617

  • Changed! pfam SIN1 464aa 1e-140 in ref transcript
    • Stress-activated map kinase interacting protein 1 (SIN1). This family consists of several stress-activated map kinase interacting protein 1 (MAPKAP1 OR SIN1) sequences. The fission yeast Sty1/Spc1 mitogen-activated protein (MAP) kinase is a member of the eukaryotic stress-activated MAP kinase (SAPK) family. Sin1 interacts with Sty1/Spc1. Cells lacking Sin1 display many, but not all, of the phenotypes of cells lacking the Sty1/Spc1 MAP kinase including sterility, multiple stress sensitivity and a cell-cycle delay. Sin1 is phosphorylated after stress but this is not Sty1/Spc1-dependent.
  • Changed! pfam SIN1 289aa 4e-87 in modified transcript

MAPT

  • refseq_MAPT.F2 refseq_MAPT.R2 128 221
  • NCBIGene 36.3 4137
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016835

  • Changed! pfam Tubulin-binding 31aa 8e-10 in ref transcript
    • Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
  • pfam Tubulin-binding 31aa 2e-08 in ref transcript
  • pfam Tubulin-binding 32aa 2e-06 in ref transcript
  • pfam Tubulin-binding 32aa 3e-06 in ref transcript
  • pfam Hid1 123aa 3e-04 in ref transcript
    • High-temperature-induced dauer-formation protein. Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor.

MAPT

  • refseq_MAPT.F3 refseq_MAPT.R3 181 379
  • NCBIGene 36.3 4137
  • Single exon skipping, size difference: 198
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016835

  • pfam Tubulin-binding 31aa 8e-10 in ref transcript
    • Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
  • pfam Tubulin-binding 31aa 2e-08 in ref transcript
  • pfam Tubulin-binding 32aa 2e-06 in ref transcript
  • pfam Tubulin-binding 32aa 3e-06 in ref transcript
  • pfam Hid1 123aa 3e-04 in ref transcript
    • High-temperature-induced dauer-formation protein. Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor.

MAPT

  • refseq_MAPT.F5 refseq_MAPT.R5 109 283
  • NCBIGene 36.3 4137
  • Multiple exon skipping, size difference: 174
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016835

  • pfam Tubulin-binding 31aa 8e-10 in ref transcript
    • Tau and MAP protein, tubulin-binding repeat. This family includes the vertebrate proteins MAP2, MAP4 and Tau, as well as other animal homologs. MAP4 is present in many tissues but is usually absent from neurons; MAP2 and Tau are mainly neuronal. Members of this family have the ability to bind to and stabilise microtubules. As a result, they are involved in neuronal migration, supporting dendrite elongation, and regulating microtubules during mitotic metaphase. Note that Tau is involved in neurofibrillary tangle formation in Alzheimer's disease and some other dementias. This family features a C-terminal microtubule binding repeat that contains a conserved KXGS motif.
  • pfam Tubulin-binding 31aa 2e-08 in ref transcript
  • pfam Tubulin-binding 32aa 2e-06 in ref transcript
  • pfam Tubulin-binding 32aa 3e-06 in ref transcript
  • Changed! pfam Hid1 123aa 3e-04 in ref transcript
    • High-temperature-induced dauer-formation protein. Hid1 (high-temperature-induced dauer-formation protein 1) represents proteins of approximately 800 residues long and is conserved from fungi to humans. It contains up to seven potential transmembrane domains separated by regions of low complexity. Functionally it might be involved in vesicle secretion or be an inter-cellular signalling protein or be a novel insulin receptor.

MARCH2

  • refseq_MARCH2.F1 refseq_MARCH2.R1 123 394
  • NCBIGene 36.3 51257
  • Single exon skipping, size difference: 271
  • Exclusion in 5'UTR
  • Reference transcript: NM_016496

  • smart RINGv 48aa 1e-12 in ref transcript
    • The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Classical RING domain: C-x (2) -C-x (9-39)-C-x(1-3)-H-x(2-3)-C-x(2)-C-x(4-48) -C-x(2)-C.
  • pfam Vpu 52aa 0.002 in ref transcript
    • Vpu protein. The Vpu protein contains an N-terminal transmembrane spanning region and a C-terminal cytoplasmic region. The HIV-1 Vpu protein stimulates virus production by enhancing the release of viral particles from infected cells. The VPU protein binds specifically to CD4.
  • COG SSM4 58aa 3e-12 in ref transcript
    • Protein involved in mRNA turnover and stability [RNA processing and modification].

MARCH2

  • refseq_MARCH2.F3 refseq_MARCH2.R3 159 369
  • NCBIGene 36.3 51257
  • Single exon skipping, size difference: 210
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005415

  • smart RINGv 48aa 1e-12 in ref transcript
    • The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Classical RING domain: C-x (2) -C-x (9-39)-C-x(1-3)-H-x(2-3)-C-x(2)-C-x(4-48) -C-x(2)-C.
  • Changed! pfam Vpu 52aa 0.002 in ref transcript
    • Vpu protein. The Vpu protein contains an N-terminal transmembrane spanning region and a C-terminal cytoplasmic region. The HIV-1 Vpu protein stimulates virus production by enhancing the release of viral particles from infected cells. The VPU protein binds specifically to CD4.
  • COG SSM4 58aa 3e-12 in ref transcript
    • Protein involved in mRNA turnover and stability [RNA processing and modification].

MARCH8

  • refseq_MARCH8.F1 refseq_MARCH8.R1 124 217
  • NCBIGene 36.3 220972
  • Single exon skipping, size difference: 93
  • Exclusion in 5'UTR
  • Reference transcript: NM_001002265

  • smart RINGv 49aa 1e-15 in ref transcript
    • The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Classical RING domain: C-x (2) -C-x (9-39)-C-x(1-3)-H-x(2-3)-C-x(2)-C-x(4-48) -C-x(2)-C.
  • COG SSM4 60aa 8e-16 in ref transcript
    • Protein involved in mRNA turnover and stability [RNA processing and modification].

MARK2

  • refseq_MARK2.F2 refseq_MARK2.R2 106 133
  • NCBIGene 36.3 2011
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039469

  • cd S_TKc 253aa 4e-87 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd UBA 38aa 0.001 in ref transcript
    • Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
  • smart S_TKc 241aa 8e-89 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam KA1 43aa 7e-14 in ref transcript
    • Kinase associated domain 1.
  • smart UBA 37aa 3e-05 in ref transcript
    • Ubiquitin associated domain. Present in Rad23, SNF1-like kinases. The newly-found UBA in p62 is known to bind ubiquitin.
  • PTZ PTZ00263 250aa 3e-46 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • COG RER1 45aa 0.002 in ref transcript
    • Golgi protein involved in Golgi-to-ER retrieval [Intracellular trafficking and secretion].

MARK2

  • refseq_MARK2.F4 refseq_MARK2.R4 112 274
  • NCBIGene 36.3 2011
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039469

  • cd S_TKc 253aa 4e-87 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd UBA 38aa 0.001 in ref transcript
    • Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
  • smart S_TKc 241aa 8e-89 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam KA1 43aa 7e-14 in ref transcript
    • Kinase associated domain 1.
  • smart UBA 37aa 3e-05 in ref transcript
    • Ubiquitin associated domain. Present in Rad23, SNF1-like kinases. The newly-found UBA in p62 is known to bind ubiquitin.
  • PTZ PTZ00263 250aa 3e-46 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • COG RER1 45aa 0.002 in ref transcript
    • Golgi protein involved in Golgi-to-ER retrieval [Intracellular trafficking and secretion].

MATN2

  • refseq_MATN2.F1 refseq_MATN2.R1 139 196
  • NCBIGene 36.3 4147
  • Alternative 3-prime, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002380

  • cd vWA_Matrilin 223aa 7e-99 in ref transcript
    • VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
  • Changed! cd vWA_Matrilin 241aa 5e-92 in ref transcript
  • cd vWA_Matrilin 44aa 5e-09 in ref transcript
  • cd vWA_Matrilin 44aa 2e-06 in ref transcript
  • cd vWA_Matrilin 44aa 2e-06 in ref transcript
  • cd vWA_Matrilin 43aa 5e-06 in ref transcript
  • cd vWA_Matrilin 52aa 5e-06 in ref transcript
  • cd vWA_Matrilin 43aa 7e-06 in ref transcript
  • cd vWA_Matrilin 50aa 5e-04 in ref transcript
  • pfam VWA 176aa 4e-49 in ref transcript
    • von Willebrand factor type A domain.
  • pfam VWA 176aa 2e-48 in ref transcript
  • pfam Matrilin_ccoil 44aa 1e-11 in ref transcript
    • Trimeric coiled-coil oligomerisation domain of matrilin. This short domain is a coiled coil structure and has a single cysteine residue at the start which is likely to form a di-sulfide bridge with a corresponding cysteine in an upstream EGF (pfam00008) domain thereby spanning a VWA (pfam00092) domain. All three domains can be associated together as in the cartilage matrix protein matrilin, where this domain is likely to be responsible for oligomerisation.
  • smart EGF_CA 39aa 0.008 in ref transcript
    • Calcium-binding EGF-like domain.
  • Changed! cd vWA_Matrilin 222aa 6e-89 in modified transcript

MATN4

  • refseq_MATN4.F2 refseq_MATN4.R2 256 379
  • NCBIGene 36.3 8785
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003833

  • cd vWA_Matrilin 237aa 3e-88 in ref transcript
    • VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.
  • Changed! cd vWA_Matrilin 223aa 6e-86 in ref transcript
  • Changed! cd vWA_Matrilin 44aa 0.001 in ref transcript
  • pfam VWA 176aa 2e-47 in ref transcript
    • von Willebrand factor type A domain.
  • pfam VWA 172aa 2e-41 in ref transcript
  • pfam Matrilin_ccoil 44aa 2e-08 in ref transcript
    • Trimeric coiled-coil oligomerisation domain of matrilin. This short domain is a coiled coil structure and has a single cysteine residue at the start which is likely to form a di-sulfide bridge with a corresponding cysteine in an upstream EGF (pfam00008) domain thereby spanning a VWA (pfam00092) domain. All three domains can be associated together as in the cartilage matrix protein matrilin, where this domain is likely to be responsible for oligomerisation.
  • Changed! smart EGF_CA 39aa 5e-04 in ref transcript
    • Calcium-binding EGF-like domain.
  • Changed! cd vWA_Matrilin 223aa 3e-82 in modified transcript
  • Changed! smart EGF_CA 40aa 0.006 in modified transcript

MAX

  • refseq_MAX.F2 refseq_MAX.R2 125 226
  • NCBIGene 36.3 4149
  • Single exon skipping, size difference: 101
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002382

  • cd HLH 57aa 6e-07 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam HLH 52aa 5e-12 in ref transcript
    • Helix-loop-helix DNA-binding domain.

MAX

  • refseq_MAX.F3 refseq_MAX.R3 141 168
  • NCBIGene 36.3 4149
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002382

  • Changed! cd HLH 57aa 6e-07 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam HLH 52aa 5e-12 in ref transcript
    • Helix-loop-helix DNA-binding domain.
  • Changed! cd HLH 59aa 4e-07 in modified transcript

MBD1

  • refseq_MBD1.F1 refseq_MBD1.R1 123 261
  • NCBIGene 36.3 4152
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015846

  • cd MeCP2_MBD 62aa 1e-15 in ref transcript
    • MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
  • smart MBD 74aa 7e-20 in ref transcript
    • Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
  • pfam zf-CXXC 47aa 1e-11 in ref transcript
    • CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
  • pfam zf-CXXC 48aa 8e-08 in ref transcript
  • pfam zf-CXXC 46aa 1e-06 in ref transcript

MBD1

  • refseq_MBD1.F4 refseq_MBD1.R4 135 204
  • NCBIGene 36.3 4152
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015846

  • cd MeCP2_MBD 62aa 1e-15 in ref transcript
    • MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
  • smart MBD 74aa 7e-20 in ref transcript
    • Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
  • pfam zf-CXXC 47aa 1e-11 in ref transcript
    • CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
  • pfam zf-CXXC 48aa 8e-08 in ref transcript
  • pfam zf-CXXC 46aa 1e-06 in ref transcript

MBD1

  • refseq_MBD1.F5 refseq_MBD1.R5 163 235
  • NCBIGene 36.3 4152
  • Single exon skipping, size difference: 72
  • Exclusion of the stop codon
  • Reference transcript: NM_015846

  • cd MeCP2_MBD 62aa 1e-15 in ref transcript
    • MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
  • smart MBD 74aa 7e-20 in ref transcript
    • Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
  • pfam zf-CXXC 47aa 1e-11 in ref transcript
    • CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
  • pfam zf-CXXC 48aa 8e-08 in ref transcript
  • pfam zf-CXXC 46aa 1e-06 in ref transcript

MBD1

  • refseq_MBD1.F7 refseq_MBD1.R7 156 303
  • NCBIGene 36.3 4152
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015846

  • cd MeCP2_MBD 62aa 1e-15 in ref transcript
    • MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
  • smart MBD 74aa 7e-20 in ref transcript
    • Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
  • pfam zf-CXXC 47aa 1e-11 in ref transcript
    • CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
  • Changed! pfam zf-CXXC 48aa 8e-08 in ref transcript
  • Changed! pfam zf-CXXC 46aa 1e-06 in ref transcript
  • Changed! pfam zf-CXXC 46aa 9e-08 in modified transcript

MBD1

  • refseq_MBD1.F9 refseq_MBD1.R9 144 312
  • NCBIGene 36.3 4152
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015846

  • cd MeCP2_MBD 62aa 1e-15 in ref transcript
    • MeCP2, MBD1, MBD2, MBD3, and MBD4 are members of a protein family that share the methyl-CpG-binding domain (MBD). The MBD, consists of about 70 residues and is defined as the minimal region required for binding to methylated DNA by a methyl-CpG-binding protein which binds specifically to methylated DNA. The MBD can recognize a single symmetrically methylated CpG either as naked DNA or within chromatin. MeCP2, MBD1 and MBD2 (and likely MBD3) form complexes with histone deacetylase and are involved in histone deacetylase-dependent repression of transcription. MBD4 is an endonuclease that forms a complex with the DNA mismatch-repair protein MLH1.
  • smart MBD 74aa 7e-20 in ref transcript
    • Methyl-CpG binding domain. Methyl-CpG binding domain, also known as the TAM (TTF-IIP5, ARBP, MeCP1) domain.
  • Changed! pfam zf-CXXC 47aa 1e-11 in ref transcript
    • CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
  • pfam zf-CXXC 48aa 8e-08 in ref transcript
  • pfam zf-CXXC 46aa 1e-06 in ref transcript

MBNL1

  • refseq_MBNL1.F2 refseq_MBNL1.R2 153 189
  • NCBIGene 36.3 4154
  • Single exon skipping, size difference: 36
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_207293

  • smart ZnF_C3H1 25aa 0.004 in ref transcript
    • zinc finger.

MBNL1

  • refseq_MBNL1.F3 refseq_MBNL1.R3 298 352
  • NCBIGene 36.3 4154
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207293

  • smart ZnF_C3H1 25aa 0.004 in ref transcript
    • zinc finger.

MBNL1

  • refseq_MBNL1.F5 refseq_MBNL1.R5 108 312
  • NCBIGene 36.3 4154
  • Single exon skipping, size difference: 204
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207293

  • Changed! smart ZnF_C3H1 25aa 0.004 in ref transcript
    • zinc finger.
  • Changed! smart ZnF_C3H1 25aa 0.008 in modified transcript

MBNL1

  • refseq_MBNL1.F7 refseq_MBNL1.R7 159 254
  • NCBIGene 36.3 4154
  • Single exon skipping, size difference: 95
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_021038

  • smart ZnF_C3H1 25aa 0.005 in ref transcript
    • zinc finger.

MBNL1

  • refseq_MBNL1.F8 refseq_MBNL1.R8 115 179
  • NCBIGene 36.3 4154
  • Single exon skipping, size difference: 64
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_207293

  • smart ZnF_C3H1 25aa 0.004 in ref transcript
    • zinc finger.

MBNL2

  • refseq_MBNL2.F1 refseq_MBNL2.R1 205 336
  • NCBIGene 36.3 10150
  • Multiple exon skipping, size difference: 131
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_144778

MBNL3

  • refseq_MBNL3.F2 refseq_MBNL3.R2 156 230
  • NCBIGene 36.3 55796
  • Single exon skipping, size difference: 74
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_018388

  • smart ZnF_C3H1 27aa 0.005 in ref transcript
    • zinc finger.

MBNL3

  • refseq_MBNL3.F4 refseq_MBNL3.R4 269 374
  • NCBIGene 36.3 55796
  • Alternative 3-prime, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018388

  • smart ZnF_C3H1 27aa 0.005 in ref transcript
    • zinc finger.

MBP

  • refseq_MBP.F2 refseq_MBP.R2 134 212
  • NCBIGene 36.3 4155
  • Single exon skipping, size difference: 78
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001025101

  • Changed! pfam Myelin_MBP 154aa 1e-59 in ref transcript
    • Myelin basic protein.
  • Changed! pfam Myelin_MBP 180aa 7e-56 in modified transcript

MCHR2

  • refseq_MCHR2.F1 refseq_MCHR2.R1 219 458
  • NCBIGene 36.3 84539
  • Alternative 5-prime, size difference: 239
  • Exclusion in 5'UTR
  • Reference transcript: NM_001040179

  • pfam 7tm_1 250aa 7e-24 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

MCM8

  • refseq_MCM8.F2 refseq_MCM8.R2 269 323
  • NCBIGene 36.3 84515
  • Alternative 5-prime, size difference: 54
  • Inclusion in 5'UTR
  • Reference transcript: NM_032485

  • cd AAA 159aa 1e-04 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • smart MCM 543aa 1e-121 in ref transcript
    • minichromosome maintenance proteins.
  • COG MCM2 699aa 1e-120 in ref transcript
    • Predicted ATPase involved in replication control, Cdc46/Mcm family [DNA replication, recombination, and repair].

MCM8

  • refseq_MCM8.F3 refseq_MCM8.R3 105 153
  • NCBIGene 36.3 84515
  • Alternative 3-prime, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032485

  • cd AAA 159aa 1e-04 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • Changed! smart MCM 543aa 1e-121 in ref transcript
    • minichromosome maintenance proteins.
  • Changed! COG MCM2 699aa 1e-120 in ref transcript
    • Predicted ATPase involved in replication control, Cdc46/Mcm family [DNA replication, recombination, and repair].
  • Changed! smart MCM 527aa 1e-116 in modified transcript
  • Changed! COG MCM2 683aa 1e-116 in modified transcript

MECR

  • refseq_MECR.F1 refseq_MECR.R1 233 360
  • NCBIGene 36.3 51102
  • Alternative 3-prime, size difference: 127
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016011

  • Changed! cd CBS_like 75aa 2e-05 in ref transcript
    • CBS_like: This subgroup includes Cystathionine beta-synthase (CBS) and Cysteine synthase. CBS is a unique heme-containing enzyme that catalyzes a pyridoxal 5'-phosphate (PLP)-dependent condensation of serine and homocysteine to give cystathionine. Deficiency of CBS leads to homocystinuria, an inherited disease of sulfur metabolism characterized by increased levels of the toxic metabolite homocysteine. Cysteine synthase on the other hand catalyzes the last step of cysteine biosynthesis. This subgroup also includes an O-Phosphoserine sulfhydrylase found in hyperthermophilic archaea which produces L-cysteine from sulfide and the more thermostable O-phospho-L-serine.
  • Changed! smart PKS_ER 285aa 3e-26 in ref transcript
    • Enoylreductase. Enoylreductase in Polyketide synthases.
  • Changed! COG Qor 330aa 6e-50 in ref transcript
    • NADPH:quinone reductase and related Zn-dependent oxidoreductases [Energy production and conversion / General function prediction only].

MED8

  • refseq_MED8.F1 refseq_MED8.R1 106 167
  • NCBIGene 36.2 112950
  • Alternative 3-prime, size difference: 61
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001001651

  • Changed! pfam Arc32 178aa 3e-55 in ref transcript
    • Mediator of RNA polymerase II transcription. Arc32, or Med8, is one of the subunits of the Mediator complex of RNA polymerase II. The region conserved contains two alpha helices putatively necessary for binding to other subunits within the core of the Mediator complex.

MED8

  • refseq_MED8.F3 refseq_MED8.R3 102 120
  • NCBIGene 36.2 112950
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001651

  • pfam Arc32 178aa 3e-55 in ref transcript
    • Mediator of RNA polymerase II transcription. Arc32, or Med8, is one of the subunits of the Mediator complex of RNA polymerase II. The region conserved contains two alpha helices putatively necessary for binding to other subunits within the core of the Mediator complex.

MEIS2

  • refseq_MEIS2.F2 refseq_MEIS2.R2 101 122
  • NCBIGene 36.3 4212
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170675

  • cd homeodomain 54aa 2e-10 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • smart HOX 54aa 2e-11 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.

MEIS2

  • refseq_MEIS2.F4 refseq_MEIS2.R4 299 395
  • NCBIGene 36.3 4212
  • Single exon skipping, size difference: 96
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_170675

  • cd homeodomain 54aa 2e-10 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • smart HOX 54aa 2e-11 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.

MEIS2

  • refseq_MEIS2.F5 refseq_MEIS2.R5 130 153
  • NCBIGene 36.3 4212
  • Alternative 3-prime, size difference: 23
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_170675

  • Changed! cd homeodomain 54aa 2e-10 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • Changed! smart HOX 54aa 2e-11 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.

MEIS2

  • refseq_MEIS2.F7 refseq_MEIS2.R7 174 251
  • NCBIGene 36.2 4212
  • Single exon skipping, size difference: 77
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_170675

  • Changed! cd homeodomain 54aa 2e-10 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • Changed! smart HOX 54aa 2e-11 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.

MEIS3

  • refseq_MEIS3.F2 refseq_MEIS3.R2 169 307
  • NCBIGene 36.3 56917
  • Alternative 3-prime, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020160

  • Changed! cd homeodomain 37aa 2e-08 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • Changed! smart HOX 37aa 9e-09 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
  • Changed! cd homeodomain 56aa 1e-09 in modified transcript
  • Changed! smart HOX 56aa 8e-11 in modified transcript

MEIS3

  • refseq_MEIS3.F4 refseq_MEIS3.R4 141 192
  • NCBIGene 36.3 56917
  • Alternative 3-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020160

  • cd homeodomain 37aa 2e-08 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • smart HOX 37aa 9e-09 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.

MEN1

  • refseq_MEN1.F1 refseq_MEN1.R1 109 355
  • NCBIGene 36.3 4221
  • Alternative 5-prime, size difference: 246
  • Exclusion in 5'UTR
  • Reference transcript: NM_130802

  • pfam Menin 615aa 0.0 in ref transcript
    • Menin. MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23.

MEOX1

  • refseq_MEOX1.F1 refseq_MEOX1.R1 227 400
  • NCBIGene 36.3 4222
  • Single exon skipping, size difference: 173
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004527

  • Changed! cd homeodomain 59aa 5e-18 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • Changed! pfam Homeobox 57aa 8e-24 in ref transcript
    • Homeobox domain.
  • Changed! COG COG5576 71aa 1e-08 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

METT11D1

  • refseq_METT11D1.F2 refseq_METT11D1.R2 108 140
  • NCBIGene 36.2 64745
  • Alternative 3-prime, size difference: 32
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001029991

  • Changed! pfam Rsm22 267aa 5e-48 in ref transcript
    • Mitochondrial small ribosomal subunit Rsm22. Rsm22 has been identified as a mitochondrial small ribosomal subunit and is a methyltransferase. In Schizosaccharomyces pombe, Rsm22 is tandemly fused to Cox11 (a factor required for copper insertion into cytochrome oxidase) and the two proteins are proteolytically cleaved after import into the mitochondria.
  • Changed! COG COG5459 263aa 1e-26 in ref transcript
    • Predicted rRNA methylase [Translation, ribosomal structure and biogenesis].

METTL1

  • refseq_METTL1.F1 refseq_METTL1.R1 163 348
  • NCBIGene 36.3 4234
  • Single exon skipping, size difference: 185
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005371

  • Changed! pfam Methyltransf_4 200aa 2e-62 in ref transcript
    • Putative methyltransferase. This is a family of putative methyltransferases. The aligned region contains the GXGXG S-AdoMet binding site suggesting a putative methyltransferase activity.
  • Changed! COG COG0220 243aa 3e-39 in ref transcript
    • Predicted S-adenosylmethionine-dependent methyltransferase [General function prediction only].
  • Changed! COG COG0220 78aa 0.002 in modified transcript

MGA

  • refseq_MGA.F1 refseq_MGA.R1 100 247
  • NCBIGene 36.2 23269
  • Alternative 3-prime, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_932720

  • cd TBOX 191aa 9e-87 in ref transcript
    • T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
  • pfam T-box 186aa 2e-80 in ref transcript
    • T-box. The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.

C16orf13

  • refseq_MGC13114.F1 refseq_MGC13114.R1 237 400
  • NCBIGene 36.3 84326
  • Single exon skipping, size difference: 163
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001040160

  • Changed! pfam DUF938 137aa 6e-48 in ref transcript
    • Protein of unknown function (DUF938). This family consists of several hypothetical proteins from both prokaryotes and eukaryotes. The function of this family is unknown.
  • Changed! pfam DUF938 62aa 8e-21 in modified transcript
  • Changed! pfam DUF938 40aa 2e-12 in modified transcript

C17orf91

  • refseq_MGC14376.F1 refseq_MGC14376.R1 267 350
  • NCBIGene 36.3 84981
  • Single exon skipping, size difference: 83
  • Exclusion in 5'UTR
  • Reference transcript: NM_032895

MGC15875

  • refseq_MGC15875.F2 refseq_MGC15875.R2 130 218
  • NCBIGene 36.2 85007
  • Alternative 3-prime, size difference: 88
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_153373

  • Changed! cd OAT_like 404aa 1e-105 in ref transcript
    • Acetyl ornithine aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to ornithine aminotransferase, acetylornithine aminotransferase, alanine-glyoxylate aminotransferase, dialkylglycine decarboxylase, 4-aminobutyrate aminotransferase, beta-alanine-pyruvate aminotransferase, adenosylmethionine-8-amino-7-oxononanoate aminotransferase, and glutamate-1-semialdehyde 2,1-aminomutase. All the enzymes belonging to this family act on basic amino acids and their derivatives are involved in transamination or decarboxylation.
  • Changed! pfam Aminotran_3 340aa 8e-65 in ref transcript
    • Aminotransferase class-III.
  • Changed! PRK PRK06148 429aa 1e-153 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! cd OAT_like 205aa 1e-46 in modified transcript
  • Changed! pfam Aminotran_3 201aa 2e-30 in modified transcript
  • Changed! PRK PRK06148 227aa 2e-80 in modified transcript

MGC15875

  • refseq_MGC15875.F3 refseq_MGC15875.R3 143 378
  • NCBIGene 36.2 85007
  • Multiple exon skipping, size difference: 235
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153373

  • Changed! cd OAT_like 404aa 1e-105 in ref transcript
    • Acetyl ornithine aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to ornithine aminotransferase, acetylornithine aminotransferase, alanine-glyoxylate aminotransferase, dialkylglycine decarboxylase, 4-aminobutyrate aminotransferase, beta-alanine-pyruvate aminotransferase, adenosylmethionine-8-amino-7-oxononanoate aminotransferase, and glutamate-1-semialdehyde 2,1-aminomutase. All the enzymes belonging to this family act on basic amino acids and their derivatives are involved in transamination or decarboxylation.
  • Changed! pfam Aminotran_3 340aa 8e-65 in ref transcript
    • Aminotransferase class-III.
  • Changed! PRK PRK06148 429aa 1e-153 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! cd OAT_like 31aa 5e-06 in modified transcript
  • Changed! TIGR argD 28aa 6e-05 in modified transcript
    • Members of this family may also act on ornithine, like ornithine aminotransferase (EC 2.6.1.13) and on succinyldiaminopimelate, like N-succinyldiaminopmelate-aminotransferase (EC 2.6.1.17, DapC, an enzyme of lysine biosynthesis).
  • Changed! PRK PRK06148 55aa 4e-09 in modified transcript

MGC15875

  • refseq_MGC15875.F5 refseq_MGC15875.R5 106 197
  • NCBIGene 36.2 85007
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 91
  • Inclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153373

  • Changed! cd OAT_like 404aa 1e-105 in ref transcript
    • Acetyl ornithine aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major groups in this CD correspond to ornithine aminotransferase, acetylornithine aminotransferase, alanine-glyoxylate aminotransferase, dialkylglycine decarboxylase, 4-aminobutyrate aminotransferase, beta-alanine-pyruvate aminotransferase, adenosylmethionine-8-amino-7-oxononanoate aminotransferase, and glutamate-1-semialdehyde 2,1-aminomutase. All the enzymes belonging to this family act on basic amino acids and their derivatives are involved in transamination or decarboxylation.
  • Changed! pfam Aminotran_3 340aa 8e-65 in ref transcript
    • Aminotransferase class-III.
  • Changed! PRK PRK06148 429aa 1e-153 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! cd OAT_like 126aa 7e-35 in modified transcript
  • Changed! pfam Aminotran_3 128aa 9e-22 in modified transcript
  • Changed! PRK PRK06148 163aa 7e-60 in modified transcript

MGC19604

  • refseq_MGC19604.F1 refseq_MGC19604.R1 181 218
  • NCBIGene 36.2 112812
  • Alternative 3-prime, size difference: 37
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031734

  • Changed! cd fer2 90aa 7e-08 in ref transcript
    • 2Fe-2S iron-sulfur cluster binding domain. Iron-sulfur proteins play an important role in electron transfer processes and in various enzymatic reactions. The family includes plant and algal ferredoxins, which act as electron carriers in photosynthesis and ferredoxins, which participate in redox chains (from bacteria to mammals). Fold is ismilar to thioredoxin.
  • Changed! TIGR fdx_isc 86aa 2e-16 in ref transcript
    • This family consists of proteobacterial ferredoxins associated with and essential to the ISC system of 2Fe-2S cluster assembly. This family is closely related to (but excludes) eukaryotic (mitochondrial) adrenodoxins, which are ferredoxins involved in electron transfer to P450 cytochromes.
  • Changed! COG Fdx 102aa 2e-15 in ref transcript
    • Ferredoxin [Energy production and conversion].
  • Changed! cd fer2 68aa 4e-05 in modified transcript
  • Changed! TIGR fdx_isc 46aa 1e-07 in modified transcript
  • Changed! COG Fdx 71aa 7e-07 in modified transcript

PRELID2

  • refseq_MGC21644.F1 refseq_MGC21644.R1 140 343
  • NCBIGene 36.3 153768
  • Single exon skipping, size difference: 203
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_182960

  • Changed! pfam PRELI 170aa 4e-39 in ref transcript
    • PRELI-like family. This family includes a conserved region found in the PRELI protein and yeast YLR168C gene MSF1 product. The function of this protein is unknown, though it is thought to be involved in intra-mitochondrial protein sorting. This region is also found in a number of other eukaryotic proteins.

PRELID2

  • refseq_MGC21644.F2 refseq_MGC21644.R2 100 136
  • NCBIGene 36.3 153768
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182960

  • Changed! pfam PRELI 170aa 4e-39 in ref transcript
    • PRELI-like family. This family includes a conserved region found in the PRELI protein and yeast YLR168C gene MSF1 product. The function of this protein is unknown, though it is thought to be involved in intra-mitochondrial protein sorting. This region is also found in a number of other eukaryotic proteins.
  • Changed! pfam PRELI 158aa 1e-40 in modified transcript

MGC3207

  • refseq_MGC3207.F1 refseq_MGC3207.R1 247 388
  • NCBIGene 36.3 84245
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 141
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031727

  • Changed! TIGR salvage_mtnA 343aa 1e-122 in ref transcript
    • The delineation of this family was based in part on a discussion and neighbor-joining phylogenetic study, by Kyrpides and Woese, of archaeal and other proteins homologous to the alpha, beta, and delta subunits of eukaryotic initiation factor 2B (eIF-2B), a five-subunit molecule that catalyzes GTP recycling for eIF-2. This clade is now recognized to include the methionine salvage pathway enzyme MtnA.
  • Changed! PRK mtnA 351aa 2e-84 in ref transcript
    • methylthioribose-1-phosphate isomerase; Reviewed.
  • Changed! TIGR salvage_mtnA 146aa 1e-67 in modified transcript
  • Changed! TIGR salvage_mtnA 93aa 5e-27 in modified transcript
  • Changed! PRK mtnA 154aa 8e-58 in modified transcript
  • Changed! COG COG0182 96aa 9e-13 in modified transcript
    • Predicted translation initiation factor 2B subunit, eIF-2B alpha/beta/delta family [Translation, ribosomal structure and biogenesis].

FAM133B

  • refseq_MGC40405.F1 refseq_MGC40405.R1 160 259
  • NCBIGene 36.3 257415
  • Single exon skipping, size difference: 99
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_152789

MID1

  • refseq_MID1.F2 refseq_MID1.R2 213 313
  • NCBIGene 36.2 4281
  • Alternative 5-prime and 3-prime, size difference: 100
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000381

  • cd FN3 100aa 9e-07 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd RING 55aa 2e-04 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 39aa 4e-04 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • Changed! smart SPRY 119aa 2e-25 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart BBC 126aa 1e-13 in ref transcript
    • B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
  • smart FN3 90aa 4e-08 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart BBOX 42aa 3e-06 in ref transcript
    • B-Box-type zinc finger.
  • smart RING 50aa 3e-05 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • smart PRY 38aa 0.002 in ref transcript
    • associated with SPRY domains.

MID2

  • refseq_MID2.F1 refseq_MID2.R1 205 295
  • NCBIGene 36.3 11043
  • Alternative 5-prime, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012216

  • cd RING 55aa 2e-04 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 38aa 5e-04 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • Changed! cd FN3 47aa 0.001 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • smart BBC 126aa 8e-29 in ref transcript
    • B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
  • smart SPRY 109aa 4e-24 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • pfam zf-B_box 42aa 2e-05 in ref transcript
    • B-box zinc finger.
  • smart RING 50aa 2e-05 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! smart FN3 120aa 1e-04 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • PRK PRK00409 124aa 1e-04 in ref transcript
    • recombination and DNA strand exchange inhibitor protein; Reviewed.
  • Changed! cd FN3 100aa 1e-06 in modified transcript
  • Changed! smart FN3 90aa 4e-08 in modified transcript

MINA

  • refseq_MINA.F1 refseq_MINA.R1 264 368
  • NCBIGene 36.2 84864
  • Alternative 3-prime, size difference: 3
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_032778

  • Changed! pfam Cupin_4 312aa 5e-89 in ref transcript
    • Cupin superfamily protein. This family contains many hypothetical proteins that belong to the cupin superfamily.
  • COG COG2850 202aa 2e-10 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam Cupin_4 313aa 3e-88 in modified transcript

MINA

  • refseq_MINA.F1 refseq_MINA.R3 124 225
  • NCBIGene 36.2 84864
  • Single exon skipping, size difference: 101
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_032778

  • Changed! pfam Cupin_4 312aa 5e-89 in ref transcript
    • Cupin superfamily protein. This family contains many hypothetical proteins that belong to the cupin superfamily.
  • COG COG2850 202aa 2e-10 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam Cupin_4 227aa 2e-71 in modified transcript

MINK1

  • refseq_MINK1.F1 refseq_MINK1.R1 131 155
  • NCBIGene 36.3 50488
  • Single exon skipping, size difference: 24
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_153827

  • cd STKc_MAP4K4_6 192aa 1e-114 in ref transcript
    • Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
  • smart CNH 299aa 9e-83 in ref transcript
    • Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • smart S_TKc 188aa 2e-54 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 205aa 2e-24 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • COG ROM1 286aa 3e-08 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

MINK1

  • refseq_MINK1.F3 refseq_MINK1.R3 136 247
  • NCBIGene 36.3 50488
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153827

  • cd STKc_MAP4K4_6 192aa 1e-114 in ref transcript
    • Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
  • smart CNH 299aa 9e-83 in ref transcript
    • Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • smart S_TKc 188aa 2e-54 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 205aa 2e-24 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • COG ROM1 286aa 3e-08 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].
  • Changed! PRK PRK07764 140aa 2e-04 in modified transcript
    • DNA polymerase III subunits gamma and tau; Validated.

MINK1

  • refseq_MINK1.F5 refseq_MINK1.R5 246 306
  • NCBIGene 36.3 50488
  • Alternative 3-prime, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153827

  • cd STKc_MAP4K4_6 192aa 1e-114 in ref transcript
    • Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 4 (MAPKKKK4 or MAP4K4) and MAPKKKK6 (or MAP4K6) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K4/MAP4K6 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Members of this subfamily contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain. MAP4Ks (or MAPKKKKs) are involved in MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K4 is also called Nck Interacting kinase (NIK). It facilitates the activation of the MAPKs, extracellular signal-regulated kinase (ERK) 1, ERK2, and c-Jun N-terminal kinase (JNK), by phosphorylating and activating MEKK1. MAP4K4 plays a role in tumor necrosis factor (TNF) alpha-induced insulin resistance. MAP4K4 silencing in skeletal muscle cells from type II diabetic patients restores insulin-mediated glucose uptake. MAP4K4, through JNK, also plays a broad role in cell motility, which impacts inflammation, homeostasis, as well as the invasion and spread of cancer. MAP4K4 is found to be highly expressed in most tumor cell lines relative to normal tissue. MAP4K6 (also called MINK for Misshapen/NIKs-related kinase) is activated after Ras induction and mediates activation of p38 MAPK. MAP4K6 plays a role in cell cycle arrest, cytoskeleton organization, cell adhesion, and cell motility.
  • smart CNH 299aa 9e-83 in ref transcript
    • Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • smart S_TKc 188aa 2e-54 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 205aa 2e-24 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • COG ROM1 286aa 3e-08 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

MKNK1

  • refseq_MKNK1.F2 refseq_MKNK1.R2 215 259
  • NCBIGene 36.3 8569
  • Single exon skipping, size difference: 44
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003684

  • cd S_TKc 322aa 3e-62 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 316aa 2e-66 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG SPS1 323aa 1e-33 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! COG SPS1 336aa 3e-34 in modified transcript

MKNK1

  • refseq_MKNK1.F4 refseq_MKNK1.R4 132 255
  • NCBIGene 36.3 8569
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003684

  • Changed! cd S_TKc 322aa 3e-62 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 316aa 2e-66 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG SPS1 323aa 1e-33 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd S_TKc 281aa 3e-67 in modified transcript
  • Changed! smart S_TKc 275aa 2e-71 in modified transcript
  • Changed! COG SPS1 282aa 4e-39 in modified transcript

MLH3

  • refseq_MLH3.F1 refseq_MLH3.R1 153 225
  • NCBIGene 36.3 27030
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040108

  • cd MutL_Trans_MLH3 144aa 4e-53 in ref transcript
    • MutL_Trans_MLH3: transducer domain, having a ribosomal S5 domain 2-like fold, found in proteins similar to yeast and human MLH3 (MutL homologue 3). MLH3 belongs to the DNA mismatch repair (MutL/MLH1/PMS2) family. This transducer domain is homologous to the second domain of the DNA gyrase B subunit, which is known to be important in nucleotide hydrolysis and the transduction of structural signals from ATP-binding site to the DNA breakage/reunion regions of the enzymes. MLH1 forms heterodimers with MLH3. The MLH1-MLH3 complex plays a role in meiosis. A role for hMLH1-hMLH3 in DNA mismatch repair (MMR) has not been established. It has been suggested that hMLH3 may be a low risk gene for colorectal cancer; however there is little evidence to support it having a role in classical HNPCC.
  • TIGR mutl 269aa 1e-44 in ref transcript
    • All proteins in this family for which the functions are known are involved in the process of generalized mismatch repair. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! smart MutL_C 174aa 4e-21 in ref transcript
    • MutL C terminal dimerisation domain. MutL and MutS are key components of the DNA repair machinery that corrects replication errors. MutS recognises mispaired or unpaired bases in a DNA duplex and in the presence of ATP, recruits MutL to form a DNA signaling complex for repair. The N terminal region of MutL contains the ATPase domain and the C terminal is involved in dimerisation.
  • pfam DNA_mis_repair 134aa 1e-05 in ref transcript
    • DNA mismatch repair protein, C-terminal domain. This family represents the C-terminal domain of the mutL/hexB/PMS1 family. This domain has a ribosomal S5 domain 2-like fold.
  • COG MutL 437aa 4e-45 in ref transcript
    • DNA mismatch repair enzyme (predicted ATPase) [DNA replication, recombination, and repair].
  • Changed! PRK mutL 221aa 3e-22 in ref transcript
    • DNA mismatch repair protein; Reviewed.
  • Changed! smart MutL_C 150aa 6e-11 in modified transcript
  • Changed! COG MutL 152aa 6e-13 in modified transcript

MLL3

  • refseq_MLL3.F1 refseq_MLL3.R1 229 400
  • NCBIGene 36.2 58508
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 171
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_170606

  • pfam SET 129aa 5e-28 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • smart FYRC 88aa 2e-19 in ref transcript
    • "FY-rich" domain, C-terminal region. is sometimes closely juxtaposed with the N-terminal region (FYRN), but sometimes is far distant. Unknown function, but occurs frequently in chromatin-associated proteins.
  • pfam FYRN 55aa 3e-16 in ref transcript
    • F/Y-rich N-terminus. This region is normally found in the trithorax/ALL1 family proteins. It is similar to SMART:SM00541.
  • pfam PHD 47aa 9e-09 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • pfam PHD 49aa 5e-08 in ref transcript
  • pfam PAT1 178aa 7e-08 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
  • pfam PHD 51aa 1e-07 in ref transcript
  • pfam PHD 48aa 3e-06 in ref transcript
  • pfam PHD 52aa 1e-05 in ref transcript
  • smart HMG 52aa 4e-04 in ref transcript
    • high mobility group.
  • pfam PHD 52aa 0.001 in ref transcript
  • pfam ATG13 135aa 0.009 in ref transcript
    • Autophagy-related protein 13. Members of this family of phosphoproteins are involved in cytoplasm to vacuole transport (Cvt), and more specifically in Cvt vesicle formation. They are probably involved in the switching machinery regulating the conversion between the Cvt pathway and autophagy. Finally, ATG13 is also required for glycogen storage.
  • COG COG2940 187aa 2e-18 in ref transcript
    • Proteins containing SET domain [General function prediction only].
  • COG COG5141 115aa 0.001 in ref transcript
    • PHD zinc finger-containing protein [General function prediction only].
  • COG COG5141 137aa 0.003 in ref transcript
  • Changed! PRK PRK07003 331aa 0.010 in modified transcript
    • DNA polymerase III subunits gamma and tau; Validated.

MLL5

  • refseq_MLL5.F2 refseq_MLL5.R2 249 397
  • NCBIGene 36.3 55904
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 148
  • Exclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_182931

  • pfam SET 128aa 5e-25 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • pfam PHD 46aa 2e-10 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • TIGR PABP-1234 119aa 0.003 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • pfam PAT1 113aa 0.003 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
  • COG COG2940 298aa 1e-04 in ref transcript
    • Proteins containing SET domain [General function prediction only].
  • PRK PRK13729 87aa 0.007 in ref transcript
    • conjugal transfer pilus assembly protein TraB; Provisional.

MLLT10

  • refseq_MLLT10.F1 refseq_MLLT10.R1 157 205
  • NCBIGene 36.3 8028
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004641

  • pfam PHD 47aa 1e-06 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • smart PHD 59aa 0.005 in ref transcript
    • PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the R ING finger and the FY VE finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent B ROMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were R ING fingers misidentified as PHD fingers.
  • COG COG5141 179aa 1e-37 in ref transcript
    • PHD zinc finger-containing protein [General function prediction only].

MLLT4

  • refseq_MLLT4.F1 refseq_MLLT4.R1 158 203
  • NCBIGene 36.3 4301
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040001

  • cd AF6_RA_repeat1 112aa 3e-53 in ref transcript
    • The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
  • cd AF6_RA_repeat2 100aa 2e-42 in ref transcript
    • The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
  • cd PDZ_signaling 84aa 3e-12 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • Changed! cd FHA 90aa 2e-05 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • pfam DIL 107aa 5e-26 in ref transcript
    • DIL domain. The DIL domain has no known function.
  • smart RA 94aa 7e-17 in ref transcript
    • Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
  • smart PDZ 88aa 4e-14 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart RA 100aa 2e-13 in ref transcript
  • smart FHA 52aa 4e-08 in ref transcript
    • Forkhead associated domain. Found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain.
  • COG Prc 133aa 0.001 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • Changed! cd FHA 101aa 1e-05 in modified transcript

MLX

  • refseq_MLX.F2 refseq_MLX.R2 115 277
  • NCBIGene 36.3 6945
  • Alternative 5-prime, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170607

  • cd HLH 66aa 2e-10 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam HLH 59aa 1e-12 in ref transcript
    • Helix-loop-helix DNA-binding domain.

MLX

  • refseq_MLX.F3 refseq_MLX.R3 125 215
  • NCBIGene 36.3 6945
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170607

  • cd HLH 66aa 2e-10 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam HLH 59aa 1e-12 in ref transcript
    • Helix-loop-helix DNA-binding domain.

MLXIPL

  • refseq_MLXIPL.F1 refseq_MLXIPL.R1 147 204
  • NCBIGene 36.3 51085
  • Alternative 3-prime, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032951

  • Changed! cd HLH 61aa 3e-09 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • Changed! pfam HLH 55aa 8e-09 in ref transcript
    • Helix-loop-helix DNA-binding domain.

MMP19

  • refseq_MMP19.F1 refseq_MMP19.R1 101 448
  • NCBIGene 36.2 4327
  • Multiple exon skipping, size difference: 347
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002429

  • Changed! cd ZnMc_MMP 154aa 4e-53 in ref transcript
    • Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
  • Changed! cd HX 186aa 7e-48 in ref transcript
    • Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
  • Changed! pfam Peptidase_M10 154aa 3e-57 in ref transcript
    • Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis.
  • Changed! smart HX 39aa 2e-07 in ref transcript
    • Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs).
  • Changed! smart HX 44aa 2e-06 in ref transcript
  • Changed! pfam PG_binding_1 50aa 6e-05 in ref transcript
    • Putative peptidoglycan binding domain. This domain is composed of three alpha helices. This domain is found at the N or C terminus of a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function. This family is found N-terminal to the catalytic domain of matrixins.
  • Changed! smart HX 43aa 4e-04 in ref transcript
  • Changed! smart HX 47aa 0.010 in ref transcript

MMP23B

  • refseq_MMP23B.F1 refseq_MMP23B.R1 112 252
  • NCBIGene 36.2 8510
  • Single exon skipping, size difference: 140
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006983

  • Changed! cd ZnMc_MMP 168aa 2e-48 in ref transcript
    • Zinc-dependent metalloprotease, matrix metalloproteinase (MMP) sub-family. MMPs are responsible for a great deal of pericellular proteolysis of extracellular matrix and cell surface molecules, playing crucial roles in morphogenesis, cell fate specification, cell migration, tissue repair, tumorigenesis, gain or loss of tissue-specific functions, and apoptosis. In many instances, they are anchored to cell membranes via trans-membrane domains, and their activity is controlled via TIMPs (tissue inhibitors of metalloproteinases).
  • Changed! cd IGcam 61aa 0.004 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! pfam Peptidase_M10 168aa 7e-40 in ref transcript
    • Matrixin. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis.
  • Changed! smart IGc2 61aa 3e-04 in ref transcript
    • Immunoglobulin C-2 Type.
  • Changed! smart ShKT 35aa 0.004 in ref transcript
    • ShK toxin domain. ShK toxin domain.

MOG

  • refseq_MOG.F1 refseq_MOG.R1 120 370
  • NCBIGene 36.3 4340
  • Alternative 3-prime, size difference: 250
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002433

  • smart IGv 81aa 2e-12 in ref transcript
    • Immunoglobulin V-Type.

MOG

  • refseq_MOG.F5 refseq_MOG.R5 275 413
  • NCBIGene 36.2 4340
  • Single exon skipping, size difference: 138
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002433

  • smart IGv 81aa 2e-12 in ref transcript
    • Immunoglobulin V-Type.

MORF4L1

  • refseq_MORF4L1.F1 refseq_MORF4L1.R1 143 260
  • NCBIGene 36.3 10933
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206839

  • Changed! pfam MRG 261aa 1e-81 in ref transcript
    • MRG. This family consists of three different eukaryotic proteins (mortality factor 4 (MORF4/MRG15), male-specific lethal 3(MSL-3) and ESA1-associated factor 3(EAF3)). It is thought that the MRG family is involved in transcriptional regulation via histone acetylation. It contains 2 chromo domains and a leucine zipper motif.
  • Changed! cd CHROMO 49aa 0.003 in modified transcript
    • Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
  • Changed! pfam MRG 261aa 1e-81 in modified transcript

MOSPD3

  • refseq_MOSPD3.F1 refseq_MOSPD3.R1 128 158
  • NCBIGene 36.3 64598
  • Alternative 3-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040097

  • Changed! pfam Motile_Sperm 64aa 1e-10 in ref transcript
    • MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins.
  • Changed! pfam Motile_Sperm 54aa 9e-08 in modified transcript

MPG

  • refseq_MPG.F2 refseq_MPG.R2 111 304
  • NCBIGene 36.3 4350
  • Single exon skipping, size difference: 193
  • Exclusion of the protein initiation site
  • Reference transcript: NM_002434

  • cd AAG 190aa 3e-57 in ref transcript
    • Alkyladenine DNA glycosylase (AAG), also known as 3-methyladenine DNA glycosylase, catalyzes the first step in base excision repair (BER) by cleaving damaged DNA bases within double-stranded DNA to produce an abasic site. AAG bends DNA by intercalating between the base pairs, causing the damaged base to flip out of the double helix and into the enzyme active site for cleavage. Although AAG represents one of six DNA glycosylase classes, it lacks the helix-hairpin-helix active site motif associated with the other BER glycosylases and is structurally quite distinct from them.
  • TIGR 3mg 201aa 1e-77 in ref transcript
    • This families are based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). All proteins in this family for which the function is known are involved in the base excision repair of alkylation damage to DNA. The exact specificty of the type of alkylation damage repaired by each of these varies somewhat between species. Substrates include 3-methyl adenine, 7-methyl-guanaine, and 3-methyl-guanine.
  • PRK PRK00802 204aa 4e-53 in ref transcript
    • 3-methyladenine DNA glycosylase; Reviewed.

MPST

  • refseq_MPST.F1 refseq_MPST.R1 237 413
  • NCBIGene 36.3 4357
  • Single exon skipping, size difference: 176
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_021126

  • Changed! cd TST_Repeat_2 116aa 5e-38 in ref transcript
    • Thiosulfate sulfurtransferase (TST), C-terminal, catalytic domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the second repeat. Only the second repeat contains the catalytically active Cys residue.
  • Changed! cd TST_Repeat_1 130aa 9e-37 in ref transcript
    • Thiosulfate sulfurtransferase (TST), N-terminal, inactive domain. TST contains 2 copies of the Rhodanese Homology Domain; this is the 1st repeat, which does not contain the catalytically active Cys residue. The role of the 1st repeat is uncertain, but it is believed to be involved in protein interaction.
  • Changed! pfam Rhodanese 118aa 3e-19 in ref transcript
    • Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
  • Changed! pfam Rhodanese 123aa 2e-16 in ref transcript
  • Changed! COG SseA 284aa 4e-70 in ref transcript
    • Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism].

MPZL1

  • refseq_MPZL1.F1 refseq_MPZL1.R1 146 249
  • NCBIGene 36.3 9019
  • Single exon skipping, size difference: 103
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003953

  • pfam V-set 114aa 6e-12 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

MRE11A

  • refseq_MRE11A.F1 refseq_MRE11A.R1 186 293
  • NCBIGene 36.3 4361
  • Alternative 5-prime, size difference: 107
  • Inclusion in 5'UTR
  • Reference transcript: NM_005591

  • TIGR mre11 393aa 1e-140 in ref transcript
    • All proteins in this family for which functions are known are subunits of a nuclease complex made up of multiple proteins including MRE11 and RAD50 homologs. The functions of this nuclease complex include recombinational repair and non-homolgous end joining. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). The proteins in this family are distantly related to proteins in the SbcCD complex of bacteria.
  • COG SbcD 309aa 2e-20 in ref transcript
    • DNA repair exonuclease [DNA replication, recombination, and repair].

MRPL10

  • refseq_MRPL10.F2 refseq_MRPL10.R2 167 265
  • NCBIGene 36.3 124995
  • Single exon skipping, size difference: 98
  • Exclusion of the protein initiation site
  • Reference transcript: NM_148887

  • cd Ribosomal_L10 132aa 9e-22 in ref transcript
    • Ribosomal protein L10 family, L10 subfamily; composed of bacterial 50S ribosomal protein and eukaryotic mitochondrial 39S ribosomal protein, L10. L10 occupies the L7/L12 stalk of the ribosome. The N-terminal domain (NTD) of L10 interacts with L11 protein and forms the base of the L7/L12 stalk, while the extended C-terminal helix binds to two or three dimers of the NTD of L7/L12 (L7 and L12 are identical except for an acetylated N-terminus). The L7/L12 stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-interacting protein (WIP). These bacteria and eukaryotic sequences have no additional C-terminal domain, present in other eukaryotic and archaeal orthologs.
  • COG RplJ 128aa 3e-07 in ref transcript
    • Ribosomal protein L10 [Translation, ribosomal structure and biogenesis].

MRPL11

  • refseq_MRPL11.F1 refseq_MRPL11.R1 148 226
  • NCBIGene 36.3 65003
  • Alternative 5-prime, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016050

  • Changed! cd Ribosomal_L11 136aa 1e-36 in ref transcript
    • Ribosomal protein L11. Ribosomal protein L11, together with proteins L10 and L7/L12, and 23S rRNA, form the L7/L12 stalk on the surface of the large subunit of the ribosome. The homologous eukaryotic cytoplasmic protein is also called 60S ribosomal protein L12, which is distinct from the L12 involved in the formation of the L7/L12 stalk. The C-terminal domain (CTD) of L11 is essential for binding 23S rRNA, while the N-terminal domain (NTD) contains the binding site for the antibiotics thiostrepton and micrococcin. L11 and 23S rRNA form an essential part of the GTPase-associated region (GAR). Based on differences in the relative positions of the L11 NTD and CTD during the translational cycle, L11 is proposed to play a significant role in the binding of initiation factors, elongation factors, and release factors to the ribosome. Several factors, including the class I release factors RF1 and RF2, are known to interact directly with L11. In eukaryotes, L11 has been implicated in regulating the levels of ubiquinated p53 and MDM2 in the MDM2-p53 feedback loop, which is responsible for apoptosis in response to DNA damage. In bacteria, the "stringent response" to harsh conditions allows bacteria to survive, and ribosomes that lack L11 are deficient in stringent factor stimulation.
  • Changed! smart RL11 137aa 8e-36 in ref transcript
    • Ribosomal protein L11/L12.
  • Changed! PRK rplK 143aa 1e-27 in ref transcript
    • 50S ribosomal protein L11; Validated.
  • Changed! cd Ribosomal_L11 114aa 8e-36 in modified transcript
  • Changed! smart RL11 115aa 5e-35 in modified transcript
  • Changed! PRK rplK 116aa 1e-26 in modified transcript

MRPL11

  • refseq_MRPL11.F3 refseq_MRPL11.R3 193 287
  • NCBIGene 36.3 65003
  • Single exon skipping, size difference: 94
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016050

  • cd Ribosomal_L11 136aa 1e-36 in ref transcript
    • Ribosomal protein L11. Ribosomal protein L11, together with proteins L10 and L7/L12, and 23S rRNA, form the L7/L12 stalk on the surface of the large subunit of the ribosome. The homologous eukaryotic cytoplasmic protein is also called 60S ribosomal protein L12, which is distinct from the L12 involved in the formation of the L7/L12 stalk. The C-terminal domain (CTD) of L11 is essential for binding 23S rRNA, while the N-terminal domain (NTD) contains the binding site for the antibiotics thiostrepton and micrococcin. L11 and 23S rRNA form an essential part of the GTPase-associated region (GAR). Based on differences in the relative positions of the L11 NTD and CTD during the translational cycle, L11 is proposed to play a significant role in the binding of initiation factors, elongation factors, and release factors to the ribosome. Several factors, including the class I release factors RF1 and RF2, are known to interact directly with L11. In eukaryotes, L11 has been implicated in regulating the levels of ubiquinated p53 and MDM2 in the MDM2-p53 feedback loop, which is responsible for apoptosis in response to DNA damage. In bacteria, the "stringent response" to harsh conditions allows bacteria to survive, and ribosomes that lack L11 are deficient in stringent factor stimulation.
  • smart RL11 137aa 8e-36 in ref transcript
    • Ribosomal protein L11/L12.
  • PRK rplK 143aa 1e-27 in ref transcript
    • 50S ribosomal protein L11; Validated.

MRPL22

  • refseq_MRPL22.F1 refseq_MRPL22.R1 188 306
  • NCBIGene 36.3 29093
  • Single exon skipping, size difference: 118
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014180

  • Changed! cd Ribosomal_L22 106aa 9e-23 in ref transcript
    • Ribosomal protein L22/L17e. L22 (L17 in eukaryotes) is a core protein of the large ribosomal subunit. It is the only ribosomal protein that interacts with all six domains of 23S rRNA, and is one of the proteins important for directing the proper folding and stabilizing the conformation of 23S rRNA. L22 is the largest protein contributor to the surface of the polypeptide exit channel, the tunnel through which the polypeptide product passes. L22 is also one of six proteins located at the putative translocon binding site on the exterior surface of the ribosome.
  • Changed! pfam Ribosomal_L22 96aa 3e-09 in ref transcript
    • Ribosomal protein L22p/L17e. This family includes L22 from prokaryotes and chloroplasts and L17 from eukaryotes.
  • Changed! PRK rplV 103aa 1e-14 in ref transcript
    • 50S ribosomal protein L22; Reviewed.

MRPL24

  • refseq_MRPL24.F2 refseq_MRPL24.R2 183 224
  • NCBIGene 36.3 79590
  • Alternative 5-prime, size difference: 41
  • Exclusion in 5'UTR
  • Reference transcript: NM_145729

  • TIGR rplX_bact 96aa 7e-15 in ref transcript
    • This model recognizes bacterial and organellar forms of ribosomal protein L24. It excludes eukaryotic and archaeal forms, designated L26 in eukaryotes.
  • PRK rplX 96aa 2e-13 in ref transcript
    • 50S ribosomal protein L24; Reviewed.

MRPL33

  • refseq_MRPL33.F2 refseq_MRPL33.R2 192 299
  • NCBIGene 36.3 9553
  • Single exon skipping, size difference: 107
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004891

  • Changed! TIGR rpmG_bact 49aa 0.002 in ref transcript
    • This model describes bacterial ribosomal protein L33 and its chloroplast and mitochondrial equivalents.
  • Changed! PRK rpmG 51aa 3e-05 in ref transcript
    • 50S ribosomal protein L33; Validated.

MRPL43

  • refseq_MRPL43.F2 refseq_MRPL43.R2 190 317
  • NCBIGene 36.3 84545
  • Single exon skipping, size difference: 127
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_176794

  • pfam L51_S25_CI-B8 51aa 1e-10 in ref transcript
    • Mitochondrial ribosomal protein L51 / S25 / CI-B8 domain. The proteins in this family are located in the mitochondrion. The family includes ribosomal protein L51, and S25. This family also includes mitochondrial NADH-ubiquinone oxidoreductase B8 subunit (CI-B8) EC:1.6.5.3. It is not known whether all members of this family form part of the NADH-ubiquinone oxidoreductase and whether they are also all ribosomal proteins.

MRPL47

  • refseq_MRPL47.F2 refseq_MRPL47.R2 180 326
  • NCBIGene 36.3 57129
  • Single exon skipping, size difference: 146
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020409

  • Changed! pfam MRP-L47 86aa 3e-29 in ref transcript
    • Mitochondrial 39-S ribosomal protein L47 (MRP-L47). This family represents the N-terminal region (approximately 8 residues) of the eukaryotic mitochondrial 39-S ribosomal protein L47 (MRP-L47). Mitochondrial ribosomal proteins (MRPs) are the counterparts of the cytoplasmic ribosomal proteins, in that they fulfil similar functions in protein biosynthesis. However, they are distinct in number, features and primary structure.

MRPL48

  • refseq_MRPL48.F2 refseq_MRPL48.R2 343 468
  • NCBIGene 36.2 51642
  • Single exon skipping, size difference: 125
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016055

  • Changed! pfam Ribosomal_S10 98aa 3e-16 in ref transcript
    • Ribosomal protein S10p/S20e. This family includes small ribosomal subunit S10 from prokaryotes and S20 from eukaryotes.

MRPL52

  • refseq_MRPL52.F1 refseq_MRPL52.R1 101 175
  • NCBIGene 36.3 122704
  • Single exon skipping, size difference: 74
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_178336

MRPL55

  • refseq_MRPL55.F1 refseq_MRPL55.R1 168 276
  • NCBIGene 36.3 128308
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181462

  • Changed! pfam Mitoc_L55 118aa 1e-32 in ref transcript
    • Mitochondrial ribosomal protein L55. Members of this family are involved in mitochondrial biogenesis and G2/M phase cell cycle progression. They form a component of the mitochondrial ribosome large subunit (39S) which comprises a 16S rRNA and about 50 distinct proteins.
  • Changed! pfam Mitoc_L55 119aa 3e-33 in modified transcript

MRPL55

  • refseq_MRPL55.F3 refseq_MRPL55.R3 145 190
  • NCBIGene 36.3 128308
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 45
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_181454

  • pfam Mitoc_L55 119aa 3e-33 in ref transcript
    • Mitochondrial ribosomal protein L55. Members of this family are involved in mitochondrial biogenesis and G2/M phase cell cycle progression. They form a component of the mitochondrial ribosome large subunit (39S) which comprises a 16S rRNA and about 50 distinct proteins.

MRRF

  • refseq_MRRF.F1 refseq_MRRF.R1 109 269
  • NCBIGene 36.3 92399
  • Single exon skipping, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_138777

  • Changed! cd RRF 178aa 3e-27 in ref transcript
    • Ribosome recycling factor (RRF). Ribosome recycling factor dissociates the posttermination complex, composed of the ribosome, deacylated tRNA, and mRNA, after termination of translation. Thus ribosomes are "recycled" and ready for another round of protein synthesis. RRF is believed to bind the ribosome at the A-site in a manner that mimics tRNA, but the specific mechanisms remain unclear. RRF is essential for bacterial growth. It is not necessary for cell growth in archaea or eukaryotes, but is found in mitochondria or chloroplasts of some eukaryotic species.
  • Changed! pfam RRF 164aa 3e-22 in ref transcript
    • Ribosome recycling factor. The ribosome recycling factor (RRF / ribosome release factor) dissociates the ribosome from the mRNA after termination of translation, and is essential bacterial growth. Thus ribosomes are "recycled" and ready for another round of protein synthesis.
  • Changed! PRK frr 183aa 7e-30 in ref transcript
    • ribosome recycling factor; Reviewed.
  • Changed! cd RRF 109aa 2e-14 in modified transcript
  • Changed! TIGR frr 103aa 2e-09 in modified transcript
    • This model finds only eubacterial proteins. Mitochondrial and/or chloroplast forms might be expected but are not currently known. This protein was previously called ribosome releasing factor. By releasing ribosomes from mRNA at the end of protein biosynthesis, it prevents inappropriate translation from 3-prime regions of the mRNA and frees the ribosome for new rounds of translation. EGAD|53116|YHR038W is part of the frr superfamily.
  • Changed! PRK frr 114aa 1e-15 in modified transcript

MS4A1

  • refseq_MS4A1.F2 refseq_MS4A1.R2 137 400
  • NCBIGene 36.3 931
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 263
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_152866

  • pfam CD20 167aa 2e-32 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.

MS4A3

  • refseq_MS4A3.F1 refseq_MS4A3.R1 191 329
  • NCBIGene 36.3 932
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006138

  • Changed! pfam CD20 148aa 7e-24 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
  • Changed! pfam CD20 106aa 1e-20 in modified transcript

MS4A6A

  • refseq_MS4A6A.F2 refseq_MS4A6A.R2 141 245
  • NCBIGene 36.3 64231
  • Alternative 5-prime, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_152852

  • Changed! pfam CD20 163aa 3e-27 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
  • Changed! pfam CD20 101aa 1e-14 in modified transcript

MS4A7

  • refseq_MS4A7.F1 refseq_MS4A7.R1 101 149
  • NCBIGene 36.3 58475
  • Alternative 5-prime, size difference: 48
  • Exclusion in 5'UTR
  • Reference transcript: NM_021201

  • pfam CD20 162aa 6e-33 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.

MS4A7

  • refseq_MS4A7.F4 refseq_MS4A7.R4 162 297
  • NCBIGene 36.3 58475
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206939

  • Changed! pfam CD20 162aa 6e-33 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
  • Changed! pfam CD20 121aa 1e-21 in modified transcript

MSH5

  • refseq_MSH5.F1 refseq_MSH5.R1 219 270
  • NCBIGene 36.3 4439
  • Alternative 3-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_025259

  • cd ABC_MSH5_euk 207aa 5e-77 in ref transcript
    • MutS5 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.
  • TIGR mutS1 529aa 9e-66 in ref transcript
  • COG MutS 561aa 1e-84 in ref transcript
    • Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair].

MSH5

  • refseq_MSH5.F4 refseq_MSH5.R4 114 272
  • NCBIGene 36.3 4439
  • Alternative 5-prime, size difference: 158
  • Inclusion in 5'UTR
  • Reference transcript: NM_025259

  • cd ABC_MSH5_euk 207aa 5e-77 in ref transcript
    • MutS5 homolog in eukaryotes. The MutS protein initiates DNA mismatch repair by recognizing mispaired and unpaired bases embedded in duplex DNA and activating endo- and exonucleases to remove the mismatch. Members of the MutS family possess C-terminal domain with a conserved ATPase activity that belongs to the ATP binding cassette (ABC) superfamily. MutS homologs (MSH) have been identified in most prokaryotic and all eukaryotic organisms examined. Prokaryotes have two homologs (MutS1 and MutS2), whereas seven MSH proteins (MSH1 to MSH7) have been identified in eukaryotes. The homodimer MutS1 and heterodimers MSH2-MSH3 and MSH2-MSH6 are primarily involved in mitotic mismatch repair, whereas MSH4-MSH5 is involved in resolution of Holliday junctions during meiosis. All members of the MutS family contain the highly conserved Walker A/B ATPase domain, and many share a common mechanism of action. MutS1, MSH2-MSH3, MSH2-MSH6, and MSH4-MSH5 dimerize to form sliding clamps, and recognition of specific DNA structures or lesions results in ADP/ATP exchange.
  • TIGR mutS1 529aa 9e-66 in ref transcript
  • COG MutS 561aa 1e-84 in ref transcript
    • Mismatch repair ATPase (MutS family) [DNA replication, recombination, and repair].

MSL3L1

  • refseq_MSL3L1.F2 refseq_MSL3L1.R2 139 222
  • NCBIGene 36.3 10943
  • Single exon skipping, size difference: 83
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_078629

  • Changed! cd CHROMO 46aa 0.005 in ref transcript
    • Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
  • Changed! pfam MRG 110aa 5e-26 in ref transcript
    • MRG. This family consists of three different eukaryotic proteins (mortality factor 4 (MORF4/MRG15), male-specific lethal 3(MSL-3) and ESA1-associated factor 3(EAF3)). It is thought that the MRG family is involved in transcriptional regulation via histone acetylation. It contains 2 chromo domains and a leucine zipper motif.
  • Changed! pfam MRG 57aa 5e-19 in ref transcript
  • Changed! smart CHROMO 43aa 2e-04 in ref transcript
    • Chromatin organization modifier domain.

MSMB

  • refseq_MSMB.F1 refseq_MSMB.R1 107 213
  • NCBIGene 36.3 4477
  • Single exon skipping, size difference: 106
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002443

  • Changed! pfam PSP94 114aa 2e-50 in ref transcript
    • Beta-microseminoprotein (PSP-94). This family consists of the mammalian specific protein beta-microseminoprotein. Prostatic secretory protein of 94 amino acids (PSP94), also called beta-microseminoprotein, is a small, nonglycosylated protein, rich in cysteine residues. It was first isolated as a major protein from human seminal plasma. The exact function of this protein is unknown.
  • Changed! pfam PSP94 36aa 2e-13 in modified transcript

MSRB3

  • refseq_MSRB3.F1 refseq_MSRB3.R1 177 304
  • NCBIGene 36.3 253827
  • Single exon skipping, size difference: 127
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_198080

  • Changed! pfam SelR 123aa 3e-52 in ref transcript
    • SelR domain. Methionine sulfoxide reduction is an important process, by which cells regulate biological processes and cope with oxidative stress. MsrA, a protein involved in the reduction of methionine sulfoxides in proteins, has been known for four decades and has been extensively characterised with respect to structure and function. However, recent studies revealed that MsrA is only specific for methionine-S-sulfoxides. Because oxidised methionines occur in a mixture of R and S isomers in vivo, it was unclear how stereo-specific MsrA could be responsible for the reduction of all protein methionine sulfoxides. It appears that a second methionine sulfoxide reductase, SelR, evolved that is specific for methionine-R-sulfoxides, the activity that is different but complementary to that of MsrA. Thus, these proteins, working together, could reduce both stereoisomers of methionine sulfoxide. This domain is found both in SelR proteins and fused with the peptide methionine sulfoxide reductase enzymatic domain pfam01625. The domain has two conserved cysteine and histidines. The domain binds both selenium and zinc. The final cysteine is found to be replaced by the rare amino acid selenocysteine in some members of the family. This family has methionine-R-sulfoxide reductase activity.
  • Changed! PRK PRK00222 129aa 3e-61 in ref transcript
    • methionine sulfoxide reductase B; Provisional.

MCAT

  • refseq_MT.F1 refseq_MT.R1 110 328
  • NCBIGene 36.3 27349
  • Single exon skipping, size difference: 218
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_173467

  • Changed! TIGR fabD 295aa 4e-53 in ref transcript
    • The seed alignment for this family of proteins contains a single member each from a number of bacterial species but also an additional pair of closely related, uncharacterized proteins from B. subtilis, one of which has a long C-terminal extension.
  • Changed! COG FabD 299aa 1e-55 in ref transcript
    • (acyl-carrier-protein) S-malonyltransferase [Lipid metabolism].
  • Changed! TIGR fabD 107aa 6e-21 in modified transcript
  • Changed! COG FabD 108aa 5e-19 in modified transcript

MTHFD2

  • refseq_MTHFD2.F1 refseq_MTHFD2.R1 176 274
  • NCBIGene 36.3 10797
  • Single exon skipping, size difference: 98
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006636

  • Changed! cd NAD_bind_m-THF_DH_Cyclohyd 180aa 5e-67 in ref transcript
    • NADP binding domain of methylene-tetrahydrofolate dehydrogenase/cyclohydrolase. NADP binding domain of the Methylene-Tetrahydrofolate Dehydrogenase/cyclohydrolase (m-THF DH/cyclohydrolase) bifunctional enzyme. Tetrahydrofolate is a versatile carrier of activated one-carbon units. The major one-carbon folate donors are N-5 methyltetrahydrofolate, N5,N10-m-THF, and N10-formayltetrahydrofolate. The oxidation of metabolic intermediate m-THF to m-THF requires the enzyme m-THF DH. In addition, most DHs also have an associated cyclohydrolase activity which catalyzes its hydrolysis to N10-formyltetrahydrofolate. m-THF DH is typically found as part of a multifunctional protein in eukaryotes. NADP-dependent m-THF DH in mammals, birds and yeast are components of a trifunctional enzyme with DH, cyclohydrolase, and synthetase activities. Certain eukaryotic cells also contain homodimeric bifunctional DH/cyclodrolase form. In bacteria, monofucntional DH, as well as bifunctional m-THF m-THF DHm-THF DHDH/cyclodrolase are found. In addition, yeast (S. cerevisiae) also express an monofunctional DH. This family contains the bifunctional DH/cyclohydrolase. M-THF DH, like other amino acid DH-like NAD(P)-binding domains, is a member of the Rossmann fold superfamily which includes glutamate, leucine, and phenylalanine DHs, m-THF DH, methylene-tetrahydromethanopterin DH, m-THF DH/cyclohydrolase, Shikimate DH-like proteins, malate oxidoreductases, and glutamyl tRNA reductase. Amino acid DHs catalyze the deamination of amino acids to keto acids with NAD(P)+ as a cofactor. The NAD(P)-binding Rossmann fold superfamily includes a wide variety of protein families including NAD(P)- binding domains of alcohol DHs, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate DH, lactate/malate DHs, formate/glycerate DHs, siroheme synthases, 6-phosphogluconate DH, amino acid DHs, repressor rex, NAD-binding potassium channel domain, CoA-binding, and ornithine cyclodeaminase-like domains.
  • Changed! pfam THF_DHG_CYH_C 173aa 1e-64 in ref transcript
    • Tetrahydrofolate dehydrogenase/cyclohydrolase, NAD(P)-binding domain.
  • Changed! pfam THF_DHG_CYH 118aa 4e-35 in ref transcript
    • Tetrahydrofolate dehydrogenase/cyclohydrolase, catalytic domain.
  • Changed! COG FolD 296aa 2e-83 in ref transcript
    • 5,10-methylene-tetrahydrofolate dehydrogenase/Methenyl tetrahydrofolate cyclohydrolase [Coenzyme metabolism].

MTIF2

  • refseq_MTIF2.F2 refseq_MTIF2.R2 229 367
  • NCBIGene 36.3 4528
  • Single exon skipping, size difference: 138
  • Exclusion in 5'UTR
  • Reference transcript: NM_001005369

  • cd IF2_eIF5B 165aa 2e-76 in ref transcript
    • IF2/eIF5B (initiation factors 2/ eukaryotic initiation factor 5B) subfamily. IF2/eIF5B contribute to ribosomal subunit joining and function as GTPases that are maximally activated by the presence of both ribosomal subunits. As seen in other GTPases, IF2/IF5B undergoes conformational changes between its GTP- and GDP-bound states. Eukaryotic IF2/eIF5Bs possess three characteristic segments, including a divergent N-terminal region followed by conserved central and C-terminal segments. This core region is conserved among all known eukaryotic and archaeal IF2/eIF5Bs and eubacterial IF2s.
  • cd IF2_mtIF2_II 95aa 1e-32 in ref transcript
    • This family represents the domain II of bacterial Initiation Factor 2 (IF2) and its eukaryotic mitochondrial homologue mtIF2. IF2, the largest initiation factor is an essential GTP binding protein. In E. coli three natural forms of IF2 exist in the cell, IF2alpha, IF2beta1, and IF2beta2. Bacterial IF-2 is structurally and functionally related to eukaryotic mitochondrial mtIF-2.
  • cd mtIF2_IVc 88aa 5e-22 in ref transcript
    • mtIF2_IVc: this family represents the C2 subdomain of domain IV of mitochondrial translation initiation factor 2 (mtIF2) which adopts a beta-barrel fold displaying a high degree of structural similarity with domain II of the translation elongation factor EF-Tu. The C-terminal part of mtIF2 contains the entire fMet-tRNAfmet binding site of IF-2 and is resistant to proteolysis. This C-terminal portion consists of two domains, IF2 C1 and IF2 C2. IF2 C2 been shown to contain all molecular determinants necessary and sufficient for the recognition and binding of fMet-tRNAfMet. Like IF2 from certain prokaryotes such as Thermus thermophilus, mtIF2lacks domain II which is thought to be involved in binding of E.coli IF-2 to 30S subunits.
  • cd IF2_eIF5B 162aa 8e-04 in ref transcript
  • TIGR IF-2 545aa 1e-146 in ref transcript
    • This model discriminates eubacterial (and mitochondrial) translation initiation factor 2 (IF-2), encoded by the infB gene in bacteria, from similar proteins in the Archaea and Eukaryotes. In the bacteria and in organelles, the initiator tRNA is charged with N-formyl-Met instead of Met. This translation factor acts in delivering the initator tRNA to the ribosome. It is one of a number of GTP-binding translation factors recognized by the pfam HMM GTP_EFTU.
  • PRK infB 563aa 0.0 in ref transcript
    • translation initiation factor IF-2; Validated.

MTMR2

  • refseq_MTMR2.F1 refseq_MTMR2.R1 102 225
  • NCBIGene 36.3 8898
  • Single exon skipping, size difference: 123
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016156

  • Changed! pfam Myotub-related 118aa 5e-54 in ref transcript
    • Myotubularin-related. This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with pfam02893. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate. Mutations in gene encoding myotubularin-related proteins have been associated with disease.
  • Changed! pfam GRAM 62aa 5e-09 in ref transcript
    • GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins.
  • Changed! smart PTPc_motif 107aa 6e-06 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain motif.

MTMR2

  • refseq_MTMR2.F2 refseq_MTMR2.R2 154 225
  • NCBIGene 36.3 8898
  • Single exon skipping, size difference: 71
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016156

  • Changed! pfam Myotub-related 118aa 5e-54 in ref transcript
    • Myotubularin-related. This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with pfam02893. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate. Mutations in gene encoding myotubularin-related proteins have been associated with disease.
  • Changed! pfam GRAM 62aa 5e-09 in ref transcript
    • GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins.
  • Changed! smart PTPc_motif 107aa 6e-06 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain motif.

MTMR3

  • refseq_MTMR3.F1 refseq_MTMR3.R1 138 165
  • NCBIGene 36.3 8897
  • Single exon skipping, size difference: 27
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_021090

  • Changed! cd FYVE 55aa 4e-15 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • pfam Myotub-related 163aa 5e-38 in ref transcript
    • Myotubularin-related. This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with pfam02893. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate. Mutations in gene encoding myotubularin-related proteins have been associated with disease.
  • Changed! smart FYVE 66aa 4e-22 in ref transcript
    • Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PHD finger and the RIN G finger. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. The FYVE finger functions in the membrane recruitment of cytosolic proteins by binding to phosphatidylinositol 3-phosphate (PI3P), which is prominent on endosomes. The R+HHC+XCG motif is critical for PI3P binding.
  • smart PTPc_motif 40aa 7e-05 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain motif.
  • Changed! cd FYVE 64aa 8e-13 in modified transcript
  • Changed! smart FYVE 75aa 8e-20 in modified transcript

MTMR3

  • refseq_MTMR3.F3 refseq_MTMR3.R3 185 296
  • NCBIGene 36.3 8897
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021090

  • cd FYVE 55aa 4e-15 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • pfam Myotub-related 163aa 5e-38 in ref transcript
    • Myotubularin-related. This family represents a region within eukaryotic myotubularin-related proteins that is sometimes found with pfam02893. Myotubularin is a dual-specific lipid phosphatase that dephosphorylates phosphatidylinositol 3-phosphate and phosphatidylinositol (3,5)-bi-phosphate. Mutations in gene encoding myotubularin-related proteins have been associated with disease.
  • smart FYVE 66aa 4e-22 in ref transcript
    • Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PHD finger and the RIN G finger. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. The FYVE finger functions in the membrane recruitment of cytosolic proteins by binding to phosphatidylinositol 3-phosphate (PI3P), which is prominent on endosomes. The R+HHC+XCG motif is critical for PI3P binding.
  • smart PTPc_motif 40aa 7e-05 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain motif.

MTO1

  • refseq_MTO1.F2 refseq_MTO1.R2 181 256
  • NCBIGene 36.3 25821
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_133645

  • Changed! TIGR gidA 652aa 0.0 in ref transcript
    • GidA, the longer of two forms of GidA-related proteins, appears to be present in all complete eubacterial genomes so far, as well as Saccharomyces cerevisiae. A subset of these organisms have a closely related protein. GidA is absent in the Archaea. It appears to act with MnmE, in an alpha2/beta2 heterotetramer, in the 5-carboxymethylaminomethyl modification of uridine 34 in certain tRNAs. The shorter, related protein, previously called gid or gidA(S), is now called TrmFO (see model TIGR00137).
  • Changed! PRK PRK05192 657aa 0.0 in ref transcript
    • tRNA uridine 5-carboxymethylaminomethyl modification enzyme GidA; Validated.
  • Changed! TIGR gidA 627aa 0.0 in modified transcript
  • Changed! PRK PRK05192 632aa 0.0 in modified transcript

MTP18

  • refseq_MTP18.F1 refseq_MTP18.R1 158 391
  • NCBIGene 36.3 51537
  • Single exon skipping, size difference: 233
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016498

  • Changed! pfam MTP18 130aa 1e-45 in ref transcript
    • Mitochondrial 18 KDa protein (MTP18). This family of proteins are mitochondrial 18KDa proteins that are often misannotated as carbonic anhydrases. It was shown that knockdown of MTP18 protein results in a cytochrome c release from mitochondria and consequently leads to apoptosis. Overexpression studies suggest that MTP18 is required for mitochondrial fission.
  • Changed! pfam MTP18 54aa 1e-16 in modified transcript

MTRR

  • refseq_MTRR.F2 refseq_MTRR.R2 114 140
  • NCBIGene 36.3 4552
  • Alternative 5-prime, size difference: 26
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_024010

  • Changed! cd methionine_synthase_red 421aa 1e-164 in ref transcript
    • Human methionine synthase reductase (MSR) restores methionine sythase which is responsible for the regeneration of methionine from homocysteine, as well as the coversion of methyltetrahydrofolate to tetrahydrofolate. In MSR, electrons are transferred from NADPH to FAD to FMN to cob(II)alamin. MSR resembles proteins of the cytochrome p450 family including nitric oxide synthase, the alpha subunit of sulfite reductase, but contains an extended hinge region. NADPH cytochrome p450 reductase (CYPOR) serves as an electron donor in several oxygenase systems and is a component of nitric oxide synthases and methionine synthase reductases. CYPOR transfers two electrons from NADPH to the heme of cytochrome p450 via FAD and FMN. CYPORs resemble ferredoxin reductase (FNR) but have a connecting subdomain inserted within the flavin binding region, which helps orient the FMN binding doamin with the FNR module. Ferredoxin-NADP+ (oxido)reductase is an FAD-containing enzyme that catalyzes the reversible electron transfer between NADP(H) and electron carrier proteins such as ferredoxin and flavodoxin. Isoforms of these flavoproteins (i.e. having a non-covalently bound FAD as a prosthetic group) are present in chloroplasts, mitochondria, and bacteria in which they participate in a wide variety of redox metabolic pathways. The C-terminal domain contains most of the NADP(H) binding residues and the N-terminal domain interacts non-covalently with the isoalloxazine rings of the flavin molecule which lies largely in a large gap betweed the two domains. Ferredoxin-NADP+ reductase first accepts one electron from reduced ferredoxin to form a flavin semiquinone intermediate. The enzyme then accepts a second electron to form FADH2 which then transfers two electrons and a proton to NADP+ to form NADPH.
  • Changed! TIGR cysJ 414aa 4e-49 in ref transcript
    • This model describes an NADPH-dependent sulfite reductase flavoprotein subunit. Most members of this family are found in Cys biosynthesis gene clusters. The closest homologs below the trusted cutoff are designated as subunits nitrate reductase.
  • Changed! pfam Flavodoxin_1 137aa 1e-29 in ref transcript
    • Flavodoxin.
  • Changed! COG CysJ 432aa 1e-70 in ref transcript
    • Sulfite reductase, alpha subunit (flavoprotein) [Inorganic ion transport and metabolism].
  • Changed! COG CysJ 188aa 1e-19 in ref transcript

MTUS1

  • refseq_MTUS1.F2 refseq_MTUS1.R2 105 166
  • NCBIGene 36.2 57509
  • Single exon skipping, size difference: 61
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001001924

  • Changed! TIGR SMC_prok_B 297aa 3e-14 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! COG Smc 320aa 2e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

MTX1

  • refseq_MTX1.F1 refseq_MTX1.R1 114 207
  • NCBIGene 36.3 4580
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002455

  • Changed! cd GST_C_Metaxin1_3 137aa 3e-51 in ref transcript
    • GST_C family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins. Mammalian metaxin (or metaxin 1) is a component of the preprotein import complex of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken, and mammals.
  • Changed! cd GST_N_Metaxin1_like 74aa 7e-23 in ref transcript
    • GST_N family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins including Tom37 from fungi. Mammalian metaxin (or metaxin 1) and the fungal protein Tom37 are components of preprotein import complexes of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken and mammals.
  • Changed! pfam Tom37 219aa 4e-58 in ref transcript
    • Outer mitochondrial membrane transport complex protein. The TOM37 protein is one of the outer membrane proteins that make up the TOM complex for guiding cytosolic mitochondrial beta-barrel proteins from the cytosol across the outer mitochondrial membrane into the intramembrane space. In conjunction with TOM70 it guides peptides without an MTS into TOM40, the protein that forms the passage through the outer membrane. It has homology with Metaxin-1, also part of the outer mitochondrial membrane beta-barrel protein transport complex.
  • Changed! cd GST_C_Metaxin1_3 134aa 3e-50 in modified transcript
  • Changed! cd GST_N_Metaxin1_like 73aa 1e-22 in modified transcript
  • Changed! pfam Tom37 188aa 8e-43 in modified transcript

MTX2

  • refseq_MTX2.F1 refseq_MTX2.R1 153 357
  • NCBIGene 36.3 10651
  • Single exon skipping, size difference: 204
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006554

  • Changed! cd GST_C_Metaxin2 126aa 2e-62 in ref transcript
    • GST_C family, Metaxin subfamily, Metaxin 2; a metaxin 1 binding protein identified through a yeast two-hybrid system using metaxin 1 as the bait. Metaxin 2 shares sequence similarity with metaxin 1 but does not contain a C-terminal mitochondrial outer membrane signal-anchor domain. It associates with mitochondrial membranes through its interaction with metaxin 1, which is a component of the mitochondrial preprotein import complex of the outer membrane. The biological function of metaxin 2 is unknown. It is likely that it also plays a role in protein translocation into the mitochondria. However, this has not been experimentally validated. In a recent proteomics study, it has been shown that metaxin 2 is overexpressed in response to lipopolysaccharide-induced liver injury.
  • Changed! cd GST_N_Metaxin2 74aa 8e-32 in ref transcript
    • GST_N family, Metaxin subfamily, Metaxin 2; a metaxin 1 binding protein identified through a yeast two-hybrid system using metaxin 1 as the bait. Metaxin 2 shares sequence similarity with metaxin 1 but does not contain a C-terminal mitochondrial outer membrane signal-anchor domain. It associates with mitochondrial membranes through its interaction with metaxin 1, which is a component of the mitochondrial preprotein import complex of the outer membrane. The biological function of metaxin 2 is unknown. It is likely that it also plays a role in protein translocation into the mitochondria. However, this has not been experimentally validated. In a recent proteomics study, it has been shown that metaxin 2 is overexpressed in response to lipopolysaccharide-induced liver injury.
  • Changed! pfam Tom37 197aa 2e-11 in ref transcript
    • Outer mitochondrial membrane transport complex protein. The TOM37 protein is one of the outer membrane proteins that make up the TOM complex for guiding cytosolic mitochondrial beta-barrel proteins from the cytosol across the outer mitochondrial membrane into the intramembrane space. In conjunction with TOM70 it guides peptides without an MTS into TOM40, the protein that forms the passage through the outer membrane. It has homology with Metaxin-1, also part of the outer mitochondrial membrane beta-barrel protein transport complex.
  • Changed! COG Gst 187aa 0.002 in ref transcript
    • Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].

MXI1

  • refseq_MXI1.F1 refseq_MXI1.R1 144 174
  • NCBIGene 36.3 4601
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130439

  • Changed! cd HLH 60aa 2e-06 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • Changed! smart HLH 53aa 1e-07 in ref transcript
    • helix loop helix domain.
  • Changed! cd HLH 47aa 0.003 in modified transcript
  • Changed! smart HLH 48aa 2e-05 in modified transcript

MXRA7

  • refseq_MXRA7.F2 refseq_MXRA7.R2 307 388
  • NCBIGene 36.3 439921
  • Single exon skipping, size difference: 81
  • Exclusion of the stop codon
  • Reference transcript: NM_001008529

MYADM

  • refseq_MYADM.F1 refseq_MYADM.R1 107 171
  • NCBIGene 36.3 91663
  • Alternative 3-prime, size difference: 64
  • Inclusion in 5'UTR
  • Reference transcript: NM_138373

  • pfam MARVEL 108aa 2e-10 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.
  • pfam MARVEL 152aa 2e-07 in ref transcript

MYBPC1

  • refseq_MYBPC1.F1 refseq_MYBPC1.R1 129 188
  • NCBIGene 36.3 4604
  • Single exon skipping, size difference: 59
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002465

  • cd IGcam 89aa 2e-15 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 84aa 1e-11 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 81aa 1e-10 in ref transcript
  • cd FN3 85aa 2e-08 in ref transcript
  • cd IGcam 80aa 4e-07 in ref transcript
  • cd IGcam 90aa 1e-06 in ref transcript
  • cd IGcam 70aa 0.001 in ref transcript
  • cd IGcam 75aa 0.003 in ref transcript
  • pfam I-set 90aa 1e-17 in ref transcript
    • Immunoglobulin I-set domain.
  • smart FN3 83aa 2e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam I-set 78aa 6e-11 in ref transcript
  • pfam I-set 85aa 4e-10 in ref transcript
  • pfam I-set 69aa 2e-09 in ref transcript
  • pfam I-set 84aa 8e-08 in ref transcript
  • smart FN3 74aa 2e-07 in ref transcript
  • smart IG_like 97aa 4e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 82aa 3e-06 in ref transcript
  • smart FN3 83aa 1e-04 in ref transcript

MYBPC1

  • refseq_MYBPC1.F4 refseq_MYBPC1.R4 105 180
  • NCBIGene 36.3 4604
  • Multiple exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002465

  • cd IGcam 89aa 2e-15 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 84aa 1e-11 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 81aa 1e-10 in ref transcript
  • cd FN3 85aa 2e-08 in ref transcript
  • cd IGcam 80aa 4e-07 in ref transcript
  • cd IGcam 90aa 1e-06 in ref transcript
  • cd IGcam 70aa 0.001 in ref transcript
  • cd IGcam 75aa 0.003 in ref transcript
  • pfam I-set 90aa 1e-17 in ref transcript
    • Immunoglobulin I-set domain.
  • smart FN3 83aa 2e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam I-set 78aa 6e-11 in ref transcript
  • pfam I-set 85aa 4e-10 in ref transcript
  • pfam I-set 69aa 2e-09 in ref transcript
  • pfam I-set 84aa 8e-08 in ref transcript
  • smart FN3 74aa 2e-07 in ref transcript
  • smart IG_like 97aa 4e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 82aa 3e-06 in ref transcript
  • smart FN3 83aa 1e-04 in ref transcript

MYBPC1

  • refseq_MYBPC1.F5 refseq_MYBPC1.R5 190 244
  • NCBIGene 36.3 4604
  • Single exon skipping, size difference: 54
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_002465

  • cd IGcam 89aa 2e-15 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 84aa 1e-11 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 81aa 1e-10 in ref transcript
  • Changed! cd FN3 85aa 2e-08 in ref transcript
  • cd IGcam 80aa 4e-07 in ref transcript
  • cd IGcam 90aa 1e-06 in ref transcript
  • cd IGcam 70aa 0.001 in ref transcript
  • cd IGcam 75aa 0.003 in ref transcript
  • pfam I-set 90aa 1e-17 in ref transcript
    • Immunoglobulin I-set domain.
  • smart FN3 83aa 2e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam I-set 78aa 6e-11 in ref transcript
  • pfam I-set 85aa 4e-10 in ref transcript
  • pfam I-set 69aa 2e-09 in ref transcript
  • pfam I-set 84aa 8e-08 in ref transcript
  • smart FN3 74aa 2e-07 in ref transcript
  • smart IG_like 97aa 4e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 82aa 3e-06 in ref transcript
  • Changed! smart FN3 83aa 1e-04 in ref transcript
  • Changed! cd FN3 103aa 2e-05 in modified transcript
  • Changed! smart FN3 101aa 0.001 in modified transcript

MYH11

  • refseq_MYH11.F1 refseq_MYH11.R1 172 211
  • NCBIGene 36.3 4629
  • Single exon skipping, size difference: 39
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001040114

  • cd MYSc_type_II 709aa 0.0 in ref transcript
    • Myosin motor domain, type II myosins. Myosin II mediates cortical contraction in cell motility, and is the motor in smooth and skeletal muscle. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
  • pfam Myosin_head 692aa 0.0 in ref transcript
    • Myosin head (motor domain).
  • Changed! pfam Myosin_tail_1 820aa 1e-126 in ref transcript
    • Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
  • pfam SMC_N 233aa 1e-09 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • pfam Myosin_N 43aa 1e-09 in ref transcript
    • Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown.
  • COG COG5022 1265aa 0.0 in ref transcript
    • Myosin heavy chain [Cytoskeleton].
  • Changed! COG Smc 392aa 2e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG COG4372 227aa 2e-04 in ref transcript
    • Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown].
  • COG Smc 356aa 0.001 in ref transcript
  • Changed! pfam Myosin_tail_1 819aa 1e-126 in modified transcript
  • Changed! COG Smc 386aa 2e-05 in modified transcript

MYH11

  • refseq_MYH11.F3 refseq_MYH11.R3 104 125
  • NCBIGene 36.3 4629
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040114

  • Changed! cd MYSc_type_II 709aa 0.0 in ref transcript
    • Myosin motor domain, type II myosins. Myosin II mediates cortical contraction in cell motility, and is the motor in smooth and skeletal muscle. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
  • Changed! pfam Myosin_head 692aa 0.0 in ref transcript
    • Myosin head (motor domain).
  • pfam Myosin_tail_1 820aa 1e-126 in ref transcript
    • Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
  • pfam SMC_N 233aa 1e-09 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • pfam Myosin_N 43aa 1e-09 in ref transcript
    • Myosin N-terminal SH3-like domain. This domain has an SH3-like fold. It is found at the N-terminus of many but not all myosins. The function of this domain is unknown.
  • Changed! COG COG5022 1265aa 0.0 in ref transcript
    • Myosin heavy chain [Cytoskeleton].
  • COG Smc 392aa 2e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG COG4372 227aa 2e-04 in ref transcript
    • Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown].
  • COG Smc 356aa 0.001 in ref transcript
  • Changed! cd MYSc_type_II 702aa 0.0 in modified transcript
  • Changed! pfam Myosin_head 685aa 0.0 in modified transcript
  • Changed! TIGR SMC_prok_A 236aa 0.001 in modified transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! COG COG5022 1258aa 0.0 in modified transcript

MYL6

  • refseq_MYL6.F2 refseq_MYL6.R2 241 286
  • NCBIGene 36.3 4637
  • Single exon skipping, size difference: 45
  • Exclusion of the stop codon
  • Reference transcript: NM_021019

  • Changed! cd EFh 56aa 4e-05 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • smart EFh 29aa 0.003 in ref transcript
    • EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
  • Changed! PTZ PTZ00184 147aa 4e-30 in ref transcript
    • calmodulin; Provisional.
  • Changed! cd EFh 56aa 1e-05 in modified transcript
  • Changed! PTZ PTZ00184 147aa 6e-30 in modified transcript

MYL9

  • refseq_MYL9.F2 refseq_MYL9.R2 261 423
  • NCBIGene 36.3 10398
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006097

  • Changed! cd EFh 62aa 3e-04 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! cd EFh 58aa 4e-04 in ref transcript
  • smart EFh 29aa 0.002 in ref transcript
    • EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
  • Changed! COG FRQ1 150aa 1e-26 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! cd EFh 78aa 6e-04 in modified transcript
  • Changed! PTZ PTZ00184 85aa 2e-08 in modified transcript
    • calmodulin; Provisional.

MYLK

  • refseq_MYLK.F2 refseq_MYLK.R2 155 375
  • NCBIGene 36.2 4638
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 220
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_053025

  • Changed! cd S_TKc 256aa 7e-74 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd IGcam 88aa 1e-16 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 90aa 9e-16 in ref transcript
  • cd IGcam 90aa 5e-15 in ref transcript
  • Changed! cd IGcam 90aa 9e-15 in ref transcript
  • cd IGcam 90aa 2e-14 in ref transcript
  • cd IGcam 89aa 3e-13 in ref transcript
  • cd IGcam 86aa 3e-12 in ref transcript
  • cd IGcam 88aa 1e-10 in ref transcript
  • cd FN3 66aa 4e-09 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 87aa 8e-05 in ref transcript
  • Changed! smart S_TKc 245aa 4e-77 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! pfam I-set 91aa 2e-30 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 90aa 1e-23 in ref transcript
  • pfam I-set 91aa 1e-21 in ref transcript
  • pfam I-set 90aa 2e-21 in ref transcript
  • pfam I-set 91aa 5e-19 in ref transcript
  • pfam I-set 90aa 3e-18 in ref transcript
  • pfam I-set 87aa 3e-18 in ref transcript
  • pfam I-set 89aa 7e-15 in ref transcript
  • pfam I-set 89aa 2e-13 in ref transcript
  • pfam fn3 59aa 2e-06 in ref transcript
    • Fibronectin type III domain.
  • Changed! COG SPS1 351aa 1e-38 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MYLK

  • refseq_MYLK.F3 refseq_MYLK.R3 208 361
  • NCBIGene 36.3 4638
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_053025

  • Changed! cd S_TKc 256aa 7e-74 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd IGcam 88aa 1e-16 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 90aa 9e-16 in ref transcript
  • cd IGcam 90aa 5e-15 in ref transcript
  • cd IGcam 90aa 9e-15 in ref transcript
  • cd IGcam 90aa 2e-14 in ref transcript
  • cd IGcam 89aa 3e-13 in ref transcript
  • cd IGcam 86aa 3e-12 in ref transcript
  • cd IGcam 88aa 1e-10 in ref transcript
  • cd FN3 66aa 4e-09 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 87aa 8e-05 in ref transcript
  • Changed! smart S_TKc 245aa 4e-77 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam I-set 91aa 2e-30 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 90aa 1e-23 in ref transcript
  • pfam I-set 91aa 1e-21 in ref transcript
  • pfam I-set 90aa 2e-21 in ref transcript
  • pfam I-set 91aa 5e-19 in ref transcript
  • pfam I-set 90aa 3e-18 in ref transcript
  • pfam I-set 87aa 3e-18 in ref transcript
  • pfam I-set 89aa 7e-15 in ref transcript
  • pfam I-set 89aa 2e-13 in ref transcript
  • pfam fn3 59aa 2e-06 in ref transcript
    • Fibronectin type III domain.
  • Changed! COG SPS1 351aa 1e-38 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd S_TKc 220aa 3e-56 in modified transcript
  • Changed! smart S_TKc 209aa 3e-59 in modified transcript
  • Changed! COG SPS1 342aa 1e-29 in modified transcript

MYLK

  • refseq_MYLK.F4 refseq_MYLK.R4 140 347
  • NCBIGene 36.3 4638
  • Single exon skipping, size difference: 207
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_053025

  • cd S_TKc 256aa 7e-74 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd IGcam 88aa 1e-16 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! cd IGcam 90aa 9e-16 in ref transcript
  • cd IGcam 90aa 5e-15 in ref transcript
  • cd IGcam 90aa 9e-15 in ref transcript
  • cd IGcam 90aa 2e-14 in ref transcript
  • cd IGcam 89aa 3e-13 in ref transcript
  • cd IGcam 86aa 3e-12 in ref transcript
  • cd IGcam 88aa 1e-10 in ref transcript
  • cd FN3 66aa 4e-09 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 87aa 8e-05 in ref transcript
  • smart S_TKc 245aa 4e-77 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam I-set 91aa 2e-30 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 90aa 1e-23 in ref transcript
  • pfam I-set 91aa 1e-21 in ref transcript
  • pfam I-set 90aa 2e-21 in ref transcript
  • Changed! pfam I-set 91aa 5e-19 in ref transcript
  • pfam I-set 90aa 3e-18 in ref transcript
  • pfam I-set 87aa 3e-18 in ref transcript
  • pfam I-set 89aa 7e-15 in ref transcript
  • pfam I-set 89aa 2e-13 in ref transcript
  • pfam fn3 59aa 2e-06 in ref transcript
    • Fibronectin type III domain.
  • COG SPS1 351aa 1e-38 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MYO18A

  • refseq_MYO18A.F2 refseq_MYO18A.R2 255 300
  • NCBIGene 36.3 399687
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_078471

  • cd MYSc_type_XVIII 779aa 0.0 in ref transcript
    • Myosin motor domain, type XVIII myosins. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
  • cd PDZ_signaling 80aa 6e-10 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart MYSc 786aa 1e-105 in ref transcript
    • Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
  • pfam Myosin_tail_1 527aa 1e-36 in ref transcript
    • Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
  • Changed! TIGR SMC_prok_B 288aa 2e-11 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart PDZ 81aa 5e-08 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG COG5022 1200aa 1e-103 in ref transcript
    • Myosin heavy chain [Cytoskeleton].
  • Changed! COG Smc 738aa 5e-21 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 331aa 2e-08 in modified transcript
  • Changed! COG Smc 691aa 1e-18 in modified transcript

MYOHD1

  • refseq_MYOHD1.F2 refseq_MYOHD1.R2 115 165
  • NCBIGene 36.2 80179
  • Single exon skipping, size difference: 50
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_025109

  • cd MYSc 402aa 1e-116 in ref transcript
    • Myosin motor domain. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
  • Changed! cd MYSc_type_V 116aa 8e-08 in ref transcript
    • Myosin motor domain, type V myosins. Myosins V transport a variety of intracellular cargo processively along actin filaments, such as membraneous organelles and mRNA. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
  • smart MYSc 404aa 1e-124 in ref transcript
    • Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
  • Changed! smart MYSc 116aa 4e-07 in ref transcript
  • COG COG5022 438aa 1e-107 in ref transcript
    • Myosin heavy chain [Cytoskeleton].
  • Changed! COG COG5022 185aa 2e-11 in ref transcript
  • Changed! cd MYSc_type_V 34aa 1e-07 in modified transcript
  • Changed! smart MYSc 34aa 9e-07 in modified transcript
  • Changed! COG COG5022 34aa 5e-07 in modified transcript

MZF1

  • refseq_MZF1.F1 refseq_MZF1.R1 100 418
  • NCBIGene 36.3 7593
  • Alternative 5-prime, size difference: 318
  • Exclusion in 5'UTR
  • Reference transcript: NM_198055

  • smart SCAN 113aa 2e-43 in ref transcript
    • leucine rich region.
  • COG COG5048 165aa 6e-08 in ref transcript
    • FOG: Zn-finger [General function prediction only].

NLRP1

  • refseq_NALP1.F2 refseq_NALP1.R2 184 316
  • NCBIGene 36.3 22861
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033004

  • cd LRR_RI 179aa 8e-30 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • cd AAA 95aa 0.009 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • pfam NACHT 170aa 5e-45 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam CARD 82aa 2e-16 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • pfam PAAD_DAPIN 58aa 2e-08 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • smart LRR_RI 28aa 0.002 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.
  • COG RNA1 142aa 0.004 in ref transcript
    • Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].

NLRP1

  • refseq_NALP1.F3 refseq_NALP1.R3 178 268
  • NCBIGene 36.3 22861
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033004

  • Changed! cd LRR_RI 179aa 8e-30 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • cd AAA 95aa 0.009 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • pfam NACHT 170aa 5e-45 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam CARD 82aa 2e-16 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • pfam PAAD_DAPIN 58aa 2e-08 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • smart LRR_RI 28aa 0.002 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.
  • COG RNA1 142aa 0.004 in ref transcript
    • Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
  • Changed! cd LRR_RI 161aa 4e-27 in modified transcript

NLRP12

  • refseq_NALP12.F1 refseq_NALP12.R1 206 377
  • NCBIGene 36.3 91662
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144687

  • Changed! cd LRR_RI 319aa 6e-34 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! cd LRR_RI 306aa 2e-23 in ref transcript
  • pfam NACHT 170aa 1e-43 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam PAAD_DAPIN 81aa 5e-24 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • Changed! cd LRR_RI 286aa 3e-31 in modified transcript

NLRP12

  • refseq_NALP12.F4 refseq_NALP12.R4 168 358
  • NCBIGene 36.3 91662
  • Single exon skipping, size difference: 190
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_144687

  • Changed! cd LRR_RI 319aa 6e-34 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! cd LRR_RI 306aa 2e-23 in ref transcript
  • pfam NACHT 170aa 1e-43 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam PAAD_DAPIN 81aa 5e-24 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • Changed! cd LRR_RI 188aa 4e-05 in modified transcript

NLRP7

  • refseq_NALP7.F2 refseq_NALP7.R2 200 284
  • NCBIGene 36.3 199713
  • Alternative 3-prime, size difference: 84
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_139176

  • cd LRR_RI 266aa 7e-27 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • pfam NACHT 168aa 1e-35 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam PAAD_DAPIN 67aa 3e-11 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • COG RNA1 165aa 5e-04 in ref transcript
    • Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].

NLRP7

  • refseq_NALP7.F3 refseq_NALP7.R3 203 374
  • NCBIGene 36.3 199713
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139176

  • Changed! cd LRR_RI 266aa 7e-27 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • pfam NACHT 168aa 1e-35 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam PAAD_DAPIN 67aa 3e-11 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • Changed! COG RNA1 165aa 5e-04 in ref transcript
    • Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].
  • Changed! cd LRR_RI 246aa 1e-23 in modified transcript

NASP

  • refseq_NASP.F1 refseq_NASP.R1 140 213
  • NCBIGene 36.3 4678
  • Single exon skipping, size difference: 73
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172164

  • Changed! cd TPR 97aa 5e-04 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • Changed! pfam SHNi-TPR 38aa 2e-07 in ref transcript
    • SHNi-TPR. SHNi-TPR family members contain a reiterated sequence motif that is an interrupted form of TPR repeat.
  • Changed! TIGR 2A1904 189aa 0.001 in ref transcript
  • Changed! COG MDN1 198aa 0.004 in ref transcript
    • AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only].

NAT5

  • refseq_NAT5.F1 refseq_NAT5.R1 178 324
  • NCBIGene 36.3 51126
  • Single exon skipping, size difference: 146
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016100

  • Changed! cd GNAT 86aa 1e-07 in ref transcript
    • GCN5-related N-acetyltransferases (GNAT) represent a large superfamily of functionally diverse enzymes that catalyze the transfer of an acetyl group from acetyl-Coenzyme A to the primary amine of a wide range of acceptor substrates. Members of this superfamily include aminoglycoside N-acetyltransferases, serotonin N-acetyltransferase, glucosamine-6-phosphate N-acetyltransferase, the histone acetyltransferases, mycothiol synthase, and the Fem family of amino acyl transferases.
  • Changed! pfam Acetyltransf_1 79aa 2e-12 in ref transcript
    • Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions such as Elp3-related proteins.
  • Changed! COG RimI 156aa 1e-19 in ref transcript
    • Acetyltransferases [General function prediction only].
  • Changed! pfam Acetyltransf_1 44aa 3e-04 in modified transcript
  • Changed! COG RimI 93aa 1e-09 in modified transcript

NAV2

  • refseq_NAV2.F2 refseq_NAV2.R2 165 264
  • NCBIGene 36.3 89797
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182964

  • cd CH 102aa 4e-12 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • pfam CH 102aa 5e-14 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • smart AAA 109aa 0.001 in ref transcript
    • ATPases associated with a variety of cellular activities. AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.
  • COG SAC6 98aa 3e-08 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • PRK PRK06835 104aa 0.006 in ref transcript
    • DNA replication protein DnaC; Validated.

NAV2

  • refseq_NAV2.F4 refseq_NAV2.R4 341 410
  • NCBIGene 36.3 89797
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182964

  • cd CH 102aa 4e-12 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • pfam CH 102aa 5e-14 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • smart AAA 109aa 0.001 in ref transcript
    • ATPases associated with a variety of cellular activities. AAA - ATPases associated with a variety of cellular activities. This profile/alignment only detects a fraction of this vast family. The poorly conserved N-terminal helix is missing from the alignment.
  • COG SAC6 98aa 3e-08 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • PRK PRK06835 104aa 0.006 in ref transcript
    • DNA replication protein DnaC; Validated.

NBR1

  • refseq_NBR1.F1 refseq_NBR1.R1 103 435
  • NCBIGene 36.3 4077
  • Alternative 5-prime, size difference: 332
  • Exclusion in 5'UTR
  • Reference transcript: NM_031858

  • cd PB1_NBR1 81aa 1e-35 in ref transcript
    • The PB1 domain is an essential part of NBR1 protein, next to BRCA1, a scaffold protein mediating specific protein-protein interaction with both titin protein kinase and with another scaffold protein p62. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The NBR1 protein contains a type I PB1 domain.
  • cd ZZ_NBR1_like 45aa 6e-12 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in Drosophila ref(2)P, NBR1, Human sequestosome 1 and related proteins. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Drosophila ref(2)P appears to control the multiplication of sigma rhabdovirus. NBR1 (Next to BRCA1 gene 1 protein) interacts with fasciculation and elongation protein zeta-1 (FEZ1) and calcium and integrin binding protein (CIB), and may function in cell signalling pathways. Sequestosome 1 is a phosphotyrosine independent ligand for the Lck SH2 domain and binds noncovalently to ubiquitin via its UBA domain.
  • smart PB1 80aa 1e-12 in ref transcript
    • PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.
  • smart ZnF_ZZ 45aa 9e-11 in ref transcript
    • Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].

NCAM1

  • refseq_NCAM1.F1 refseq_NCAM1.R1 154 184
  • NCBIGene 36.3 4684
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181351

  • cd IGcam 91aa 5e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 87aa 8e-10 in ref transcript
  • Changed! cd IGcam 104aa 1e-09 in ref transcript
  • cd IGcam 73aa 2e-09 in ref transcript
  • cd FN3 98aa 5e-07 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 88aa 2e-04 in ref transcript
  • pfam I-set 83aa 1e-15 in ref transcript
    • Immunoglobulin I-set domain.
  • Changed! smart IG_like 91aa 2e-11 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 69aa 8e-10 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 85aa 2e-08 in ref transcript
  • pfam fn3 89aa 7e-08 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 83aa 1e-06 in ref transcript
  • Changed! cd IGcam 94aa 9e-11 in modified transcript
  • Changed! smart IG_like 81aa 2e-12 in modified transcript

NCKAP1

  • refseq_NCKAP1.F1 refseq_NCKAP1.R1 102 120
  • NCBIGene 36.3 10787
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_205842

  • Changed! pfam Nckap1 1123aa 0.0 in ref transcript
    • Membrane-associated apoptosis protein. Expression of this protein was found to be markedly reduced in patients with Alzheimer's disease. It is involved in the regulation of actin polymerisation in the brain as part of a WAVE2 signalling complex.
  • Changed! pfam Nckap1 1117aa 0.0 in modified transcript

NCKIPSD

  • refseq_NCKIPSD.F1 refseq_NCKIPSD.R1 107 128
  • NCBIGene 36.3 51517
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016453

  • cd SH3 55aa 4e-09 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam DUF2013 117aa 2e-33 in ref transcript
    • Protein of unknown function (DUF2013). This region is found at the C terminal of a group of cytoskeletal proteins.
  • smart SH3 56aa 7e-10 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

NCOA1

  • refseq_NCOA1.F1 refseq_NCOA1.R1 114 174
  • NCBIGene 36.3 8648
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 60
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_147233

  • cd PAS 94aa 2e-08 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd HLH 58aa 4e-05 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam Nuc_rec_co-act 47aa 8e-13 in ref transcript
    • Nuclear receptor coactivator. This region is found on eukaryotic nuclear receptor coactivators and forms an alpha helical structure.
  • pfam SRC-1 79aa 7e-10 in ref transcript
    • Steroid receptor coactivator. This domain is found in steroid/nuclear receptor coactivators and contains two LXXLL motifs that are involved in receptor binding. The family includes SRC-1/NcoA-1, NcoA-2/TIF2, pCIP/ACTR/GRIP-1/AIB1.
  • smart PAS 58aa 8e-09 in ref transcript
    • PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
  • smart HLH 55aa 3e-06 in ref transcript
    • helix loop helix domain.
  • pfam DUF1518 38aa 0.001 in ref transcript
    • Domain of unknown function (DUF1518). This domain, which is usually found tandemly repeated, is found various receptor co-activating proteins.

NDEL1

  • refseq_NDEL1.F2 refseq_NDEL1.R2 130 165
  • NCBIGene 36.3 81565
  • Single exon skipping, size difference: 35
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_030808

  • pfam NUDE_C 181aa 5e-32 in ref transcript
    • NUDE protein, C-terminal conserved region. This family represents the C-terminal conserved region of the NUDE proteins. NUDE proteins are involved in nuclear migration.

NDRG2

  • refseq_NDRG2.F1 refseq_NDRG2.R1 140 182
  • NCBIGene 36.3 57447
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201537

  • pfam Ndr 279aa 1e-124 in ref transcript
    • Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases pfam00561, suggesting that this family may have an enzymatic function (Bateman A pers. obs.).
  • COG MhpC 259aa 3e-05 in ref transcript
    • Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only].

NDRG2

  • refseq_NDRG2.F3 refseq_NDRG2.R3 106 173
  • NCBIGene 36.3 57447
  • Single exon skipping, size difference: 67
  • Exclusion in 5'UTR
  • Reference transcript: NM_201540

  • pfam Ndr 279aa 1e-124 in ref transcript
    • Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases pfam00561, suggesting that this family may have an enzymatic function (Bateman A pers. obs.).
  • COG MhpC 259aa 3e-05 in ref transcript
    • Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only].

NDRG2

  • refseq_NDRG2.F5 refseq_NDRG2.R3 130 197
  • NCBIGene 36.3 57447
  • Single exon skipping, size difference: 67
  • Exclusion in 5'UTR
  • Reference transcript: NM_201535

  • pfam Ndr 279aa 1e-124 in ref transcript
    • Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases pfam00561, suggesting that this family may have an enzymatic function (Bateman A pers. obs.).
  • COG MhpC 259aa 3e-05 in ref transcript
    • Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only].

NDRG3

  • refseq_NDRG3.F1 refseq_NDRG3.R1 119 155
  • NCBIGene 36.3 57446
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032013

  • pfam Ndr 286aa 1e-123 in ref transcript
    • Ndr family. This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted. The precise molecular and cellular function of members of this family is still unknown. Yet, they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases pfam00561, suggesting that this family may have an enzymatic function (Bateman A pers. obs.).
  • COG MhpC 275aa 4e-08 in ref transcript
    • Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) [General function prediction only].

NDUFB6

  • refseq_NDUFB6.F1 refseq_NDUFB6.R1 225 270
  • NCBIGene 36.3 4712
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002493

  • Changed! pfam NDUF_B6 128aa 2e-45 in ref transcript
    • NADH:ubiquinone oxidoreductase, NDUFB6/B17 subunit. Members of this family mediate the transfer of electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone, the reaction that occurs being: NADH + ubiquinone = NAD(+) + ubiquinol.
  • Changed! pfam NDUF_B6 113aa 2e-36 in modified transcript

NEBL

  • refseq_NEBL.F1 refseq_NEBL.R1 131 374
  • NCBIGene 36.3 10529
  • Multiple exon skipping, size difference: 243
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006393

  • cd SH3 54aa 1e-11 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart SH3 57aa 2e-13 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • pfam Nebulin 29aa 4e-04 in ref transcript
    • Nebulin repeat.
  • smart NEBU 30aa 7e-04 in ref transcript
    • The Nebulin repeat is present also in Las1. Tandem arrays of these repeats are known to bind actin.
  • pfam Nebulin 28aa 0.002 in ref transcript
  • smart NEBU 26aa 0.003 in ref transcript
  • pfam Nebulin 29aa 0.005 in ref transcript

NF2

  • refseq_NF2.F1 refseq_NF2.R1 217 343
  • NCBIGene 36.3 4771
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000268

  • cd FERM_C 93aa 2e-22 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • Changed! smart B41 204aa 6e-53 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam ERM 234aa 1e-35 in ref transcript
    • Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins.
  • pfam FERM_C 87aa 1e-32 in ref transcript
    • FERM C-terminal PH-like domain.
  • Changed! smart B41 143aa 9e-36 in modified transcript

NF2

  • refseq_NF2.F2 refseq_NF2.R2 100 223
  • NCBIGene 36.3 4771
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000268

  • cd FERM_C 93aa 2e-22 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • Changed! smart B41 204aa 6e-53 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam ERM 234aa 1e-35 in ref transcript
    • Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins.
  • pfam FERM_C 87aa 1e-32 in ref transcript
    • FERM C-terminal PH-like domain.
  • Changed! smart B41 163aa 2e-38 in modified transcript

NFAT5

  • refseq_NFAT5.F1 refseq_NFAT5.R1 262 324
  • NCBIGene 36.3 10725
  • Single exon skipping, size difference: 62
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_138713

  • Changed! cd IPT_NFAT 100aa 5e-36 in ref transcript
    • IPT domain of the NFAT family of transcription factors. NFAT transcription complexes are a target of calcineurin, a calcium dependent phosphatase, and activate genes mainly involved in cell-cell-interaction.
  • Changed! pfam RHD 158aa 9e-20 in ref transcript
    • Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
  • Changed! smart IPT 98aa 4e-08 in ref transcript
    • ig-like, plexins, transcription factors.
  • Changed! pfam PAT1 225aa 2e-04 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.

NFAT5

  • refseq_NFAT5.F3 refseq_NFAT5.R3 112 166
  • NCBIGene 36.3 10725
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138713

  • cd IPT_NFAT 100aa 5e-36 in ref transcript
    • IPT domain of the NFAT family of transcription factors. NFAT transcription complexes are a target of calcineurin, a calcium dependent phosphatase, and activate genes mainly involved in cell-cell-interaction.
  • pfam RHD 158aa 9e-20 in ref transcript
    • Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
  • smart IPT 98aa 4e-08 in ref transcript
    • ig-like, plexins, transcription factors.
  • pfam PAT1 225aa 2e-04 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.

NFATC1

  • refseq_NFATC1.F1 refseq_NFATC1.R1 181 488
  • NCBIGene 36.3 4772
  • Alternative 5-prime, size difference: 307
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172387

  • cd IPT_NFAT 101aa 5e-33 in ref transcript
    • IPT domain of the NFAT family of transcription factors. NFAT transcription complexes are a target of calcineurin, a calcium dependent phosphatase, and activate genes mainly involved in cell-cell-interaction.
  • pfam RHD 161aa 6e-30 in ref transcript
    • Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
  • pfam TIG 98aa 8e-09 in ref transcript
    • IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: This family does not currently recognise a significant number of members.

NFATC2

  • refseq_NFATC2.F2 refseq_NFATC2.R2 230 318
  • NCBIGene 36.3 4773
  • Single exon skipping, size difference: 88
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_173091

  • cd IPT_NFAT 101aa 1e-40 in ref transcript
    • IPT domain of the NFAT family of transcription factors. NFAT transcription complexes are a target of calcineurin, a calcium dependent phosphatase, and activate genes mainly involved in cell-cell-interaction.
  • pfam RHD 161aa 9e-26 in ref transcript
    • Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
  • smart IPT 100aa 3e-10 in ref transcript
    • ig-like, plexins, transcription factors.

NFIC

  • refseq_NFIC.F1 refseq_NFIC.R1 107 347
  • NCBIGene 36.3 4782
  • Multiple exon skipping, size difference: 240
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_205843

  • cd MH1 114aa 5e-17 in ref transcript
    • MH1 is a small DNA binding domain, binding in an unusal way involving a beta hairpin structure binding to the major groove. MH1 is present in Smad proteins, an important family of proteins involved in TGF-beta signalling and frequent targets of tumorigenic mutations. Also known as Domain A in dwarfin family proteins.
  • Changed! pfam CTF_NFI 292aa 1e-107 in ref transcript
    • CTF/NF-I family transcription modulation region.
  • pfam NfI_DNAbd_pre-N 38aa 5e-19 in ref transcript
    • Nuclear factor I protein pre-N-terminus. The Nuclear factor I (NFI) family of site-specific DNA-binding proteins (also known as CTF or CAAT box transcription factor) functions both in viral DNA replication and in the regulation of gene expression in higher organisms. The N-terminal 200 residues contains the DNA-binding and dimerisation domain, but also has an 8-47 residue highly conserved region 5' of this, whose function is not known. Deletion of the N-terminal 200 amino acids removes the DNA-binding activity, dimerisation-ability and the stimulation of adenovirus DNA replication.
  • smart DWA 102aa 9e-15 in ref transcript
    • Domain A in dwarfin family proteins.
  • Changed! pfam CTF_NFI 207aa 2e-85 in modified transcript

NFKBIB

  • refseq_NFKBIB.F2 refseq_NFKBIB.R2 213 269
  • NCBIGene 36.3 4793
  • Alternative 3-prime, size difference: 56
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002503

  • Changed! cd ANK 97aa 3e-15 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd ANK 86aa 7e-12 in ref transcript
  • Changed! cd ANK 139aa 2e-07 in ref transcript
  • Changed! TIGR trp 68aa 2e-07 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! COG Arp 85aa 2e-09 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! COG Arp 94aa 7e-09 in ref transcript
  • Changed! COG Arp 210aa 6e-06 in ref transcript

NFYA

  • refseq_NFYA.F1 refseq_NFYA.R1 212 299
  • NCBIGene 36.3 4800
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002505

  • smart CBF 60aa 9e-27 in ref transcript
    • CCAAT-Binding transcription Factor.
  • COG HAP2 64aa 1e-10 in ref transcript
    • CCAAT-binding factor, subunit B [Transcription].

NGFRAP1

  • refseq_NGFRAP1.F2 refseq_NGFRAP1.R2 241 288
  • NCBIGene 36.3 27018
  • Alternative 3-prime, size difference: 47
  • Exclusion of the protein initiation site
  • Reference transcript: NM_206915

  • Changed! pfam BEX 91aa 2e-20 in ref transcript
    • Brain expressed X-linked like family.
  • Changed! pfam BEX 100aa 3e-22 in modified transcript

ISCU

  • refseq_NIFUN.F1 refseq_NIFUN.R1 125 221
  • NCBIGene 36.3 23479
  • Single exon skipping, size difference: 96
  • Exclusion of the protein initiation site
  • Reference transcript: NM_014301

  • Changed! TIGR iscU 123aa 1e-60 in ref transcript
    • This model represents IscU, a homolog of the N-terminal region of NifU, an Fe-S cluster assembly protein found mostly in nitrogen-fixing bacteria. IscU is considered part of the IscSUA-hscAB-fdx system of Fe-S assembly, whereas NifU is found in nitrogenase-containing (nitrogen-fixing) species. A NifU-type protein is also found in Helicobacter and Campylobacter. IscU and NifU are considered scaffold proteins on which Fe-S clusters are assembled before transfer to apoproteins. This model excludes true NifU proteins as in Klebsiella pneumoniae and Anabaena sp. as well as archaeal homologs. It includes largely proteobacterial and eukaryotic forms.
  • Changed! PRK PRK11325 123aa 1e-66 in ref transcript
    • scaffold protein; Provisional.
  • Changed! TIGR iscU 86aa 9e-40 in modified transcript
  • Changed! PRK PRK11325 86aa 2e-43 in modified transcript

NIPA2

  • refseq_NIPA2.F1 refseq_NIPA2.R1 125 247
  • NCBIGene 36.3 81614
  • Single exon skipping, size difference: 122
  • Exclusion in 5'UTR
  • Reference transcript: NM_030922

  • pfam DUF803 301aa 7e-96 in ref transcript
    • Protein of unknown function (DUF803). This family consists of several eukaryotic proteins of unknown function.

NIPA2

  • refseq_NIPA2.F4 refseq_NIPA2.R4 189 246
  • NCBIGene 36.3 81614
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008860

  • Changed! pfam DUF803 301aa 7e-96 in ref transcript
    • Protein of unknown function (DUF803). This family consists of several eukaryotic proteins of unknown function.
  • Changed! pfam DUF803 224aa 2e-81 in modified transcript

NKTR

  • refseq_NKTR.F2 refseq_NKTR.R2 139 167
  • NCBIGene 36.2 4820
  • Single exon skipping, size difference: 28
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_005385

  • Changed! cd cyclophilin_ABH_like 168aa 8e-58 in ref transcript
    • cyclophilin_ABH_like: Cyclophilin A, B and H-like cyclophilin-type peptidylprolyl cis- trans isomerase (PPIase) domain. This family represents the archetypal cystolic cyclophilin similar to human cyclophilins A, B and H. PPIase is an enzyme which accelerates protein folding by catalyzing the cis-trans isomerization of the peptide bonds preceding proline residues. These enzymes have been implicated in protein folding processes which depend on catalytic /chaperone-like activities. As cyclophilins, Human hCyP-A, human cyclophilin-B (hCyP-19), S. cerevisiae Cpr1 and C. elegans Cyp-3, are inhibited by the immunosuppressive drug cyclopsporin A (CsA). CsA binds to the PPIase active site. Cyp-3. S. cerevisiae Cpr1 interacts with the Rpd3 - Sin3 complex and in addition is a component of the Set3 complex. S. cerevisiae Cpr1 has also been shown to have a role in Zpr1p nuclear transport. Human cyclophilin H associates with the [U4/U6.U5] tri-snRNP particles of the splicesome.
  • Changed! pfam Pro_isomerase 169aa 8e-72 in ref transcript
    • Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
  • Changed! PTZ PTZ00060 171aa 1e-51 in ref transcript
    • cyclophilin; Provisional.
  • Changed! cd cyclophilin_ABH_like 88aa 1e-22 in modified transcript
  • Changed! pfam Pro_isomerase 89aa 4e-27 in modified transcript
  • Changed! PTZ PTZ00060 91aa 7e-22 in modified transcript

NKTR

  • refseq_NKTR.F3 refseq_NKTR.R3 148 176
  • NCBIGene 36.2 4820
  • Single exon skipping, size difference: 28
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_005385

  • Changed! cd cyclophilin_ABH_like 168aa 8e-58 in ref transcript
    • cyclophilin_ABH_like: Cyclophilin A, B and H-like cyclophilin-type peptidylprolyl cis- trans isomerase (PPIase) domain. This family represents the archetypal cystolic cyclophilin similar to human cyclophilins A, B and H. PPIase is an enzyme which accelerates protein folding by catalyzing the cis-trans isomerization of the peptide bonds preceding proline residues. These enzymes have been implicated in protein folding processes which depend on catalytic /chaperone-like activities. As cyclophilins, Human hCyP-A, human cyclophilin-B (hCyP-19), S. cerevisiae Cpr1 and C. elegans Cyp-3, are inhibited by the immunosuppressive drug cyclopsporin A (CsA). CsA binds to the PPIase active site. Cyp-3. S. cerevisiae Cpr1 interacts with the Rpd3 - Sin3 complex and in addition is a component of the Set3 complex. S. cerevisiae Cpr1 has also been shown to have a role in Zpr1p nuclear transport. Human cyclophilin H associates with the [U4/U6.U5] tri-snRNP particles of the splicesome.
  • Changed! pfam Pro_isomerase 169aa 8e-72 in ref transcript
    • Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
  • Changed! PTZ PTZ00060 171aa 1e-51 in ref transcript
    • cyclophilin; Provisional.
  • Changed! cd cyclophilin_ABH_like 126aa 2e-39 in modified transcript
  • Changed! pfam Pro_isomerase 121aa 1e-45 in modified transcript
  • Changed! PTZ PTZ00060 126aa 2e-38 in modified transcript

NME1

  • refseq_NME1.F2 refseq_NME1.R2 167 387
  • NCBIGene 36.3 4830
  • Single exon skipping, size difference: 220
  • Exclusion of the protein initiation site
  • Reference transcript: NM_198175

  • cd NDPk_I 130aa 1e-65 in ref transcript
    • Nucleoside diphosphate kinase Group I (NDPk_I)-like: NDP kinase domains are present in a large family of structurally and functionally conserved proteins from bacteria to humans that generally catalyze the transfer of gamma-phosphates of a nucleoside triphosphate (NTP) donor onto a nucleoside diphosphate (NDP) acceptor through a phosphohistidine intermediate. The mammalian nm23/NDP kinase gene family can be divided into two distinct groups. The group I genes encode proteins that generally have highly homologous counterparts in other organisms and possess the classic enzymatic activity of a kinase. This group includes vertebrate NDP kinases A-D (Nm23- H1 to -H4), and its counterparts in bacteria, archea and other eukaryotes. NDP kinases exist in two different quaternary structures; all known eukaryotic enzymes are hexamers, while some bacterial enzymes are tetramers, as in Myxococcus. They possess the NDP kinase active site motif (NXXH[G/A]SD) and the nine residues that are most essential for catalysis.
  • pfam NDK 135aa 2e-72 in ref transcript
    • Nucleoside diphosphate kinase.
  • PTZ PTZ00093 147aa 2e-72 in ref transcript
    • nucleoside diphosphate kinase; Provisional.

NME7

  • refseq_NME7.F1 refseq_NME7.R1 141 172
  • NCBIGene 36.3 29922
  • Alternative 3-prime, size difference: 31
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_013330

  • Changed! cd NDPk7A 131aa 3e-63 in ref transcript
    • Nucleoside diphosphate kinase 7 domain A (NDPk7A): The nm23-H7 class of nucleoside diphosphate kinase (NDPk7) consists of an N-terminal DM10 domain and two functional catalytic NDPk modules, NDPk7A and NDPk7B. The function of the DM10 domain, which also occurs in multiple copies in other proteins, is unknown. NDPk7 is predominantly expressed in testes, although appreciable amount are also found in liver, heart, brain, ovary, small intestine and spleen. The nm23-H7 gene is located in or near the hereditary prostrate cancer susceptibility locus. Nm23-H7 may be involved in the development of colon and gastric carcinoma, the latter possibly in a type-specific manner.
  • Changed! cd NDPk7B 134aa 3e-62 in ref transcript
    • Nucleoside diphosphate kinase 7 domain B (NDPk7B): The nm23-H7 class of nucleoside diphosphate kinase (NDPk7) consists of an N-terminal DM10 domain and two functional catalytic NDPk modules, NDPk7A and NDPk7B. The function of the DM10 domain, which also occurs in multiple copies in other proteins, is unknown. NDPk7 is predominantly expressed in testes, although appreciable amount are also found in liver, heart, brain, ovary, small intestine and spleen. The nm23-H7 gene is located in or near the hereditary prostrate cancer susceptibility locus. Nm23-H7 may be involved in the development of colon and gastric carcinoma, the latter possibly in a type-specific manner.
  • Changed! smart NDK 134aa 4e-47 in ref transcript
    • These are enzymes that catalyze nonsubstrate specific conversions of nucleoside diphosphates to nucleoside triphosphates. These enzymes play important roles in bacterial growth, signal transduction and pathogenicity.
  • Changed! smart NDK 138aa 1e-44 in ref transcript
  • Changed! smart DM10 89aa 6e-25 in ref transcript
    • Domains in hypothetical proteins in Drosophila, C. elegans and mammals. Occurs singly in some nucleoside diphosphate kinases.
  • Changed! COG Ndk 134aa 4e-31 in ref transcript
    • Nucleoside diphosphate kinase [Nucleotide transport and metabolism].
  • Changed! COG Ndk 133aa 3e-22 in ref transcript

NNAT

  • refseq_NNAT.F1 refseq_NNAT.R1 316 397
  • NCBIGene 36.3 4826
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005386

NOL1

  • refseq_NOL1.F1 refseq_NOL1.R1 116 184
  • NCBIGene 36.3 4839
  • Alternative 5-prime, size difference: 68
  • Exclusion in 5'UTR
  • Reference transcript: NM_001033714

  • cd AdoMet_MTases 129aa 1e-05 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • pfam Nol1_Nop2_Fmu 286aa 8e-92 in ref transcript
    • NOL1/NOP2/sun family.
  • COG Sun 330aa 2e-79 in ref transcript
    • tRNA and rRNA cytosine-C5-methylases [Translation, ribosomal structure and biogenesis].

NOLA1

  • refseq_NOLA1.F1 refseq_NOLA1.R1 117 376
  • NCBIGene 36.3 54433
  • Alternative 5-prime, size difference: 259
  • Exclusion in 5'UTR
  • Reference transcript: NM_018983

  • pfam Gar1 104aa 3e-36 in ref transcript
    • Gar1 protein RNA binding region. Gar1 is a small nucleolar RNP that is required for pre-mRNA processing and pseudouridylation. It is co-immunoprecipitated with the H/ACA families of snoRNAs. This family represents the conserved central region of Gar1. This region is necessary and sufficient for normal cell growth, and specifically binds two snoRNAs snR10 and snR30. This region is also necessary for nucleolar targeting, and it is thought that the protein is co-transported to the nucleolus as part of a nucleoprotein complex. In humans, Gar1 is also component of telomerase in vivo.
  • COG GAR1 78aa 4e-12 in ref transcript
    • RNA-binding protein involved in rRNA processing [Translation, ribosomal structure and biogenesis].

NOLA2

  • refseq_NOLA2.F2 refseq_NOLA2.R2 269 375
  • NCBIGene 36.3 55651
  • Single exon skipping, size difference: 106
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017838

  • Changed! pfam Ribosomal_L7Ae 94aa 1e-18 in ref transcript
    • Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from eukaryotes and archaebacteria. Gadd45 and MyD118.
  • Changed! COG RPL8A 114aa 2e-16 in ref transcript
    • Ribosomal protein HS6-type (S12/L30/L7a) [Translation, ribosomal structure and biogenesis].
  • Changed! pfam Ribosomal_L7Ae 32aa 0.003 in modified transcript

NOSTRIN

  • refseq_NOSTRIN.F1 refseq_NOSTRIN.R1 122 208
  • NCBIGene 36.3 115677
  • Single exon skipping, size difference: 86
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001039724

  • Changed! cd SH3 52aa 1e-14 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! smart SH3 57aa 9e-17 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

NOX1

  • refseq_NOX1.F1 refseq_NOX1.R1 165 312
  • NCBIGene 36.3 27035
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007052

  • Changed! cd NOX_Duox_like_FAD_NADP 268aa 3e-42 in ref transcript
    • NADPH oxidase (NOX) catalyzes the generation of reactive oxygen species (ROS) such as superoxide and hydrogen peroxide. ROS were originally identified as bactericidal agents in phagocytes, but are now also implicated in cell signaling and metabolism. NOX has a 6-alpha helix heme-binding transmembrane domain fused to a flavoprotein with the nucleotide binding domain located in the cytoplasm. Duox enzymes link a peroxidase domain to the NOX domain via a single transmembrane and EF-hand Ca2+ binding sites. The flavoprotein module has a ferredoxin like FAD/NADPH binding domain. In classical phagocytic NOX2, electron transfer occurs from NADPH to FAD to the heme of cytb to oxygen leading to superoxide formation.
  • Changed! pfam NAD_binding_6 150aa 5e-27 in ref transcript
    • Ferric reductase NAD binding domain.
  • pfam FAD_binding_8 95aa 6e-21 in ref transcript
    • FAD-binding domain.
  • pfam Ferric_reduct 156aa 2e-18 in ref transcript
    • Ferric reductase like transmembrane component. This family includes a common region in the transmembrane proteins mammalian cytochrome B-245 heavy chain (gp91-phox), ferric reductase transmembrane component in yeast and respiratory burst oxidase from mouse-ear cress. This may be a family of flavocytochromes capable of moving electrons across the plasma membrane. The Frp1 protein from S. pombe is a ferric reductase component and is required for cell surface ferric reductase activity, mutants in frp1 are deficient in ferric iron uptake. Cytochrome B-245 heavy chain is a FAD-dependent dehydrogenase it is also has electron transferase activity which reduces molecular oxygen to superoxide anion, a precursor in the production of microbicidal oxidants. Mutations in the sequence of cytochrome B-245 heavy chain (gp91-phox) lead to the X-linked chronic granulomatous disease. The bacteriocidal ability of phagocytic cells is reduced and is characterised by the absence of a functional plasma membrane associated NADPH oxidase. The chronic granulomatous disease gene codes for the beta chain of cytochrome B-245 and cytochrome B-245 is missing from patients with the disease.
  • Changed! COG COG4097 263aa 9e-09 in ref transcript
    • Predicted ferric reductase [Inorganic ion transport and metabolism].
  • Changed! cd NOX_Duox_like_FAD_NADP 219aa 6e-34 in modified transcript
  • Changed! pfam NAD_binding_6 101aa 4e-12 in modified transcript
  • Changed! COG COG4097 214aa 6e-10 in modified transcript

NPAS3

  • refseq_NPAS3.F1 refseq_NPAS3.R1 147 204
  • NCBIGene 36.3 64067
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 57
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_173159

  • cd PAS 81aa 1e-09 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd PAS 53aa 3e-07 in ref transcript
  • cd HLH 58aa 3e-06 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam PAS_3 69aa 1e-10 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • smart PAS 59aa 6e-09 in ref transcript
    • PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
  • smart HLH 46aa 1e-05 in ref transcript
    • helix loop helix domain.

NPTN

  • refseq_NPTN.F1 refseq_NPTN.R1 103 451
  • NCBIGene 36.3 27020
  • Single exon skipping, size difference: 348
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012428

  • cd IGcam 77aa 6e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! cd IGcam 89aa 1e-05 in ref transcript
  • pfam I-set 96aa 5e-10 in ref transcript
    • Immunoglobulin I-set domain.
  • Changed! pfam I-set 88aa 1e-07 in ref transcript

NR1I2

  • refseq_NR1I2.F1 refseq_NR1I2.R1 265 376
  • NCBIGene 36.3 8856
  • Alternative 3-prime, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022002

  • Changed! cd NR_LBD_PXR_like 193aa 1e-102 in ref transcript
    • The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • Changed! cd NR_LBD_PXR_like 33aa 1e-09 in ref transcript
  • pfam Hormone_recep 181aa 2e-32 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • pfam zf-C4 74aa 1e-30 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • Changed! cd NR_LBD_PXR_like 249aa 1e-113 in modified transcript

NR4A1

  • refseq_NR4A1.F1 refseq_NR4A1.R1 140 290
  • NCBIGene 36.3 3164
  • Single exon skipping, size difference: 150
  • Exclusion in 5'UTR
  • Reference transcript: NM_002135

  • cd NR_LBD_NGFI-B 238aa 1e-138 in ref transcript
    • The ligand binding domain of Nurr1, a member of conserved family of nuclear receptors. The ligand binding domain of Nerve growth factor-induced-B (NGFI-B): NGFI-B is a member of the nuclear#steroid receptor superfamily. NGFI-B is classified as an orphan receptor because no ligand has yet been identified. NGFI-B is an early immediate gene product of the embryo development that is rapidly produced in response to a variety of cellular signals including nerve growth factor. It is involved in T-cell-mediated apoptosis, as well as neuronal differentiation and function. NGFI-B regulates transcription by binding to a specific DNA target upstream of its target genes and regulating the rate of transcriptional initiation. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, NGFI-B has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam Hormone_recep 179aa 4e-36 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • pfam zf-C4 74aa 1e-28 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.

NR4A2

  • refseq_NR4A2.F1 refseq_NR4A2.R1 162 216
  • NCBIGene 36.3 4929
  • Alternative 3-prime, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006186

  • Changed! cd NR_LBD_Nurr1 235aa 1e-137 in ref transcript
    • The ligand binding domain of Nurr1, a member of conserved family of nuclear receptors. The ligand binding domain of nuclear receptor Nurr1: Nurr1 belongs to the conserved family of nuclear receptors. It is a transcription factor that is expressed in the embryonic ventral midbrain and is critical for the development of dopamine (DA) neurons. Structural studies have shown that the ligand binding pocket of Nurr1 is filled by bulky hydrophobic residues, making it unable to bind to ligands. Therefore, it belongs to the class of orphan receptors. However, Nurr1 forms heterodimers with RXR and can promote signaling via its partner, RXR. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, Nurr1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 75aa 1e-39 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • Changed! pfam Hormone_recep 175aa 8e-34 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • Changed! cd NR_LBD_Nurr1 217aa 1e-122 in modified transcript
  • Changed! pfam Hormone_recep 157aa 2e-28 in modified transcript

NR4A2

  • refseq_NR4A2.F2 refseq_NR4A2.R2 115 293
  • NCBIGene 36.3 4929
  • Alternative 3-prime, size difference: 178
  • Exclusion of the protein initiation site
  • Reference transcript: NM_006186

  • cd NR_LBD_Nurr1 235aa 1e-137 in ref transcript
    • The ligand binding domain of Nurr1, a member of conserved family of nuclear receptors. The ligand binding domain of nuclear receptor Nurr1: Nurr1 belongs to the conserved family of nuclear receptors. It is a transcription factor that is expressed in the embryonic ventral midbrain and is critical for the development of dopamine (DA) neurons. Structural studies have shown that the ligand binding pocket of Nurr1 is filled by bulky hydrophobic residues, making it unable to bind to ligands. Therefore, it belongs to the class of orphan receptors. However, Nurr1 forms heterodimers with RXR and can promote signaling via its partner, RXR. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, Nurr1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 75aa 1e-39 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • pfam Hormone_recep 175aa 8e-34 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.

NR5A2

  • refseq_NR5A2.F1 refseq_NR5A2.R1 124 262
  • NCBIGene 36.3 2494
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_205860

  • cd NR_LBD_Lrh-1 241aa 1e-140 in ref transcript
    • The ligand binding domain of the liver receptor homolog-1, a member of nuclear receptor superfamily,. The ligand binding domain (LBD) of the liver receptor homolog-1 (LRH-1): LRH-1 belongs to nuclear hormone receptor superfamily, and is expressed mainly in the liver, intestine, exocrine pancreas, and ovary. Most nuclear receptors function as homodimer or heterodimers. However, LRH-1 binds DNA as a monomer, and is a regulator of bile-acid homeostasis, steroidogenesis, reverse cholesterol transport and the initial stages of embryonic development. Recently, phospholipids have been identified as potential ligand for LRH-1 and steroidogenic factor-1 (SF-1). Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, LRH-1 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam Hormone_recep 186aa 1e-41 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • pfam zf-C4 76aa 4e-38 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.

NRAP

  • refseq_NRAP.F2 refseq_NRAP.R2 124 229
  • NCBIGene 36.3 4892
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198060

  • pfam LIM 52aa 2e-08 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • smart NEBU 31aa 7e-05 in ref transcript
    • The Nebulin repeat is present also in Las1. Tandem arrays of these repeats are known to bind actin.
  • smart NEBU 31aa 7e-04 in ref transcript
  • smart NEBU 31aa 0.001 in ref transcript
  • smart NEBU 31aa 0.002 in ref transcript
  • pfam Nebulin 29aa 0.002 in ref transcript
    • Nebulin repeat.
  • smart NEBU 31aa 0.003 in ref transcript
  • smart NEBU 31aa 0.005 in ref transcript

NRCAM

  • refseq_NRCAM.F2 refseq_NRCAM.R2 150 180
  • NCBIGene 36.3 4897
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001037132

  • cd IGcam 94aa 7e-14 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 88aa 5e-13 in ref transcript
  • Changed! cd FN3 90aa 8e-12 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 88aa 4e-10 in ref transcript
  • cd IGcam 89aa 4e-10 in ref transcript
  • cd FN3 96aa 6e-10 in ref transcript
  • cd FN3 100aa 5e-09 in ref transcript
  • cd IGcam 74aa 4e-07 in ref transcript
  • cd FN3 88aa 4e-04 in ref transcript
  • cd FN3 77aa 0.004 in ref transcript
  • pfam I-set 89aa 2e-12 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 89aa 1e-11 in ref transcript
  • smart IGc2 63aa 1e-11 in ref transcript
    • Immunoglobulin C-2 Type.
  • Changed! pfam fn3 85aa 5e-11 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 86aa 6e-11 in ref transcript
  • pfam fn3 93aa 4e-10 in ref transcript
  • pfam I-set 87aa 1e-09 in ref transcript
  • pfam I-set 71aa 2e-09 in ref transcript
  • pfam fn3 83aa 4e-05 in ref transcript
  • pfam fn3 78aa 2e-04 in ref transcript
  • Changed! cd FN3 91aa 1e-12 in modified transcript
  • Changed! pfam fn3 87aa 1e-11 in modified transcript

NRCAM

  • refseq_NRCAM.F4 refseq_NRCAM.R4 102 120
  • NCBIGene 36.3 4897
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001037132

  • cd IGcam 94aa 7e-14 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 88aa 5e-13 in ref transcript
  • cd FN3 90aa 8e-12 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 88aa 4e-10 in ref transcript
  • cd IGcam 89aa 4e-10 in ref transcript
  • cd FN3 96aa 6e-10 in ref transcript
  • cd FN3 100aa 5e-09 in ref transcript
  • cd IGcam 74aa 4e-07 in ref transcript
  • cd FN3 88aa 4e-04 in ref transcript
  • cd FN3 77aa 0.004 in ref transcript
  • pfam I-set 89aa 2e-12 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 89aa 1e-11 in ref transcript
  • smart IGc2 63aa 1e-11 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 85aa 5e-11 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 86aa 6e-11 in ref transcript
  • pfam fn3 93aa 4e-10 in ref transcript
  • pfam I-set 87aa 1e-09 in ref transcript
  • pfam I-set 71aa 2e-09 in ref transcript
  • pfam fn3 83aa 4e-05 in ref transcript
  • pfam fn3 78aa 2e-04 in ref transcript

NRCAM

  • refseq_NRCAM.F5 refseq_NRCAM.R5 153 468
  • NCBIGene 36.3 4897
  • Multiple exon skipping, size difference: 315
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001037132

  • cd IGcam 94aa 7e-14 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 88aa 5e-13 in ref transcript
  • cd FN3 90aa 8e-12 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 88aa 4e-10 in ref transcript
  • cd IGcam 89aa 4e-10 in ref transcript
  • cd FN3 96aa 6e-10 in ref transcript
  • cd FN3 100aa 5e-09 in ref transcript
  • cd IGcam 74aa 4e-07 in ref transcript
  • cd FN3 88aa 4e-04 in ref transcript
  • Changed! cd FN3 77aa 0.004 in ref transcript
  • pfam I-set 89aa 2e-12 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 89aa 1e-11 in ref transcript
  • smart IGc2 63aa 1e-11 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 85aa 5e-11 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 86aa 6e-11 in ref transcript
  • pfam fn3 93aa 4e-10 in ref transcript
  • pfam I-set 87aa 1e-09 in ref transcript
  • pfam I-set 71aa 2e-09 in ref transcript
  • pfam fn3 83aa 4e-05 in ref transcript
  • Changed! pfam fn3 78aa 2e-04 in ref transcript

NRXN2

  • refseq_NRXN2.F2 refseq_NRXN2.R2 153 180
  • NCBIGene 36.3 9379
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015080

  • cd LamG 154aa 3e-24 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 171aa 3e-20 in ref transcript
  • cd LamG 168aa 1e-19 in ref transcript
  • cd LamG 174aa 2e-12 in ref transcript
  • cd LamG 157aa 4e-11 in ref transcript
  • Changed! cd LamG 157aa 6e-11 in ref transcript
  • cd EGF_CA 34aa 0.001 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • smart LamG 136aa 2e-28 in ref transcript
    • Laminin G domain.
  • smart LamG 151aa 2e-24 in ref transcript
  • smart LamG 147aa 8e-24 in ref transcript
  • Changed! pfam Laminin_G_2 128aa 6e-16 in ref transcript
    • Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
  • smart LamG 154aa 3e-13 in ref transcript
  • smart LamG 139aa 2e-11 in ref transcript
  • pfam FAP 88aa 0.009 in ref transcript
    • Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
  • Changed! cd LamG 148aa 7e-14 in modified transcript
  • Changed! smart LamG 128aa 2e-18 in modified transcript

NRXN2

  • refseq_NRXN2.F3 refseq_NRXN2.R3 229 301
  • NCBIGene 36.3 9379
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015080

  • cd LamG 154aa 3e-24 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 171aa 3e-20 in ref transcript
  • cd LamG 168aa 1e-19 in ref transcript
  • cd LamG 174aa 2e-12 in ref transcript
  • cd LamG 157aa 4e-11 in ref transcript
  • cd LamG 157aa 6e-11 in ref transcript
  • cd EGF_CA 34aa 0.001 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • smart LamG 136aa 2e-28 in ref transcript
    • Laminin G domain.
  • smart LamG 151aa 2e-24 in ref transcript
  • smart LamG 147aa 8e-24 in ref transcript
  • pfam Laminin_G_2 128aa 6e-16 in ref transcript
    • Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
  • smart LamG 154aa 3e-13 in ref transcript
  • smart LamG 139aa 2e-11 in ref transcript
  • pfam FAP 88aa 0.009 in ref transcript
    • Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.

NRXN2

  • refseq_NRXN2.F5 refseq_NRXN2.R5 220 310
  • NCBIGene 36.3 9379
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015080

  • cd LamG 154aa 3e-24 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 171aa 3e-20 in ref transcript
  • cd LamG 168aa 1e-19 in ref transcript
  • Changed! cd LamG 174aa 2e-12 in ref transcript
  • cd LamG 157aa 4e-11 in ref transcript
  • cd LamG 157aa 6e-11 in ref transcript
  • cd EGF_CA 34aa 0.001 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • smart LamG 136aa 2e-28 in ref transcript
    • Laminin G domain.
  • smart LamG 151aa 2e-24 in ref transcript
  • smart LamG 147aa 8e-24 in ref transcript
  • pfam Laminin_G_2 128aa 6e-16 in ref transcript
    • Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
  • Changed! smart LamG 154aa 3e-13 in ref transcript
  • smart LamG 139aa 2e-11 in ref transcript
  • Changed! pfam FAP 88aa 0.009 in ref transcript
    • Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
  • Changed! cd LamG 144aa 2e-14 in modified transcript
  • Changed! smart LamG 124aa 8e-17 in modified transcript

NRXN2

  • refseq_NRXN2.F7 refseq_NRXN2.R7 102 123
  • NCBIGene 36.3 9379
  • Alternative 5-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015080

  • cd LamG 154aa 3e-24 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 171aa 3e-20 in ref transcript
  • Changed! cd LamG 168aa 1e-19 in ref transcript
  • cd LamG 174aa 2e-12 in ref transcript
  • cd LamG 157aa 4e-11 in ref transcript
  • cd LamG 157aa 6e-11 in ref transcript
  • cd EGF_CA 34aa 0.001 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • smart LamG 136aa 2e-28 in ref transcript
    • Laminin G domain.
  • smart LamG 151aa 2e-24 in ref transcript
  • Changed! smart LamG 147aa 8e-24 in ref transcript
  • pfam Laminin_G_2 128aa 6e-16 in ref transcript
    • Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
  • smart LamG 154aa 3e-13 in ref transcript
  • smart LamG 139aa 2e-11 in ref transcript
  • pfam FAP 88aa 0.009 in ref transcript
    • Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.
  • Changed! cd LamG 161aa 1e-20 in modified transcript
  • Changed! smart LamG 140aa 1e-24 in modified transcript

NSFL1C

  • refseq_NSFL1C.F1 refseq_NSFL1C.R1 142 335
  • NCBIGene 36.3 55968
  • Alternative 3-prime, size difference: 95
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016143

  • Changed! cd p47_UBX 79aa 3e-32 in ref transcript
    • p47_UBX p47 is an adaptor molecule of the cytosolic AAA ATPase p97. The principal role of the p97-p47 complex is to regulate membrane fusion events. Mono-ubiquitin recognition by p47 is crucial for p97-p47-mediated Golgi membrane fusion events. p47 has carboxy-terminal SEP and UBX domains. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
  • Changed! smart SEP 93aa 2e-38 in ref transcript
    • Domain present in Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47.
  • Changed! pfam UBX 80aa 2e-15 in ref transcript
    • UBX domain. This domain is present in ubiquitin-regulatory proteins and is a general Cdc48-interacting module.

NSFL1C

  • refseq_NSFL1C.F4 refseq_NSFL1C.R4 102 195
  • NCBIGene 36.3 55968
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016143

  • cd p47_UBX 79aa 3e-32 in ref transcript
    • p47_UBX p47 is an adaptor molecule of the cytosolic AAA ATPase p97. The principal role of the p97-p47 complex is to regulate membrane fusion events. Mono-ubiquitin recognition by p47 is crucial for p97-p47-mediated Golgi membrane fusion events. p47 has carboxy-terminal SEP and UBX domains. The UBX domain has a beta-grasp fold similar to that of ubiquitin however, UBX lacks the c-terminal double glycine motif and is thus unlikely to be conjugated to other proteins.
  • Changed! smart SEP 93aa 2e-38 in ref transcript
    • Domain present in Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47.
  • pfam UBX 80aa 2e-15 in ref transcript
    • UBX domain. This domain is present in ubiquitin-regulatory proteins and is a general Cdc48-interacting module.
  • Changed! smart SEP 92aa 9e-38 in modified transcript

NSUN5B

  • refseq_NSUN5B.F1 refseq_NSUN5B.R1 139 247
  • NCBIGene 36.3 155400
  • Alternative 5-prime, size difference: 108
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039575

  • TIGR nop2p 71aa 2e-04 in ref transcript
  • COG Sun 101aa 1e-09 in ref transcript
    • tRNA and rRNA cytosine-C5-methylases [Translation, ribosomal structure and biogenesis].

NT5C1B

  • refseq_NT5C1B.F1 refseq_NT5C1B.R1 151 331
  • NCBIGene 36.3 93034
  • Single exon skipping, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001002006

  • pfam 5-nucleotidase 261aa 1e-116 in ref transcript
    • 5'-nucleotidase. This family consists of both eukaryotic and prokaryotic 5'-nucleotidase sequences (EC:3.1.3.5).

NT5C3

  • refseq_NT5C3.F1 refseq_NT5C3.R1 243 298
  • NCBIGene 36.3 51251
  • Single exon skipping, size difference: 55
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001002010

  • Changed! TIGR HAD-SF-IE 275aa 1e-134 in ref transcript
    • This group of sequences was found during searches for members of the haloacid dehalogenase (HAD) superfamily. All of the conserved catalytic motifs are found. The placement of the variable domain between motifs 1 and 2 indicates membership in subfamily I of the superfamily, but these sequences are sufficiently different from any of the branches (IA, TIGR01493, TIGR01509, TIGR01549; IB, TIGR01488; IC, TIGR01494; ID, TIGR01658; IF TIGR01545) of that subfamily as to constitute a separate branch to now be called IE. Considering that the closest identifiable hit outside of the noise range is to a phosphoserine phosphatase, this group may be considered to be most closely allied to subfamily IB.
  • Changed! COG SerB 122aa 0.009 in ref transcript
    • Phosphoserine phosphatase [Amino acid transport and metabolism].

NTRK2

  • refseq_NTRK2.F2 refseq_NTRK2.R2 147 195
  • NCBIGene 36.3 4915
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006180

  • cd PTKc_TrkB 288aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Tropomyosin Related Kinase B. Protein Tyrosine Kinase (PTK) family; Tropomyosin related kinase B (TrkB); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. TrkB is a member of the Trk subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular region with arrays of leucine-rich motifs flanked by two cysteine-rich clusters followed by two immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. Binding of TrkB to its ligands, brain-derived neurotrophic factor (BDNF) or neurotrophin 4 (NT4), results in receptor oligomerization and activation of the catalytic domain. TrkB is broadly expressed in the nervous system and in some non-neural tissues. It plays important roles in cell proliferation, differentiation, and survival. BDNF/Trk signaling plays a key role in regulating activity-dependent synaptic plasticity. TrkB also contributes to protection against gp120-induced neuronal cell death. TrkB overexpression is associated with poor prognosis in neuroblastoma (NB) and other human cancers. It acts as a suppressor of anoikis (detachment-induced apoptosis) and contributes to tumor metastasis.
  • cd IGcam 79aa 4e-08 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 69aa 3e-04 in ref transcript
  • pfam Pkinase_Tyr 270aa 1e-115 in ref transcript
    • Protein tyrosine kinase.
  • pfam I-set 79aa 3e-12 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 67aa 4e-06 in ref transcript
  • smart LRRCT 46aa 8e-05 in ref transcript
    • Leucine rich repeat C-terminal domain.
  • pfam LRRNT 30aa 0.006 in ref transcript
    • Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.
  • COG SPS1 262aa 4e-24 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

NTRK3

  • refseq_NTRK3.F1 refseq_NTRK3.R1 194 236
  • NCBIGene 36.3 4916
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012338

  • Changed! cd PTKc_TrkC 305aa 1e-179 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Tropomyosin Related Kinase C. Protein Tyrosine Kinase (PTK) family; Tropomyosin related kinase C (TrkC); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. TrkC is a member of the Trk subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular region with arrays of leucine-rich motifs flanked by two cysteine-rich clusters followed by two immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. Binding of TrkC to its ligand, neurotrophin 3 (NT3), results in receptor oligomerization and activation of the catalytic domain. TrkC is broadly expressed in the nervous system and in some non-neural tissues including the developing heart. NT3/TrkC signaling plays an important role in the innervation of the cardiac conducting system and the development of smooth muscle cells. Mice deficient with NT3 and TrkC have multiple heart defects. NT3/TrkC signaling is also critical for the development and maintenance of enteric neurons that are important for the control of gut peristalsis.
  • cd IGcam 89aa 7e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 89aa 2e-05 in ref transcript
  • Changed! pfam Pkinase_Tyr 287aa 1e-108 in ref transcript
    • Protein tyrosine kinase.
  • smart IG_like 85aa 2e-10 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart LRRCT 46aa 1e-07 in ref transcript
    • Leucine rich repeat C-terminal domain.
  • pfam I-set 68aa 5e-07 in ref transcript
    • Immunoglobulin I-set domain.
  • Changed! COG SPS1 283aa 3e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd PTKc_TrkC 291aa 0.0 in modified transcript
  • Changed! pfam Pkinase_Tyr 273aa 1e-111 in modified transcript
  • Changed! COG SPS1 269aa 1e-23 in modified transcript

NUDT1

  • refseq_NUDT1.F1 refseq_NUDT1.R1 113 186
  • NCBIGene 36.3 4521
  • Alternative 5-prime, size difference: 73
  • Exclusion of the protein initiation site
  • Reference transcript: NM_198949

  • cd MTH1 136aa 5e-30 in ref transcript
    • MutT homolog-1 (MTH1) is a member of the Nudix hydrolase superfamily. MTH1, the mammalian counterpart of MutT, hydrolyzes oxidized purine nucleoside triphosphates, such as 8-oxo-dGTP and 2-hydroxy-ATP, to monophosphates, thereby preventing the incorporation of such oxygen radicals during replication. This is an important step in the repair mechanism in genomic and mitochondrial DNA. Like other members of the Nudix family, it requires a divalent cation, such as Mg2+ or Mn2+, for activity, and contain the Nudix motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as a metal binding and catalytic site. MTH1 is predominantly localized in the cytoplasm and mitochondria. Structurally, this enzyme adopts a similar fold to MutT despite low sequence similarity outside the conserved nudix motif. The most distinctive structural difference between MutT and MTH1 is the presence of a beta-hairpin, which is absent in MutT. This results in a much deeper and narrower substrate binding pocket. Mechanistically, MTH1 contains dual specificity for nucleotides that contain 2-OH-adenine bases and those that contain 8-oxo-guanine bases.
  • pfam NUDIX 93aa 1e-14 in ref transcript
    • NUDIX domain.
  • COG COG1051 76aa 4e-05 in ref transcript
    • ADP-ribose pyrophosphatase [Nucleotide transport and metabolism].

NUDT2

  • refseq_NUDT2.F2 refseq_NUDT2.R2 116 187
  • NCBIGene 36.3 318
  • Alternative 3-prime, size difference: 71
  • Inclusion in 5'UTR
  • Reference transcript: NM_147172

  • cd Ap4A_hydrolase_human_like 139aa 4e-43 in ref transcript
    • Diadenosine tetraphosphate (Ap4A) hydrolase is a member of the Nudix hydrolase superfamily. Ap4A hydrolases are well represented in a variety of prokaryotic and eukaryotic organisms. Phylogenetic analysis reveals two distinct subgroups where plant enzymes fall into one subfamily and fungi/animals/archaea enzymes, represented by this subfamily, fall into another. Bacterial enzymes are found in both subfamilies. Ap4A is a potential by-product of aminoacyl tRNA synthesis, and accumulation of Ap4A has been implicated in a range of biological events, such as DNA replication, cellular differentiation, heat shock, metabolic stress, and apoptosis. Ap4A hydrolase cleaves Ap4A asymmetrically into ATP and AMP. It is important in the invasive properties of bacteria and thus presents a potential target for inhibition of such invasive bacteria. Besides the signature nudix motif (G[X5]E[X7]REUXEEXGU, where U is Ile, Leu, or Val) that functions as a metal binding and catalytic site, and a required divalent cation, Ap4A hydrolase is structurally similar to the other members of the nudix superfamily with some degree of variation. Several regions in the sequences are poorly defined and substrate and metal binding sites are only predicted based on kinetic studies.
  • pfam NUDIX 112aa 5e-16 in ref transcript
    • NUDIX domain.
  • COG MutT 121aa 3e-07 in ref transcript
    • NTP pyrophosphohydrolases including oxidative damage repair enzymes [DNA replication, recombination, and repair / General function prediction only].

NUDT9

  • refseq_NUDT9.F1 refseq_NUDT9.R1 109 446
  • NCBIGene 36.3 53343
  • Alternative 5-prime, size difference: 337
  • Exclusion of the protein initiation site
  • Reference transcript: NM_024047

  • cd ADPRase_NUDT9 187aa 9e-77 in ref transcript
    • ADP-ribose pyrophosphatase (ADPRase) catalyzes the hydrolysis of ADP-ribose to AMP and ribose-5-P. Like other members of the Nudix hydrolase superfamily of enzymes, it is thought to require a divalent cation, such as Mg2+, for its activity. It also contains a 23-residue Nudix motif (GX5EX7REUXEEXGU, where U = I, L or V) which functions as a metal binding site/catalytic site. In addition to the Nudix motif, there are additional conserved amino acid residues, distal from the signature sequence, that correlate with substrate specificity. In humans, there are four distinct ADPRase activities, three putative cytosolic (ADPRase-I, -II, and -Mn) and a single mitochondrial enzyme (ADPRase-m). ADPRase-m is also known as NUDT9. It can be distinugished from the cytosolic ADPRase by a N-terminal target sequence unique to mitochondrial ADPRase. NUDT9 functions as a monomer.
  • pfam NUDIX 38aa 2e-05 in ref transcript
    • NUDIX domain.
  • COG COG1051 135aa 7e-06 in ref transcript
    • ADP-ribose pyrophosphatase [Nucleotide transport and metabolism].

NUMB

  • refseq_NUMB.F1 refseq_NUMB.R1 127 271
  • NCBIGene 36.3 8650
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005743

  • cd Numb 149aa 2e-72 in ref transcript
    • Numb Phosphotyrosine-binding (PTB) domain. Numb is a membrane associated adaptor protein, which is a determinant of asymmetric cell division. Numb has an N-terminal PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
  • pfam PID 134aa 7e-34 in ref transcript
    • Phosphotyrosine interaction domain (PTB/PID).
  • pfam NumbF 82aa 8e-32 in ref transcript
    • NUMB domain. This presumed domain is found in the Numb family of proteins adjacent to the PTB domain.

NUP35

  • refseq_NUP35.F1 refseq_NUP35.R1 217 359
  • NCBIGene 36.2 129401
  • Single exon skipping, size difference: 142
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_138285

  • Changed! pfam MPPN 84aa 9e-33 in ref transcript
    • MPPN (rrm-like) domain. The MPPN (Mitotic PhosphoProtein N' end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins. This family also includes a suppressor of thermosensitive mutations in the DNA polymerase delta gene (Pol III). The conserved central region appears to be distantly related to the pfam00076 domain, suggesting an RNA binding function for this protein (Bateman A. pers obs).

NUP62

  • refseq_NUP62.F1 refseq_NUP62.R1 140 173
  • NCBIGene 36.3 23636
  • Alternative 3-prime, size difference: 33
  • Inclusion in 5'UTR
  • Reference transcript: NM_016553

  • pfam Nsp1_C 102aa 8e-39 in ref transcript
    • Nsp1-like C-terminal region. This family probably forms a coiled-coil. This important region of Nsp1 is involved in binding Nup82.

NUPL1

  • refseq_NUPL1.F2 refseq_NUPL1.R2 156 192
  • NCBIGene 36.3 9818
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014089

NVL

  • refseq_NVL.F1 refseq_NVL.R1 189 263
  • NCBIGene 36.3 4931
  • Single exon skipping, size difference: 74
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002533

  • Changed! cd AAA 141aa 2e-24 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • Changed! cd AAA 137aa 1e-23 in ref transcript
  • Changed! TIGR CDC48 614aa 1e-161 in ref transcript
    • This subfamily of the AAA family ATPases includes two members each from three archaeal species. It also includes yeast CDC48 (cell division control protein 48) and the human ortholog, transitional endoplasmic reticulum ATPase (valosin-containing protein). These proteins in eukaryotes are involved in the budding and transfer of membrane from the transitional endoplasmic reticulum to the Golgi apparatus.
  • Changed! COG SpoVK 557aa 1e-117 in ref transcript
    • ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones].

NXF5

  • refseq_NXF5.F1 refseq_NXF5.R1 191 380
  • NCBIGene 36.2 55998
  • Multiple exon skipping, size difference: 189
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032946

  • Changed! pfam Tap-RNA_bind 88aa 5e-39 in ref transcript
    • Tap, RNA-binding. Members of this family adopt a structure consisting of an alpha+beta sandwich with an antiparallel beta-sheet, arranged in a 2(beta-alpha-beta) motif. They are mainly found in mRNA export factors, and mediate the sequence nonspecific nuclear export of cellular mRNAs as well as the sequence-specific export of retroviral mRNAs bearing the constitutive transport element.

NXF5

  • refseq_NXF5.F5 refseq_NXF5.R5 224 288
  • NCBIGene 36.2 55998
  • Alternative 3-prime, size difference: 64
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032946

  • pfam Tap-RNA_bind 88aa 5e-39 in ref transcript
    • Tap, RNA-binding. Members of this family adopt a structure consisting of an alpha+beta sandwich with an antiparallel beta-sheet, arranged in a 2(beta-alpha-beta) motif. They are mainly found in mRNA export factors, and mediate the sequence nonspecific nuclear export of cellular mRNAs as well as the sequence-specific export of retroviral mRNAs bearing the constitutive transport element.

OAS1

  • refseq_OAS1.F1 refseq_OAS1.R1 201 299
  • NCBIGene 36.3 4938
  • Alternative 3-prime, size difference: 98
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001032409

  • Changed! pfam OAS1_C 182aa 1e-100 in ref transcript
    • 2'-5'-oligoadenylate synthetase 1, domain 2, C-terminus. This is the largely alpha-helical, C-terminal half of 2'-5'-oligoadenylate synthetase 1, being described as domain 2 of the enzyme and homologous to a tandem ubiquitin repeat. It carries the region of enzymic activity between 320 and 344 at the extreme C-terminal end. Oligoadenylate synthetases are antiviral enzymes that counteract vial attack by degrading viral RNA. The enzyme uses ATP in 2'-specific nucleotidyl transfer reactions to synthesise 2'.5'-oligoadenylates, which activate latent ribonuclease, resulting in degradation of viral RNA and inhibition of virus replication. This domain is often associated with NTP_transf_2 pfam01909.
  • Changed! pfam OAS1_C 187aa 1e-100 in modified transcript

OASL

  • refseq_OASL.F1 refseq_OASL.R1 106 348
  • NCBIGene 36.3 8638
  • Single exon skipping, size difference: 242
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003733

  • Changed! cd OASL_repeat1 80aa 5e-34 in ref transcript
    • OASL_repeat1 (2'-5' oligoadenylate synthetase-like protein) belongs to a family of interferon-induced 2'-5' oligoadenylate synthetases which are important for the antiviral activity of interferons. While each member of this famliy has a conserved N-terminal OAS catalytic domain, only OASL has two tandem ubiquitin-like repeats located at the C-terminus and this CD represents one of those repeats.
  • Changed! pfam OAS1_C 187aa 2e-76 in ref transcript
    • 2'-5'-oligoadenylate synthetase 1, domain 2, C-terminus. This is the largely alpha-helical, C-terminal half of 2'-5'-oligoadenylate synthetase 1, being described as domain 2 of the enzyme and homologous to a tandem ubiquitin repeat. It carries the region of enzymic activity between 320 and 344 at the extreme C-terminal end. Oligoadenylate synthetases are antiviral enzymes that counteract vial attack by degrading viral RNA. The enzyme uses ATP in 2'-specific nucleotidyl transfer reactions to synthesise 2'.5'-oligoadenylates, which activate latent ribonuclease, resulting in degradation of viral RNA and inhibition of virus replication. This domain is often associated with NTP_transf_2 pfam01909.
  • Changed! pfam ubiquitin 67aa 6e-07 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
  • Changed! pfam OAS1_C 66aa 8e-22 in modified transcript

OATL1

  • refseq_OATL1.F1 refseq_OATL1.R1 230 385
  • NCBIGene 36.2 4943
  • Single exon skipping, size difference: 155
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002536

  • Changed! smart TBC 212aa 2e-46 in ref transcript
    • Domain in Tre-2, BUB2p, and Cdc16p. Probable Rab-GAPs. Widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it performs a GTP-activator activity on Rab-like GTPases.
  • Changed! COG COG5210 215aa 1e-26 in ref transcript
    • GTPase-activating protein [General function prediction only].

OCIAD2

  • refseq_OCIAD2.F1 refseq_OCIAD2.R1 101 219
  • NCBIGene 36.3 132299
  • Single exon skipping, size difference: 118
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001014446

  • Changed! pfam OCIA 118aa 1e-34 in ref transcript
    • Ovarian carcinoma immunoreactive antigen (OCIA). This family consists of several ovarian carcinoma immunoreactive antigen (OCIA) and related eukaryotic sequences. The function of this family is unknown.
  • Changed! pfam OCIA 86aa 7e-27 in modified transcript

OCRL

  • refseq_OCRL.F1 refseq_OCRL.R1 123 147
  • NCBIGene 36.3 4952
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000276

  • Changed! cd RhoGAP_OCRL1 229aa 6e-65 in ref transcript
    • RhoGAP_OCRL1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in OCRL1-like proteins. OCRL1 (oculocerebrorenal syndrome of Lowe 1)-like proteins contain two conserved domains: a central inositol polyphosphate 5-phosphatase domain and a C-terminal Rho GAP domain, this GAP domain lacks the catalytic residue and therefore maybe inactive. OCRL-like proteins are type II inositol polyphosphate 5-phosphatases that can hydrolyze lipid PI(4,5)P2 and PI(3,4,5)P3 and soluble Ins(1,4,5)P3 and Ins(1,3,4,5)P4, but their individual specificities vary. The functionality of the RhoGAP domain is still unclear. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • smart IPPc 302aa 1e-112 in ref transcript
    • Inositol polyphosphate phosphatase, catalytic domain homologues. Mg(2+)-dependent/Li(+)-sensitive enzymes.
  • smart RhoGAP 162aa 1e-34 in ref transcript
    • GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers.
  • COG COG5411 318aa 7e-53 in ref transcript
    • Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms].
  • Changed! cd RhoGAP_OCRL1 221aa 7e-66 in modified transcript

ODF2

  • refseq_ODF2.F2 refseq_ODF2.R2 115 374
  • NCBIGene 36.3 4957
  • Multiple exon skipping, size difference: 259
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_002540

  • TIGR SMC_prok_B 601aa 2e-14 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 329aa 5e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 353aa 5e-09 in ref transcript

ODF2L

  • refseq_ODF2L.F1 refseq_ODF2L.R1 188 321
  • NCBIGene 36.3 57489
  • Single exon skipping, size difference: 133
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020729

  • Changed! pfam SMC_N 255aa 7e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! TIGR SMC_prok_B 198aa 0.006 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.

ODZ3

  • refseq_ODZ3.F1 refseq_ODZ3.R1 102 123
  • NCBIGene 36.2 55714
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_931600

  • pfam Ten_N 279aa 4e-63 in ref transcript
    • Teneurin Intracellular Region. This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
  • pfam Keratin_B2 153aa 0.005 in ref transcript
    • Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesised during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.
  • COG RhsA 630aa 1e-07 in ref transcript
    • Rhs family protein [Cell envelope biogenesis, outer membrane].

ODZ3

  • refseq_ODZ3.F3 refseq_ODZ3.R3 101 128
  • NCBIGene 36.2 55714
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_931600

  • pfam Ten_N 279aa 4e-63 in ref transcript
    • Teneurin Intracellular Region. This family is found in the intracellular N-terminal region of the Teneurin family of proteins. These proteins are 'pair-rule' genes and are involved in tissue patterning, specifically probably neural patterning. The intracellular domain is cleaved in response to homophilic interaction of the extracellular domain, and translocates to the nucleus. Here it probably carries out to some transcriptional regulatory activity. The length of this region and the conservation suggests that there may be two structural domains here (personal obs:C Yeats).
  • pfam Keratin_B2 153aa 0.005 in ref transcript
    • Keratin, high sulfur B2 protein. High sulfur proteins are cysteine-rich proteins synthesised during the differentiation of hair matrix cells, and form hair fibres in association with hair keratin intermediate filaments. This family has been divided up into four regions, with the second region containing 8 copies of a short repeat. This family is also known as B2 or KAP1.
  • COG RhsA 630aa 1e-07 in ref transcript
    • Rhs family protein [Cell envelope biogenesis, outer membrane].

OGT

  • refseq_OGT.F2 refseq_OGT.R2 106 136
  • NCBIGene 36.3 8473
  • Alternative 3-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181672

  • cd TPR 100aa 4e-18 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • cd TPR 100aa 4e-18 in ref transcript
  • cd TPR 100aa 9e-18 in ref transcript
  • cd TPR 96aa 1e-16 in ref transcript
  • cd TPR 95aa 1e-14 in ref transcript
  • cd TPR 98aa 2e-08 in ref transcript
  • Changed! TIGR PEP_TPR_lipo 444aa 5e-39 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • COG Spy 199aa 8e-46 in ref transcript
    • Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones].
  • COG Spy 396aa 1e-38 in ref transcript
  • COG NrfG 211aa 1e-15 in ref transcript
    • FOG: TPR repeat [General function prediction only].
  • COG TadD 199aa 2e-11 in ref transcript
    • Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion].
  • Changed! TIGR PEP_TPR_lipo 433aa 5e-39 in modified transcript

OSGIN1

  • refseq_OKL38.F1 refseq_OKL38.R1 110 270
  • NCBIGene 36.3 29948
  • Single exon skipping, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_013370

  • Changed! pfam Pyr_redox_2 208aa 3e-06 in ref transcript
    • Pyridine nucleotide-disulphide oxidoreductase. This family includes both class I and class II oxidoreductases and also NADH oxidases and peroxidases. This domain is actually a small NADH binding domain within a larger FAD binding domain.
  • Changed! PRK PRK12770 80aa 4e-07 in ref transcript
    • putative glutamate synthase subunit beta; Provisional.
  • Changed! COG GltD 205aa 8e-07 in ref transcript
    • NADPH-dependent glutamate synthase beta chain and related oxidoreductases [Amino acid transport and metabolism / General function prediction only].
  • Changed! COG TrkA 202aa 0.003 in ref transcript
    • Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism].

OLAH

  • refseq_OLAH.F1 refseq_OLAH.R1 227 386
  • NCBIGene 36.3 55301
  • Single exon skipping, size difference: 159
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018324

  • Changed! pfam Thioesterase 282aa 9e-26 in ref transcript
    • Thioesterase domain. Peptide synthetases are involved in the non-ribosomal synthesis of peptide antibiotics. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates. There are also modules within the peptide synthetases that also share this similarity. With respect to antibiotic production, thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Thioesterases (non-integrated) have molecular masses of 25-29 kDa.
  • Changed! COG GrsT 278aa 8e-34 in ref transcript
    • Predicted thioesterase involved in non-ribosomal peptide biosynthesis [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam Thioesterase 229aa 4e-33 in modified transcript
  • Changed! COG GrsT 225aa 4e-42 in modified transcript

OPA1

  • refseq_OPA1.F1 OPA1.u.r.6 129 183
  • NCBIGene 36.3 4976
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130837

  • pfam Dynamin_N 179aa 4e-38 in ref transcript
    • Dynamin family.

OPN3

  • refseq_OPN3.F2 refseq_OPN3.R2 206 366
  • NCBIGene 36.2 23596
  • Alternative 5-prime, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014322

  • Changed! pfam 7tm_1 243aa 6e-23 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • Changed! pfam 7tm_1 115aa 2e-08 in modified transcript

OPN4

  • refseq_OPN4.F1 refseq_OPN4.R1 142 175
  • NCBIGene 36.3 94233
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001030015

  • Changed! pfam 7tm_1 257aa 6e-41 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • Changed! pfam 7tm_1 258aa 8e-43 in modified transcript

OPN5

  • refseq_OPN5.F2 refseq_OPN5.R2 242 322
  • NCBIGene 36.3 221391
  • Single exon skipping, size difference: 80
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_181744

  • Changed! pfam 7tm_1 248aa 1e-25 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • Changed! pfam 7tm_1 79aa 5e-10 in modified transcript

OPRL1

  • refseq_OPRL1.F1 refseq_OPRL1.R1 134 285
  • NCBIGene 36.3 4987
  • Single exon skipping, size difference: 151
  • Exclusion in 5'UTR
  • Reference transcript: NM_182647

  • pfam 7tm_1 246aa 4e-38 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

OPRS1

  • refseq_OPRS1.F2 refseq_OPRS1.R2 140 248
  • NCBIGene 36.2 10280
  • Alternative 5-prime, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005866

  • Changed! pfam ERG2_Sigma1R 189aa 3e-73 in ref transcript
    • ERG2 and Sigma1 receptor like protein. This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8-delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum, which interacts with endogenous steroid hormones, such as progesterone and testosterone. It also binds the sigma ligands, which are are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.
  • Changed! pfam ERG2_Sigma1R 167aa 7e-67 in modified transcript

OPRS1

  • refseq_OPRS1.F3 refseq_OPRS1.R3 129 222
  • NCBIGene 36.3 10280
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005866

  • Changed! pfam ERG2_Sigma1R 189aa 3e-73 in ref transcript
    • ERG2 and Sigma1 receptor like protein. This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8-delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum, which interacts with endogenous steroid hormones, such as progesterone and testosterone. It also binds the sigma ligands, which are are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.
  • Changed! pfam ERG2_Sigma1R 158aa 3e-54 in modified transcript

ORC4L

  • refseq_ORC4L.F2 refseq_ORC4L.R2 133 158
  • NCBIGene 36.3 5000
  • Alternative 5-prime, size difference: 25
  • Exclusion in 5'UTR
  • Reference transcript: NM_002552

  • cd AAA 168aa 5e-08 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • TIGR TIGR02928 182aa 6e-14 in ref transcript
    • Members of this protein family are found exclusively in the archaea. This set of DNA binding proteins shows homology to the origin recognition complex subunit 1/cell division control protein 6 family in eukaryotes. Several members may be found in genome and interact with each other.
  • COG CDC6 176aa 2e-16 in ref transcript
    • Cdc6-related protein, AAA superfamily ATPase [DNA replication, recombination, and repair / Posttranslational modification, protein turnover, chaperones].

OS9

  • refseq_OS9.F1 refseq_OS9.R1 300 465
  • NCBIGene 36.3 10956
  • Single exon skipping, size difference: 165
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006812

  • pfam PRKCSH 69aa 5e-13 in ref transcript
    • Glucosidase II beta subunit-like protein. The sequences found in this family are similar to a region found in the beta-subunit of glucosidase II, which is also known as protein kinase C substrate 80K-H (PRKCSH). The enzyme catalyses the sequential removal of two alpha-1,3-linked glucose residues in the second step of N-linked oligosaccharide processing. The beta subunit is required for the solubility and stability of the heterodimeric enzyme, and is involved in retaining the enzyme within the endoplasmic reticulum. Mutations in the gene coding for PRKCSH have been found to be involved in the development of autosomal dominant polycystic liver disease (ADPLD), but the precise role the protein has in the pathogenesis of this disease is unknown. This family also includes an ER sensor for misfolded glycoproteins and is therefore likely to be a generic sugar binding domain.

OS9

  • refseq_OS9.F3 refseq_OS9.R3 214 259
  • NCBIGene 36.3 10956
  • Alternative 5-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006812

  • pfam PRKCSH 69aa 5e-13 in ref transcript
    • Glucosidase II beta subunit-like protein. The sequences found in this family are similar to a region found in the beta-subunit of glucosidase II, which is also known as protein kinase C substrate 80K-H (PRKCSH). The enzyme catalyses the sequential removal of two alpha-1,3-linked glucose residues in the second step of N-linked oligosaccharide processing. The beta subunit is required for the solubility and stability of the heterodimeric enzyme, and is involved in retaining the enzyme within the endoplasmic reticulum. Mutations in the gene coding for PRKCSH have been found to be involved in the development of autosomal dominant polycystic liver disease (ADPLD), but the precise role the protein has in the pathogenesis of this disease is unknown. This family also includes an ER sensor for misfolded glycoproteins and is therefore likely to be a generic sugar binding domain.

OSBPL1A

  • refseq_OSBPL1A.F2 refseq_OSBPL1A.R2 105 174
  • NCBIGene 36.2 114876
  • Alternative 5-prime and 3-prime, size difference: 69
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080597

  • cd ANK 154aa 2e-19 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 74aa 3e-07 in ref transcript
  • cd PH_oxysterol_bp 95aa 3e-06 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • Changed! pfam Oxysterol_BP 402aa 2e-72 in ref transcript
    • Oxysterol-binding protein.
  • pfam Ank 32aa 1e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 32aa 4e-05 in ref transcript
  • pfam Ank 31aa 3e-04 in ref transcript
  • smart PH 97aa 0.001 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • PTZ PTZ00322 96aa 9e-10 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.
  • COG Arp 169aa 5e-09 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! pfam Oxysterol_BP 379aa 5e-58 in modified transcript

OSBPL3

  • refseq_OSBPL3.F2 refseq_OSBPL3.R2 102 195
  • NCBIGene 36.3 26031
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015550

  • cd PH_oxysterol_bp 91aa 2e-18 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam Oxysterol_BP 300aa 2e-56 in ref transcript
    • Oxysterol-binding protein.
  • smart PH 91aa 3e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

OSBPL5

  • refseq_OSBPL5.F1 refseq_OSBPL5.R1 112 316
  • NCBIGene 36.3 114879
  • Single exon skipping, size difference: 204
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020896

  • Changed! cd PH_oxysterol_bp 114aa 4e-20 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam Oxysterol_BP 332aa 1e-52 in ref transcript
    • Oxysterol-binding protein.
  • Changed! smart PH 117aa 4e-08 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

OSBPL6

  • refseq_OSBPL6.F2 refseq_OSBPL6.R2 241 316
  • NCBIGene 36.3 114880
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145739

  • cd PH_oxysterol_bp 91aa 7e-15 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam Oxysterol_BP 342aa 3e-58 in ref transcript
    • Oxysterol-binding protein.
  • smart PH 89aa 8e-08 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

OSBPL8

  • refseq_OSBPL8.F1 refseq_OSBPL8.R1 188 234
  • NCBIGene 36.3 114882
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 46
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020841

  • Changed! cd PH_oxysterol_bp 114aa 2e-19 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • Changed! pfam Oxysterol_BP 333aa 3e-51 in ref transcript
    • Oxysterol-binding protein.
  • Changed! smart PH 117aa 2e-08 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

OSBPL9

  • refseq_OSBPL9.F1 refseq_OSBPL9.R1 104 143
  • NCBIGene 36.3 114883
  • Single exon skipping, size difference: 39
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_148909

  • cd PH_oxysterol_bp 93aa 1e-28 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam Oxysterol_BP 342aa 2e-49 in ref transcript
    • Oxysterol-binding protein.
  • smart PH 95aa 6e-13 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

OSBPL9

  • refseq_OSBPL9.F2 refseq_OSBPL9.R2 303 399
  • NCBIGene 36.3 114883
  • Single exon skipping, size difference: 96
  • Exclusion in 5'UTR
  • Reference transcript: NM_148904

  • pfam Oxysterol_BP 342aa 1e-49 in ref transcript
    • Oxysterol-binding protein.

OSBPL9

  • refseq_OSBPL9.F4 refseq_OSBPL9.R4 173 252
  • NCBIGene 36.3 114883
  • Single exon skipping, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_148909

  • Changed! cd PH_oxysterol_bp 93aa 1e-28 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • Changed! pfam Oxysterol_BP 342aa 2e-49 in ref transcript
    • Oxysterol-binding protein.
  • Changed! smart PH 95aa 6e-13 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • Changed! cd PH_oxysterol_bp 49aa 2e-13 in modified transcript
  • Changed! smart PH 51aa 0.001 in modified transcript

OSBPL9

  • refseq_OSBPL9.F6 refseq_OSBPL9.R6 130 253
  • NCBIGene 36.3 114883
  • Alternative 5-prime, size difference: 123
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_148907

  • Changed! pfam Oxysterol_BP 342aa 1e-49 in ref transcript
    • Oxysterol-binding protein.

OSCAR

  • refseq_OSCAR.F1 refseq_OSCAR.R1 134 158
  • NCBIGene 36.2 126014
  • Alternative 5-prime and 3-prime, size difference: 24
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_206818

OTOF

  • refseq_OTOF.F1 refseq_OTOF.R1 117 315
  • NCBIGene 36.3 9381
  • Single exon skipping, size difference: 198
  • Exclusion of the stop codon
  • Reference transcript: NM_194248

  • cd C2 90aa 5e-14 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • cd C2 91aa 5e-09 in ref transcript
  • cd C2 110aa 7e-09 in ref transcript
  • cd C2 100aa 4e-05 in ref transcript
  • cd C2 99aa 0.006 in ref transcript
  • pfam FerB 78aa 2e-32 in ref transcript
    • FerB (NUC096) domain. This is central domain B in proteins of the Ferlin family.
  • pfam FerI 72aa 3e-31 in ref transcript
    • FerI (NUC094) domain. This domain is present in proteins of the Ferlin family. It is often located between two C2 domains.
  • pfam C2 84aa 2e-15 in ref transcript
    • C2 domain.
  • pfam C2 90aa 6e-11 in ref transcript
  • smart C2 103aa 5e-10 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • smart C2 97aa 4e-07 in ref transcript
  • smart C2 105aa 0.001 in ref transcript
  • COG COG5038 88aa 9e-11 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
  • COG COG5038 107aa 8e-04 in ref transcript

OTOF

  • refseq_OTOF.F3 refseq_OTOF.R3 136 196
  • NCBIGene 36.3 9381
  • Alternative 3-prime, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_194248

  • cd C2 90aa 5e-14 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • cd C2 91aa 5e-09 in ref transcript
  • cd C2 110aa 7e-09 in ref transcript
  • cd C2 100aa 4e-05 in ref transcript
  • cd C2 99aa 0.006 in ref transcript
  • pfam FerB 78aa 2e-32 in ref transcript
    • FerB (NUC096) domain. This is central domain B in proteins of the Ferlin family.
  • pfam FerI 72aa 3e-31 in ref transcript
    • FerI (NUC094) domain. This domain is present in proteins of the Ferlin family. It is often located between two C2 domains.
  • pfam C2 84aa 2e-15 in ref transcript
    • C2 domain.
  • pfam C2 90aa 6e-11 in ref transcript
  • smart C2 103aa 5e-10 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • smart C2 97aa 4e-07 in ref transcript
  • smart C2 105aa 0.001 in ref transcript
  • COG COG5038 88aa 9e-11 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
  • COG COG5038 107aa 8e-04 in ref transcript

OTX2

  • refseq_OTX2.F1 refseq_OTX2.R1 125 149
  • NCBIGene 36.3 5015
  • Alternative 3-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021728

  • cd homeodomain 56aa 3e-16 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam Homeobox 56aa 2e-21 in ref transcript
    • Homeobox domain.
  • pfam TF_Otx 66aa 5e-20 in ref transcript
    • Otx1 transcription factor.
  • COG COG5576 59aa 5e-11 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

P2RX2

  • refseq_P2RX2.F1 refseq_P2RX2.R1 154 222
  • NCBIGene 36.3 22953
  • Alternative 5-prime, size difference: 68
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_170683

  • Changed! TIGR P2X 392aa 1e-178 in ref transcript
    • ACC channels are probably hetero- or homomultimers and transport small monovalent cations (Me+). Some also transport Ca2+; a few also transport small metabolites.
  • Changed! TIGR P2X 32aa 6e-06 in modified transcript

P2RX2

  • refseq_P2RX2.F2 refseq_P2RX2.R2 136 208
  • NCBIGene 36.3 22953
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170683

  • Changed! TIGR P2X 392aa 1e-178 in ref transcript
    • ACC channels are probably hetero- or homomultimers and transport small monovalent cations (Me+). Some also transport Ca2+; a few also transport small metabolites.
  • Changed! TIGR P2X 368aa 1e-163 in modified transcript

P2RX5

  • refseq_P2RX5.F1 refseq_P2RX5.R1 294 369
  • NCBIGene 36.3 5026
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002561

  • Changed! pfam P2X_receptor 344aa 0.0 in ref transcript
    • ATP P2X receptor.
  • Changed! pfam P2X_receptor 320aa 1e-169 in modified transcript

P2RX5

  • refseq_P2RX5.F2 refseq_P2RX5.R3 126 198
  • NCBIGene 36.3 5026
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002561

  • Changed! pfam P2X_receptor 344aa 0.0 in ref transcript
    • ATP P2X receptor.
  • Changed! pfam P2X_receptor 320aa 1e-169 in modified transcript

P2RY10

  • refseq_P2RY10.F2 refseq_P2RY10.R2 187 379
  • NCBIGene 36.3 27334
  • Multiple exon skipping, size difference: 192
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_014499

  • pfam 7tm_1 230aa 1e-27 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

P2RY2

  • refseq_P2RY2.F1 refseq_P2RY2.R1 106 240
  • NCBIGene 36.3 5029
  • Alternative 3-prime, size difference: 134
  • Inclusion in 5'UTR
  • Reference transcript: NM_002564

  • pfam 7tm_1 251aa 2e-33 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

P2RY6

  • refseq_P2RY6.F1 refseq_P2RY6.R1 114 274
  • NCBIGene 36.3 5031
  • Single exon skipping, size difference: 160
  • Exclusion in 5'UTR
  • Reference transcript: NM_176796

  • pfam 7tm_1 250aa 2e-22 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

P2RY6

  • refseq_P2RY6.F3 refseq_P2RY6.R3 130 216
  • NCBIGene 36.3 5031
  • Single exon skipping, size difference: 86
  • Exclusion in 5'UTR
  • Reference transcript: NM_176796

  • pfam 7tm_1 250aa 2e-22 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

PAEP

  • refseq_PAEP.F1 refseq_PAEP.R1 146 175
  • NCBIGene 36.3 5047
  • Alternative 5-prime, size difference: 29
  • Exclusion in 3'UTR
  • Reference transcript: NM_001018049

  • pfam Lipocalin 142aa 2e-24 in ref transcript
    • Lipocalin / cytosolic fatty-acid binding protein family. Lipocalins are transporters for small hydrophobic molecules, such as lipids, steroid hormones, bilins, and retinoids. The family also encompasses the enzyme prostaglandin D synthase (EC:5.3.99.2). Alignment subsumes both the lipocalin and fatty acid binding protein signatures from PROSITE. This is supported on structural and functional grounds. The structure is an eight-stranded beta barrel.

PAGE5

  • refseq_PAGE5.F2 refseq_PAGE5.R2 107 306
  • NCBIGene 36.3 90737
  • Alternative 5-prime, size difference: 199
  • Exclusion of the protein initiation site
  • Reference transcript: NM_130467

  • pfam GAGE 103aa 8e-06 in ref transcript
    • GAGE protein. This family consists of several GAGE and XAGE proteins which are found exclusively in humans. The function of this family is unknown although they have been implicated in human cancers.

PAIP1

  • refseq_PAIP1.F1 refseq_PAIP1.R1 160 397
  • NCBIGene 36.3 10605
  • Alternative 5-prime, size difference: 237
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006451

  • smart MIF4G 217aa 2e-30 in ref transcript
    • Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press).

PAK4

  • refseq_PAK4.F1 refseq_PAK4.R1 110 336
  • NCBIGene 36.3 10298
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 226
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_001014831

  • cd STKc_PAK_II 285aa 1e-173 in ref transcript
    • Serine/threonine kinases (STKs), p21-activated kinase (PAK) subfamily, Group II, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PAK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. PAKs are Rho family GTPase-regulated kinases that serve as important mediators in the function of Cdc42 (cell division cycle 42) and Rac. PAKs from higher eukaryotes are classified into two groups (I and II), according to their biochemical and structural features. Group II PAKs, also called non-conventional PAKs, include PAK4, PAK5, and PAK6. Group II PAKs contain PBD (p21-binding domain) and catalytic domains, but lack other motifs found in group I PAKs, such as an AID (autoinhibitory domain) and SH3 binding sites. Since group II PAKs do not contain an obvious AID, they may be regulated differently from group I PAKs. While group I PAKs interact with the SH3 containing proteins Nck, Grb2 and PIX, no such binding has been demonstrated for group II PAKs. Some known substrates of group II PAKs are also substrates of group I PAKs such as Raf, BAD, LIMK and GEFH1. Unique group II substrates include MARK/Par-1 and PDZ-RhoGEF. Group II PAKs play important roles in filopodia formation, neuron extension, cytoskeletal organization, and cell survival.
  • cd CRIB_PAK_like 39aa 4e-11 in ref transcript
    • PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. This subgroup of CRIB/PBD-domains is found N-terminal of Serine/Threonine kinase domains in PAK and PAK-like proteins.
  • smart S_TKc 242aa 5e-76 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam PBD 55aa 1e-11 in ref transcript
    • P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
  • COG SPS1 248aa 2e-35 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

PAK7

  • refseq_PAK7.F1 refseq_PAK7.R1 159 308
  • NCBIGene 36.3 57144
  • Single exon skipping, size difference: 149
  • Exclusion in 5'UTR
  • Reference transcript: NM_020341

  • cd STKc_PAK_II 285aa 1e-175 in ref transcript
    • Serine/threonine kinases (STKs), p21-activated kinase (PAK) subfamily, Group II, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PAK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. PAKs are Rho family GTPase-regulated kinases that serve as important mediators in the function of Cdc42 (cell division cycle 42) and Rac. PAKs from higher eukaryotes are classified into two groups (I and II), according to their biochemical and structural features. Group II PAKs, also called non-conventional PAKs, include PAK4, PAK5, and PAK6. Group II PAKs contain PBD (p21-binding domain) and catalytic domains, but lack other motifs found in group I PAKs, such as an AID (autoinhibitory domain) and SH3 binding sites. Since group II PAKs do not contain an obvious AID, they may be regulated differently from group I PAKs. While group I PAKs interact with the SH3 containing proteins Nck, Grb2 and PIX, no such binding has been demonstrated for group II PAKs. Some known substrates of group II PAKs are also substrates of group I PAKs such as Raf, BAD, LIMK and GEFH1. Unique group II substrates include MARK/Par-1 and PDZ-RhoGEF. Group II PAKs play important roles in filopodia formation, neuron extension, cytoskeletal organization, and cell survival.
  • cd CRIB_PAK_like 47aa 2e-11 in ref transcript
    • PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. This subgroup of CRIB/PBD-domains is found N-terminal of Serine/Threonine kinase domains in PAK and PAK-like proteins.
  • smart S_TKc 242aa 3e-75 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam PBD 58aa 1e-12 in ref transcript
    • P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
  • PTZ PTZ00263 245aa 3e-34 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

PALM

  • refseq_PALM.F1 refseq_PALM.R1 146 278
  • NCBIGene 36.3 5064
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002579

  • Changed! pfam Paralemmin 318aa 1e-102 in ref transcript
    • Paralemmin.
  • Changed! pfam Paralemmin 274aa 5e-74 in modified transcript

PALM2-AKAP2

  • refseq_PALM2-AKAP2.F2 refseq_PALM2-AKAP2.R2 216 255
  • NCBIGene 36.3 445815
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007203

PALM2

  • refseq_PALM2.F1 refseq_PALM2.R1 170 272
  • NCBIGene 36.3 114299
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_053016

  • Changed! pfam Paralemmin 336aa 3e-34 in ref transcript
    • Paralemmin.
  • Changed! pfam Paralemmin 302aa 6e-33 in modified transcript

PAM

  • refseq_PAM.F1 refseq_PAM.R1 100 421
  • NCBIGene 36.3 5066
  • Single exon skipping, size difference: 321
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000919

  • pfam Cu2_monoox_C 154aa 1e-69 in ref transcript
    • Copper type II ascorbate-dependent monooxygenase, C-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold.
  • pfam Cu2_monooxygen 138aa 2e-59 in ref transcript
    • Copper type II ascorbate-dependent monooxygenase, N-terminal domain. The N and C-terminal domains of members of this family adopt the same PNGase F-like fold.
  • pfam NHL 28aa 8e-04 in ref transcript
    • NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in the peptidyl-glycine alpha-amidating monooxygenase (PAM), proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localised to the repeats. The human tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is me diated by the NHL repeats.
  • pfam NHL 28aa 0.006 in ref transcript
  • pfam NHL 30aa 0.007 in ref transcript
  • COG COG3391 247aa 6e-07 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam NHL 29aa 0.010 in modified transcript

PANK1

  • refseq_PANK1.F1 refseq_PANK1.R1 152 329
  • NCBIGene 36.3 53354
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_148977

  • Changed! pfam Fumble 356aa 1e-131 in ref transcript
    • Fumble. Fumble is required for cell division in Drosophila. Mutants lacking fumble exhibit abnormalities in bipolar spindle organisation, chromosome segregation, and contractile ring formation. Analyses have demonstrated that encodes three protein isoforms, all of which contain a domain with high similarity to the pantothenate kinases of A. nidulans and mouse. A role of fumble in membrane synthesis has been proposed.
  • Changed! COG PanK 350aa 8e-45 in ref transcript
    • Pantothenate kinase, acetyl-CoA regulated [Coenzyme metabolism].
  • Changed! pfam Fumble 297aa 8e-94 in modified transcript
  • Changed! COG PanK 291aa 5e-22 in modified transcript

PAOX

  • refseq_PAOX.F2 refseq_PAOX.R2 159 317
  • NCBIGene 36.3 196743
  • Single exon skipping, size difference: 158
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_152911

  • Changed! pfam Amino_oxidase 455aa 5e-52 in ref transcript
    • Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. The family also contains phytoene dehydrogenases and related enzymes. In vertebrates MAO plays an important role regulating the intracellular levels of amines via there oxidation; these include various neurotransmitters, neurotoxins and trace amines. In lower eukaryotes such as aspergillus and in bacteria the main role of amine oxidases is to provide a source of ammonium. PAOs in plants, bacteria and protozoa oxidase spermidine and spermine to an aminobutyral, diaminopropane and hydrogen peroxide and are involved in the catabolism of polyamines. Other members of this family include tryptophan 2-monooxygenase, putrescine oxidase, corticosteroid binding proteins and antibacterial glycoproteins.
  • Changed! COG COG1231 232aa 8e-12 in ref transcript
    • Monoamine oxidase [Amino acid transport and metabolism].
  • Changed! pfam Amino_oxidase 391aa 9e-35 in modified transcript
  • Changed! COG COG1231 88aa 6e-08 in modified transcript

PAOX

  • refseq_PAOX.F4 refseq_PAOX.R4 105 358
  • NCBIGene 36.3 196743
  • Single exon skipping, size difference: 253
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_152911

  • Changed! pfam Amino_oxidase 455aa 5e-52 in ref transcript
    • Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. The family also contains phytoene dehydrogenases and related enzymes. In vertebrates MAO plays an important role regulating the intracellular levels of amines via there oxidation; these include various neurotransmitters, neurotoxins and trace amines. In lower eukaryotes such as aspergillus and in bacteria the main role of amine oxidases is to provide a source of ammonium. PAOs in plants, bacteria and protozoa oxidase spermidine and spermine to an aminobutyral, diaminopropane and hydrogen peroxide and are involved in the catabolism of polyamines. Other members of this family include tryptophan 2-monooxygenase, putrescine oxidase, corticosteroid binding proteins and antibacterial glycoproteins.
  • Changed! COG COG1231 232aa 8e-12 in ref transcript
    • Monoamine oxidase [Amino acid transport and metabolism].
  • Changed! pfam Amino_oxidase 267aa 5e-18 in modified transcript

PAPD5

  • refseq_PAPD5.F2 refseq_PAPD5.R2 252 393
  • NCBIGene 36.3 64282
  • Alternative 3-prime, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040284

  • pfam PAP_assoc 61aa 3e-14 in ref transcript
    • Poly(A) polymerase. This domain is found in poly(A) polymerases and has been shown to have polynucleotide adenylyltransferase activity. Proteins in this family have been located to both the nucleus and cytoplasm.
  • pfam NTP_transf_2 77aa 2e-09 in ref transcript
    • Nucleotidyltransferase domain. Members of this family belong to a large family of nucleotidyltransferases. This family includes kanamycin nucleotidyltransferase (KNTase) which is a plasmid-coded enzyme responsible for some types of bacterial resistance to aminoglycosides. KNTase in-activates antibiotics by catalysing the addition of a nucleotidyl group onto the drug.
  • COG TRF4 262aa 3e-44 in ref transcript
    • DNA polymerase sigma [DNA replication, recombination, and repair].

PAQR6

  • refseq_PAQR6.F1 refseq_PAQR6.R1 104 146
  • NCBIGene 36.3 79957
  • Alternative 5-prime, size difference: 42
  • Exclusion in 5'UTR
  • Reference transcript: NM_024897

  • pfam HlyIII 149aa 1e-04 in ref transcript
    • Haemolysin-III related. Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It has been proposed that YOL002c encodes a Saccharomyces cerevisiae protein that plays a key role in metabolic pathways that regulate lipid and phosphate metabolism.

PAQR6

  • refseq_PAQR6.F3 refseq_PAQR6.R3 103 174
  • NCBIGene 36.3 79957
  • Alternative 5-prime, size difference: 71
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024897

  • Changed! pfam HlyIII 149aa 1e-04 in ref transcript
    • Haemolysin-III related. Members of this family are integral membrane proteins. This family includes a protein with hemolytic activity from Bacillus cereus. It is not clear if all the members of this family are hemolysins. It has been proposed that YOL002c encodes a Saccharomyces cerevisiae protein that plays a key role in metabolic pathways that regulate lipid and phosphate metabolism.
  • Changed! pfam HlyIII 165aa 3e-10 in modified transcript

PARL

  • refseq_PARL.F2 refseq_PARL.R2 252 402
  • NCBIGene 36.3 55486
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018622

  • Changed! pfam Rhomboid 142aa 1e-21 in ref transcript
    • Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth factor Spitz, allowing it to activate the Drosophila EGF receptor. Analysis has shown that Rhomboid-1 is an intramembrane serine protease (EC:3.4.21.105). Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite.
  • Changed! COG GlpG 198aa 2e-13 in ref transcript
    • Uncharacterized membrane protein (homolog of Drosophila rhomboid) [General function prediction only].
  • Changed! pfam Rhomboid 100aa 4e-09 in modified transcript
  • Changed! COG GlpG 119aa 5e-04 in modified transcript

PARP3

  • refseq_PARP3.F2 refseq_PARP3.R2 263 385
  • NCBIGene 36.3 10039
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 122
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001003931

  • Changed! cd parp_like 351aa 1e-99 in ref transcript
    • Poly(ADP-ribose) polymerase (parp) catalytic domain catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active. Poly(ADP-ribose)-like polymerases (PARPS 1-3, VPARP, tankyrase) catalyze the addition of up to 100 ADP_ribose units from NAD+. PARPs 1 and 2 are localized in the nucleaus, bind DNA, and are activated by DNA damage. VPARP is part of the vault ribonucleoprotein complex. Tankyrases regulates telomere length through interactions with telomere repeat binding factor 1.
  • Changed! pfam PARP 211aa 5e-53 in ref transcript
    • Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active.
  • Changed! pfam WGR 64aa 2e-22 in ref transcript
    • WGR domain. This domain is found in a variety of polyA polymerases as well as the Escherichia coli molybdate metabolism regulator and other proteins of unknown function. I have called this domain WGR after the most conserved central motif of the domain. The domain is found in isolation in some proteins and is between 70 and 80 residues in length. I propose that this may be a nucleic acid binding domain.
  • Changed! pfam PARP_reg 136aa 2e-20 in ref transcript
    • Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active.
  • Changed! COG COG3831 66aa 1e-04 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

PAX2

  • refseq_PAX2.F1 refseq_PAX2.R1 149 232
  • NCBIGene 36.3 5076
  • Single exon skipping, size difference: 83
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_003990

  • cd PAX 127aa 2e-62 in ref transcript
    • Paired Box domain.
  • smart PAX 125aa 1e-69 in ref transcript
    • Paired Box domain.
  • COG COG3415 92aa 0.003 in ref transcript
    • Transposase and inactivated derivatives [DNA replication, recombination, and repair].

PAX2

  • refseq_PAX2.F4 refseq_PAX2.R4 105 124
  • NCBIGene 36.3 5076
  • Alternative 3-prime, size difference: 19
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_003990

  • cd PAX 127aa 2e-62 in ref transcript
    • Paired Box domain.
  • smart PAX 125aa 1e-69 in ref transcript
    • Paired Box domain.
  • COG COG3415 92aa 0.003 in ref transcript
    • Transposase and inactivated derivatives [DNA replication, recombination, and repair].

PAX2

  • refseq_PAX2.F5 refseq_PAX2.R5 162 231
  • NCBIGene 36.3 5076
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003990

  • cd PAX 127aa 2e-62 in ref transcript
    • Paired Box domain.
  • smart PAX 125aa 1e-69 in ref transcript
    • Paired Box domain.
  • COG COG3415 92aa 0.003 in ref transcript
    • Transposase and inactivated derivatives [DNA replication, recombination, and repair].

PAX6

  • refseq_PAX6.F1 refseq_PAX6.R1 237 335
  • NCBIGene 36.3 5080
  • Alternative 5-prime, size difference: 98
  • Inclusion in 5'UTR
  • Reference transcript: NM_001604

  • cd PAX 141aa 2e-58 in ref transcript
    • Paired Box domain.
  • cd homeodomain 57aa 1e-15 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • smart PAX 139aa 2e-63 in ref transcript
    • Paired Box domain.
  • pfam Homeobox 55aa 1e-21 in ref transcript
    • Homeobox domain.
  • COG COG5576 66aa 2e-08 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

PAX6

  • refseq_PAX6.F4 refseq_PAX6.R4 111 153
  • NCBIGene 36.3 5080
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001604

  • Changed! cd PAX 141aa 2e-58 in ref transcript
    • Paired Box domain.
  • cd homeodomain 57aa 1e-15 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • Changed! smart PAX 139aa 2e-63 in ref transcript
    • Paired Box domain.
  • pfam Homeobox 55aa 1e-21 in ref transcript
    • Homeobox domain.
  • COG COG5576 66aa 2e-08 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].
  • Changed! cd PAX 127aa 4e-61 in modified transcript
  • Changed! smart PAX 125aa 7e-66 in modified transcript

PAX8

  • refseq_PAX8.F2 refseq_PAX8.R2 153 232
  • NCBIGene 36.3 7849
  • Alternative 3-prime, size difference: 79
  • Inclusion in 3'UTR
  • Reference transcript: NM_013952

  • cd PAX 127aa 4e-59 in ref transcript
    • Paired Box domain.
  • smart PAX 125aa 5e-65 in ref transcript
    • Paired Box domain.
  • COG COG3415 92aa 0.004 in ref transcript
    • Transposase and inactivated derivatives [DNA replication, recombination, and repair].

PBRM1

  • refseq_PB1.F1 refseq_PB1.R1 220 295
  • NCBIGene 36.3 55193
  • Alternative 3-prime, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018165

  • cd Bromo_polybromo_I 109aa 3e-53 in ref transcript
    • Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_VI 108aa 1e-51 in ref transcript
    • Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_V 105aa 5e-49 in ref transcript
    • Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_III 102aa 8e-49 in ref transcript
    • Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • Changed! cd BAH_polybromo 118aa 3e-48 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • cd Bromo_polybromo_II 103aa 3e-46 in ref transcript
    • Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd BAH_polybromo 119aa 2e-44 in ref transcript
  • cd Bromo_polybromo_IV 95aa 2e-35 in ref transcript
    • Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd HMG-box 44aa 2e-06 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • Changed! pfam BAH 119aa 2e-33 in ref transcript
    • BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
  • smart BROMO 113aa 9e-26 in ref transcript
    • bromo domain.
  • smart BAH 116aa 6e-25 in ref transcript
    • Bromo adjacent homology domain.
  • smart BROMO 107aa 3e-23 in ref transcript
  • smart BROMO 104aa 5e-22 in ref transcript
  • smart BROMO 103aa 1e-21 in ref transcript
  • pfam Bromodomain 87aa 3e-14 in ref transcript
    • Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • smart BROMO 101aa 2e-13 in ref transcript
  • smart HMG 44aa 5e-07 in ref transcript
    • high mobility group.
  • COG COG5076 281aa 4e-20 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • COG COG5076 205aa 2e-16 in ref transcript
  • COG COG5076 136aa 4e-16 in ref transcript
  • Changed! COG COG5076 392aa 4e-09 in ref transcript
  • Changed! cd BAH_polybromo 93aa 3e-29 in modified transcript
  • Changed! pfam BAH 94aa 8e-20 in modified transcript

PBRM1

  • refseq_PB1.F3 refseq_PB1.R3 158 314
  • NCBIGene 36.3 55193
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018165

  • cd Bromo_polybromo_I 109aa 3e-53 in ref transcript
    • Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_VI 108aa 1e-51 in ref transcript
    • Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_V 105aa 5e-49 in ref transcript
    • Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_III 102aa 8e-49 in ref transcript
    • Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd BAH_polybromo 118aa 3e-48 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • cd Bromo_polybromo_II 103aa 3e-46 in ref transcript
    • Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd BAH_polybromo 119aa 2e-44 in ref transcript
  • cd Bromo_polybromo_IV 95aa 2e-35 in ref transcript
    • Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • Changed! cd HMG-box 44aa 2e-06 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • pfam BAH 119aa 2e-33 in ref transcript
    • BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
  • smart BROMO 113aa 9e-26 in ref transcript
    • bromo domain.
  • smart BAH 116aa 6e-25 in ref transcript
    • Bromo adjacent homology domain.
  • smart BROMO 107aa 3e-23 in ref transcript
  • smart BROMO 104aa 5e-22 in ref transcript
  • smart BROMO 103aa 1e-21 in ref transcript
  • pfam Bromodomain 87aa 3e-14 in ref transcript
    • Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • smart BROMO 101aa 2e-13 in ref transcript
  • Changed! smart HMG 44aa 5e-07 in ref transcript
    • high mobility group.
  • COG COG5076 281aa 4e-20 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • COG COG5076 205aa 2e-16 in ref transcript
  • COG COG5076 136aa 4e-16 in ref transcript
  • COG COG5076 392aa 4e-09 in ref transcript
  • Changed! cd HMG-box 47aa 2e-07 in modified transcript
  • Changed! smart HMG 47aa 4e-08 in modified transcript

PBRM1

  • refseq_PB1.F5 refseq_PB1.R5 292 373
  • NCBIGene 36.3 55193
  • Alternative 5-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018165

  • cd Bromo_polybromo_I 109aa 3e-53 in ref transcript
    • Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_VI 108aa 1e-51 in ref transcript
    • Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_V 105aa 5e-49 in ref transcript
    • Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_III 102aa 8e-49 in ref transcript
    • Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd BAH_polybromo 118aa 3e-48 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • cd Bromo_polybromo_II 103aa 3e-46 in ref transcript
    • Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd BAH_polybromo 119aa 2e-44 in ref transcript
  • cd Bromo_polybromo_IV 95aa 2e-35 in ref transcript
    • Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd HMG-box 44aa 2e-06 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • pfam BAH 119aa 2e-33 in ref transcript
    • BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
  • smart BROMO 113aa 9e-26 in ref transcript
    • bromo domain.
  • smart BAH 116aa 6e-25 in ref transcript
    • Bromo adjacent homology domain.
  • smart BROMO 107aa 3e-23 in ref transcript
  • smart BROMO 104aa 5e-22 in ref transcript
  • smart BROMO 103aa 1e-21 in ref transcript
  • pfam Bromodomain 87aa 3e-14 in ref transcript
    • Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • smart BROMO 101aa 2e-13 in ref transcript
  • smart HMG 44aa 5e-07 in ref transcript
    • high mobility group.
  • COG COG5076 281aa 4e-20 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • COG COG5076 205aa 2e-16 in ref transcript
  • COG COG5076 136aa 4e-16 in ref transcript
  • COG COG5076 392aa 4e-09 in ref transcript

PBRM1

  • refseq_PB1.F7 refseq_PB1.R7 150 246
  • NCBIGene 36.3 55193
  • Single exon skipping, size difference: 96
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_018165

  • cd Bromo_polybromo_I 109aa 3e-53 in ref transcript
    • Bromodomain, polybromo repeat I. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_VI 108aa 1e-51 in ref transcript
    • Bromodomain, polybromo repeat VI. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_V 105aa 5e-49 in ref transcript
    • Bromodomain, polybromo repeat V. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd Bromo_polybromo_III 102aa 8e-49 in ref transcript
    • Bromodomain, polybromo repeat III. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd BAH_polybromo 118aa 3e-48 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • cd Bromo_polybromo_II 103aa 3e-46 in ref transcript
    • Bromodomain, polybromo repeat II. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd BAH_polybromo 119aa 2e-44 in ref transcript
  • cd Bromo_polybromo_IV 95aa 2e-35 in ref transcript
    • Bromodomain, polybromo repeat IV. Polybromo is a nuclear protein of unknown function, which contains 6 bromodomains. The human ortholog BAF180 is part of a SWI/SNF chromatin-remodeling complex, and it may carry out the functions of Yeast Rsc-1 and Rsc-2. It was shown that polybromo bromodomains bind to histone H3 at specific acetyl-lysine positions. Bromodomains are found in many chromatin-associated proteins and in nuclear histone acetyltransferases. They interact specifically with acetylated lysine, but not all the bromodomains in polybromo may bind to acetyl-lysine.
  • cd HMG-box 44aa 2e-06 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • pfam BAH 119aa 2e-33 in ref transcript
    • BAH domain. This domain has been called BAH (Bromo adjacent homology) domain and has also been called ELM1 and BAM (Bromo adjacent motif) domain. The function of this domain is unknown but may be involved in protein-protein interaction.
  • smart BROMO 113aa 9e-26 in ref transcript
    • bromo domain.
  • smart BAH 116aa 6e-25 in ref transcript
    • Bromo adjacent homology domain.
  • smart BROMO 107aa 3e-23 in ref transcript
  • smart BROMO 104aa 5e-22 in ref transcript
  • smart BROMO 103aa 1e-21 in ref transcript
  • pfam Bromodomain 87aa 3e-14 in ref transcript
    • Bromodomain. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • smart BROMO 101aa 2e-13 in ref transcript
  • smart HMG 44aa 5e-07 in ref transcript
    • high mobility group.
  • COG COG5076 281aa 4e-20 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • COG COG5076 205aa 2e-16 in ref transcript
  • COG COG5076 136aa 4e-16 in ref transcript
  • COG COG5076 392aa 4e-09 in ref transcript

PCDH11X

  • refseq_PCDH11X.F1 refseq_PCDH11X.R1 105 135
  • NCBIGene 36.3 27328
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032968

  • cd CA 194aa 1e-45 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 201aa 6e-40 in ref transcript
  • cd CA 196aa 1e-37 in ref transcript
  • cd CA 197aa 2e-35 in ref transcript
  • pfam Protocadherin 199aa 6e-67 in ref transcript
    • Protocadherin. The structure of protocadherins is similar to that of classic cadherins (pfam00028), but particularly on the cytoplasmic domains they also have some unique features. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated.
  • smart CA 77aa 5e-19 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • smart CA 77aa 7e-19 in ref transcript
  • pfam Cadherin 80aa 7e-16 in ref transcript
    • Cadherin domain.
  • smart CA 84aa 7e-13 in ref transcript
  • pfam Cadherin 82aa 1e-10 in ref transcript
  • smart CA 79aa 2e-09 in ref transcript
  • pfam Cadherin_2 85aa 6e-06 in ref transcript
    • Cadherin-like. This cadherin domain is usually the most N-terminal copy of the domain.

PCDH8

  • refseq_PCDH8.F2 refseq_PCDH8.R2 126 417
  • NCBIGene 36.3 5100
  • Alternative 5-prime, size difference: 291
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002590

  • cd CA 201aa 4e-43 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 206aa 2e-33 in ref transcript
  • cd CA 238aa 2e-28 in ref transcript
  • cd CA 200aa 5e-20 in ref transcript
  • smart CA 86aa 6e-19 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • smart CA 78aa 5e-17 in ref transcript
  • smart CA 79aa 1e-13 in ref transcript
  • smart CA 84aa 5e-13 in ref transcript
  • pfam Cadherin_2 79aa 6e-09 in ref transcript
    • Cadherin-like. This cadherin domain is usually the most N-terminal copy of the domain.
  • pfam Cadherin 81aa 2e-07 in ref transcript
    • Cadherin domain.

PCDH9

  • refseq_PCDH9.F1 refseq_PCDH9.R1 293 395
  • NCBIGene 36.3 5101
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203487

  • cd CA 195aa 1e-45 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 201aa 4e-39 in ref transcript
  • cd CA 195aa 1e-38 in ref transcript
  • cd CA 204aa 8e-38 in ref transcript
  • cd CA 204aa 5e-35 in ref transcript
  • cd CA 212aa 6e-22 in ref transcript
  • pfam Protocadherin 225aa 9e-65 in ref transcript
    • Protocadherin. The structure of protocadherins is similar to that of classic cadherins (pfam00028), but particularly on the cytoplasmic domains they also have some unique features. They are expressed in a variety of organisms and are found in high concentrations in the brain where they seem to be localised mainly at cell-cell contact sites. Their expression seems to be developmentally regulated.
  • smart CA 77aa 2e-20 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • smart CA 77aa 2e-19 in ref transcript
  • pfam Cadherin 92aa 2e-17 in ref transcript
    • Cadherin domain.
  • smart CA 84aa 7e-14 in ref transcript
  • smart CA 79aa 1e-11 in ref transcript
  • smart CA 79aa 1e-07 in ref transcript
  • pfam Cadherin_2 90aa 4e-07 in ref transcript
    • Cadherin-like. This cadherin domain is usually the most N-terminal copy of the domain.

PCGF6

  • refseq_PCGF6.F2 refseq_PCGF6.R2 129 354
  • NCBIGene 36.3 84108
  • Multiple exon skipping, size difference: 225
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001011663

  • cd RING 43aa 4e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam zf-C3HC4 39aa 1e-06 in ref transcript
    • Zinc finger, C3HC4 type (RING finger). The C3HC4 type zinc-finger (RING finger) is a cysteine-rich domain of 40 to 60 residues that coordinates two zinc ions, and has the consensus sequence: C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C where X is any amino acid. Many proteins containing a RING finger play a key role in the ubiquitination pathway.
  • COG RAD18 49aa 1e-04 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].

PCNXL3

  • refseq_PCNXL3.F1 refseq_PCNXL3.R1 149 365
  • NCBIGene 36.2 399909
  • Single exon skipping, size difference: 216
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_934366

  • pfam Pecanex_C 231aa 1e-125 in ref transcript
    • Pecanex protein (C-terminus). This family consists of C terminal region of the pecanex protein homologues. The pecanex protein is a maternal-effect neurogenic gene found in Drosophila.

MED15

  • refseq_PCQAP.F1 refseq_PCQAP.R1 146 266
  • NCBIGene 36.3 51586
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001003891

  • pfam ARC105_Med_act 65aa 7e-29 in ref transcript
    • ARC105 domain. The approx. 70 residue ARC105 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, ARC105 is a critical transducer of gene activation signals that control early metazoan development.

PCTK3

  • refseq_PCTK3.F1 refseq_PCTK3.R1 148 238
  • NCBIGene 36.3 5129
  • Alternative 5-prime, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_212503

  • cd S_TKc 282aa 3e-64 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 272aa 1e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00024 284aa 1e-44 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

PDCD4

  • refseq_PDCD4.F1 refseq_PDCD4.R1 290 374
  • NCBIGene 36.3 27250
  • Single exon skipping, size difference: 84
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_014456

  • Changed! smart MA3 114aa 1e-29 in ref transcript
    • Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press.
  • Changed! smart MA3 112aa 8e-28 in ref transcript

PDE8A

  • refseq_PDE8A.F1 refseq_PDE8A.R1 169 367
  • NCBIGene 36.2 5151
  • Single exon skipping, size difference: 198
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002605

  • Changed! cd PAS 99aa 3e-06 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • Changed! cd HDc 195aa 4e-05 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • Changed! pfam PDEase_I 255aa 2e-55 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.
  • Changed! pfam PAS 106aa 2e-07 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • pfam Response_reg 117aa 5e-07 in ref transcript
    • Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
  • Changed! COG AtoS 113aa 1e-04 in ref transcript
    • FOG: PAS/PAC domain [Signal transduction mechanisms].

PDE8B

  • refseq_PDE8B.F2 refseq_PDE8B.R2 168 309
  • NCBIGene 36.3 8622
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003719

  • Changed! cd PAS 98aa 3e-07 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd HDc 191aa 0.001 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • pfam PDEase_I 251aa 3e-52 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.
  • pfam PDE8 52aa 2e-23 in ref transcript
    • PDE8 phosphodiesterase. This region is found in members of the PDE8 phosphodiesterase family. It is found with pfam00233.
  • pfam Response_reg 118aa 1e-12 in ref transcript
    • Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
  • Changed! TIGR nifL_nitrog 106aa 5e-08 in ref transcript
    • NifL is a modulator of the nitrogen fixation positive regulator protein NifA, and is therefore a negative regulator. It binds NifA. NifA and NifL are encoded by adjacent genes.
  • Changed! COG AtoS 209aa 5e-05 in ref transcript
    • FOG: PAS/PAC domain [Signal transduction mechanisms].

PDE8B

  • refseq_PDE8B.F3 refseq_PDE8B.R3 104 269
  • NCBIGene 36.3 8622
  • Single exon skipping, size difference: 165
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003719

  • cd PAS 98aa 3e-07 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd HDc 191aa 0.001 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • pfam PDEase_I 251aa 3e-52 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.
  • pfam PDE8 52aa 2e-23 in ref transcript
    • PDE8 phosphodiesterase. This region is found in members of the PDE8 phosphodiesterase family. It is found with pfam00233.
  • pfam Response_reg 118aa 1e-12 in ref transcript
    • Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
  • TIGR nifL_nitrog 106aa 5e-08 in ref transcript
    • NifL is a modulator of the nitrogen fixation positive regulator protein NifA, and is therefore a negative regulator. It binds NifA. NifA and NifL are encoded by adjacent genes.
  • COG AtoS 209aa 5e-05 in ref transcript
    • FOG: PAS/PAC domain [Signal transduction mechanisms].

PDE8B

  • refseq_PDE8B.F5 refseq_PDE8B.R5 337 397
  • NCBIGene 36.3 8622
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003719

  • cd PAS 98aa 3e-07 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd HDc 191aa 0.001 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • pfam PDEase_I 251aa 3e-52 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.
  • pfam PDE8 52aa 2e-23 in ref transcript
    • PDE8 phosphodiesterase. This region is found in members of the PDE8 phosphodiesterase family. It is found with pfam00233.
  • pfam Response_reg 118aa 1e-12 in ref transcript
    • Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
  • TIGR nifL_nitrog 106aa 5e-08 in ref transcript
    • NifL is a modulator of the nitrogen fixation positive regulator protein NifA, and is therefore a negative regulator. It binds NifA. NifA and NifL are encoded by adjacent genes.
  • COG AtoS 209aa 5e-05 in ref transcript
    • FOG: PAS/PAC domain [Signal transduction mechanisms].

PDE9A

  • refseq_PDE9A.F1 refseq_PDE9A.R9 104 180
  • NCBIGene 36.3 5152
  • Single exon skipping, size difference: 76
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_002606

  • Changed! cd HDc 174aa 6e-10 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • Changed! pfam PDEase_I 223aa 8e-56 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.

PDE9A

  • refseq_PDE9A.F2 refseq_PDE9A.R2 113 293
  • NCBIGene 36.3 5152
  • Single exon skipping, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002606

  • cd HDc 174aa 6e-10 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • pfam PDEase_I 223aa 8e-56 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.

PDE9A

  • refseq_PDE9A.F3 refseq_PDE9A.R3 140 211
  • NCBIGene 36.3 5152
  • Single exon skipping, size difference: 71
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002606

  • Changed! cd HDc 174aa 6e-10 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • Changed! pfam PDEase_I 223aa 8e-56 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.

PDE9A

  • refseq_PDE9A.F5 refseq_PDE9A.R5 116 201
  • NCBIGene 36.3 5152
  • Single exon skipping, size difference: 85
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002606

  • Changed! cd HDc 174aa 6e-10 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • Changed! pfam PDEase_I 223aa 8e-56 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.

PDE9A

  • refseq_PDE9A.F8 refseq_PDE9A.R8 113 191
  • NCBIGene 36.3 5152
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002606

  • cd HDc 174aa 6e-10 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • pfam PDEase_I 223aa 8e-56 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.

PDGFA

  • refseq_PDGFA.F1 refseq_PDGFA.R1 143 212
  • NCBIGene 36.3 5154
  • Single exon skipping, size difference: 69
  • Exclusion of the stop codon
  • Reference transcript: NM_002607

  • cd PDGF 89aa 3e-18 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
  • pfam PDGF 84aa 3e-33 in ref transcript
    • Platelet-derived growth factor (PDGF).
  • pfam PDGF_N 75aa 3e-25 in ref transcript
    • Platelet-derived growth factor, N terminal region. This family consists of the amino terminal regions of platelet-derived growth factor (PDGF, pfam00341) A and B chains.

PDGFD

  • refseq_PDGFD.F1 refseq_PDGFD.R1 100 118
  • NCBIGene 36.3 80310
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_025208

  • cd CUB 114aa 2e-19 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd PDGF 96aa 2e-17 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
  • smart CUB 106aa 5e-19 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • smart PDGF 92aa 0.002 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family. Platelet-derived growth factor is a potent activator for cells of mesenchymal origin. PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer. Members of the VEGF family are homologues of PDGF.

PDLIM2

  • refseq_PDLIM2.F1 refseq_PDLIM2.R1 106 224
  • NCBIGene 36.3 64236
  • Single exon skipping, size difference: 118
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_176871

  • cd PDZ_signaling 77aa 6e-15 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 74aa 1e-14 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • Changed! pfam LIM 45aa 0.001 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • COG Prc 62aa 6e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

PDLIM5

  • refseq_PDLIM5.F1 refseq_PDLIM5.R1 106 433
  • NCBIGene 36.3 10611
  • Alternative 3-prime, size difference: 327
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006457

  • cd PDZ_signaling 73aa 6e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 74aa 2e-17 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • pfam LIM 56aa 7e-12 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 53aa 3e-09 in ref transcript
  • pfam LIM 56aa 4e-09 in ref transcript
  • COG Prc 59aa 6e-06 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

PDLIM5

  • refseq_PDLIM5.F1 refseq_PDLIM5.R3 312 372
  • NCBIGene 36.3 10611
  • Mutually exclusive exon skipping, size difference: 60
  • Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001011515

  • cd PDZ_signaling 73aa 2e-15 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 74aa 1e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG Prc 59aa 1e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

PDLIM5

  • refseq_PDLIM5.F4 refseq_PDLIM5.R5 100 118
  • NCBIGene 36.3 10611
  • Single exon skipping, size difference: 18
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_006457

  • cd PDZ_signaling 73aa 6e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 74aa 2e-17 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • pfam LIM 56aa 7e-12 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 53aa 3e-09 in ref transcript
  • pfam LIM 56aa 4e-09 in ref transcript
  • COG Prc 59aa 6e-06 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

PDLIM7

  • refseq_PDLIM7.F2 refseq_PDLIM7.R2 160 262
  • NCBIGene 36.3 9260
  • Mutually exclusive exon skipping, size difference: 102
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005451

  • cd PDZ_signaling 75aa 9e-10 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • pfam LIM 56aa 1e-11 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 52aa 5e-11 in ref transcript
  • smart PDZ 80aa 3e-10 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • pfam LIM 53aa 4e-09 in ref transcript
  • Changed! PRK PRK12323 140aa 0.001 in ref transcript
    • DNA polymerase III subunits gamma and tau; Provisional.

PDPK1

  • refseq_PDPK1.F1 refseq_PDPK1.R1 100 481
  • NCBIGene 36.3 5170
  • Multiple exon skipping, size difference: 381
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002613

  • Changed! cd STKc_PDK1 263aa 1e-120 in ref transcript
    • STKc_PDK1: Serine/Threonine Kinases (STKs), Phosphoinositide-dependent kinase 1 (PDK1) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PDK1 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). PDK1 carries an N-terminal catalytic domain and a C-terminal pleckstrin homology (PH) domain that binds phosphoinositides. It phosphorylates the activation loop of AGC kinases that are regulated by PI3K such as PKB, SGK, and PKC, among others, and is crucial for their activation. Thus, it contributes in regulating many processes including metabolism, growth, proliferation, and survival. PDK1 also has the ability to autophosphorylate and is constitutively active in mammalian cells. PDK1 is essential for normal embryo development and is important in regulating cell volume.
  • cd PH_PDK1 88aa 6e-36 in ref transcript
    • 3-Phosphoinositide dependent protein kinase 1 (PDK1) pleckstrin homology (PH) domain. PDK1 contains an N-terminal serine/threonine kinase domain followed by a PH domain. Following binding of the PH domain to PtdIns(3,4,5)P3 and PtdIns(3,4)P2, PDK1 activates kinases such as Akt (PKB). PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • Changed! smart S_TKc 251aa 1e-78 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00263 276aa 1e-64 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! cd STKc_PDK1 135aa 8e-46 in modified transcript
  • Changed! smart S_TKc 118aa 6e-26 in modified transcript
  • Changed! smart STYKc 28aa 8e-04 in modified transcript
    • Protein kinase; unclassified specificity. Phosphotransferases. The specificity of this class of kinases can not be predicted. Possible dual-specificity Ser/Thr/Tyr kinase.
  • Changed! PTZ PTZ00263 134aa 5e-20 in modified transcript

PECI

  • refseq_PECI.F2 refseq_PECI.R2 144 215
  • NCBIGene 36.3 10455
  • Single exon skipping, size difference: 71
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006117

  • Changed! cd crotonase-like 195aa 5e-47 in ref transcript
    • Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole.
  • cd ACBP 76aa 5e-23 in ref transcript
    • Acyl CoA binding protein (ACBP) binds thiol esters of long fatty acids and coenzyme A in a one-to-one binding mode with high specificity and affinity. Acyl-CoAs are important intermediates in fatty lipid synthesis and fatty acid degradation and play a role in regulation of intermediary metabolism and gene regulation. The suggested role of ACBP is to act as a intracellular acyl-CoA transporter and pool former. ACBPs are present in a large group of eukaryotic species and several tissue-specific isoforms have been detected.
  • Changed! TIGR PaaB1 216aa 1e-22 in ref transcript
    • This family of proteins are found within apparent operons for the degradation of phenylacetic acid. These proteins contain the enoyl-CoA hydratase domain as detected by pfam00378. This activity is consistent with current hypotheses for the degradation pathway which involve the ligation of phenylacetate with coenzyme A (paaF), hydroxylation (paaGHIJK), ring-opening (paaN) and degradation of the resulting fatty acid-like compound to a Krebs cycle intermediate (paaABCDE).
  • pfam ACBP 76aa 2e-21 in ref transcript
    • Acyl CoA binding protein.
  • Changed! PRK PRK06688 232aa 3e-46 in ref transcript
    • enoyl-CoA hydratase; Provisional.
  • COG ACB 75aa 2e-16 in ref transcript
    • Acyl-CoA-binding protein [Lipid metabolism].
  • Changed! cd crotonase-like 26aa 0.002 in modified transcript
  • Changed! PRK PRK06688 31aa 0.002 in modified transcript

PER2

  • refseq_PER2.F1 refseq_PER2.R1 110 217
  • NCBIGene 36.2 8864
  • Single exon skipping, size difference: 107
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_022817

  • Changed! cd PAS 102aa 2e-08 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • Changed! pfam PAS_3 85aa 2e-08 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.

PEX10

  • refseq_PEX10.F1 refseq_PEX10.R1 222 282
  • NCBIGene 36.3 5192
  • Alternative 3-prime, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153818

  • cd RING 42aa 1e-07 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • Changed! pfam Pex2_Pex12 169aa 1e-16 in ref transcript
    • Pex2 / Pex12 amino terminal region. This region is found at the N terminal of a number of known and predicted peroxins including Pex2, Pex10 and Pex12. This conserved region is usually associated with a C terminal ring finger (pfam00097) domain.
  • smart RING 38aa 2e-08 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG PEX10 55aa 7e-12 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam Pex2_Pex12 175aa 3e-19 in modified transcript

PFDN5

  • refseq_PFDN5.F1 refseq_PFDN5.R1 142 277
  • NCBIGene 36.3 5204
  • Multiple exon skipping, size difference: 135
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002624

  • Changed! cd Prefoldin_alpha 128aa 2e-25 in ref transcript
    • Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
  • Changed! TIGR TIGR00293 127aa 2e-31 in ref transcript
    • This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and builds of HMMs for the seed alignment with less selective parameters lead to significant hits to large numbers of proteins that contain coiled coil regions. This model is built with a more selective usage of Dirichlet priors.
  • Changed! COG GIM5 130aa 3e-16 in ref transcript
    • Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Prefoldin_alpha 79aa 5e-13 in modified transcript
  • Changed! TIGR TIGR00293 70aa 2e-16 in modified transcript
  • Changed! COG GIM5 67aa 2e-06 in modified transcript

PFN2

  • refseq_PFN2.F2 refseq_PFN2.R2 100 422
  • NCBIGene 36.3 5217
  • Alternative 3-prime, size difference: 322
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002628

  • Changed! cd PROF 137aa 2e-26 in ref transcript
    • Profilin binds actin monomers, membrane polyphosphoinositides such as PI(4,5)P2, and poly-L-proline. Profilin can inhibit actin polymerization into F-actin by binding to monomeric actin (G-actin) and terminal F-actin subunits, but - as a regulator of the cytoskeleton - it may also promote actin polymerization. It plays a role in the assembly of branched actin filament networks, by activating WASP via binding to WASP's proline rich domain. Profilin may link the cytoskeleton with major signalling pathways by interacting with components of the phosphatidylinositol cycle and Ras pathway.
  • Changed! smart PROF 138aa 2e-33 in ref transcript
    • Profilin. Binds actin monomers, membrane polyphosphoinositides and poly-L-proline.
  • Changed! cd PROF 138aa 2e-25 in modified transcript
  • Changed! smart PROF 139aa 2e-32 in modified transcript

CBY1

  • refseq_PGEA1.F1 refseq_PGEA1.R1 151 291
  • NCBIGene 36.3 25776
  • Single exon skipping, size difference: 140
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001002880

PHACTR3

  • refseq_PHACTR3.F2 refseq_PHACTR3.R2 115 325
  • NCBIGene 36.3 116154
  • Single exon skipping, size difference: 210
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080672

PHF1

  • refseq_PHF1.F1 refseq_PHF1.R1 299 394
  • NCBIGene 36.3 5252
  • Single exon skipping, size difference: 95
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024165

  • smart TUDOR 52aa 1e-06 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
  • pfam PHD 53aa 4e-06 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • Changed! COG COG5141 84aa 0.009 in ref transcript
    • PHD zinc finger-containing protein [General function prediction only].

PHF1

  • refseq_PHF1.F3 refseq_PHF1.R3 285 391
  • NCBIGene 36.3 5252
  • Single exon skipping, size difference: 95
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024165

  • smart TUDOR 52aa 1e-06 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
  • pfam PHD 53aa 4e-06 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • Changed! COG COG5141 84aa 0.009 in ref transcript
    • PHD zinc finger-containing protein [General function prediction only].

PHF2

  • refseq_PHF2.F1 refseq_PHF2.R1 100 136
  • NCBIGene 36.2 5253
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 36
  • Inclusion in the protein causing a frameshift, Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005392

  • pfam JmjC 101aa 2e-17 in ref transcript
    • JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
  • Changed! smart PHD 47aa 1e-08 in ref transcript
    • PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the R ING finger and the FY VE finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent B ROMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were R ING fingers misidentified as PHD fingers.
  • smart JmjC 64aa 5e-06 in ref transcript
    • A domain family that is part of the cupin metalloenzyme superfamily. Probable enzymes, but of unknown functions, that regulate chromatin reorganisation processes (Clissold and Ponting, in press).
  • Changed! COG TNG2 51aa 0.006 in ref transcript
    • Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics].

PHF20L1

  • refseq_PHF20L1.F2 refseq_PHF20L1.R2 193 271
  • NCBIGene 36.3 51105
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016018

  • cd TUDOR 46aa 2e-06 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • smart TUDOR 54aa 3e-06 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
  • smart MBT 66aa 3e-05 in ref transcript
    • Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
  • pfam PHD 44aa 1e-04 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.

PHF7

  • refseq_PHF7.F2 refseq_PHF7.R2 276 393
  • NCBIGene 36.3 51533
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016483

PHKB

  • refseq_PHKB.F1 refseq_PHKB.R1 113 225
  • NCBIGene 36.3 5257
  • Single exon skipping, size difference: 112
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_000293

  • Changed! pfam PHK_AB 1040aa 0.0 in ref transcript
    • Phosphorylase kinase alpha/beta. This family consists of several eukaryotic phosphorylase kinase alpha and beta subunits. Phosphorylase kinase (PHK) is a regulatory enzyme in glycogen metabolism. Mutations in the gene encoding the alpha subunit of PHK (PHKA2) have been shown to be responsible for X-linked liver glycogenosis (XLG). XLG, a frequent type of glycogen storage disease, is characterised by hepatomegaly and growth retardation.

PIB5PA

  • refseq_PIB5PA.F1 refseq_PIB5PA.R1 111 289
  • NCBIGene 36.3 27124
  • Single exon skipping, size difference: 178
  • Exclusion of the protein initiation site
  • Reference transcript: NM_014422

  • Changed! smart IPPc 308aa 9e-74 in ref transcript
    • Inositol polyphosphate phosphatase, catalytic domain homologues. Mg(2+)-dependent/Li(+)-sensitive enzymes.
  • pfam CALCOCO1 96aa 0.003 in ref transcript
    • Calcium binding and coiled-coil domain (CALCOCO1) like. Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
  • Changed! COG COG5411 328aa 5e-40 in ref transcript
    • Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms].
  • Changed! smart IPPc 305aa 2e-73 in modified transcript
  • Changed! COG COG5411 283aa 2e-39 in modified transcript

PICALM

  • refseq_PICALM.F2 refseq_PICALM.R2 103 127
  • NCBIGene 36.3 8301
  • Single exon skipping, size difference: 24
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_007166

  • cd ANTH_AP180_CALM 117aa 1e-37 in ref transcript
    • ANTH domain family; composed of adaptor protein 180 (AP180), clathrin assembly lymphoid myeloid leukemia protein (CALM) and similar proteins. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. AP180 and CALM play important roles in clathrin-mediated endocytosis. AP180 is a brain-specific clathrin-binding protein which stimulates clathrin assembly during the recycling of synaptic vesicles. The ANTH domain is structurally similar to the VHS domain and is composed of a superhelix of eight alpha helices. ANTH domains bind both inositol phospholipids and proteins, and contribute to the nucleation and formation of clathrin coats on membranes. ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that the ANTH domain is a universal component of the machinery for clathrin-mediated membrane budding.
  • pfam ANTH 265aa 7e-80 in ref transcript
    • ANTH domain. AP180 is an endocytotic accessory proteins that has been implicated in the formation of clathrin-coated pits. The domain is involved in phosphatidylinositol 4,5-bisphosphate binding and is a universal adaptor for nucleation of clathrin coats.

PICK1

  • refseq_PICK1.F1 refseq_PICK1.R1 173 226
  • NCBIGene 36.3 9463
  • Alternative 5-prime, size difference: 53
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039583

  • cd Arfaptin 215aa 2e-68 in ref transcript
    • Arfaptin domain; arfaptin is a ubiquitously expressed protein implicated in mediating cross-talk between Rac, a member of the Rho family, and Arf small GTPases; Arfaptin binds to GTP-bound Arf1 and Arf6, but binds Rac.GTP and Rac.GDP with similar affinities. Structures of Arfaptin with Rac bound to either GDP or the slowly hydrolysable analogue GMPPNP show that the switch regions adopt similar conformations in both complexes. Arf1 and Arf6 are thought to bind to the same surface as Rac.
  • cd PDZ_signaling 77aa 5e-11 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • pfam Arfaptin 216aa 2e-74 in ref transcript
    • Arfaptin-like domain. Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils. The N-terminal region of ICA69 is similar to arfaptin.
  • smart PDZ 81aa 2e-12 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG Prc 64aa 5e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

PIGC

  • refseq_PIGC.F1 refseq_PIGC.R1 117 140
  • NCBIGene 36.3 5279
  • Alternative 3-prime, size difference: 23
  • Inclusion in 5'UTR
  • Reference transcript: NM_153747

  • pfam GPI2 210aa 2e-70 in ref transcript
    • Phosphatidylinositol N-acetylglucosaminyltransferase. Glycosylphosphatidylinositol (GPI) represents an important anchoring molecule for cell surface proteins. The first step in its synthesis is the transfer of N-acetylglucosamine (GlcNAc) from UDP-N-acetylglucosamine to phosphatidylinositol (PI). This step involves products of three or four genes in both yeast (GPI1, GPI2 and GPI3) and mammals (GPI1, PIG A, PIG H and PIG C), respectively.

PIGF

  • refseq_PIGF.F1 refseq_PIGF.R1 206 311
  • NCBIGene 36.3 5281
  • Single exon skipping, size difference: 105
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002643

  • Changed! pfam PIG-F 206aa 2e-59 in ref transcript
    • GPI biosynthesis protein family Pig-F. PIG-F is involved in glycosylphosphatidylinositol (GPI) anchor biosynthesis.
  • Changed! pfam PIG-F 182aa 5e-51 in modified transcript

PIGN

  • refseq_PIGN.F1 refseq_PIGN.R1 272 349
  • NCBIGene 36.3 23556
  • Single exon skipping, size difference: 77
  • Exclusion in 5'UTR
  • Reference transcript: NM_176787

  • pfam PigN 188aa 5e-43 in ref transcript
    • Phosphatidylinositolglycan class N (PIG-N). Phosphatidylinositolglycan class N (PIG-N) is a mammalian homologue of the yeast protein MCD4P and is expressed in the endoplasmic reticulum. PIG-N is essential for glycosylphosphatidylinositol anchor synthesis. Glycosylphosphatidylinositol (GPI)-anchored proteins are cell surface-localised proteins that serve many important cellular functions.
  • pfam PigN 202aa 5e-38 in ref transcript

PIGQ

  • refseq_PIGQ.F1 refseq_PIGQ.R1 298 360
  • NCBIGene 36.3 9091
  • Single exon skipping, size difference: 62
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_148920

  • pfam Gpi1 172aa 1e-58 in ref transcript
    • N-acetylglucosaminyl transferase component (Gpi1). Glycosylphosphatidylinositol (GPI) represents an important anchoring molecule for cell surface proteins.The first step in its synthesis is the transfer of N-acetylglucosamine (GlcNAc) from UDP-N-acetylglucosamine to phosphatidylinositol (PI). This chemically simple step is genetically complex because three or four genes are required in both yeast (GPI1, GPI2 and GPI3) and mammals (GPI1, PIG A, PIG H and PIG C), respectively.

PILRA

  • refseq_PILRA.F1 refseq_PILRA.R1 118 337
  • NCBIGene 36.3 29992
  • Single exon skipping, size difference: 219
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013439

  • pfam V-set 115aa 9e-05 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

PILRB

  • refseq_PILRB.F1 refseq_PILRB.R1 150 218
  • NCBIGene 36.3 29990
  • Alternative 3-prime, size difference: 68
  • Inclusion in 5'UTR
  • Reference transcript: NM_013440

  • pfam V-set 92aa 3e-05 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

PILRB

  • refseq_PILRB.F3 refseq_PILRB.R3 114 213
  • NCBIGene 36.3 29990
  • Alternative 3-prime, size difference: 99
  • Exclusion in 5'UTR
  • Reference transcript: NM_013440

  • pfam V-set 92aa 3e-05 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

PIP5K1B

  • refseq_PIP5K1B.F1 refseq_PIP5K1B.R1 258 379
  • NCBIGene 36.2 8395
  • Single exon skipping, size difference: 121
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_003558

  • Changed! cd PIPKc 252aa 4e-81 in ref transcript
    • Phosphatidylinositol phosphate kinases (PIPK) catalyze the phosphorylation of phosphatidylinositol phosphate on the fourth or fifth hydroxyl of the inositol ring, to form phosphatidylinositol bisphosphate. CD alignment includes type II phosphatidylinositol phosphate kinases (PIPKII-beta), type I andII PIPK (-alpha, -beta, and -gamma) kinases and related yeast Fab1p and Mss4p kinases. Signaling by phosphorylated species of phosphatidylinositol regulates secretion, vesicular trafficking, membrane translocation, cell adhesion, chemotaxis, DNA synthesis, and cell cycling. The catalytic core domains of PIPKs are structurally similar to PI3K, PI4K, and cAMP-dependent protein kinases (PKA), the dimerization region is a unique feature of the PIPKs.
  • Changed! cd PIPKc 57aa 5e-15 in ref transcript
  • Changed! pfam PIP5K 289aa 1e-119 in ref transcript
    • Phosphatidylinositol-4-phosphate 5-Kinase. This family contains a region from the common kinase core found in the type I phosphatidylinositol-4-phosphate 5-kinase (PIP5K) family. The family consists of various type I, II and III PIP5K enzymes. PIP5K catalyses the formation of phosphoinositol-4,5-bisphosphate via the phosphorylation of phosphatidylinositol-4-phosphate a precursor in the phosphinositide signaling pathway.
  • Changed! COG MSS4 327aa 4e-27 in ref transcript
    • Phosphatidylinositol-4-phosphate 5-kinase [Signal transduction mechanisms].

PIP5K3

  • refseq_PIP5K3.F2 refseq_PIP5K3.R2 221 393
  • NCBIGene 36.2 200576
  • Single exon skipping, size difference: 172
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015040

  • Changed! cd Fab1_TCP 245aa 1e-104 in ref transcript
    • TCP-1 like domain of the eukaryotic phosphatidylinositol 3-phosphate (PtdIns3P) 5-kinase Fab1. Fab1p is important for vacuole size regulation, presumably by modulating PtdIns(3,5)P2 effector activity. In the human homolog p235/PIKfyve deletion of this domain leads to loss of catalytic activity. However no exact function this domain has been defined. In general, chaperonins are involved in productive folding of proteins.
  • Changed! cd PIPKc 314aa 5e-98 in ref transcript
    • Phosphatidylinositol phosphate kinases (PIPK) catalyze the phosphorylation of phosphatidylinositol phosphate on the fourth or fifth hydroxyl of the inositol ring, to form phosphatidylinositol bisphosphate. CD alignment includes type II phosphatidylinositol phosphate kinases (PIPKII-beta), type I andII PIPK (-alpha, -beta, and -gamma) kinases and related yeast Fab1p and Mss4p kinases. Signaling by phosphorylated species of phosphatidylinositol regulates secretion, vesicular trafficking, membrane translocation, cell adhesion, chemotaxis, DNA synthesis, and cell cycling. The catalytic core domains of PIPKs are structurally similar to PI3K, PI4K, and cAMP-dependent protein kinases (PKA), the dimerization region is a unique feature of the PIPKs.
  • Changed! cd DEP_PIKfyve 82aa 1e-38 in ref transcript
    • DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in fungal RhoGEF (GDP/GTP exchange factor) PIKfyve-like proteins. PIKfyve contains N-terminal Fyve finger and DEP domains, a central chaperonin-like domain and a C-terminal PIPK (phosphatidylinositol phosphate kinase) domain. PIKfyve-like proteins are important phosphatidylinositol (3)-monophosphate (PtdIns(3)P)-5-kinases, producing PtdIns(3,5)P2, which plays a major role in multivesicular body (MVB) sorting and control of retrograde traffic from the vacuole back to the endosome and/or Golgi. PIKfyve itself has been shown to be play a role in regulating early-endosome-to-trans-Golgi network (TGN) retrograde trafficking.
  • Changed! cd FYVE 53aa 2e-15 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • Changed! smart PIPKc 238aa 5e-72 in ref transcript
    • Phosphatidylinositol phosphate kinases.
  • Changed! TIGR chap_CCT_gamma 224aa 8e-32 in ref transcript
    • Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT gamma chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
  • Changed! pfam FYVE 60aa 9e-20 in ref transcript
    • FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
  • Changed! smart DEP 73aa 3e-17 in ref transcript
    • Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signalling proteins that contain PH, rasGEF, rhoGEF, rhoGAP, RGS, PDZ domains. DEP domain in Drosophila dishevelled is essential to rescue planar polarity defects and induce JNK signalling (Cell 94, 109-118).
  • Changed! smart PIPKc 56aa 6e-17 in ref transcript
  • Changed! COG MSS4 288aa 9e-54 in ref transcript
    • Phosphatidylinositol-4-phosphate 5-kinase [Signal transduction mechanisms].
  • Changed! COG GroL 219aa 2e-17 in ref transcript
    • Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].

PIR

  • refseq_PIR.F1 FIGF.R4 192 226
  • NCBIGene 36.3 8544
  • Alternative 5-prime, size difference: 34
  • Exclusion in 5'UTR
  • Reference transcript: NM_003662

  • pfam Pirin 98aa 3e-33 in ref transcript
    • Pirin. This family consists of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures. A tomato homologue of human Pirin has been found to be induced during programmed cell death. Human Pirin interacts with Bcl-3 and NFI and hence is probably involved in the regulation of DNA transcription and replication. It appears to be an Fe(II)-containing member of the Cupin superfamily.
  • pfam Pirin_C 107aa 5e-33 in ref transcript
    • Pirin C-terminal cupin domain. This region is found the C-terminal half of the Pirin protein.
  • COG COG1741 275aa 2e-53 in ref transcript
    • Pirin-related protein [General function prediction only].

PITPNC1

  • refseq_PITPNC1.F2 refseq_PITPNC1.R2 263 382
  • NCBIGene 36.3 26207
  • Single exon skipping, size difference: 119
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_012417

  • Changed! pfam IP_trans 244aa 1e-73 in ref transcript
    • Phosphatidylinositol transfer protein. Along with the structurally unrelated Sec14p family (found in pfam00650), this family can bind/exchange one molecule of phosphatidylinositol (PI) or phosphatidylcholine (PC) and thus aids their transfer between different membrane compartments. There are three sub-families - all share an N-terminal PITP-like domain, whose sequence is highly conserved. It is described as consisting of three regions. The N-terminal region is thought to bind the lipid and contains two helices and an eight-stranded, mostly antiparallel beta-sheet. An intervening loop region, which is thought to play a role in protein-protein interactions, separates this from the C-terminal region, which exhibits the greatest sequence variation and may be involved in membrane binding. PITP alpha has a 16-fold greater affinity for PI than PC. Together with PITP beta, it is expressed ubiquitously in all tissues.
  • Changed! pfam IP_trans 239aa 4e-73 in modified transcript

PITX2

  • refseq_PITX2.F2 refseq_PITX2.R2 122 260
  • NCBIGene 36.3 5308
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153426

  • cd homeodomain 59aa 9e-15 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam Homeobox 57aa 5e-21 in ref transcript
    • Homeobox domain.
  • pfam OAR 21aa 0.001 in ref transcript
    • OAR domain.
  • Changed! COG COG5576 114aa 2e-08 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].
  • Changed! COG COG5576 89aa 6e-08 in modified transcript

PJA1

  • refseq_PJA1.F2 refseq_PJA1.R2 100 426
  • NCBIGene 36.3 64219
  • Alternative 3-prime, size difference: 326
  • Exclusion of the protein initiation site
  • Reference transcript: NM_145119

  • cd RING 43aa 5e-08 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • smart RING 41aa 3e-07 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG COG5540 48aa 1e-08 in ref transcript
    • RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

PKD1L2

  • refseq_PKD1L2.F1 refseq_PKD1L2.R1 192 397
  • NCBIGene 36.2 114780
  • Single exon skipping, size difference: 205
  • Exclusion in 3'UTR
  • Reference transcript: NM_052892

  • cd PLAT_polycystin 120aa 2e-51 in ref transcript
    • PLAT/LH2 domain of polycystin-1 like proteins. Polycystins are a large family of membrane proteins composed of multiple domains, present in fish, invertebrates, mammals, and humans that are widely expressed in various cell types and whose biological functions remain poorly defined. In human, mutations in polycystin-1 (PKD1) and polycystin-2 (PKD2) have been shown to be the cause for autosomal dominant polycystic kidney disease (ADPKD). The generally proposed function of PLAT/LH2 domains is to mediate interaction with lipids or membrane bound proteins.
  • cd CLECT 116aa 2e-17 in ref transcript
    • CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. CTLDs may bind a variety of carbohydrate ligands including mannose, N-acetylglucosamine, galactose, N-acetylgalactosamine, and fucose. Several CTLDs bind to protein ligands, and only some of these binding interactions are Ca2+-dependent; including the CTLDs of Coagulation Factors IX/X (IX/X) and Von Willebrand Factor (VWF) binding proteins, and natural killer cell receptors. C-type lectins, such as lithostathine, and some type II antifreeze glycoproteins function in a Ca2+-independent manner to bind inorganic surfaces. Many proteins in this group contain a single CTLD; these CTLDs associate with each other through several different surfaces to form dimers, trimers, or tetramers, from which ligand-binding sites project in different orientations. Various vertebrate type 1 transmembrane proteins including macrophage mannose receptor, endo180, phospholipase A2 receptor, and dendritic and epithelial cell receptor (DEC205) have extracellular domains containing 8 or more CTLDs; these CTLDs remain in the parent model. In some members (IX/X and VWF binding proteins), a loop extends to the adjoining domain to form a loop-swapped dimer. A similar conformation is seen in the macrophage mannose receptor CRD4's putative non-sugar bound form of the domain in the acid environment of the endosome. Lineage specific expansions of CTLDs have occurred in several animal lineages including Drosophila melanogaster and Caenorhabditis elegans; these CTLDs also remain in the parent model.
  • smart CLECT 126aa 6e-21 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
  • pfam PLAT 105aa 7e-15 in ref transcript
    • PLAT/LH2 domain. This domain is found in a variety of membrane or lipid associated proteins. It is called the PLAT (Polycystin-1, Lipoxygenase, Alpha-Toxin) domain or LH2 (Lipoxygenase homology) domain. The known structure of pancreatic lipase shows this domain binds to procolipase pfam01114, which mediates membrane association. So it appears possible that this domain mediates membrane attachment via other protein binding partners. The structure of this domain is known for many members of the family and is composed of a beta sandwich.
  • pfam GPS 40aa 2e-10 in ref transcript
    • Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • pfam REJ 203aa 2e-06 in ref transcript
    • REJ domain. The REJ (Receptor for Egg Jelly) domain is found in PKD1, and the sperm receptor for egg jelly. The function of this domain is unknown. The domain is 600 amino acids long so is probably composed of multiple structural domains. There are six completely conserved cysteine residues that may form disulphide bridges.

PKIA

  • refseq_PKIA.F1 refseq_PKIA.R1 228 357
  • NCBIGene 36.3 5569
  • Single exon skipping, size difference: 129
  • Exclusion in 5'UTR
  • Reference transcript: NM_006823

  • pfam PKI 74aa 9e-19 in ref transcript
    • cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of its regulatory chains.

PKIB

  • refseq_PKIB.F2 refseq_PKIB.R2 249 316
  • NCBIGene 36.3 5570
  • Single exon skipping, size difference: 67
  • Exclusion in 5'UTR
  • Reference transcript: NM_181795

  • pfam PKI 46aa 2e-08 in ref transcript
    • cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of its regulatory chains.

PKIG

  • refseq_PKIG.F1 refseq_PKIG.R1 275 380
  • NCBIGene 36.3 11142
  • Single exon skipping, size difference: 105
  • Exclusion in 5'UTR
  • Reference transcript: NM_181805

  • pfam PKI 74aa 1e-19 in ref transcript
    • cAMP-dependent protein kinase inhibitor. Members of this family are extremely potent competitive inhibitors of camp-dependent protein kinase activity. These proteins interact with the catalytic subunit of the enzyme after the cAMP-induced dissociation of its regulatory chains.

PKM2

  • refseq_PKM2.F1 refseq_PKM2.R1 145 361
  • NCBIGene 36.3 5315
  • Alternative 5-prime, size difference: 216
  • Exclusion in 5'UTR
  • Reference transcript: NM_182470

  • cd Pyruvate_Kinase 489aa 0.0 in ref transcript
    • Pyruvate kinase (PK): Large allosteric enzyme that regulates glycolysis through binding of the substrate, phosphoenolpyruvate, and one or more allosteric effectors. Like other allosteric enzymes, PK has a high substrate affinity R state and a low affinity T state. PK exists as several different isozymes, depending on organism and tissue type. In mammals, there are four PK isozymes: R, found in red blood cells, L, found in liver, M1, found in skeletal muscle, and M2, found in kidney, adipose tissue, and lung. PK forms a homotetramer, with each subunit containing three domains. The T state to R state transition of PK is more complex than in most allosteric enzymes, involving a concerted rotation of all 3 domains of each monomer in the homotetramer.
  • TIGR pyruv_kin 485aa 0.0 in ref transcript
    • This enzyme is a homotetramer. Some forms are active only in the presence of fructose-1,6-bisphosphate or similar phosphorylated sugars.
  • PRK PRK05826 472aa 1e-158 in ref transcript
    • pyruvate kinase; Provisional.

PKMYT1

  • refseq_PKMYT1.F1 refseq_PKMYT1.R1 106 144
  • NCBIGene 36.3 9088
  • Alternative 3-prime, size difference: 38
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_004203

  • cd S_TKc 248aa 4e-50 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 250aa 2e-48 in ref transcript
    • Protein kinase domain.
  • COG SPS1 261aa 8e-30 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

PKP1

  • refseq_PKP1.F1 refseq_PKP1.R1 120 183
  • NCBIGene 36.3 5317
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000299

  • cd ARM 113aa 6e-20 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 129aa 4e-07 in ref transcript
  • smart ARM 41aa 3e-07 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.

PKP2

  • refseq_PKP2.F2 refseq_PKP2.R2 184 316
  • NCBIGene 36.3 5318
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004572

  • Changed! cd ARM 118aa 3e-11 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 130aa 6e-06 in ref transcript
  • Changed! cd ARM 118aa 3e-13 in modified transcript
  • Changed! pfam Arm 42aa 0.001 in modified transcript
    • Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.

PKP4

  • refseq_PKP4.F2 refseq_PKP4.R2 271 400
  • NCBIGene 36.3 8502
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003628

  • cd ARM 123aa 3e-25 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 129aa 1e-11 in ref transcript
  • cd ARM 142aa 1e-04 in ref transcript
  • smart ARM 41aa 4e-07 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
  • smart ARM 36aa 2e-05 in ref transcript
  • smart ARM 42aa 3e-05 in ref transcript

PLA2G6

  • refseq_PLA2G6.F1 refseq_PLA2G6.R1 109 271
  • NCBIGene 36.3 8398
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003560

  • cd Pat_PNPLA9 314aa 1e-164 in ref transcript
    • Patatin-like phospholipase domain containing protein 9. PNPLA9 is a Ca-independent phospholipase that catalyzes the hydrolysis of glycerophospholipids at the sn-2 position. PNPLA9 is also known as PLA2G6 (phospholipase A2 group VI) or iPLA2beta. PLA2G6 is stimulated by ATP and inhibited by bromoenol lactone (BEL). In humans, PNPLA9 in expressed ubiquitously and is involved in signal transduction, cell proliferation, and apoptotic cell death. Mutations in human PLA2G6 leads to infantile neuroaxonal dystrophy (INAD) and idiopathic neurodegeneration with brain iron accumulation (NBIA). This family includes PLA2G6 from Homo sapiens and Rattus norvegicus.
  • Changed! cd ANK 119aa 1e-24 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 126aa 9e-20 in ref transcript
  • pfam Patatin 185aa 1e-33 in ref transcript
    • Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of lipid acyl hydrolase, catalysing the cleavage of fatty acids from membrane lipids. Members of this family have been found also in vertebrates.
  • Changed! TIGR trp 146aa 1e-04 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • COG COG3621 227aa 6e-19 in ref transcript
    • Patatin [General function prediction only].
  • COG Arp 204aa 2e-11 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! PTZ PTZ00322 110aa 3e-06 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.
  • COG Arp 196aa 6e-06 in ref transcript
  • Changed! cd ANK 111aa 4e-24 in modified transcript
  • Changed! TIGR trp 101aa 1e-04 in modified transcript
  • Changed! pfam Ank 30aa 0.002 in modified transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • Changed! TIGR trp 234aa 0.008 in modified transcript

PLCB1

  • refseq_PLCB1.F1 refseq_PLCB1.R1 234 352
  • NCBIGene 36.3 23236
  • Single exon skipping, size difference: 118
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015192

  • cd PLCc 149aa 6e-42 in ref transcript
    • Phospholipase C, catalytic domain; Phosphoinositide-specific phospholipases C catalyze hydrolysis of phosphatidylinositol-4,5-bisphosphate (PIP2) to D-myo-inositol-1,4,5-trisphosphate (1,4,5-IP3) and sn-1,2-diacylglycerol (DAG). Both products function as second messengers in eukaryotic signal transduction cascades. 1,4,5-IP3 triggers inflow of calcium from intracellular stores; the membrane resident product DAG controls cellular protein phosphorylation states by activating various protein kinase C isozymes. The enzyme comprises 2 regions (X and Y) connected via a linker which may contain inserted domains, X and Y together form a TIM barrel-like structure containing the active site residues.
  • cd PH_PLC 121aa 2e-25 in ref transcript
    • Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • cd PLCc 102aa 2e-19 in ref transcript
  • cd C2_2 118aa 5e-09 in ref transcript
    • Protein kinase C conserved region 2, subgroup 2; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing phospholipases C and D( PLC-1, PLD) and specific protein kinases C (PKC) subtypes, the C-terminal beta strand occupies the position of what is the N-terminal strand in subgroup 1.
  • pfam PI-PLC-Y 118aa 1e-59 in ref transcript
    • Phosphatidylinositol-specific phospholipase C, Y domain. This associates with pfam00388 to form a single structural unit.
  • pfam PI-PLC-X 147aa 2e-55 in ref transcript
    • Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit.
  • Changed! pfam PLC-beta_C 165aa 1e-41 in ref transcript
    • PLC-beta C terminal. This domain corresponds to the alpha helical C terminal domain of phospholipase C beta.
  • pfam efhand_like 92aa 9e-22 in ref transcript
    • Phosphoinositide-specific phospholipase C, efhand-like. Members of this family are predominantly found in phosphoinositide-specific phospholipase C. They adopt a structure consisting of a core of four alpha helices, in an EF like fold, and are required for functioning of the enzyme.
  • smart C2 100aa 7e-10 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • Changed! pfam PLC-beta_C 118aa 7e-31 in modified transcript

PLOD2

  • refseq_PLOD2.F1 refseq_PLOD2.R1 113 176
  • NCBIGene 36.3 5352
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182943

  • smart P4Hc 173aa 1e-18 in ref transcript
    • Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.

PLP1

  • refseq_PLP1.F1 refseq_PLP1.R1 218 323
  • NCBIGene 36.3 5354
  • Alternative 5-prime, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000533

  • Changed! pfam Myelin_PLP 276aa 1e-108 in ref transcript
    • Myelin proteolipid protein (PLP or lipophilin).
  • Changed! pfam Myelin_PLP 241aa 1e-109 in modified transcript

PLTP

  • refseq_PLTP.F1 refseq_PLTP.R1 149 305
  • NCBIGene 36.3 5360
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006227

  • Changed! cd BPI1 219aa 2e-56 in ref transcript
    • BPI/LBP/CETP N-terminal domain; Bactericidal permeability-increasing protein (BPI) / Lipopolysaccharide-binding protein (LBP) / Cholesteryl ester transfer protein (CETP) N-terminal domain; binds to and neutralizes lipopolysaccharides from the outer membrane of Gram-negative bacteria.; Apolar pockets on the concave surface bind a molecule of phosphatidylcholine, primarily by interacting with their acyl chains; this suggests that the pockets may also bind the acyl chains of lipopolysaccharide.
  • cd BPI2 201aa 1e-47 in ref transcript
    • BPI/LBP/CETP C-terminal domain; Bactericidal permeability-increasing protein (BPI) / Lipopolysaccharide-binding protein (LBP) / Cholesteryl ester transfer protein (CETP) C-terminal domain; binds to and neutralizes lipopolysaccharides from the outer membrane of Gram-negative bacteria.; Apolar pockets on the concave surface bind a molecule of phosphatidylcholine, primarily by interacting with their acyl chains; this suggests that the pockets may also bind the acyl chains of lipopolysaccharide.
  • pfam LBP_BPI_CETP_C 237aa 7e-78 in ref transcript
    • LBP / BPI / CETP family, C-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar.
  • Changed! smart BPI1 219aa 5e-53 in ref transcript
    • BPI/LBP/CETP N-terminal domain. Bactericidal permeability-increasing protein (BPI) / Lipopolysaccharide-binding protein (LBP) / Cholesteryl ester transfer protein (CETP) N-terminal domain.
  • Changed! cd BPI1 167aa 9e-33 in modified transcript
  • Changed! smart BPI1 167aa 9e-35 in modified transcript

PLUNC

  • refseq_PLUNC.F1 refseq_PLUNC.R1 184 239
  • NCBIGene 36.3 51297
  • Alternative 3-prime, size difference: 55
  • Inclusion in 3'UTR
  • Reference transcript: NM_016583

  • cd BPI1 138aa 0.001 in ref transcript
    • BPI/LBP/CETP N-terminal domain; Bactericidal permeability-increasing protein (BPI) / Lipopolysaccharide-binding protein (LBP) / Cholesteryl ester transfer protein (CETP) N-terminal domain; binds to and neutralizes lipopolysaccharides from the outer membrane of Gram-negative bacteria.; Apolar pockets on the concave surface bind a molecule of phosphatidylcholine, primarily by interacting with their acyl chains; this suggests that the pockets may also bind the acyl chains of lipopolysaccharide.
  • pfam LBP_BPI_CETP 141aa 3e-24 in ref transcript
    • LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar.

PML

  • refseq_PML.F2 refseq_PML.R2 139 283
  • NCBIGene 36.3 5371
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033238

  • smart RING 35aa 3e-04 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • pfam zf-B_box 43aa 0.003 in ref transcript
    • B-box zinc finger.
  • TIGR rad18 73aa 0.003 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! pfam zf-B_box 40aa 0.009 in ref transcript

POGZ

  • refseq_POGZ.F1 refseq_POGZ.R1 200 359
  • NCBIGene 36.3 23126
  • Single exon skipping, size difference: 159
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015100

  • pfam DDE 165aa 2e-15 in ref transcript
    • DDE superfamily endonuclease. This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which contain three carboxylate residues that are believed to be responsible for coordinating metal ions needed for catalysis. The catalytic activity of this enzyme involves DNA cleavage at a specific site followed by a strand transfer reaction. Interestingly this family also includes the CENP-B protein. This domain in that protein appears to have lost the metal binding residues and is unlikely to have endonuclease activity. Centromere Protein B (CENP-B) is a DNA-binding protein localised to the centromere.
  • smart CENPB 62aa 6e-12 in ref transcript
    • Putative DNA-binding domain in centromere protein B, mouse jerky and transposases.

POLDIP3

  • refseq_POLDIP3.F1 refseq_POLDIP3.R1 301 388
  • NCBIGene 36.3 84271
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032311

  • cd RRM 65aa 1e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM 64aa 8e-09 in ref transcript
    • RNA recognition motif.
  • COG COG0724 133aa 2e-04 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

POLR3H

  • refseq_POLR3H.F1 refseq_POLR3H.R1 126 181
  • NCBIGene 36.2 171568
  • Alternative 5-prime, size difference: 55
  • Exclusion in 5'UTR
  • Reference transcript: NM_001018051

POMC

  • refseq_POMC.F1 refseq_POMC.R1 145 195
  • NCBIGene 36.3 5443
  • Single exon skipping, size difference: 50
  • Exclusion in 5'UTR
  • Reference transcript: NM_001035256

  • pfam NPP 45aa 8e-16 in ref transcript
    • Pro-opiomelanocortin, N-terminal region. This family features the N-terminal peptide of pro-opiomelanocortin (NPP). It is thought to represent an important pituitary peptide, given its high yield from pituitary glands, and exhibits a potent in vitro aldosterone-stimulating activity.
  • pfam Op_neuropeptide 29aa 2e-07 in ref transcript
    • Opioids neuropeptide. This family corresponds to the conserved YGG motif that is found in a wide variety of opioid neuropeptides such as enkephalin.
  • pfam ACTH_domain 39aa 1e-04 in ref transcript
    • Corticotropin ACTH domain.

POMZP3

  • refseq_POMZP3.F1 refseq_POMZP3.R1 186 416
  • NCBIGene 36.3 22932
  • Multiple exon skipping, size difference: 230
  • Exclusion in the protein causing a frameshift, Exclusion of the stop codon
  • Reference transcript: NM_012230

  • Changed! smart ZP 65aa 9e-09 in ref transcript
    • Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).

POP5

  • refseq_POP5.F1 refseq_POP5.R1 182 332
  • NCBIGene 36.3 51367
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015918

  • Changed! pfam RNase_P_Rpp14 111aa 4e-21 in ref transcript
    • Rpp14/Pop5 family. tRNA processing enzyme ribonuclease P (RNase P) consists of an RNA molecule associated with at least eight protein subunits, hPop1, Rpp14, Rpp20, Rpp25, Rpp29, Rpp30, Rpp38, and Rpp40. This protein is known as Pop5 in eukaryotes.
  • Changed! COG POP5 114aa 8e-06 in ref transcript
    • RNase P/RNase MRP subunit POP5 [Translation, ribosomal structure and biogenesis].

PORCN

  • refseq_PORCN.F1 refseq_PORCN.R1 155 226
  • NCBIGene 36.3 64840
  • Alternative 3-prime, size difference: 71
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_203475

  • Changed! pfam MBOAT 288aa 8e-29 in ref transcript
    • MBOAT family. The MBOAT (membrane bound O-acyl transferase) family of membrane proteins contains a variety of acyltransferase enzymes. A conserved histidine has been suggested to be the active site residue.
  • Changed! COG COG5202 336aa 5e-09 in ref transcript
    • Predicted membrane protein [Function unknown].

PPA2

  • refseq_PPA2.F2 refseq_PPA2.R2 286 340
  • NCBIGene 36.3 27068
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_176869

  • Changed! cd pyrophosphatase 184aa 1e-48 in ref transcript
    • Inorganic pyrophosphatase. These enzymes hydrolyze inorganic pyrophosphate (PPi) to two molecules of orthophosphates (Pi). The reaction requires bivalent cations. The enzymes in general exist as homooligomers.
  • Changed! pfam Pyrophosphatase 185aa 3e-55 in ref transcript
    • Inorganic pyrophosphatase.
  • Changed! COG Ppa 216aa 7e-33 in ref transcript
    • Inorganic pyrophosphatase [Energy production and conversion].
  • Changed! cd pyrophosphatase 178aa 1e-40 in modified transcript
  • Changed! pfam Pyrophosphatase 171aa 3e-50 in modified transcript
  • Changed! COG Ppa 198aa 7e-29 in modified transcript

PPA2

  • refseq_PPA2.F3 refseq_PPA2.R3 188 275
  • NCBIGene 36.3 27068
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_176869

  • Changed! cd pyrophosphatase 184aa 1e-48 in ref transcript
    • Inorganic pyrophosphatase. These enzymes hydrolyze inorganic pyrophosphate (PPi) to two molecules of orthophosphates (Pi). The reaction requires bivalent cations. The enzymes in general exist as homooligomers.
  • Changed! pfam Pyrophosphatase 185aa 3e-55 in ref transcript
    • Inorganic pyrophosphatase.
  • Changed! COG Ppa 216aa 7e-33 in ref transcript
    • Inorganic pyrophosphatase [Energy production and conversion].
  • Changed! cd pyrophosphatase 155aa 5e-39 in modified transcript
  • Changed! pfam Pyrophosphatase 156aa 4e-45 in modified transcript
  • Changed! COG Ppa 187aa 2e-25 in modified transcript

PPAP2C

  • refseq_PPAP2C.F1 refseq_PPAP2C.R1 127 168
  • NCBIGene 36.3 8612
  • Alternative 5-prime, size difference: 41
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003712

  • Changed! cd PAP2_wunen 131aa 2e-48 in ref transcript
    • PAP2, wunen subfamily. Most likely a family of membrane associated phosphatidic acid phosphatases. Wunen is a drosophila protein expressed in the central nervous system, which provides repellent activity towards primordial germ cells (PGCs), controls the survival of PGCs and is essential in the migration process of these cells towards the somatic gonadal precursors.
  • Changed! pfam PAP2 123aa 4e-19 in ref transcript
    • PAP2 superfamily. This family includes the enzyme type 2 phosphatidic acid phosphatase (PAP2), Glucose-6-phosphatase EC:3.1.3.9, Phosphatidylglycerophosphatase B EC:3.1.3.27 and bacterial acid phosphatase EC:3.1.3.2. The family also includes a variety of haloperoxidases that function by oxidising halides in the presence of hydrogen peroxide to form the corresponding hypohalous acids.
  • Changed! COG PgpB 134aa 0.003 in ref transcript
    • Membrane-associated phospholipid phosphatase [Lipid metabolism].

PPEF1

  • refseq_PPEF1.F1 refseq_PPEF1.R1 195 381
  • NCBIGene 36.3 5475
  • Single exon skipping, size difference: 186
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006240

  • Changed! cd PP2Ac 318aa 5e-63 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
  • cd EFh 66aa 2e-07 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! smart PP2Ac 322aa 2e-91 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
  • pfam PPP5 62aa 3e-07 in ref transcript
    • PPP5. This domain is specific to the PPP5 subfamily of serine/threonine phosphatases.
  • smart EH 67aa 0.001 in ref transcript
    • Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
  • Changed! PTZ PTZ00244 314aa 5e-30 in ref transcript
    • serine/threonine-protein phosphatase PP1; Provisional.
  • COG FRQ1 68aa 2e-05 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! cd PP2Ac 256aa 2e-42 in modified transcript
  • Changed! smart PP2Ac 260aa 1e-64 in modified transcript
  • Changed! PTZ PTZ00244 176aa 5e-22 in modified transcript

PPEF1

  • refseq_PPEF1.F1 refseq_PPEF1.R2 219 303
  • NCBIGene 36.3 5475
  • Alternative 5-prime, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006240

  • Changed! cd PP2Ac 318aa 5e-63 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
  • cd EFh 66aa 2e-07 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! smart PP2Ac 322aa 2e-91 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
  • pfam PPP5 62aa 3e-07 in ref transcript
    • PPP5. This domain is specific to the PPP5 subfamily of serine/threonine phosphatases.
  • smart EH 67aa 0.001 in ref transcript
    • Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
  • Changed! PTZ PTZ00244 314aa 5e-30 in ref transcript
    • serine/threonine-protein phosphatase PP1; Provisional.
  • COG FRQ1 68aa 2e-05 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! cd PP2Ac 290aa 2e-65 in modified transcript
  • Changed! smart PP2Ac 294aa 8e-93 in modified transcript
  • Changed! PTZ PTZ00244 286aa 7e-33 in modified transcript
  • Changed! PTZ PTZ00184 148aa 0.010 in modified transcript
    • calmodulin; Provisional.

PPHLN1

  • refseq_PPHLN1.F1 refseq_PPHLN1.R1 118 283
  • NCBIGene 36.3 51535
  • Single exon skipping, size difference: 165
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016488

PPHLN1

  • refseq_PPHLN1.F4 refseq_PPHLN1.R4 112 155
  • NCBIGene 36.3 51535
  • Single exon skipping, size difference: 43
  • Inclusion in 5'UTR
  • Reference transcript: NM_016488

PPIL2

  • refseq_PPIL2.F1 refseq_PPIL2.R1 110 248
  • NCBIGene 36.3 23759
  • Alternative 3-prime, size difference: 138
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_148176

  • cd cyclophilin_RING 160aa 9e-76 in ref transcript
    • cyclophilin_RING: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) having a modified RING finger domain. This group includes the nuclear proteins, Human hCyP-60 and Caenorhabditis elegans MOG-6 which, compared to the archetypal cyclophilin Human cyclophilin A exhibit reduced peptidylprolyl cis- trans isomerase activity and lack a residue important for cyclophilin binding. Human hCyP-60 has been shown to physically interact with the proteinase inhibitor peptide eglin c and; C. elegans MOG-6 to physically interact with MEP-1, a nuclear zinc finger protein. MOG-6 has been shown to function in germline sex determination.
  • pfam Pro_isomerase 140aa 4e-35 in ref transcript
    • Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
  • smart Ubox 60aa 1e-12 in ref transcript
    • Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
  • COG PpiB 153aa 8e-40 in ref transcript
    • Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones].
  • COG COG1723 117aa 0.010 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

PPIL2

  • refseq_PPIL2.F2 refseq_PPIL2.R2 237 379
  • NCBIGene 36.3 23759
  • Alternative 3-prime, size difference: 4
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_148176

  • cd cyclophilin_RING 160aa 9e-76 in ref transcript
    • cyclophilin_RING: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) having a modified RING finger domain. This group includes the nuclear proteins, Human hCyP-60 and Caenorhabditis elegans MOG-6 which, compared to the archetypal cyclophilin Human cyclophilin A exhibit reduced peptidylprolyl cis- trans isomerase activity and lack a residue important for cyclophilin binding. Human hCyP-60 has been shown to physically interact with the proteinase inhibitor peptide eglin c and; C. elegans MOG-6 to physically interact with MEP-1, a nuclear zinc finger protein. MOG-6 has been shown to function in germline sex determination.
  • pfam Pro_isomerase 140aa 4e-35 in ref transcript
    • Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
  • smart Ubox 60aa 1e-12 in ref transcript
    • Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
  • COG PpiB 153aa 8e-40 in ref transcript
    • Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG COG1723 117aa 0.010 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

PPIL2

  • refseq_PPIL2.F3 refseq_PPIL2.R3 256 398
  • NCBIGene 36.3 23759
  • Alternative 3-prime, size difference: 4
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_148176

  • cd cyclophilin_RING 160aa 9e-76 in ref transcript
    • cyclophilin_RING: cyclophilin-type peptidylprolyl cis- trans isomerases (cyclophilins) having a modified RING finger domain. This group includes the nuclear proteins, Human hCyP-60 and Caenorhabditis elegans MOG-6 which, compared to the archetypal cyclophilin Human cyclophilin A exhibit reduced peptidylprolyl cis- trans isomerase activity and lack a residue important for cyclophilin binding. Human hCyP-60 has been shown to physically interact with the proteinase inhibitor peptide eglin c and; C. elegans MOG-6 to physically interact with MEP-1, a nuclear zinc finger protein. MOG-6 has been shown to function in germline sex determination.
  • pfam Pro_isomerase 140aa 4e-35 in ref transcript
    • Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
  • smart Ubox 60aa 1e-12 in ref transcript
    • Modified RING finger domain. Modified RING finger domain, without the full complement of Zn2+-binding ligands. Probable involvement in E2-dependent ubiquitination.
  • COG PpiB 153aa 8e-40 in ref transcript
    • Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG COG1723 117aa 0.010 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

PPIL3

  • refseq_PPIL3.F2 refseq_PPIL3.R2 105 396
  • NCBIGene 36.3 53938
  • Alternative 5-prime and 3-prime, size difference: 291
  • Inclusion in 5'UTR, Inclusion in 5'UTR
  • Reference transcript: NM_032472

  • cd Cyclophilin_PPIL3_like 158aa 2e-55 in ref transcript
    • Cyclophilin_PPIL3_like. Proteins similar to Human cyclophilin-like peptidylprolyl cis- trans isomerase (PPIL3). Members of this family lack a key residue important for cyclosporin binding: the tryptophan residue corresponding to W121 in human hCyP-18a; most members have a histidine at this position. The exact function of the protein is not known.
  • pfam Pro_isomerase 152aa 5e-26 in ref transcript
    • Cyclophilin type peptidyl-prolyl cis-trans isomerase/CLD. The peptidyl-prolyl cis-trans isomerases, also known as cyclophilins, share this domain of about 109 amino acids. Cyclophilins have been found in all organisms studied so far and catalyse peptidyl-prolyl isomerisation during which the peptide bond preceding proline (the peptidyl-prolyl bond) is stabilised in the cis conformation. Mammalian cyclophilin A (CypA) is a major cellular target for the immunosuppressive drug cyclosporin A (CsA). Other roles for cyclophilins may include chaperone and cell signalling function.
  • COG PpiB 157aa 2e-24 in ref transcript
    • Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family [Posttranslational modification, protein turnover, chaperones].

PPM1A

  • refseq_PPM1A.F2 refseq_PPM1A.R2 279 381
  • NCBIGene 36.3 5494
  • Single exon skipping, size difference: 102
  • Inclusion in 5'UTR
  • Reference transcript: NM_021003

  • cd PP2Cc 269aa 5e-71 in ref transcript
    • Serine/threonine phosphatases, family 2C, catalytic domain; The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
  • smart PP2Cc 273aa 1e-82 in ref transcript
    • Serine/threonine phosphatases, family 2C, catalytic domain. The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
  • pfam PP2C_C 80aa 1e-32 in ref transcript
    • Protein serine/threonine phosphatase 2C, C-terminal domain. Protein phosphatase 2C (PP2C) is involved in regulating cellular responses to stress in various eukaryotes. It consists of two domains: an N-terminal catalytic domain and a C-terminal domain characteristic of mammalian PP2Cs. This domain consists of three antiparallel alpha helices, one of which packs against two corresponding alpha-helices of the N-terminal domain. The C-terminal domain does not seem to play a role in catalysis, but it may provide protein substrate specificity due to the cleft that is created between it and the catalytic domain.
  • PTZ PTZ00224 293aa 2e-45 in ref transcript
    • protein phosphatase 2C; Provisional.

PPP1CA

  • refseq_PPP1CA.F2 refseq_PPP1CA.R2 129 162
  • NCBIGene 36.3 5499
  • Alternative 3-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008709

  • cd PP2Ac 267aa 1e-116 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
  • smart PP2Ac 270aa 1e-121 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
  • Changed! PTZ PTZ00244 281aa 1e-118 in ref transcript
    • serine/threonine-protein phosphatase PP1; Provisional.
  • Changed! PTZ PTZ00244 289aa 1e-120 in modified transcript

PPP1CB

  • refseq_PPP1CB.F2 refseq_PPP1CB.R2 118 405
  • NCBIGene 36.2 5500
  • Multiple exon skipping, size difference: 287
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206876

  • Changed! cd PP2Ac 267aa 1e-114 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, including PP1, PP2A and PP2B (calcineurin) family members.
  • Changed! smart PP2Ac 271aa 1e-120 in ref transcript
    • Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
  • Changed! PTZ PTZ00244 288aa 1e-117 in ref transcript
    • serine/threonine-protein phosphatase PP1; Provisional.
  • Changed! cd PP2Ac 168aa 2e-69 in modified transcript
  • Changed! smart PP2Ac 168aa 4e-73 in modified transcript
  • Changed! PTZ PTZ00244 192aa 7e-72 in modified transcript

PPP1R12B

  • refseq_PPP1R12B.F1 refseq_PPP1R12B.R1 133 314
  • NCBIGene 36.3 4660
  • Single exon skipping, size difference: 181
  • Exclusion of the stop codon
  • Reference transcript: NM_032105

  • cd ANK 103aa 2e-25 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 152aa 7e-20 in ref transcript
  • cd ANK 58aa 1e-06 in ref transcript
  • pfam Ank 33aa 4e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 32aa 2e-05 in ref transcript
  • pfam Ank 31aa 1e-04 in ref transcript
  • TIGR trp 170aa 8e-04 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • pfam Ank 33aa 0.005 in ref transcript
  • COG Arp 125aa 1e-12 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • PTZ PTZ00322 77aa 6e-06 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

PPP1R8

  • refseq_PPP1R8.F2 refseq_PPP1R8.R2 257 377
  • NCBIGene 36.3 5511
  • Alternative 5-prime, size difference: 120
  • Inclusion in 5'UTR
  • Reference transcript: NM_138558

PPP1R8

  • refseq_PPP1R8.F3 refseq_PPP1R8.R3 145 366
  • NCBIGene 36.3 5511
  • Single exon skipping, size difference: 221
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014110

  • Changed! cd FHA 100aa 3e-12 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • Changed! pfam FHA 69aa 8e-13 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • Changed! COG COG1716 89aa 0.001 in ref transcript
    • FOG: FHA domain [Signal transduction mechanisms].
  • Changed! cd FHA 65aa 2e-06 in modified transcript
  • Changed! pfam FHA 42aa 2e-07 in modified transcript

PPP2R2B

  • refseq_PPP2R2B.F1 refseq_PPP2R2B.R1 100 294
  • NCBIGene 36.3 5521
  • Single exon skipping, size difference: 194
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_181674

  • Changed! cd WD40 347aa 4e-07 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! pfam PP2A_B_N 71aa 1e-28 in ref transcript
    • Serine-threonine phosphatase 2A subunit B alpha and beta N-terminal domain. Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase involved in many aspects of cellular function including the regulation of metabolic enzymes and proteins involved in signal transduction. PP2A is a trimeric enzyme consisting of a central catalytic core, and two subunits, A and B. This is the N-terminal domain of subunit B that is thought to perform substrate recognition function or be responsible for targeting the enzyme complex to the appropriate subcellular compartment. Changes in intracellular PP2A subunit composition have been found modulate micro-tubule dynamics. It would appear that reduced amounts of neuronal B-alpha-containing PP2A heterotrimers contribute to micro-tubule destabilisation in Alzheimer's disease.
  • Changed! pfam PP2A_B_subs_rcg 61aa 5e-27 in ref transcript
    • Serine-threonine phosphatase 2A subunit B alpha and beta central domain. Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase involved in many aspects of cellular function including the regulation of metabolic enzymes and proteins involved in signal transduction. PP2A is a trimeric enzyme consisting of a central catalytic core, and two subunits, A and B. This is a central highly conserved region of subunit B that is thought to perform a substrate recognition function or to be be responsible for targeting the enzyme complex to the appropriate subcellular compartment. Changes in intracellular PP2A subunit composition have been found to modulate micro-tubule dynamics. It would appear that reduced amounts of neuronal B-alpha-containing PP2A heterotrimers contribute to micro-tubule destabilisation in Alzheimer's disease.
  • Changed! COG CDC55 417aa 1e-122 in ref transcript
    • Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms].

PPP2R2D

  • refseq_PPP2R2D.F1 refseq_PPP2R2D.R1 177 304
  • NCBIGene 36.3 55844
  • Alternative 3-prime, size difference: 127
  • Exclusion of the protein initiation site
  • Reference transcript: NM_018461

  • Changed! pfam PP2A_B_subs_rcg 61aa 3e-26 in ref transcript
    • Serine-threonine phosphatase 2A subunit B alpha and beta central domain. Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase involved in many aspects of cellular function including the regulation of metabolic enzymes and proteins involved in signal transduction. PP2A is a trimeric enzyme consisting of a central catalytic core, and two subunits, A and B. This is a central highly conserved region of subunit B that is thought to perform a substrate recognition function or to be be responsible for targeting the enzyme complex to the appropriate subcellular compartment. Changes in intracellular PP2A subunit composition have been found to modulate micro-tubule dynamics. It would appear that reduced amounts of neuronal B-alpha-containing PP2A heterotrimers contribute to micro-tubule destabilisation in Alzheimer's disease.
  • Changed! COG CDC55 282aa 9e-80 in ref transcript
    • Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms].
  • Changed! COG CDC55 221aa 2e-52 in modified transcript

PPP2R2D

  • refseq_PPP2R2D.F2 refseq_PPP2R2D.R2 242 355
  • NCBIGene 36.3 55844
  • Single exon skipping, size difference: 113
  • Inclusion in 5'UTR
  • Reference transcript: NM_018461

  • pfam PP2A_B_subs_rcg 61aa 3e-26 in ref transcript
    • Serine-threonine phosphatase 2A subunit B alpha and beta central domain. Protein phosphatase 2A (PP2A) is a serine/threonine phosphatase involved in many aspects of cellular function including the regulation of metabolic enzymes and proteins involved in signal transduction. PP2A is a trimeric enzyme consisting of a central catalytic core, and two subunits, A and B. This is a central highly conserved region of subunit B that is thought to perform a substrate recognition function or to be be responsible for targeting the enzyme complex to the appropriate subcellular compartment. Changes in intracellular PP2A subunit composition have been found to modulate micro-tubule dynamics. It would appear that reduced amounts of neuronal B-alpha-containing PP2A heterotrimers contribute to micro-tubule destabilisation in Alzheimer's disease.
  • COG CDC55 282aa 9e-80 in ref transcript
    • Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms].

PPP2R4

  • refseq_PPP2R4.F1 refseq_PPP2R4.R1 121 226
  • NCBIGene 36.3 5524
  • Single exon skipping, size difference: 105
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_021131

  • Changed! cd PTPA 266aa 1e-116 in ref transcript
    • Phosphotyrosyl phosphatase activator (PTPA) is also known as protein phosphatase 2A (PP2A) phosphatase activator. PTPA is an essential, well conserved protein that stimulates the tyrosyl phosphatase activity of PP2A. It also reactivates the serine/threonine phosphatase activity of an inactive form of PP2A. Together, PTPA and PP2A constitute an ATPase. It has been suggested that PTPA alters the relative specificity of PP2A from phosphoserine/phosphothreonine substrates to phosphotyrosine substrates in an ATP-hydrolysis-dependent manner. Basal expression of PTPA is controlled by the transcription factor Yin Yang1 (YY1). PTPA has been suggested to play a role in the insertion of metals to the PP2A catalytic subunit (PP2Ac) active site, to act as a chaperone, and more recently, to have peptidyl prolyl cis/trans isomerase activity that specifically targets human PP2Ac.
  • Changed! pfam PTPA 299aa 1e-142 in ref transcript
    • Phosphotyrosyl phosphate activator (PTPA) protein. Phosphotyrosyl phosphatase activator (PTPA) proteins stimulate the phosphotyrosyl phosphatase (PTPase) activity of the dimeric form of protein phosphatase 2A (PP2A). PTPase activity in PP2A (in vitro) is relatively low when compared to the better recognised phosphoserine/ threonine protein phosphorylase activity. The specific biological role of PTPA is unknown, Basal expression of PTPA depends on the activity of a ubiquitous transcription factor, Yin Yang 1 (YY1). The tumour suppressor protein p53 can inhibit PTPA expression through an unknown mechanism that negatively controls YY1.
  • Changed! COG LAG1 301aa 3e-85 in ref transcript
    • Phosphotyrosyl phosphatase activator [Cell division and chromosome partitioning / Signal transduction mechanisms].
  • Changed! cd PTPA 301aa 1e-112 in modified transcript
  • Changed! pfam PTPA 334aa 1e-138 in modified transcript
  • Changed! COG LAG1 336aa 2e-82 in modified transcript

PPP2R5C

  • refseq_PPP2R5C.F2 refseq_PPP2R5C.R2 287 404
  • NCBIGene 36.3 5527
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002719

  • pfam B56 409aa 1e-170 in ref transcript
    • Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzyme to act on a diverse array of substrates is largely controlled by the nature of its regulatory B subunit. There are multiple families of B subunits (See also pfam01240), this family is called the B56 family.

PQBP1

  • refseq_PQBP1.F1 refseq_PQBP1.R1 183 316
  • NCBIGene 36.2 10084
  • Alternative 5-prime, size difference: 133
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001032381

  • pfam WW 31aa 0.004 in ref transcript
    • WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.

PQLC2

  • refseq_PQLC2.F2 refseq_PQLC2.R2 159 401
  • NCBIGene 36.3 54896
  • Single exon skipping, size difference: 242
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001040125

  • pfam PQ-loop 57aa 4e-06 in ref transcript
    • PQ loop repeat. Members of this family are all membrane bound proteins possessing a pair of repeats each spanning two transmembrane helices connected by a loop. The PQ motif found on loop 2 is critical for the localisation of cystinosin to lysosomes. However, the PQ motif appears not to be a general lysosome-targeting motif. It is thought likely to possess a more general function. Most probably this involves a glutamine residue.

PRAME

  • refseq_PRAME.F1 refseq_PRAME.R1 112 133
  • NCBIGene 36.3 23532
  • Alternative 3-prime, size difference: 21
  • Inclusion in 5'UTR
  • Reference transcript: NM_206956

PRC1

  • refseq_PRC1.F2 refseq_PRC1.R2 154 196
  • NCBIGene 36.3 9055
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003981

  • Changed! pfam MAP65_ASE1 579aa 1e-159 in ref transcript
    • Microtubule associated protein (MAP65/ASE1 family).
  • Changed! pfam MAP65_ASE1 565aa 1e-153 in modified transcript

PRCC

  • refseq_PRCC.F1 refseq_PRCC.R1 126 222
  • NCBIGene 36.3 5546
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005973

  • pfam PRCC_Cterm 41aa 2e-13 in ref transcript
    • Mitotic checkpoint protein PRCC_Cterm. This is the highly conserved C-terminal domain of the renal papillary carcinoma protein PRCC. The function of this domain is not known.

PRCP

  • refseq_PRCP.F2 refseq_PRCP.R2 137 200
  • NCBIGene 36.3 5547
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199418

  • Changed! pfam Peptidase_S28 421aa 1e-84 in ref transcript
    • Serine carboxypeptidase S28. These serine proteases include several eukaryotic enzymes such as lysosomal Pro-X carboxypeptidase, dipeptidyl-peptidase II, and thymus-specific serine peptidase.
  • Changed! pfam Peptidase_S28 423aa 9e-86 in modified transcript

PRDM10

  • refseq_PRDM10.F1 refseq_PRDM10.R1 120 159
  • NCBIGene 36.3 56980
  • Alternative 5-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020228

PRDM15

  • refseq_PRDM15.F2 refseq_PRDM15.R2 179 377
  • NCBIGene 36.3 63977
  • Single exon skipping, size difference: 198
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022115

  • smart SET 109aa 0.003 in ref transcript
    • SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues.
  • COG COG5048 170aa 0.002 in ref transcript
    • FOG: Zn-finger [General function prediction only].

PRDM16

  • refseq_PRDM16.F1 refseq_PRDM16.R1 226 283
  • NCBIGene 36.3 63976
  • Alternative 3-prime, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022114

  • smart SET 124aa 9e-09 in ref transcript
    • SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues.
  • pfam zf-C2H2 23aa 0.003 in ref transcript
    • Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
  • COG COG5048 70aa 0.007 in ref transcript
    • FOG: Zn-finger [General function prediction only].

PRDX1

  • refseq_PRDX1.F1 refseq_PRDX1.R1 118 144
  • NCBIGene 36.3 5052
  • Alternative 5-prime, size difference: 26
  • Exclusion in 5'UTR
  • Reference transcript: NM_002574

  • cd PRX_Typ2cys 173aa 2e-87 in ref transcript
    • Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides. The functional unit of typical 2-cys PRX is a homodimer. A unique intermolecular redox-active disulfide center is utilized for its activity. Upon reaction with peroxides, its peroxidatic cysteine is oxidized into a sulfenic acid intermediate which is resolved by bonding with the resolving cysteine from the other subunit of the homodimer. This intermolecular disulfide bond is then reduced by thioredoxin, tryparedoxin or AhpF. Typical 2-cys PRXs, like 1-cys PRXs, form decamers which are stabilized by reduction of the active site cysteine. Typical 2-cys PRX interacts through beta strands at one edge of the monomer (B-type interface) to form the functional homodimer, and uses an A-type interface (similar to the dimeric interface in atypical 2-cys PRX and PRX5) at the opposite end of the monomer to form the stable decameric (pentamer of dimers) structure.
  • TIGR AhpC 179aa 1e-46 in ref transcript
    • This gene contains two invariant cysteine residues, one near the N-terminus and one near the C-terminus, each followed immediately by a proline residue.
  • PTZ PTZ00253 196aa 5e-85 in ref transcript
    • tryparedoxin peroxidase; Provisional.

PRDX2

  • refseq_PRDX2.F1 refseq_PRDX2.R1 188 342
  • NCBIGene 36.2 7001
  • Single exon skipping, size difference: 154
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005809

  • Changed! cd PRX_Typ2cys 172aa 1e-85 in ref transcript
    • Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides. The functional unit of typical 2-cys PRX is a homodimer. A unique intermolecular redox-active disulfide center is utilized for its activity. Upon reaction with peroxides, its peroxidatic cysteine is oxidized into a sulfenic acid intermediate which is resolved by bonding with the resolving cysteine from the other subunit of the homodimer. This intermolecular disulfide bond is then reduced by thioredoxin, tryparedoxin or AhpF. Typical 2-cys PRXs, like 1-cys PRXs, form decamers which are stabilized by reduction of the active site cysteine. Typical 2-cys PRX interacts through beta strands at one edge of the monomer (B-type interface) to form the functional homodimer, and uses an A-type interface (similar to the dimeric interface in atypical 2-cys PRX and PRX5) at the opposite end of the monomer to form the stable decameric (pentamer of dimers) structure.
  • Changed! TIGR AhpC 181aa 5e-51 in ref transcript
    • This gene contains two invariant cysteine residues, one near the N-terminus and one near the C-terminus, each followed immediately by a proline residue.
  • Changed! PTZ PTZ00253 195aa 2e-80 in ref transcript
    • tryparedoxin peroxidase; Provisional.
  • Changed! cd PRX_Typ2cys 28aa 7e-07 in modified transcript
  • Changed! pfam AhpC-TSA 29aa 7e-04 in modified transcript
    • AhpC/TSA family. This family contains proteins related to alkyl hydroperoxide reductase (AhpC) and thiol specific antioxidant (TSA).
  • Changed! PTZ PTZ00253 35aa 1e-04 in modified transcript

PRDX3

  • refseq_PRDX3.F2 refseq_PRDX3.R2 107 161
  • NCBIGene 36.3 10935
  • Alternative 5-prime and 3-prime, size difference: 54
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006793

  • Changed! cd PRX_Typ2cys 172aa 8e-84 in ref transcript
    • Peroxiredoxin (PRX) family, Typical 2-Cys PRX subfamily; PRXs are thiol-specific antioxidant (TSA) proteins, which confer a protective role in cells through its peroxidase activity by reducing hydrogen peroxide, peroxynitrite, and organic hydroperoxides. The functional unit of typical 2-cys PRX is a homodimer. A unique intermolecular redox-active disulfide center is utilized for its activity. Upon reaction with peroxides, its peroxidatic cysteine is oxidized into a sulfenic acid intermediate which is resolved by bonding with the resolving cysteine from the other subunit of the homodimer. This intermolecular disulfide bond is then reduced by thioredoxin, tryparedoxin or AhpF. Typical 2-cys PRXs, like 1-cys PRXs, form decamers which are stabilized by reduction of the active site cysteine. Typical 2-cys PRX interacts through beta strands at one edge of the monomer (B-type interface) to form the functional homodimer, and uses an A-type interface (similar to the dimeric interface in atypical 2-cys PRX and PRX5) at the opposite end of the monomer to form the stable decameric (pentamer of dimers) structure.
  • Changed! TIGR AhpC 181aa 7e-55 in ref transcript
    • This gene contains two invariant cysteine residues, one near the N-terminus and one near the C-terminus, each followed immediately by a proline residue.
  • Changed! PTZ PTZ00253 197aa 4e-77 in ref transcript
    • tryparedoxin peroxidase; Provisional.
  • Changed! cd PRX_Typ2cys 169aa 6e-83 in modified transcript
  • Changed! TIGR AhpC 177aa 2e-55 in modified transcript
  • Changed! PTZ PTZ00253 190aa 2e-76 in modified transcript

PRDX5

  • refseq_PRDX5.F2 refseq_PRDX5.R2 196 328
  • NCBIGene 36.3 25824
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012094

  • Changed! cd PRX5_like 155aa 8e-51 in ref transcript
    • Peroxiredoxin (PRX) family, PRX5-like subfamily; members are similar to the human protein, PRX5, a homodimeric TRX peroxidase, widely expressed in tissues and found cellularly in mitochondria, peroxisomes and the cytosol. The cellular location of PRX5 suggests that it may have an important antioxidant role in organelles that are major sources of reactive oxygen species (ROS), as well as a role in the control of signal transduction. PRX5 has been shown to reduce hydrogen peroxide, alkyl hydroperoxides and peroxynitrite. As with all other PRXs, the N-terminal peroxidatic cysteine of PRX5 is oxidized into a sulfenic acid intermediate upon reaction with peroxides. Human PRX5 is able to resolve this intermediate by forming an intramolecular disulfide bond with its C-terminal cysteine (the resolving cysteine), which can then be reduced by TRX, just like an atypical 2-cys PRX. This resolving cysteine, however, is not conserved in other members of the subfamily. In such cases, it is assumed that the oxidized cysteine is directly resolved by an external small-molecule or protein reductant, typical of a 1-cys PRX. In the case of the H. influenza PRX5 hybrid, the resolving glutaredoxin domain is on the same protein chain as PRX. PRX5 homodimers show an A-type interface, similar to atypical 2-cys PRXs.
  • Changed! pfam Redoxin 145aa 1e-30 in ref transcript
    • Redoxin. This family of redoxins includes peroxiredoxin, thioredoxin and glutaredoxin proteins.
  • Changed! COG AHP1 159aa 3e-39 in ref transcript
    • Peroxiredoxin [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd PRX5_like 111aa 4e-26 in modified transcript
  • Changed! pfam Redoxin 101aa 2e-12 in modified transcript
  • Changed! COG AHP1 115aa 7e-17 in modified transcript

PRKAA1

  • refseq_PRKAA1.F1 refseq_PRKAA1.R1 182 227
  • NCBIGene 36.3 5562
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206907

  • Changed! cd S_TKc 269aa 6e-70 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 249aa 1e-76 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00263 272aa 6e-37 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! cd S_TKc 254aa 1e-72 in modified transcript
  • Changed! smart S_TKc 234aa 2e-79 in modified transcript
  • Changed! PTZ PTZ00263 257aa 2e-39 in modified transcript

PRKAG1

  • refseq_PRKAG1.F1 refseq_PRKAG1.R1 151 200
  • NCBIGene 36.3 5571
  • Alternative 3-prime, size difference: 49
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_002733

  • Changed! cd CBS_pair_28 120aa 4e-41 in ref transcript
    • The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members), Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase), retinitis pigmentosa (IMP dehydrogenase-1), and homocystinuria (cystathionine beta-synthase).
  • Changed! cd CBS_pair_5 130aa 2e-29 in ref transcript
    • The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members), Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase), retinitis pigmentosa (IMP dehydrogenase-1), and homocystinuria (cystathionine beta-synthase).
  • Changed! cd CBS_pair_14 123aa 1e-11 in ref transcript
    • The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members), Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase), retinitis pigmentosa (IMP dehydrogenase-1), and homocystinuria (cystathionine beta-synthase).
  • Changed! pfam CBS 121aa 1e-16 in ref transcript
    • CBS domain pair. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a pair of CBS domains, that has been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP.
  • Changed! pfam CBS 132aa 7e-12 in ref transcript
  • Changed! PRK PRK05567 109aa 2e-08 in ref transcript
    • inositol-5'-monophosphate dehydrogenase; Reviewed.
  • Changed! COG COG3448 120aa 3e-06 in ref transcript
    • CBS-domain-containing membrane protein [Signal transduction mechanisms].
  • Changed! COG COG2905 128aa 5e-05 in ref transcript
    • Predicted signal-transduction protein containing cAMP-binding and CBS domains [Signal transduction mechanisms].

PRKAR1A

  • refseq_PRKAR1A.F1 refseq_PRKAR1A.R1 169 200
  • NCBIGene 36.3 5573
  • Alternative 5-prime, size difference: 31
  • Exclusion in 5'UTR
  • Reference transcript: NM_212472

  • cd CAP_ED 116aa 1e-21 in ref transcript
    • effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
  • cd CAP_ED 110aa 4e-20 in ref transcript
  • smart cNMP 119aa 2e-22 in ref transcript
    • Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
  • smart cNMP 115aa 4e-21 in ref transcript
  • smart RIIa 38aa 1e-08 in ref transcript
    • RIIalpha, Regulatory subunit portion of type II PKA R-subunit. RIIalpha, Regulatory subunit portion of type II PKA R-subunit. Contains dimerisation interface and binding site for A-kinase-anchoring proteins (AKAPs).
  • COG Crp 111aa 4e-12 in ref transcript
    • cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
  • COG Crp 127aa 5e-11 in ref transcript

ZMYND8

  • refseq_PRKCBP1.F1 refseq_PRKCBP1.R1 197 272
  • NCBIGene 36.3 23613
  • Alternative 3-prime, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_183047

  • cd Bromo_RACK7 99aa 1e-49 in ref transcript
    • Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) was identified as a potential tumor suppressor genes, it shares domain architecture with BS69/ZMYND11; both have been implicated in the regulation of cellular proliferation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd BS69_related 81aa 4e-35 in ref transcript
    • The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions. BS69 is a multi-domain protein, containing bromo, PHD, PWWP, and MYND domains. The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
  • smart BROMO 101aa 3e-15 in ref transcript
    • bromo domain.
  • pfam PWWP 62aa 5e-08 in ref transcript
    • PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
  • pfam PHD 41aa 2e-07 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • pfam zf-MYND 35aa 4e-05 in ref transcript
    • MYND finger.
  • COG COG5076 127aa 8e-07 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].

ZMYND8

  • refseq_PRKCBP1.F3 refseq_PRKCBP1.R3 180 264
  • NCBIGene 36.3 23613
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_183047

  • cd Bromo_RACK7 99aa 1e-49 in ref transcript
    • Bromodomain, RACK7_like subfamily. RACK7 (also called human protein kinase C-binding protein) was identified as a potential tumor suppressor genes, it shares domain architecture with BS69/ZMYND11; both have been implicated in the regulation of cellular proliferation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd BS69_related 81aa 4e-35 in ref transcript
    • The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions. BS69 is a multi-domain protein, containing bromo, PHD, PWWP, and MYND domains. The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
  • smart BROMO 101aa 3e-15 in ref transcript
    • bromo domain.
  • pfam PWWP 62aa 5e-08 in ref transcript
    • PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
  • pfam PHD 41aa 2e-07 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • pfam zf-MYND 35aa 4e-05 in ref transcript
    • MYND finger.
  • COG COG5076 127aa 8e-07 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].

PRMT1

  • refseq_PRMT1.F1 refseq_PRMT1.R1 131 250
  • NCBIGene 36.3 3276
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 119
  • Inclusion in the protein causing a frameshift, Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001536

  • Changed! cd AdoMet_MTases 101aa 6e-06 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! pfam PrmA 59aa 2e-05 in ref transcript
    • Ribosomal protein L11 methyltransferase (PrmA). This family consists of several Ribosomal protein L11 methyltransferase (EC:2.1.1.-) sequences.
  • Changed! PRK prmA 77aa 5e-06 in ref transcript
    • ribosomal protein L11 methyltransferase; Reviewed.

PRMT2

  • refseq_PRMT2.F2 refseq_PRMT2.R2 123 232
  • NCBIGene 36.3 3275
  • Single exon skipping, size difference: 109
  • Exclusion in 5'UTR
  • Reference transcript: NM_206962

  • cd SH3 50aa 2e-08 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd AdoMet_MTases 101aa 3e-07 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • smart SH3 54aa 2e-10 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • pfam MTS 83aa 3e-06 in ref transcript
    • Methyltransferase small domain. This domain is found in ribosomal RNA small subunit methyltransferase C as well as other methyltransferases.
  • COG COG4076 131aa 4e-07 in ref transcript
    • Predicted RNA methylase [General function prediction only].

PRMT5

  • refseq_PRMT5.F2 refseq_PRMT5.R2 260 345
  • NCBIGene 36.3 10419
  • Alternative 5-prime, size difference: 85
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006109

  • Changed! cd AdoMet_MTases 71aa 4e-04 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! pfam PRMT5 439aa 0.0 in ref transcript
    • PRMT5 arginine-N-methyltransferase. The human homologue of yeast Skb1 (Shk1 kinase-binding protein 1) is PRMT5, an arginine-N-methyltransferase. These proteins appear to be key mitotic regulators. They play a role in Jak signalling in higher eukaryotes.
  • Changed! COG COG4076 168aa 4e-06 in ref transcript
    • Predicted RNA methylase [General function prediction only].

PRO1853

  • refseq_PRO1853.F2 refseq_PRO1853.R2 229 383
  • NCBIGene 36.2 55471
  • Alternative 5-prime, size difference: 154
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_144736

  • Changed! pfam DUF185 247aa 5e-33 in ref transcript
    • Uncharacterized ACR, COG1565. This family contains several uncharacterised proteins. One member has been described as an ATP synthase beta subunit transcription termination factor rho protein.
  • Changed! COG COG1565 357aa 4e-61 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam DUF185 57aa 1e-07 in modified transcript
  • Changed! COG COG1565 106aa 3e-27 in modified transcript

PRPF40B

  • refseq_PRPF40B.F2 refseq_PRPF40B.R2 102 123
  • NCBIGene 36.3 25766
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031698

  • cd WW 30aa 3e-05 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • cd WW 26aa 2e-04 in ref transcript
  • pfam WW 30aa 4e-06 in ref transcript
    • WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
  • smart WW 26aa 5e-05 in ref transcript
    • Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
  • smart FF 49aa 5e-05 in ref transcript
    • Contains two conserved F residues. A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
  • smart FF 49aa 6e-04 in ref transcript
  • Changed! COG PRP40 280aa 2e-18 in ref transcript
    • Splicing factor [RNA processing and modification].
  • COG PRP40 91aa 7e-17 in ref transcript
  • Changed! COG PRP40 299aa 1e-19 in modified transcript

PRR13

  • refseq_PRR13.F1 refseq_PRR13.R1 249 399
  • NCBIGene 36.3 54458
  • Alternative 3-prime, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018457

PRR5

  • refseq_PRR5.F2 refseq_PRR5.R2 231 348
  • NCBIGene 36.3 55615
  • Single exon skipping, size difference: 117
  • Exclusion of the protein initiation site
  • Reference transcript: NM_015366

  • Changed! pfam HbrB 132aa 6e-40 in ref transcript
    • HbrB-like. HbrB is involved hyphal growth and polarity.
  • Changed! pfam HbrB 71aa 4e-22 in modified transcript

PRRX1

  • refseq_PRRX1.F1 refseq_PRRX1.R1 161 233
  • NCBIGene 36.3 5396
  • Single exon skipping, size difference: 72
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_022716

  • cd homeodomain 55aa 2e-14 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam Homeobox 53aa 3e-20 in ref transcript
    • Homeobox domain.
  • COG COG5576 81aa 2e-08 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

PRSS21

  • refseq_PRSS21.F1 refseq_PRSS21.R1 112 154
  • NCBIGene 36.3 10942
  • Alternative 5-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006799

  • Changed! cd Tryp_SPc 242aa 3e-66 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! smart Tryp_SPc 241aa 7e-71 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! COG COG5640 255aa 7e-23 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Tryp_SPc 228aa 4e-57 in modified transcript
  • Changed! smart Tryp_SPc 227aa 5e-61 in modified transcript
  • Changed! COG COG5640 241aa 3e-20 in modified transcript

PSMA3

  • refseq_PSMA3.F1 refseq_PSMA3.R1 111 132
  • NCBIGene 36.3 5684
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002788

  • Changed! cd proteasome_alpha_type_3 213aa 1e-109 in ref transcript
    • proteasome_alpha_type_3. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis. Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
  • Changed! pfam Proteasome 187aa 2e-47 in ref transcript
    • Proteasome A-type and B-type.
  • pfam Proteasome_A_N 23aa 8e-09 in ref transcript
    • Proteasome subunit A N-terminal signature. This domain is conserved in the A subunits of the proteasome complex proteins.
  • Changed! PRK PRK03996 234aa 1e-44 in ref transcript
    • proteasome subunit alpha; Provisional.
  • Changed! cd proteasome_alpha_type_3 206aa 1e-106 in modified transcript
  • Changed! pfam Proteasome 180aa 3e-44 in modified transcript
  • Changed! PRK PRK03996 227aa 4e-44 in modified transcript

PSMA8

  • refseq_PSMA8.F1 refseq_PSMA8.R1 100 118
  • NCBIGene 36.3 143471
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144662

  • Changed! cd proteasome_alpha_type_7 215aa 1e-102 in ref transcript
    • proteasome_alpha_type_7. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis. Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
  • Changed! TIGR arc_protsome_A 233aa 6e-67 in ref transcript
    • This protein family describes the archaeal proteasome alpha subunit, homologous to both the beta subunit and to the alpha and beta subunits of eukaryotic proteasome subunits. This family is universal in the first 29 complete archaeal genomes but occasionally is duplicated.
  • Changed! PRK PRK03996 233aa 5e-71 in ref transcript
    • proteasome subunit alpha; Provisional.
  • Changed! cd proteasome_alpha_type_7 209aa 1e-104 in modified transcript
  • Changed! TIGR arc_protsome_A 227aa 6e-69 in modified transcript
  • Changed! PRK PRK03996 227aa 5e-73 in modified transcript

PSMC3IP

  • refseq_PSMC3IP.F1 refseq_PSMC3IP.R1 145 181
  • NCBIGene 36.3 29893
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016556

  • cd Fur_like 58aa 0.003 in ref transcript
    • Ferric uptake regulator(Fur) and related metalloregulatory proteins; typically iron-dependent, DNA-binding repressors and activators. Ferric uptake regulator (Fur) and related metalloregulatory proteins are iron-dependent, DNA-binding repressors and activators mainly involved in iron metabolism. A general model for Fur repression under iron-rich conditions is that activated Fur (a dimer having one Fe2+ coordinated per monomer) binds to specific DNA sequences (Fur boxes) in the promoter region of iron-responsive genes, hindering access of RNA polymerase, and repressing transcription. Positive regulation by Fur can be direct or indirect, as in the Fur repression of an anti-sense regulatory small RNA. Some members sense metal ions other than Fe2+. For example, the zinc uptake regulator (Zur) responds to Zn2+, the manganese uptake regulator (Mur) responds to Mn2+, and the nickel uptake regulator (Nur) responds to Ni2+. Other members sense signals other than metal ions. For example, PerR, a metal-dependent sensor of hydrogen peroxide. PerR regulates DNA-binding activity through metal-based protein oxidation, and co-ordinates Mn2+ or Fe2+ at its regulatory site. Fur family proteins contain an N-terminal winged-helix DNA-binding domain followed by a dimerization domain; this CD spans both those domains.
  • Changed! pfam TBPIP 167aa 9e-56 in ref transcript
    • Tat binding protein 1(TBP-1)-interacting protein (TBPIP). This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation.
  • Changed! pfam TBPIP 155aa 3e-50 in modified transcript

PSMC4

  • refseq_PSMC4.F1 refseq_PSMC4.R1 111 204
  • NCBIGene 36.3 5704
  • Alternative 3-prime, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006503

  • cd AAA 168aa 2e-16 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • Changed! TIGR 26Sp45 347aa 1e-103 in ref transcript
    • Many proteins may score above the trusted cutoff because an internal.
  • Changed! COG RPT1 383aa 1e-137 in ref transcript
    • ATP-dependent 26S proteasome regulatory subunit [Posttranslational modification, protein turnover, chaperones].
  • Changed! TIGR 26Sp45 348aa 1e-104 in modified transcript
  • Changed! COG RPT1 350aa 1e-135 in modified transcript

PSMD10

  • refseq_PSMD10.F1 refseq_PSMD10.R1 194 273
  • NCBIGene 36.3 5716
  • Alternative 5-prime, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002814

  • Changed! cd ANK 124aa 2e-30 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • pfam Ank 33aa 6e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 31aa 2e-05 in ref transcript
  • pfam Ank 31aa 3e-05 in ref transcript
  • Changed! TIGR trp 153aa 2e-04 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! COG Arp 156aa 1e-17 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! PTZ PTZ00322 66aa 6e-08 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.
  • Changed! cd ANK 115aa 1e-27 in modified transcript
  • Changed! TIGR trp 103aa 1e-04 in modified transcript
  • Changed! COG Arp 137aa 8e-16 in modified transcript

PSME3

  • refseq_PSME3.F2 refseq_PSME3.R2 156 195
  • NCBIGene 36.3 10197
  • Alternative 3-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_176863

  • Changed! pfam PA28_beta 163aa 1e-68 in ref transcript
    • Proteasome activator pa28 beta subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the beta subunit. The activator complex binds to the 20S proteasome ana simulates peptidase activity in and ATP-independent manner.
  • pfam PA28_alpha 58aa 6e-20 in ref transcript
    • Proteasome activator pa28 alpha subunit. PA28 activator complex (also known as 11s regulator of 20S proteasome) is a ring shaped hexameric structure of alternating alpha and beta subunits. This family represents the alpha subunit. The activator complex binds to the 20S proteasome ana simulates peptidase activity in and ATP-independent manner.
  • Changed! pfam PA28_beta 150aa 1e-70 in modified transcript

PSMF1

  • refseq_PSMF1.F2 refseq_PSMF1.R2 119 202
  • NCBIGene 36.2 9491
  • Single exon skipping, size difference: 83
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_178578

  • Changed! pfam PI31_Prot_Reg 86aa 4e-04 in ref transcript
    • PI31 proteasome regulator. PI31 is a cellular regulator of proteasome formation and of proteasome-mediated antigen processing.

OLA1

  • refseq_PTD004.F1 refseq_PTD004.R1 199 300
  • NCBIGene 36.3 29789
  • Single exon skipping, size difference: 101
  • Exclusion of the protein initiation site
  • Reference transcript: NM_013341

  • Changed! cd YchF 279aa 1e-124 in ref transcript
    • YchF subfamily. YchF is a member of the Obg family, which includes four other subfamilies of GTPases: Obg, DRG, Ygr210, and NOG1. Obg is an essential gene that is involved in DNA replication in C. crescentus and Streptomyces griseus and is associated with the ribosome. Several members of the family, including YchF, possess the TGS domain related to the RNA-binding proteins. Experimental data and genomic analysis suggest that YchF may be part of a nucleoprotein complex and may function as a GTP-dependent translational factor.
  • cd TGS_YchF_C 83aa 2e-46 in ref transcript
    • TGS_YchF_C: This subfamily represents TGS domain-containing YchF GTP-binding protein, a universally conserved GTPase whose function is unknown. The N-terminal domain of the YchF protein belongs to the Obg-like family of GTPases, and some members of the family contain a C-terminal TGS domain. TGS is a small domain of about 50 amino acid residues with a predominantly beta-sheet structure. There is no direct information on the function of the TGS domain, but its presence in two types of regulatory proteins (the GTPases and guanosine polyphosphate phosphohydrolases/synthetases) suggests a ligand (most likely nucleotide)-binding, regulatory role.
  • Changed! TIGR TIGR00092 366aa 8e-92 in ref transcript
    • This predicted GTP-binding protein is found in a single copy in every complete bacterial genome, and is found in Eukaryotes. A more distantly related protein, separated from this model, is found in the archaea. It is known to bind GTP and double-stranded nucleic acid. It is suggested to belong to a nucleoprotein complex and act as a translation factor.
  • Changed! PTZ PTZ00258 373aa 1e-169 in ref transcript
    • GTP-binding protein; Provisional.
  • Changed! cd YchF 141aa 1e-40 in modified transcript
  • Changed! TIGR TIGR00092 226aa 8e-51 in modified transcript
  • Changed! PTZ PTZ00258 228aa 9e-90 in modified transcript

PTER

  • refseq_PTER.F2 refseq_PTER.R2 235 290
  • NCBIGene 36.3 9317
  • Mutually exclusive exon skipping, size difference: 55
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_030664

  • cd PTE 326aa 2e-97 in ref transcript
    • Phosphotriesterase (PTE) catalyzes the hydrolysis of organophosphate nerve agents, including the chemical warfare agents VX, soman, and sarin as well as the insecticide paraoxon. PTE exists as a homodimer with one active site per monomer. The active site is located next to a binuclear metal center, at the C-terminal end of a TIM alpha- beta barrel motif. The native enzyme contains two zinc ions at the active site however these can be replaced with other metals such as cobalt, cadmium, nickel or manganese and the enzyme remains active.
  • pfam PTE 333aa 1e-119 in ref transcript
    • Phosphotriesterase family.
  • COG Php 345aa 3e-53 in ref transcript
    • Predicted metal-dependent hydrolase with the TIM-barrel fold [General function prediction only].

PTGER3

  • refseq_PTGER3.F1 refseq_PTGER3.R1 129 156
  • NCBIGene 36.3 5733
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198712

  • pfam 7tm_1 252aa 3e-10 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

PTGES2

  • refseq_PTGES2.F2 refseq_PTGES2.R2 309 402
  • NCBIGene 36.3 80142
  • Single exon skipping, size difference: 93
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_025072

  • Changed! cd GST_C_mPGES2 149aa 8e-60 in ref transcript
    • GST_C family; microsomal Prostaglandin E synthase Type 2 (mPGES2) subfamily; mPGES2 is a membrane-anchored dimeric protein containing a CXXC motif which catalyzes the isomerization of PGH2 to PGE2. Unlike cytosolic PGE synthase (cPGES) and microsomal PGES Type 1 (mPGES1), mPGES2 does not require glutathione (GSH) for its activity, although its catalytic rate is increased two- to four-fold in the presence of DTT, GSH, or other thiol compounds. PGE2 is widely distributed in various tissues and is implicated in the sleep/wake cycle, relaxation/contraction of smooth muscle, excretion of sodium ions, maintenance of body temperature, and mediation of inflammation. mPGES2 contains an N-terminal hydrophobic domain which is membrane associated and a C-terminal soluble domain with a GST-like structure. The C-terminus contains two structural domains a N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The GST active site is located in a cleft between the two domains.
  • Changed! cd GST_N_mPGES2 75aa 8e-33 in ref transcript
    • GST_N family; microsomal Prostaglandin E synthase Type 2 (mPGES2) subfamily; mPGES2 is a membrane-anchored dimeric protein containing a CXXC motif which catalyzes the isomerization of PGH2 to PGE2. Unlike cytosolic PGE synthase (cPGES) and microsomal PGES Type 1 (mPGES1), mPGES2 does not require glutathione (GSH) for its activity, although its catalytic rate is increased two- to four-fold in the presence of DTT, GSH or other thiol compounds. PGE2 is widely distributed in various tissues and is implicated in the sleep/wake cycle, relaxation/contraction of smooth muscle, excretion of sodium ions, maintenance of body temperature and mediation of inflammation. mPGES2 contains an N-terminal hydrophobic domain which is membrane associated, and a C-terminal soluble domain with a GST-like structure.
  • pfam Glutaredoxin 50aa 5e-08 in ref transcript
    • Glutaredoxin.
  • Changed! COG Gst 140aa 5e-04 in ref transcript
    • Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd GST_N_mPGES2 57aa 4e-24 in modified transcript
  • Changed! COG GrxC 58aa 6e-04 in modified transcript
    • Glutaredoxin and related proteins [Posttranslational modification, protein turnover, chaperones].

PTGFR

  • refseq_PTGFR.F2 refseq_PTGFR.R2 335 406
  • NCBIGene 36.3 5737
  • Single exon skipping, size difference: 71
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_000959

  • Changed! pfam 7tm_1 201aa 1e-12 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • Changed! pfam 7tm_1 193aa 5e-10 in modified transcript

PTK2B

  • refseq_PTK2B.F1 refseq_PTK2B.R1 159 285
  • NCBIGene 36.3 2185
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_173174

  • cd PTKc_FAK 270aa 1e-146 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Focal Adhesion Kinase. Protein Tyrosine Kinase (PTK) family; Focal Adhesion kinase (FAK); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FAK is a cytoplasmic (or nonreceptor) tyr kinase that contains an autophosphorylation site and a FERM domain at the N-terminus, a central tyr kinase domain, proline-rich regions, and a C-terminal FAT (focal adhesion targeting) domain. FAK activity is dependent on integrin-mediated cell adhesion, which facilitates N-terminal autophosphorylation. Full activation is achieved by the phosphorylation of its two adjacent A-loop tyrosines. FAK is important in mediating signaling initiated at sites of cell adhesions and at growth factor receptors. Through diverse molecular interactions, FAK functions as a biosensor or integrator to control cell motility. It is a key regulator of cell survival, proliferation, migration and invasion, and thus plays an important role in the development and progression of cancer. Src binds to autophosphorylated FAK forming the FAK-Src dual kinase complex, which is activated in a wide variety of tumor cells and generates signals promoting growth and metastasis. FAK is being developed as a target for cancer therapy.
  • pfam Pkinase_Tyr 233aa 5e-98 in ref transcript
    • Protein tyrosine kinase.
  • pfam Focal_AT 139aa 6e-59 in ref transcript
    • Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion targeting' (FAT) region is necessary and sufficient for localising FAK to focal adhesions. The crystal structure of FAT shows it forms a four-helix bundle that resembles those found in two other proteins involved in cell adhesion, alpha-catenin and vinculin. The binding of FAT to the focal adhesion protein, paxillin, requires the integrity of the helical bundle, whereas binding to another focal adhesion protein, talin, does not.
  • smart B41 227aa 2e-31 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • COG SPS1 229aa 2e-16 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

PTK2B

  • refseq_PTK2B.F3 refseq_PTK2B.R3 102 262
  • NCBIGene 36.3 2185
  • Single exon skipping, size difference: 160
  • Exclusion in 5'UTR
  • Reference transcript: NM_173174

  • cd PTKc_FAK 270aa 1e-146 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Focal Adhesion Kinase. Protein Tyrosine Kinase (PTK) family; Focal Adhesion kinase (FAK); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FAK is a cytoplasmic (or nonreceptor) tyr kinase that contains an autophosphorylation site and a FERM domain at the N-terminus, a central tyr kinase domain, proline-rich regions, and a C-terminal FAT (focal adhesion targeting) domain. FAK activity is dependent on integrin-mediated cell adhesion, which facilitates N-terminal autophosphorylation. Full activation is achieved by the phosphorylation of its two adjacent A-loop tyrosines. FAK is important in mediating signaling initiated at sites of cell adhesions and at growth factor receptors. Through diverse molecular interactions, FAK functions as a biosensor or integrator to control cell motility. It is a key regulator of cell survival, proliferation, migration and invasion, and thus plays an important role in the development and progression of cancer. Src binds to autophosphorylated FAK forming the FAK-Src dual kinase complex, which is activated in a wide variety of tumor cells and generates signals promoting growth and metastasis. FAK is being developed as a target for cancer therapy.
  • pfam Pkinase_Tyr 233aa 5e-98 in ref transcript
    • Protein tyrosine kinase.
  • pfam Focal_AT 139aa 6e-59 in ref transcript
    • Focal adhesion targeting region. Focal adhesion kinase (FAK) is a tyrosine kinase found in focal adhesions, intracellular signaling complexes that are formed following engagement of the extracellular matrix by integrins. The C-terminal 'focal adhesion targeting' (FAT) region is necessary and sufficient for localising FAK to focal adhesions. The crystal structure of FAT shows it forms a four-helix bundle that resembles those found in two other proteins involved in cell adhesion, alpha-catenin and vinculin. The binding of FAT to the focal adhesion protein, paxillin, requires the integrity of the helical bundle, whereas binding to another focal adhesion protein, talin, does not.
  • smart B41 227aa 2e-31 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • COG SPS1 229aa 2e-16 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

PTK7

  • refseq_PTK7.F1 refseq_PTK7.R1 181 301
  • NCBIGene 36.3 5754
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002821

  • cd PTK_CCK4 274aa 1e-148 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Colon Carcinoma Kinase 4. Protein Tyrosine Kinase (PTK) family; Colon carcinoma kinase 4 (CCK4); pseudokinase domain. The PTKc (catalytic domain) family, to which this subfamily belongs, includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. CCK4, also called protein tyrosine kinase 7 (PTK7), is an orphan receptor tyr kinase (RTK) containing an extracellular region with seven immunoglobulin domains, a transmembrane segment, and an intracellular inactive pseudokinase domain. Studies in mice reveal that CCK4 is essential for neural development. Mouse embryos containing a truncated CCK4 die perinatally and display craniorachischisis, a severe form of neural tube defect. The mechanism of action of the CCK4 pseudokinase is still unknown. Other pseudokinases such as HER3 rely on the activity of partner RTKs.
  • cd IGcam 85aa 5e-16 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 86aa 1e-13 in ref transcript
  • Changed! cd IGcam 85aa 2e-12 in ref transcript
  • cd IGcam 87aa 3e-10 in ref transcript
  • cd IGcam 79aa 6e-10 in ref transcript
  • cd IGcam 78aa 8e-06 in ref transcript
  • cd IGcam 81aa 3e-04 in ref transcript
  • smart TyrKc 261aa 1e-98 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • Changed! pfam I-set 86aa 9e-12 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 77aa 1e-11 in ref transcript
  • pfam I-set 87aa 2e-11 in ref transcript
  • pfam I-set 92aa 5e-10 in ref transcript
  • smart IGc2 59aa 1e-07 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 71aa 1e-06 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 61aa 3e-06 in ref transcript
  • COG SPS1 265aa 6e-19 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

PTK7

  • refseq_PTK7.F4 refseq_PTK7.R4 197 365
  • NCBIGene 36.3 5754
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 168
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002821

  • cd PTK_CCK4 274aa 1e-148 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Colon Carcinoma Kinase 4. Protein Tyrosine Kinase (PTK) family; Colon carcinoma kinase 4 (CCK4); pseudokinase domain. The PTKc (catalytic domain) family, to which this subfamily belongs, includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. CCK4, also called protein tyrosine kinase 7 (PTK7), is an orphan receptor tyr kinase (RTK) containing an extracellular region with seven immunoglobulin domains, a transmembrane segment, and an intracellular inactive pseudokinase domain. Studies in mice reveal that CCK4 is essential for neural development. Mouse embryos containing a truncated CCK4 die perinatally and display craniorachischisis, a severe form of neural tube defect. The mechanism of action of the CCK4 pseudokinase is still unknown. Other pseudokinases such as HER3 rely on the activity of partner RTKs.
  • cd IGcam 85aa 5e-16 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 86aa 1e-13 in ref transcript
  • cd IGcam 85aa 2e-12 in ref transcript
  • Changed! cd IGcam 87aa 3e-10 in ref transcript
  • cd IGcam 79aa 6e-10 in ref transcript
  • cd IGcam 78aa 8e-06 in ref transcript
  • cd IGcam 81aa 3e-04 in ref transcript
  • smart TyrKc 261aa 1e-98 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • pfam I-set 86aa 9e-12 in ref transcript
    • Immunoglobulin I-set domain.
  • Changed! pfam I-set 77aa 1e-11 in ref transcript
  • pfam I-set 87aa 2e-11 in ref transcript
  • pfam I-set 92aa 5e-10 in ref transcript
  • smart IGc2 59aa 1e-07 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 71aa 1e-06 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 61aa 3e-06 in ref transcript
  • COG SPS1 265aa 6e-19 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

PTK9

  • refseq_PTK9.F1 refseq_PTK9.R1 109 499
  • NCBIGene 36.2 5756
  • Alternative 5-prime, size difference: 390
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002822

  • Changed! cd ADF 134aa 2e-23 in ref transcript
    • Actin depolymerisation factor/cofilin -like domains; present in a family of essential eukaryotic actin regulatory proteins; these proteins enhance the turnover rate of actin and interact with actin monomers as well as actin filaments.
  • Changed! cd ADF 131aa 8e-23 in ref transcript
  • Changed! smart ADF 133aa 3e-28 in ref transcript
    • Actin depolymerisation factor/cofilin -like domains. Severs actin filaments and binds to actin monomers.
  • Changed! smart ADF 128aa 3e-26 in ref transcript

PTP4A2

  • refseq_PTP4A2.F1 refseq_PTP4A2.R1 156 362
  • NCBIGene 36.3 8073
  • Multiple exon skipping, size difference: 206
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080391

  • Changed! cd DSPc 116aa 4e-08 in ref transcript
    • Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
  • Changed! smart PTPc_motif 86aa 7e-10 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain motif.
  • Changed! PTZ PTZ00242 149aa 1e-53 in ref transcript
    • protein tyrosine phosphatase; Provisional.
  • Changed! PTZ PTZ00242 59aa 3e-13 in modified transcript

PTP4A3

  • refseq_PTP4A3.F2 refseq_PTP4A3.R2 221 296
  • NCBIGene 36.3 11156
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032611

  • Changed! cd DSPc 104aa 2e-07 in ref transcript
    • Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
  • Changed! pfam Y_phosphatase 134aa 8e-11 in ref transcript
    • Protein-tyrosine phosphatase.
  • Changed! PTZ PTZ00242 151aa 9e-53 in ref transcript
    • protein tyrosine phosphatase; Provisional.
  • Changed! cd PTPc 69aa 0.001 in modified transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • Changed! pfam Y_phosphatase 95aa 4e-04 in modified transcript
  • Changed! PTZ PTZ00242 126aa 1e-37 in modified transcript

PTPN2

  • refseq_PTPN2.F1 refseq_PTPN2.R1 260 362
  • NCBIGene 36.3 5771
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080422

  • cd PTPc 230aa 1e-78 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • smart PTPc 270aa 2e-88 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • COG PTP2 235aa 1e-41 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].

PTPN5

  • refseq_PTPN5.F1 refseq_PTPN5.R1 238 283
  • NCBIGene 36.3 84867
  • Alternative 3-prime, size difference: 45
  • Inclusion in 5'UTR
  • Reference transcript: NM_032781

  • cd PTPc 229aa 3e-76 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • smart PTPc 254aa 2e-84 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • COG PTP2 237aa 8e-35 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].

PTPN5

  • refseq_PTPN5.F4 refseq_PTPN5.R4 280 376
  • NCBIGene 36.3 84867
  • Alternative 3-prime, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032781

  • cd PTPc 229aa 3e-76 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • smart PTPc 254aa 2e-84 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • COG PTP2 237aa 8e-35 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].

PTPRA

  • refseq_PTPRA.F1 refseq_PTPRA.R1 146 173
  • NCBIGene 36.3 5786
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002836

  • cd PTPc 231aa 3e-89 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 5e-85 in ref transcript
  • smart PTPc 259aa 1e-97 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 261aa 1e-93 in ref transcript
  • COG PTP2 231aa 2e-43 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 241aa 3e-43 in ref transcript

PTPRC

  • refseq_PTPRC.F1 refseq_PTPRC.R1 200 344
  • NCBIGene 36.3 5788
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002838

  • cd PTPc 232aa 2e-86 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 256aa 2e-71 in ref transcript
  • cd FN3 81aa 2e-06 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 71aa 0.009 in ref transcript
  • smart PTPc 260aa 1e-93 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 283aa 9e-81 in ref transcript
  • pfam fn3 71aa 7e-04 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 81aa 0.003 in ref transcript
  • Changed! pfam Herpes_BLLF1 136aa 0.003 in ref transcript
    • Herpes virus major outer envelope glycoprotein (BLLF1). This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
  • COG PTP2 250aa 3e-42 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 265aa 4e-30 in ref transcript
  • Changed! smart FN3 76aa 0.002 in modified transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • Changed! pfam Herpes_BLLF1 124aa 0.006 in modified transcript

PTPRD

  • refseq_PTPRD.F2 refseq_PTPRD.R2 136 163
  • NCBIGene 36.3 5789
  • Multiple exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002839

  • cd PTPc 230aa 5e-92 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 8e-90 in ref transcript
  • cd FN3 92aa 3e-13 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 90aa 7e-13 in ref transcript
  • Changed! cd IGcam 88aa 2e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 91aa 4e-12 in ref transcript
  • cd FN3 95aa 8e-12 in ref transcript
  • cd FN3 93aa 2e-11 in ref transcript
  • cd FN3 89aa 3e-11 in ref transcript
  • cd FN3 107aa 4e-09 in ref transcript
  • cd FN3 82aa 2e-06 in ref transcript
  • cd IGcam 78aa 1e-05 in ref transcript
  • smart PTPc 256aa 1e-101 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 260aa 1e-100 in ref transcript
  • pfam I-set 92aa 5e-16 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam fn3 85aa 8e-15 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 88aa 2e-13 in ref transcript
  • pfam fn3 83aa 2e-13 in ref transcript
  • smart FN3 79aa 3e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • Changed! smart IGc2 74aa 7e-11 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart FN3 83aa 2e-10 in ref transcript
  • pfam fn3 100aa 5e-10 in ref transcript
  • smart IG_like 79aa 7e-08 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart FN3 81aa 4e-05 in ref transcript
  • COG PTP2 268aa 1e-48 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 281aa 9e-45 in ref transcript
  • Changed! cd IGcam 79aa 4e-13 in modified transcript
  • Changed! smart IGc2 65aa 4e-11 in modified transcript

PTPRD

  • refseq_PTPRD.F3 refseq_PTPRD.R4 211 250
  • NCBIGene 36.3 5789
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002839

  • cd PTPc 230aa 5e-92 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 8e-90 in ref transcript
  • cd FN3 92aa 3e-13 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 90aa 7e-13 in ref transcript
  • cd IGcam 88aa 2e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 91aa 4e-12 in ref transcript
  • cd FN3 95aa 8e-12 in ref transcript
  • cd FN3 93aa 2e-11 in ref transcript
  • cd FN3 89aa 3e-11 in ref transcript
  • cd FN3 107aa 4e-09 in ref transcript
  • cd FN3 82aa 2e-06 in ref transcript
  • Changed! cd IGcam 78aa 1e-05 in ref transcript
  • smart PTPc 256aa 1e-101 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 260aa 1e-100 in ref transcript
  • pfam I-set 92aa 5e-16 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam fn3 85aa 8e-15 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 88aa 2e-13 in ref transcript
  • pfam fn3 83aa 2e-13 in ref transcript
  • smart FN3 79aa 3e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart IGc2 74aa 7e-11 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart FN3 83aa 2e-10 in ref transcript
  • pfam fn3 100aa 5e-10 in ref transcript
  • Changed! smart IG_like 79aa 7e-08 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart FN3 81aa 4e-05 in ref transcript
  • COG PTP2 268aa 1e-48 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 281aa 9e-45 in ref transcript
  • Changed! cd IGcam 83aa 6e-06 in modified transcript
  • Changed! pfam I-set 84aa 6e-08 in modified transcript

PTPRD

  • refseq_PTPRD.F4 refseq_PTPRD.R5 138 165
  • NCBIGene 36.3 5789
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002839

  • cd PTPc 230aa 5e-92 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 8e-90 in ref transcript
  • cd FN3 92aa 3e-13 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 90aa 7e-13 in ref transcript
  • cd IGcam 88aa 2e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 91aa 4e-12 in ref transcript
  • cd FN3 95aa 8e-12 in ref transcript
  • cd FN3 93aa 2e-11 in ref transcript
  • cd FN3 89aa 3e-11 in ref transcript
  • Changed! cd FN3 107aa 4e-09 in ref transcript
  • cd FN3 82aa 2e-06 in ref transcript
  • cd IGcam 78aa 1e-05 in ref transcript
  • smart PTPc 256aa 1e-101 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 260aa 1e-100 in ref transcript
  • pfam I-set 92aa 5e-16 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam fn3 85aa 8e-15 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 88aa 2e-13 in ref transcript
  • pfam fn3 83aa 2e-13 in ref transcript
  • smart FN3 79aa 3e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart IGc2 74aa 7e-11 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart FN3 83aa 2e-10 in ref transcript
  • Changed! pfam fn3 100aa 5e-10 in ref transcript
  • smart IG_like 79aa 7e-08 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart FN3 81aa 4e-05 in ref transcript
  • COG PTP2 268aa 1e-48 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 281aa 9e-45 in ref transcript
  • Changed! cd FN3 98aa 6e-09 in modified transcript
  • Changed! pfam fn3 91aa 5e-09 in modified transcript

PTPRF

  • refseq_PTPRF.F1 refseq_PTPRF.R1 123 150
  • NCBIGene 36.3 5792
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002840

  • cd PTPc 230aa 2e-94 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 4e-89 in ref transcript
  • cd IGcam 79aa 7e-13 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 93aa 2e-12 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 89aa 1e-11 in ref transcript
  • cd IGcam 90aa 2e-11 in ref transcript
  • cd FN3 95aa 5e-10 in ref transcript
  • cd FN3 82aa 3e-09 in ref transcript
  • cd FN3 90aa 3e-09 in ref transcript
  • cd FN3 90aa 1e-08 in ref transcript
  • cd IGcam 83aa 2e-08 in ref transcript
  • Changed! cd FN3 107aa 0.001 in ref transcript
  • smart PTPc 256aa 1e-105 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 256aa 4e-97 in ref transcript
  • pfam I-set 92aa 9e-18 in ref transcript
    • Immunoglobulin I-set domain.
  • smart FN3 79aa 3e-13 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam fn3 88aa 3e-11 in ref transcript
    • Fibronectin type III domain.
  • smart FN3 83aa 7e-11 in ref transcript
  • pfam I-set 84aa 1e-10 in ref transcript
  • smart IGc2 65aa 1e-10 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 83aa 3e-10 in ref transcript
  • pfam fn3 70aa 3e-07 in ref transcript
  • pfam fn3 83aa 7e-07 in ref transcript
  • COG PTP2 272aa 1e-49 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 264aa 5e-47 in ref transcript
  • Changed! cd FN3 98aa 4e-04 in modified transcript

PTPRN2

  • refseq_PTPRN2.F1 refseq_PTPRN2.R1 258 309
  • NCBIGene 36.3 5799
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002847

  • cd PTPc 233aa 1e-79 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • smart PTPc 260aa 6e-93 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • COG PTP2 275aa 7e-40 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].

PTPRN2

  • refseq_PTPRN2.F4 refseq_PTPRN2.R4 324 411
  • NCBIGene 36.3 5799
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002847

  • cd PTPc 233aa 1e-79 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • smart PTPc 260aa 6e-93 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • COG PTP2 275aa 7e-40 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].

PTPRO

  • refseq_PTPRO.F2 refseq_PTPRO.R2 312 396
  • NCBIGene 36.3 5800
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_030667

  • cd PTPc 229aa 3e-87 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd FN3 93aa 1e-04 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 83aa 0.005 in ref transcript
  • smart PTPc 256aa 1e-97 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • pfam fn3 83aa 8e-05 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 87aa 0.002 in ref transcript
  • smart FN3 83aa 0.005 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • COG PTP2 234aa 1e-45 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].

PTPRS

  • refseq_PTPRS.F2 refseq_PTPRS.R2 135 183
  • NCBIGene 36.3 5802
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002850

  • cd PTPc 230aa 5e-95 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 2e-90 in ref transcript
  • cd FN3 90aa 6e-14 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 89aa 2e-12 in ref transcript
  • cd IGcam 87aa 4e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 93aa 4e-11 in ref transcript
  • cd IGcam 91aa 5e-11 in ref transcript
  • cd FN3 90aa 2e-07 in ref transcript
  • cd FN3 80aa 2e-07 in ref transcript
  • cd FN3 106aa 2e-07 in ref transcript
  • cd IGcam 83aa 2e-06 in ref transcript
  • cd FN3 83aa 6e-05 in ref transcript
  • cd FN3 64aa 0.001 in ref transcript
  • smart PTPc 256aa 1e-106 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 260aa 1e-100 in ref transcript
  • pfam I-set 92aa 2e-17 in ref transcript
    • Immunoglobulin I-set domain.
  • smart FN3 79aa 1e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart FN3 80aa 2e-12 in ref transcript
  • smart FN3 83aa 5e-11 in ref transcript
  • smart IGc2 74aa 1e-10 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 80aa 7e-09 in ref transcript
    • Fibronectin type III domain.
  • smart IG_like 79aa 7e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 91aa 4e-08 in ref transcript
  • pfam fn3 99aa 2e-07 in ref transcript
  • smart FN3 79aa 6e-05 in ref transcript
  • smart FN3 62aa 3e-04 in ref transcript
  • COG PTP2 268aa 6e-51 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 266aa 2e-48 in ref transcript

PTPRS

  • refseq_PTPRS.F3 refseq_PTPRS.R3 136 163
  • NCBIGene 36.3 5802
  • Multiple exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002850

  • cd PTPc 230aa 5e-95 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 2e-90 in ref transcript
  • cd FN3 90aa 6e-14 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 89aa 2e-12 in ref transcript
  • Changed! cd IGcam 87aa 4e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 93aa 4e-11 in ref transcript
  • cd IGcam 91aa 5e-11 in ref transcript
  • cd FN3 90aa 2e-07 in ref transcript
  • cd FN3 80aa 2e-07 in ref transcript
  • cd FN3 106aa 2e-07 in ref transcript
  • cd IGcam 83aa 2e-06 in ref transcript
  • cd FN3 83aa 6e-05 in ref transcript
  • cd FN3 64aa 0.001 in ref transcript
  • smart PTPc 256aa 1e-106 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 260aa 1e-100 in ref transcript
  • pfam I-set 92aa 2e-17 in ref transcript
    • Immunoglobulin I-set domain.
  • smart FN3 79aa 1e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart FN3 80aa 2e-12 in ref transcript
  • smart FN3 83aa 5e-11 in ref transcript
  • Changed! smart IGc2 74aa 1e-10 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 80aa 7e-09 in ref transcript
    • Fibronectin type III domain.
  • smart IG_like 79aa 7e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 91aa 4e-08 in ref transcript
  • pfam fn3 99aa 2e-07 in ref transcript
  • smart FN3 79aa 6e-05 in ref transcript
  • smart FN3 62aa 3e-04 in ref transcript
  • COG PTP2 268aa 6e-51 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 266aa 2e-48 in ref transcript
  • Changed! cd IGcam 78aa 1e-12 in modified transcript
  • Changed! smart IGc2 65aa 6e-11 in modified transcript

PTPRS

  • refseq_PTPRS.F6 refseq_PTPRS.R6 150 177
  • NCBIGene 36.3 5802
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002850

  • cd PTPc 230aa 5e-95 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 2e-90 in ref transcript
  • cd FN3 90aa 6e-14 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 89aa 2e-12 in ref transcript
  • cd IGcam 87aa 4e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 93aa 4e-11 in ref transcript
  • cd IGcam 91aa 5e-11 in ref transcript
  • cd FN3 90aa 2e-07 in ref transcript
  • cd FN3 80aa 2e-07 in ref transcript
  • Changed! cd FN3 106aa 2e-07 in ref transcript
  • cd IGcam 83aa 2e-06 in ref transcript
  • cd FN3 83aa 6e-05 in ref transcript
  • cd FN3 64aa 0.001 in ref transcript
  • smart PTPc 256aa 1e-106 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 260aa 1e-100 in ref transcript
  • pfam I-set 92aa 2e-17 in ref transcript
    • Immunoglobulin I-set domain.
  • smart FN3 79aa 1e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart FN3 80aa 2e-12 in ref transcript
  • smart FN3 83aa 5e-11 in ref transcript
  • smart IGc2 74aa 1e-10 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 80aa 7e-09 in ref transcript
    • Fibronectin type III domain.
  • smart IG_like 79aa 7e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 91aa 4e-08 in ref transcript
  • Changed! pfam fn3 99aa 2e-07 in ref transcript
  • smart FN3 79aa 6e-05 in ref transcript
  • smart FN3 62aa 3e-04 in ref transcript
  • COG PTP2 268aa 6e-51 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 266aa 2e-48 in ref transcript
  • Changed! cd FN3 97aa 1e-07 in modified transcript
  • Changed! smart FN3 87aa 5e-07 in modified transcript

PTPRT

  • refseq_PTPRT.F1 refseq_PTPRT.R1 228 285
  • NCBIGene 36.3 11122
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_133170

  • cd PTPc 228aa 6e-92 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 233aa 1e-69 in ref transcript
  • cd MAM 158aa 5e-35 in ref transcript
    • Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
  • cd FN3 83aa 3e-07 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 91aa 0.005 in ref transcript
  • smart PTPc 255aa 1e-102 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 263aa 4e-76 in ref transcript
  • smart MAM 161aa 1e-47 in ref transcript
    • Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
  • pfam fn3 76aa 2e-07 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 72aa 8e-04 in ref transcript
  • COG PTP2 240aa 1e-46 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 284aa 3e-27 in ref transcript

PTPRU

  • refseq_PTPRU.F1 refseq_PTPRU.R1 128 158
  • NCBIGene 36.3 10076
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005704

  • cd PTPc 221aa 6e-83 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 234aa 6e-67 in ref transcript
  • cd MAM 160aa 9e-41 in ref transcript
    • Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
  • cd FN3 102aa 6e-09 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 96aa 2e-04 in ref transcript
  • pfam Y_phosphatase 222aa 1e-91 in ref transcript
    • Protein-tyrosine phosphatase.
  • smart PTPc 264aa 2e-72 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart MAM 165aa 3e-48 in ref transcript
    • Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
  • pfam fn3 94aa 2e-09 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 72aa 3e-04 in ref transcript
  • smart FN3 86aa 0.002 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • COG PTP2 219aa 3e-38 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 289aa 2e-22 in ref transcript

PTPRU

  • refseq_PTPRU.F2 refseq_PTPRU.R2 101 119
  • NCBIGene 36.3 10076
  • Single exon skipping, size difference: 18
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_005704

  • Changed! cd PTPc 221aa 6e-83 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 234aa 6e-67 in ref transcript
  • cd MAM 160aa 9e-41 in ref transcript
    • Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
  • cd FN3 102aa 6e-09 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 96aa 2e-04 in ref transcript
  • Changed! pfam Y_phosphatase 222aa 1e-91 in ref transcript
    • Protein-tyrosine phosphatase.
  • smart PTPc 264aa 2e-72 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart MAM 165aa 3e-48 in ref transcript
    • Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
  • pfam fn3 94aa 2e-09 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 72aa 3e-04 in ref transcript
  • smart FN3 86aa 0.002 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • Changed! COG PTP2 219aa 3e-38 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 289aa 2e-22 in ref transcript
  • Changed! cd PTPc 227aa 7e-81 in modified transcript
  • Changed! pfam Y_phosphatase 228aa 1e-89 in modified transcript
  • Changed! COG PTP2 225aa 4e-38 in modified transcript

PXMP4

  • refseq_PXMP4.F2 refseq_PXMP4.R2 195 394
  • NCBIGene 36.3 11264
  • Single exon skipping, size difference: 199
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007238

  • Changed! pfam Pmp24 171aa 4e-68 in ref transcript
    • Peroxisomal membrane protein 24. Peroxisomes are single membrane bound organelles, present in practically all eukaryotic cells, and involved in a variety of metabolic pathways; the deduced protein is extremely basic, a characteristic of many other peroxisomal intrinsic membrane proteins. They carry two short stretches of hydrophobic residues shown to be necessary for the correct targeting of these proteins. A sequence with these characteristics is found in human PMP24 (amino acid residues 16-30). However, in the absence of experimental data, the involvement of this domain in the targeting of PMP24 remains to be proved. PMP24 was known as Pmp27.
  • Changed! pfam Pmp24 32aa 9e-15 in modified transcript

PYCARD

  • refseq_PYCARD.F1 refseq_PYCARD.R1 272 329
  • NCBIGene 36.3 29108
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013258

  • pfam CARD 84aa 6e-18 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • pfam PAAD_DAPIN 84aa 2e-12 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.

PYHIN1

  • refseq_PYHIN1.F1 refseq_PYHIN1.R1 227 352
  • NCBIGene 36.3 149628
  • Single exon skipping, size difference: 125
  • Exclusion of the stop codon
  • Reference transcript: NM_152501

  • pfam HIN 159aa 3e-51 in ref transcript
    • HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758.
  • pfam PAAD_DAPIN 79aa 6e-17 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.

QRICH1

  • refseq_QRICH1.F2 refseq_QRICH1.R2 337 397
  • NCBIGene 36.3 54870
  • Single exon skipping, size difference: 60
  • Exclusion in 5'UTR
  • Reference transcript: NM_017730

RAB27A

  • refseq_RAB27A.F2 refseq_RAB27A.R2 177 266
  • NCBIGene 36.3 5873
  • Single exon skipping, size difference: 89
  • Exclusion in 5'UTR
  • Reference transcript: NM_183234

  • cd Rab27A 180aa 1e-105 in ref transcript
    • Rab27a subfamily. The Rab27a subfamily consists of Rab27a and its highly homologous isoform, Rab27b. Unlike most Rab proteins whose functions remain poorly defined, Rab27a has many known functions. Rab27a has multiple effector proteins, and depending on which effector it binds, Rab27a has different functions as well as tissue distribution and/or cellular localization. Putative functions have been assigned to Rab27a when associated with the effector proteins Slp1, Slp2, Slp3, Slp4, Slp5, DmSlp, rabphilin, Dm/Ce-rabphilin, Slac2-a, Slac2-b, Slac2-c, Noc2, JFC1, and Munc13-4. Rab27a has been associated with several human diseases, including hemophagocytic syndrome (Griscelli syndrome or GS), Hermansky-Pudlak syndrome, and choroidermia. In the case of GS, a rare, autosomal recessive disease, a Rab27a mutation is directly responsible for the disorder. When Rab27a is localized to the secretory granules of pancreatic beta cells, it is believed to mediate glucose-stimulated insulin secretion, making it a potential target for diabetes therapy. When bound to JFC1 in prostate cells, Rab27a is believed to regulate the exocytosis of prostate- specific markers. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • smart RAB 175aa 1e-66 in ref transcript
    • Rab subfamily of small GTPases. Rab GTPases are implicated in vesicle trafficking.
  • COG COG1100 187aa 2e-24 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].

RAB28

  • refseq_RAB28.F2 refseq_RAB28.R2 291 386
  • NCBIGene 36.3 9364
  • Single exon skipping, size difference: 95
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001017979

  • Changed! cd Rab28 209aa 1e-106 in ref transcript
    • Rab28 subfamily. First identified in maize, Rab28 has been shown to be a late embryogenesis-abundant (Lea) protein that is regulated by the plant hormone abcisic acid (ABA). In Arabidopsis, Rab28 is expressed during embryo development and is generally restricted to provascular tissues in mature embryos. Unlike maize Rab28, it is not ABA-inducible. Characterization of the human Rab28 homolog revealed two isoforms, which differ by a 95-base pair insertion, producing an alternative sequence for the 30 amino acids at the C-terminus. The two human isoforms are presumbly the result of alternative splicing. Since they differ at the C-terminus but not in the GTP-binding region, they are predicted to be targeted to different cellular locations. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins.
  • smart RAB 166aa 6e-38 in ref transcript
    • Rab subfamily of small GTPases. Rab GTPases are implicated in vesicle trafficking.
  • Changed! COG COG1100 132aa 6e-20 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].
  • Changed! cd Rab28 205aa 1e-98 in modified transcript
  • Changed! COG COG1100 195aa 5e-20 in modified transcript

RAB37

  • refseq_RAB37.F1 refseq_RAB37.R1 110 467
  • NCBIGene 36.2 326624
  • Alternative 5-prime and 3-prime, size difference: 357
  • Exclusion in 3'UTR, Exclusion in 3'UTR
  • Reference transcript: NM_001006637

  • cd Rab26 190aa 1e-104 in ref transcript
    • Rab26 subfamily. First identified in rat pancreatic acinar cells, Rab26 is believed to play a role in recruiting mature granules to the plasma membrane upon beta-adrenergic stimulation. Rab26 belongs to the Rab functional group III, which are considered key regulators of intracellular vesicle transport during exocytosis. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins.
  • smart RAB 162aa 8e-74 in ref transcript
    • Rab subfamily of small GTPases. Rab GTPases are implicated in vesicle trafficking.
  • PTZ PTZ00099 133aa 7e-31 in ref transcript
    • rab6; Provisional.

RAB3IP

  • refseq_RAB3IP.F1 refseq_RAB3IP.R1 108 208
  • NCBIGene 36.3 117177
  • Single exon skipping, size difference: 100
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_175623

  • pfam Sec2p 125aa 7e-37 in ref transcript
    • GDP/GTP exchange factor Sec2p. In Saccharomyces cerevisiae, Sec2p is a GDP/GTP exchange factor for Sec4p, which is required for vesicular transport at the post-Golgi stage of yeast secretion.
  • COG Smc 87aa 4e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

RAB5C

  • refseq_RAB5C.F1 refseq_RAB5C.R1 257 374
  • NCBIGene 36.3 5878
  • Single exon skipping, size difference: 117
  • Exclusion in 5'UTR
  • Reference transcript: NM_201434

  • cd Rab5_related 163aa 4e-95 in ref transcript
    • Rab5-related subfamily. This subfamily includes Rab5 and Rab22 of mammals, Ypt51/Ypt52/Ypt53 of yeast, and RabF of plants. The members of this subfamily are involved in endocytosis and endocytic-sorting pathways. In mammals, Rab5 GTPases localize to early endosomes and regulate fusion of clathrin-coated vesicles to early endosomes and fusion between early endosomes. In yeast, Ypt51p family members similarly regulate membrane trafficking through prevacuolar compartments. GTPase activating proteins (GAPs) interact with GTP-bound Rab and accelerate the hydrolysis of GTP to GDP. Guanine nucleotide exchange factors (GEFs) interact with GDP-bound Rabs to promote the formation of the GTP-bound state. Rabs are further regulated by guanine nucleotide dissociation inhibitors (GDIs), which facilitate Rab recycling by masking C-terminal lipid binding and promoting cytosolic localization. Most Rab GTPases contain a lipid modification site at the C-terminus, with sequence motifs CC, CXC, or CCX. Lipid binding is essential for membrane attachment, a key feature of most Rab proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • pfam Ras 162aa 2e-71 in ref transcript
    • Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. As regards Rab GTPases, these are important regulators of vesicle formation, motility and fusion. They share a fold in common with all Ras GTPases: this is a six-stranded beta-sheet surrounded by five alpha-helices.
  • COG COG1100 196aa 4e-32 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].

ERC1

  • refseq_RAB6IP2.F1 refseq_RAB6IP2.R1 189 321
  • NCBIGene 36.3 23085
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_178040

  • Changed! pfam Cast 829aa 1e-148 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • pfam RBD-FIP 41aa 1e-07 in ref transcript
    • FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
  • COG Smc 340aa 3e-09 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! COG Smc 318aa 3e-08 in ref transcript
  • COG Smc 298aa 5e-06 in ref transcript
  • Changed! pfam Cast 785aa 1e-151 in modified transcript
  • Changed! COG Smc 328aa 7e-05 in modified transcript

ERC1

  • refseq_RAB6IP2.F3 refseq_RAB6IP2.R3 125 290
  • NCBIGene 36.3 23085
  • Alternative 5-prime, size difference: 165
  • Inclusion in 5'UTR
  • Reference transcript: NM_178040

  • pfam Cast 829aa 1e-148 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • pfam RBD-FIP 41aa 1e-07 in ref transcript
    • FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
  • COG Smc 340aa 3e-09 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 318aa 3e-08 in ref transcript
  • COG Smc 298aa 5e-06 in ref transcript

ERC1

  • refseq_RAB6IP2.F5 refseq_RAB6IP2.R5 242 326
  • NCBIGene 36.3 23085
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_178040

  • Changed! pfam Cast 829aa 1e-148 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • pfam RBD-FIP 41aa 1e-07 in ref transcript
    • FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
  • Changed! COG Smc 340aa 3e-09 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 318aa 3e-08 in ref transcript
  • Changed! COG Smc 298aa 5e-06 in ref transcript
  • Changed! pfam Cast 801aa 1e-154 in modified transcript
  • Changed! COG Smc 365aa 7e-07 in modified transcript
  • Changed! COG COG2433 126aa 0.004 in modified transcript
    • Uncharacterized conserved protein [Function unknown].

ERC1

  • refseq_RAB6IP2.F6 refseq_RAB6IP2.R6 225 290
  • NCBIGene 36.3 23085
  • Single exon skipping, size difference: 65
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_178040

  • pfam Cast 829aa 1e-148 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • Changed! pfam RBD-FIP 41aa 1e-07 in ref transcript
    • FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
  • COG Smc 340aa 3e-09 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 318aa 3e-08 in ref transcript
  • COG Smc 298aa 5e-06 in ref transcript

RAD50

  • refseq_RAD50.F1 refseq_RAD50.R1 104 169
  • NCBIGene 36.3 10111
  • Alternative 5-prime, size difference: 65
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005732

  • Changed! cd ABC_Rad50 150aa 4e-33 in ref transcript
    • The catalytic domains of Rad50 are similar to the ATP-binding cassette of ABC transporters, but are not associated with membrane-spanning domains. The conserved ATP-binding motifs common to Rad50 and the ABC transporter family include the Walker A and Walker B motifs, the Q loop, a histidine residue in the switch region, a D-loop, and a conserved LSGG sequence. This conserved sequence, LSGG, is the most specific and characteristic motif of this family and is thus known as the ABC signature sequence.
  • Changed! cd ABC_Rad50 165aa 1e-27 in ref transcript
  • Changed! TIGR rad50 1311aa 0.0 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! COG SbcC 676aa 8e-18 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! COG SbcC 870aa 5e-14 in ref transcript
  • Changed! cd ABC_Rad50 98aa 2e-27 in modified transcript
  • Changed! TIGR rad50 99aa 8e-53 in modified transcript
  • Changed! COG SbcC 98aa 5e-12 in modified transcript

RALY

  • refseq_RALY.F2 refseq_RALY.R2 170 218
  • NCBIGene 36.3 22913
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016732

  • cd RRM 67aa 3e-11 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 67aa 7e-11 in ref transcript
    • RNA recognition motif.
  • COG COG0724 95aa 1e-04 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

RANBP3

  • refseq_RANBP3.F2 refseq_RANBP3.R2 179 383
  • NCBIGene 36.3 8498
  • Single exon skipping, size difference: 204
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007322

  • cd RanBD 125aa 3e-26 in ref transcript
    • Ran-binding domain; This domain of approximately 150 residues shares structural similarity to the PH domain, but lacks detectable sequence similarity. Ran is a Ras-like nuclear small GTPase, which regulates receptor-mediated transport between the nucleus and the cytoplasm. RanGTP hydrolysis is stimulated by RanGAP together with the Ran-binding domain containing acessory proteins RanBP1 and RanBP2. These accessory proteins stabilize the active GTP-bound form of Ran . The Ran-binding domain is found in multiple copies in Nuclear pore complex proteins.
  • smart RanBD 111aa 3e-15 in ref transcript
    • Ran-binding domain. Domain of apporximately 150 residues that stabilises the GTP-bound form of Ran (the Ras-like nuclear small GTPase).
  • COG YRB1 71aa 1e-04 in ref transcript
    • Ran GTPase-activating protein (Ran-binding protein) [Intracellular trafficking and secretion].

RAPH1

  • refseq_RAPH1.F1 refseq_RAPH1.R1 200 356
  • NCBIGene 36.3 65059
  • Multiple exon skipping, size difference: 156
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_213589

  • cd PH_Apbb1ip 113aa 1e-54 in ref transcript
    • Apbb1ip (Amyloid beta (A4) Precursor protein-Binding, family B, member 1 Interacting Protein) pleckstrin homology (PH) domain. Apbb1ip consists of a Ras-associated domain and a PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • cd GRB7_RA 82aa 6e-22 in ref transcript
    • Grb7_RA The RA (RAS-associated like) domain of Grb7. Grb7 is an adaptor molecule that mediates signal transduction from multiple cell surface receptors to various downstream signaling pathways. Grb7 and its related family members Grb10 and Grb14 share a conserved domain architecture that includes an amino-terminal proline-rich region, a central segment termed the GM region (for Grb and Mig) which includes the RA, PIR, and PH domains, and a carboxyl-terminal SH2 domain. Grb7/10/14 family proteins are phosphorylated on serine/threonine as well as tyrosine residues and are mainly localized to the cytoplasm.
  • smart PH 109aa 5e-10 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • smart RA 85aa 5e-10 in ref transcript
    • Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).

RAPSN

  • refseq_RAPSN.F1 refseq_RAPSN.R1 151 328
  • NCBIGene 36.3 5913
  • Multiple exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005055

  • cd TPR 112aa 9e-07 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • pfam Rapsyn_N 80aa 2e-31 in ref transcript
    • Rapsyn N-terminal myristoylation and linker region. Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane.
  • COG COG5540 41aa 1e-04 in ref transcript
    • RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

RASA1

  • refseq_RASA1.F1 refseq_RASA1.R1 118 183
  • NCBIGene 36.3 5921
  • Alternative 5-prime, size difference: 65
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002890

  • Changed! cd RasGAP_p120GAP 315aa 1e-166 in ref transcript
    • p120GAP is a negative regulator of Ras that stimulates hydrolysis of bound GTP to GDP. Once the Ras regulator p120GAP, a member of the GAP protein family, is recruited to the membrane, it is transiently immobilized to interact with Ras-GTP. The down regulation of Ras by p120GAP is a critical step in the regulation of many cellular processes, which is disrupted in approximately 30% of human cancers. p120GAP contains SH2, SH3, PH, calcium- and lipid-binding domains, suggesting its involvement in a complex network of cellular interactions in vivo.
  • Changed! cd PH_RasGAP_CG5898 80aa 2e-20 in ref transcript
    • RAS GTPase-activating protein (GAP) CG5898 Pleckstrin homology (PH) domain. This protein has a domain architecture of SH2-SH3-SH2-PH-C2-Ras_GAP. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPAses, adaptors, a well as cytoskeletal associated molecules and in lipid associated enzymes.
  • Changed! cd SH2 91aa 5e-16 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • Changed! cd SH2 80aa 4e-15 in ref transcript
  • Changed! cd C2 98aa 1e-09 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • Changed! cd SH3 44aa 6e-04 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! smart RasGAP 347aa 1e-107 in ref transcript
    • GTPase-activator protein for Ras-like GTPases. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position. Improved domain limits from structure.
  • Changed! pfam SH2 76aa 8e-23 in ref transcript
    • SH2 domain.
  • Changed! pfam SH2 76aa 2e-20 in ref transcript
  • Changed! smart C2 95aa 2e-09 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • Changed! smart PH 84aa 4e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • Changed! pfam SH3_1 53aa 2e-06 in ref transcript
    • SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
  • Changed! COG IQG1 291aa 7e-04 in ref transcript
    • Protein involved in regulation of cellular morphogenesis/cytokinesis [Cell division and chromosome partitioning / Signal transduction mechanisms].
  • Changed! COG COG5038 99aa 0.003 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].

RASGEF1C

  • refseq_RASGEF1C.F1 refseq_RASGEF1C.R1 171 209
  • NCBIGene 36.2 255426
  • Single exon skipping, size difference: 38
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_175062

  • Changed! cd RasGEF 243aa 4e-48 in ref transcript
    • Guanine nucleotide exchange factor for Ras-like small GTPases. Small GTP-binding proteins of the Ras superfamily function as molecular switches in fundamental events such as signal transduction, cytoskeleton dynamics and intracellular trafficking. Guanine-nucleotide-exchange factors (GEFs) positively regulate these GTP-binding proteins in response to a variety of signals. GEFs catalyze the dissociation of GDP from the inactive GTP-binding proteins. GTP can then bind and induce structural changes that allow interaction with effectors.
  • cd REM 109aa 4e-11 in ref transcript
    • Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal domain (RasGef_N), also called REM domain (Ras exchanger motif). This domain is common in nucleotide exchange factors for Ras-like small GTPases and is typically found immediately N-terminal to the RasGef (Cdc25-like) domain. REM contacts the GTPase and is assumed to participate in the catalytic activity of the exchange factor. Proteins with the REM domain include Sos1 and Sos2, which relay signals from tyrosine-kinase mediated signalling to Ras, RasGRP1-4, RasGRF1,2, CNrasGEF, and RAP-specific nucleotide exchange factors, to name a few.
  • Changed! smart RasGEF 248aa 2e-53 in ref transcript
    • Guanine nucleotide exchange factor for Ras-like small GTPases.
  • pfam RasGEF_N 89aa 4e-07 in ref transcript
    • Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal motif. A subset of guanine nucleotide exchange factor for Ras-like small GTPases appear to possess this motif/domain N-terminal to the RasGef (Cdc25-like) domain.
  • Changed! cd RasGEF 147aa 1e-36 in modified transcript
  • Changed! smart RasGEF 147aa 3e-39 in modified transcript

RASGRP2

  • refseq_RASGRP2.F2 refseq_RASGRP2.R2 257 363
  • NCBIGene 36.2 10235
  • Single exon skipping, size difference: 106
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005825

  • Changed! cd RasGEF 234aa 1e-62 in ref transcript
    • Guanine nucleotide exchange factor for Ras-like small GTPases. Small GTP-binding proteins of the Ras superfamily function as molecular switches in fundamental events such as signal transduction, cytoskeleton dynamics and intracellular trafficking. Guanine-nucleotide-exchange factors (GEFs) positively regulate these GTP-binding proteins in response to a variety of signals. GEFs catalyze the dissociation of GDP from the inactive GTP-binding proteins. GTP can then bind and induce structural changes that allow interaction with effectors.
  • Changed! cd REM 112aa 2e-15 in ref transcript
    • Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal domain (RasGef_N), also called REM domain (Ras exchanger motif). This domain is common in nucleotide exchange factors for Ras-like small GTPases and is typically found immediately N-terminal to the RasGef (Cdc25-like) domain. REM contacts the GTPase and is assumed to participate in the catalytic activity of the exchange factor. Proteins with the REM domain include Sos1 and Sos2, which relay signals from tyrosine-kinase mediated signalling to Ras, RasGRP1-4, RasGRF1,2, CNrasGEF, and RAP-specific nucleotide exchange factors, to name a few.
  • Changed! cd C1 50aa 4e-11 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • Changed! cd EFh 52aa 1e-05 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! smart RasGEF 234aa 8e-69 in ref transcript
    • Guanine nucleotide exchange factor for Ras-like small GTPases.
  • Changed! smart RasGEFN 115aa 2e-20 in ref transcript
    • Guanine nucleotide exchange factor for Ras-like GTPases; N-terminal motif. A subset of guanine nucleotide exchange factor for Ras-like small GTPases appear to possess this domain N-terminal to the RasGef (Cdc25-like) domain. The recent crystal structureof Sos shows that this domain is alpha-helical and plays a "purely structural role" (Nature 394, 337-343).
  • Changed! pfam C1_1 51aa 4e-10 in ref transcript
    • Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
  • Changed! PRK PRK12309 60aa 1e-04 in ref transcript
    • transaldolase/EF-hand domain-containing protein; Provisional.

RASSF5

  • refseq_RASSF5.F1 refseq_RASSF5.R1 262 378
  • NCBIGene 36.3 83593
  • Single exon skipping, size difference: 116
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_182663

  • Changed! cd RASSF1_RA 94aa 5e-35 in ref transcript
    • RASSF1 (also known as RASSF3 and NORE1) is a tumour suppressor protein with a C-terminal Ras-associating (RA) domain that binds Ras. RASSF1 also binds the proapoptotic protein kinase MST1 and is thus thought to regulate the proapoptotic signalling pathway. RASSF1 also associates with microtubule-associated proteins like MAP1B and regulates tubulin polymerization. RASSF1 also binds CDC20 and regulates mitosis by inhibiting the anaphase-promoting complex and preventing degradation of cyclin A and cyclin B until the spindle checkpoint becomes fully operational.
  • cd C1 38aa 9e-05 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • Changed! smart RA 92aa 3e-14 in ref transcript
    • Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
  • pfam C1_1 38aa 4e-05 in ref transcript
    • Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
  • Changed! cd RASSF1_RA 59aa 7e-20 in modified transcript
  • Changed! smart RA 56aa 4e-07 in modified transcript

RBBP6

  • refseq_RBBP6.F2 refseq_RBBP6.R2 207 309
  • NCBIGene 36.3 5930
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006910

  • cd RING 44aa 6e-08 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam DWNN 73aa 7e-28 in ref transcript
    • DWNN domain. DWNN is a ubiquitin like domain found at the N terminus of the RBBP6 family of splicing-associated proteins. The DWNN domain is independently expressed in higher vertebrates so it may function as a novel ubiquitin-like modifier of other proteins.
  • smart RING 41aa 2e-08 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • smart ZnF_C2HC 16aa 0.005 in ref transcript
    • zinc finger.
  • COG COG5222 321aa 9e-40 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

RBBP8

  • refseq_RBBP8.F1 refseq_RBBP8.R1 145 226
  • NCBIGene 36.3 5932
  • Alternative 5-prime, size difference: 81
  • Inclusion in 5'UTR
  • Reference transcript: NM_203291

  • pfam CtIP_N 120aa 6e-48 in ref transcript
    • Tumour-suppressor protein CtIP N-terminal domain. CtIP is predominantly a nuclear protein that complexes with both BRCA1 and the BRCA1-associated RING domain protein (BARD1). At the protein level, CtIP expression varies with cell cycle progression in a pattern identical to that of BRCA1. Thus, the steady-state levels of CtIP polypeptides, which remain low in resting cells and G1 cycling cells, increase dramatically as Dividing cells traverse the G1/S boundary. CtIP can potentially modulate the functions ascribed to BRCA1 in transcriptional regulation, DNA repair, and/or cell cycle checkpoint control. This N-terminal domain carries a coiled-coil region and is essential for homodimerisation of the protein. The C-terminal domain is family CtIP_C and carries functionally important CxxC and RHR motifs, absence of which lead cells to grow slowly and show hypersensitivity to genotoxins.
  • pfam SAE2 88aa 5e-14 in ref transcript
    • DNA repair protein SAE2/CtIP. SAE2 is a protein involved in repairing meiotic and mitotic double-strand breaks in DNA. It has been shown to negatively regulate DNA damage checkpoint signalling. SAE2 is homologous to the CtIP proteins in mammals.
  • PRK PRK12704 130aa 0.003 in ref transcript
    • phosphodiesterase; Provisional.

RBBP8

  • refseq_RBBP8.F3 refseq_RBBP8.R3 100 197
  • NCBIGene 36.3 5932
  • Single exon skipping, size difference: 97
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002894

  • pfam CtIP_N 120aa 6e-48 in ref transcript
    • Tumour-suppressor protein CtIP N-terminal domain. CtIP is predominantly a nuclear protein that complexes with both BRCA1 and the BRCA1-associated RING domain protein (BARD1). At the protein level, CtIP expression varies with cell cycle progression in a pattern identical to that of BRCA1. Thus, the steady-state levels of CtIP polypeptides, which remain low in resting cells and G1 cycling cells, increase dramatically as Dividing cells traverse the G1/S boundary. CtIP can potentially modulate the functions ascribed to BRCA1 in transcriptional regulation, DNA repair, and/or cell cycle checkpoint control. This N-terminal domain carries a coiled-coil region and is essential for homodimerisation of the protein. The C-terminal domain is family CtIP_C and carries functionally important CxxC and RHR motifs, absence of which lead cells to grow slowly and show hypersensitivity to genotoxins.
  • Changed! pfam SAE2 88aa 5e-14 in ref transcript
    • DNA repair protein SAE2/CtIP. SAE2 is a protein involved in repairing meiotic and mitotic double-strand breaks in DNA. It has been shown to negatively regulate DNA damage checkpoint signalling. SAE2 is homologous to the CtIP proteins in mammals.
  • PRK PRK12704 130aa 0.003 in ref transcript
    • phosphodiesterase; Provisional.

RBCK1

  • refseq_RBCK1.F1 refseq_RBCK1.R1 127 272
  • NCBIGene 36.3 10616
  • Single exon skipping, size difference: 145
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_031229

  • Changed! cd Hoil1_N 75aa 4e-35 in ref transcript
    • HOIL1_N HOIL-1 (heme-oxidized IRP2 ubiquitin ligase-1) is an E3 ubiquitin-protein ligase that recognizes heme-oxidized IRP2 (iron regulatory protein2) and is thought to affect the turnover of oxidatively damaged proteins. Hoil-1 has an amino-terminal ubiquitin-like domain as well as an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain.

RBM10

  • refseq_RBM10.F1 refseq_RBM10.R1 152 383
  • NCBIGene 36.3 8241
  • Single exon skipping, size difference: 231
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005676

  • Changed! cd RRM 76aa 9e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 81aa 1e-04 in ref transcript
  • pfam G-patch 44aa 3e-13 in ref transcript
    • G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.
  • Changed! smart RRM 67aa 5e-09 in ref transcript
    • RNA recognition motif.
  • smart ZnF_RBZ 25aa 2e-07 in ref transcript
    • Zinc finger domain. Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.
  • Changed! TIGR ELAV_HUD_SF 199aa 1e-04 in ref transcript
    • These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
  • smart RRM 77aa 3e-04 in ref transcript
  • Changed! COG COG0724 104aa 0.002 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! cd RRM 72aa 2e-05 in modified transcript
  • Changed! smart RRM 46aa 2e-06 in modified transcript

RBM3

  • refseq_RBM3.F2 refseq_RBM3.R2 120 389
  • NCBIGene 36.2 5935
  • Single exon skipping, size difference: 269
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_006743

  • Changed! cd RRM 75aa 5e-21 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! smart RRM 71aa 2e-20 in ref transcript
    • RNA recognition motif.
  • Changed! COG COG0724 79aa 1e-11 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! cd RRM 68aa 1e-18 in modified transcript
  • Changed! smart RRM 65aa 4e-18 in modified transcript
  • Changed! COG COG0724 74aa 5e-13 in modified transcript

RBM38

  • refseq_RBM38.F1 refseq_RBM38.R1 132 187
  • NCBIGene 36.3 55544
  • Single exon skipping, size difference: 55
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017495

  • cd RRM 74aa 6e-17 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM 70aa 3e-18 in ref transcript
    • RNA recognition motif.
  • COG COG0724 91aa 1e-14 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

RBM39

  • refseq_RBM39.F2 refseq_RBM39.R2 149 222
  • NCBIGene 36.2 9584
  • Single exon skipping, size difference: 73
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_184234

  • Changed! cd RRM 73aa 1e-20 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! cd RRM 73aa 3e-09 in ref transcript
  • Changed! TIGR SF-CC1 427aa 1e-152 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • Changed! COG COG0724 99aa 9e-15 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

RBM39

  • refseq_RBM39.F3 refseq_RBM39.R3 119 191
  • NCBIGene 36.2 9584
  • Single exon skipping, size difference: 72
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_184234

  • Changed! cd RRM 73aa 1e-20 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! cd RRM 73aa 3e-09 in ref transcript
  • Changed! TIGR SF-CC1 427aa 1e-152 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • Changed! COG COG0724 99aa 9e-15 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

RBM39

  • refseq_RBM39.F5 refseq_RBM39.R5 101 119
  • NCBIGene 36.3 9584
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_184234

  • cd RRM 73aa 1e-20 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 73aa 3e-09 in ref transcript
  • Changed! TIGR SF-CC1 427aa 1e-152 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • COG COG0724 99aa 9e-15 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! TIGR SF-CC1 421aa 1e-150 in modified transcript

RBM9

  • refseq_RBM9.F1 refseq_RBM9.R1 206 258
  • NCBIGene 36.3 23543
  • Single exon skipping, size difference: 40
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001082578

  • cd RRM 72aa 2e-20 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 72aa 2e-20 in ref transcript
    • RNA recognition motif.
  • COG COG0724 86aa 3e-12 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

RBM9

  • refseq_RBM9.F3 refseq_RBM9.R3 105 145
  • NCBIGene 36.3 23543
  • Single exon skipping, size difference: 40
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001082578

  • cd RRM 72aa 2e-20 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 72aa 2e-20 in ref transcript
    • RNA recognition motif.
  • COG COG0724 86aa 3e-12 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

RBMS1

  • refseq_RBMS1.F1 refseq_RBMS1.R1 189 299
  • NCBIGene 36.3 5937
  • Single exon skipping, size difference: 110
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016836

  • Changed! cd RRM 66aa 8e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! cd RRM 51aa 1e-11 in ref transcript
  • Changed! TIGR ELAV_HUD_SF 182aa 5e-22 in ref transcript
    • These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
  • Changed! COG COG0724 175aa 3e-08 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! COG COG0724 113aa 1e-07 in ref transcript

RBMS1

  • refseq_RBMS1.F3 refseq_RBMS1.R3 100 148
  • NCBIGene 36.3 5937
  • Single exon skipping, size difference: 48
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_016836

  • cd RRM 66aa 8e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 51aa 1e-11 in ref transcript
  • TIGR ELAV_HUD_SF 182aa 5e-22 in ref transcript
    • These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
  • Changed! COG COG0724 175aa 3e-08 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 113aa 1e-07 in ref transcript
  • Changed! COG COG0724 187aa 2e-08 in modified transcript

RBMS3

  • refseq_RBMS3.F2 refseq_RBMS3.R2 112 160
  • NCBIGene 36.3 27303
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001003793

  • cd RRM 66aa 5e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 51aa 7e-12 in ref transcript
  • TIGR ELAV_HUD_SF 166aa 9e-23 in ref transcript
    • These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
  • COG COG0724 143aa 2e-08 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! COG COG0724 119aa 7e-07 in modified transcript

RBMS3

  • refseq_RBMS3.F4 refseq_RBMS3.R4 111 162
  • NCBIGene 36.3 27303
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001003793

  • cd RRM 66aa 5e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 51aa 7e-12 in ref transcript
  • TIGR ELAV_HUD_SF 166aa 9e-23 in ref transcript
    • These proteins contain 3 RNA-recognition motifs (rrm: pfam00076).
  • COG COG0724 143aa 2e-08 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! TIGR PABP-1234 333aa 1e-18 in modified transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • Changed! COG COG0724 119aa 6e-07 in modified transcript

RBPMS

  • refseq_RBPMS.F1 refseq_RBPMS.R1 144 230
  • NCBIGene 36.3 11030
  • Single exon skipping, size difference: 86
  • Exclusion of the stop codon
  • Reference transcript: NM_001008711

  • cd RRM 74aa 1e-08 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • pfam RRM_1 62aa 9e-09 in ref transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.

RBPMS

  • refseq_RBPMS.F2 refseq_RBPMS.R2 149 355
  • NCBIGene 36.3 11030
  • Single exon skipping, size difference: 206
  • Inclusion in 3'UTR
  • Reference transcript: NM_001008711

  • cd RRM 74aa 1e-08 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • pfam RRM_1 62aa 9e-09 in ref transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.

RBPJ

  • refseq_RBPSUH.F1 refseq_RBPSUH.R1 124 240
  • NCBIGene 36.3 3516
  • Single exon skipping, size difference: 116
  • Exclusion of the protein initiation site
  • Reference transcript: NM_005349

  • cd IPT_RBP-Jkappa 89aa 2e-45 in ref transcript
    • IPT domain of the recombination signal Jkappa binding protein (RBP-Jkappa). RBP-J kappa, was initially considered to be involved in V(D)J recombination because of its DNA binding specificity and structural similarity to site-specific recombinases known as the integrase family. Further studies indicated that RBP-J kappa functions as a repressor of transcription, via destabilization of the general transcription factor IID and recruitment of histone deacetylase complexes.
  • pfam Beta-trefoil 150aa 4e-68 in ref transcript
    • Beta-trefoil. Members of this family of DNA binding domains adopt a beta-trefoil fold, that is, a capped beta-barrel with internal pseudo threefold symmetry. In the DNA-binding protein LAG-1, it also is the site of mutually exclusive interactions with NotchIC (and the viral protein EBNA2) and corepressors (SMRT/N-Cor and CIR).
  • pfam LAG1-DNAbind 132aa 3e-55 in ref transcript
    • LAG1, DNA binding. Members of this family are found in various eukaryotic hypothetical proteins and in the DNA-binding protein LAG-1. They adopt a beta sandwich structure, with nine strands in two beta-sheets, in a Greek-key topology, and allow for DNA binding.
  • pfam TIG 81aa 1e-07 in ref transcript
    • IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: This family does not currently recognise a significant number of members.

RBPJ

  • refseq_RBPSUH.F3 refseq_RBPSUH.R1 144 327
  • NCBIGene 36.3 3516
  • Single exon skipping, size difference: 183
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015874

  • Changed! cd IPT_RBP-Jkappa 89aa 2e-45 in ref transcript
    • IPT domain of the recombination signal Jkappa binding protein (RBP-Jkappa). RBP-J kappa, was initially considered to be involved in V(D)J recombination because of its DNA binding specificity and structural similarity to site-specific recombinases known as the integrase family. Further studies indicated that RBP-J kappa functions as a repressor of transcription, via destabilization of the general transcription factor IID and recruitment of histone deacetylase complexes.
  • Changed! pfam Beta-trefoil 150aa 3e-68 in ref transcript
    • Beta-trefoil. Members of this family of DNA binding domains adopt a beta-trefoil fold, that is, a capped beta-barrel with internal pseudo threefold symmetry. In the DNA-binding protein LAG-1, it also is the site of mutually exclusive interactions with NotchIC (and the viral protein EBNA2) and corepressors (SMRT/N-Cor and CIR).
  • Changed! pfam LAG1-DNAbind 132aa 3e-55 in ref transcript
    • LAG1, DNA binding. Members of this family are found in various eukaryotic hypothetical proteins and in the DNA-binding protein LAG-1. They adopt a beta sandwich structure, with nine strands in two beta-sheets, in a Greek-key topology, and allow for DNA binding.
  • Changed! pfam TIG 81aa 2e-07 in ref transcript
    • IPT/TIG domain. This family consists of a domain that has an immunoglobulin like fold. These domains are found in cell surface receptors such as Met and Ron as well as in intracellular transcription factors where it is involved in DNA binding. CAUTION: This family does not currently recognise a significant number of members.

RCCD1

  • refseq_RCCD1.F1 refseq_RCCD1.R1 324 408
  • NCBIGene 36.3 91433
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_033544

  • COG ATS1 181aa 1e-12 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].

RECQL5

  • refseq_RECQL5.F2 refseq_RECQL5.R2 197 278
  • NCBIGene 36.2 9400
  • Alternative 3-prime, size difference: 81
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_004259

  • cd HELICc 117aa 4e-20 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! cd DEXDc 125aa 3e-09 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • Changed! TIGR recQ 397aa 2e-92 in ref transcript
    • The ATP-dependent DNA helicase RecQ of E. coli is about 600 residues long. This model represents bacterial proteins with a high degree of similarity in domain architecture and in primary sequence to E. coli RecQ. The model excludes eukaryotic and archaeal proteins with RecQ-like regions, as well as more distantly related bacterial helicases related to RecQ.
  • pfam RecQ5 205aa 7e-87 in ref transcript
    • RecQ helicase protein-like 5 (RecQ5). This family represents a conserved region approximately 200 residues long within eukaryotic RecQ helicase protein-like 5 (RecQ5). The RecQ helicases have been implicated in DNA repair and recombination, and RecQ5 may have an important role in DNA metabolism.
  • Changed! COG RecQ 396aa 4e-96 in ref transcript
    • Superfamily II DNA helicase [DNA replication, recombination, and repair].
  • Changed! cd DEXDc 149aa 1e-16 in modified transcript
  • Changed! TIGR recQ 424aa 1e-109 in modified transcript
  • Changed! COG RecQ 423aa 1e-112 in modified transcript
  • Changed! PRK PRK07003 150aa 0.010 in modified transcript
    • DNA polymerase III subunits gamma and tau; Validated.

RFC2

  • refseq_RFC2.F1 refseq_RFC2.R1 181 283
  • NCBIGene 36.3 5982
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181471

  • Changed! cd AAA 135aa 1e-18 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • pfam Rep_fac_C 89aa 2e-24 in ref transcript
    • Replication factor C. This family includes several subunits of DNA replication factor C.
  • Changed! TIGR dnaX_nterm 278aa 6e-21 in ref transcript
    • This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
  • Changed! PRK rfc 310aa 3e-99 in ref transcript
    • replication factor C small subunit; Reviewed.
  • Changed! cd AAA 101aa 3e-08 in modified transcript
  • Changed! TIGR dnaX_nterm 244aa 2e-12 in modified transcript
  • Changed! PRK rfc 276aa 1e-81 in modified transcript

RFC3

  • refseq_RFC3.F2 refseq_RFC3.R2 139 172
  • NCBIGene 36.2 5983
  • Alternative 3-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002915

  • cd AAA 159aa 3e-08 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • pfam RFC-E_C 95aa 2e-42 in ref transcript
    • Clamp-loader complex subunit E C-terminus. This is the C-terminal domain of RFC-E, one of the five RFC proteins of the clamp loader complex (replication factor-C, RFC) which binds to the DNA sliding clamp (proliferating cell nuclear antigen, PCNA). The five modules of RFC assemble into a right-handed spiral, which results in only three of the five RFC subunits (RFC-A, RFC-B and RFC-C) making contact with PCNA, leaving a wedge-shaped gap between RFC-E and the PCNA clamp-loader complex. The C-terminal is vital for the correct orientation of RFC-E with respect to RFC-A.
  • Changed! TIGR holB 169aa 1e-08 in ref transcript
    • At position 126-127 of the seed alignment, this family lacks the HM motif of gamma/tau; at 132 it has a near-invariant A vs. an invariant F in gamma/tau.
  • Changed! TIGR dnaX_nterm 173aa 0.003 in ref transcript
    • This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
  • Changed! PRK PRK12402 337aa 1e-40 in ref transcript
    • replication factor C small subunit 2; Reviewed.
  • Changed! TIGR holB 168aa 3e-08 in modified transcript
  • Changed! PRK PRK12402 326aa 4e-38 in modified transcript

RFC5

  • refseq_RFC5.F1 refseq_RFC5.R1 106 470
  • NCBIGene 36.3 5985
  • Single exon skipping, size difference: 364
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_007370

  • Changed! cd AAA 142aa 5e-16 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • Changed! TIGR dnaX_nterm 267aa 2e-26 in ref transcript
    • This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
  • Changed! PRK rfc 313aa 4e-92 in ref transcript
    • replication factor C small subunit; Reviewed.

TRIM13

  • refseq_RFP2.F1 refseq_RFP2.R1 106 220
  • NCBIGene 36.3 10206
  • Single exon skipping, size difference: 114
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001007278

  • cd RING 53aa 1e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • smart RING 48aa 2e-07 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • pfam zf-B_box 42aa 2e-06 in ref transcript
    • B-box zinc finger.
  • COG COG5222 84aa 0.002 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

RFWD2

  • refseq_RFWD2.F1 refseq_RFWD2.R1 266 338
  • NCBIGene 36.3 64326
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022457

  • cd WD40 307aa 6e-33 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd RING 42aa 6e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • smart RING 38aa 2e-07 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • pfam WD40 40aa 0.003 in ref transcript
    • WD domain, G-beta repeat.
  • COG COG2319 284aa 8e-17 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG PEX10 41aa 4e-04 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

RFWD2

  • refseq_RFWD2.F2 refseq_RFWD2.R2 321 381
  • NCBIGene 36.3 64326
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022457

  • cd WD40 307aa 6e-33 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd RING 42aa 6e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • smart RING 38aa 2e-07 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • pfam WD40 40aa 0.003 in ref transcript
    • WD domain, G-beta repeat.
  • COG COG2319 284aa 8e-17 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG PEX10 41aa 4e-04 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

RFX2

  • refseq_RFX2.F1 refseq_RFX2.R1 282 357
  • NCBIGene 36.3 5990
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000635

  • pfam RFX1_trans_act 156aa 1e-28 in ref transcript
    • RFX1 transcription activation region. The RFX family is a family of winged-helix DNA binding proteins. RFX1 is a regulatory factor essential for expression of MHC class II genes. This region is to found N terminal to the RFX DNA binding region (pfam02257) in some mammalian RFX proteins, and is thought to activate transcription when associated with DNA. Deletion analysis has identified the region 233-351 in human RFX1 as being required for maximal activation.
  • Changed! pfam RFX_DNA_binding 65aa 1e-20 in ref transcript
    • RFX DNA-binding domain. RFX is a regulatory factor which binds to the X box of MHC class II genes and is essential for their expression. The DNA-binding domain of RFX is the central domain of the protein and binds ssDNA as either a monomer or homodimer.
  • Changed! pfam RFX_DNA_binding 83aa 2e-23 in modified transcript

RFXANK

  • refseq_RFXANK.F1 refseq_RFXANK.R1 126 192
  • NCBIGene 36.3 8625
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003721

  • cd ANK 125aa 3e-24 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • pfam Ank 28aa 6e-04 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 29aa 8e-04 in ref transcript
  • Changed! COG Arp 192aa 7e-13 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! COG Arp 145aa 2e-12 in modified transcript

RFXANK

  • refseq_RFXANK.F4 refseq_RFXANK.R3 175 244
  • NCBIGene 36.3 8625
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003721

  • cd ANK 125aa 3e-24 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • pfam Ank 28aa 6e-04 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 29aa 8e-04 in ref transcript
  • Changed! COG Arp 192aa 7e-13 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! COG Arp 145aa 2e-12 in modified transcript

RGMB

  • refseq_RGMB.F1 refseq_RGMB.R1 132 377
  • NCBIGene 36.3 285704
  • Alternative 3-prime, size difference: 245
  • Inclusion in 5'UTR
  • Reference transcript: NM_001012761

  • pfam RGM_C 214aa 5e-97 in ref transcript
    • Repulsive guidance molecule (RGM) C-terminus. This family consists of several mammalian and one bird sequence from Gallus gallus (Chicken). This family represents the C-terminal region of several sequences but in others it represents the full protein. All of the mammalian proteins are hypothetical and have no known function but the member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum.
  • pfam RGM_N 176aa 5e-50 in ref transcript
    • Repulsive guidance molecule (RGM) N-terminus. This family consists of the N-terminal region of several mammalian and one bird sequence from Gallus gallus (Chicken). All of the mammalian proteins are hypothetical and have no known function but the member from chicken is annotated as being a repulsive guidance molecule (RGM). RGM is a GPI-linked axon guidance molecule of the retinotectal system. RGM is repulsive for a subset of axons, those from the temporal half of the retina. Temporal retinal axons invade the anterior optic tectum in a superficial layer, and encounter RGM expressed in a gradient with increasing concentration along the anterior-posterior axis. Temporal axons are able to receive posterior-dependent information by sensing gradients or concentrations of guidance cues. Thus, RGM is likely to provide positional information for temporal axons invading the optic tectum in the stratum opticum.

RGNEF

  • refseq_RGNEF.F1 refseq_RGNEF.R1 228 360
  • NCBIGene 36.2 64283
  • Single exon skipping, size difference: 132
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: XM_932949

  • cd RhoGEF 193aa 4e-34 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • cd C1 46aa 4e-06 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • smart RhoGEF 190aa 1e-39 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
  • smart C1 46aa 3e-06 in ref transcript
    • Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
  • COG ROM1 267aa 3e-05 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

RGR

  • refseq_RGR.F2 refseq_RGR.R2 280 394
  • NCBIGene 36.3 5995
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002921

  • Changed! pfam 7tm_1 229aa 7e-11 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • Changed! pfam 7tm_1 170aa 6e-10 in modified transcript

RGS13

  • refseq_RGS13.F1 refseq_RGS13.R1 117 157
  • NCBIGene 36.3 6003
  • Single exon skipping, size difference: 40
  • Exclusion in 5'UTR
  • Reference transcript: NM_002927

  • pfam RGS 116aa 2e-35 in ref transcript
    • Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.

RHBDF2

  • refseq_RHBDF2.F1 refseq_RHBDF2.R1 192 279
  • NCBIGene 36.3 79651
  • Alternative 3-prime, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024599

  • pfam Rhomboid 124aa 4e-24 in ref transcript
    • Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth factor Spitz, allowing it to activate the Drosophila EGF receptor. Analysis has shown that Rhomboid-1 is an intramembrane serine protease (EC:3.4.21.105). Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite.
  • COG GlpG 133aa 8e-14 in ref transcript
    • Uncharacterized membrane protein (homolog of Drosophila rhomboid) [General function prediction only].

RHCE

  • refseq_RHCE.F1 refseq_RHCE.R1 179 259
  • NCBIGene 36.3 6006
  • Single exon skipping, size difference: 80
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020485

  • Changed! pfam Ammonium_transp 360aa 2e-48 in ref transcript
    • Ammonium Transporter Family.
  • COG AmtB 104aa 1e-04 in ref transcript
    • Ammonia permease [Inorganic ion transport and metabolism].
  • Changed! pfam Ammonium_transp 329aa 9e-49 in modified transcript

RHCE

  • refseq_RHCE.F3 refseq_RHCE.R3 147 281
  • NCBIGene 36.3 6006
  • Single exon skipping, size difference: 134
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020485

  • Changed! pfam Ammonium_transp 360aa 2e-48 in ref transcript
    • Ammonium Transporter Family.
  • Changed! COG AmtB 104aa 1e-04 in ref transcript
    • Ammonia permease [Inorganic ion transport and metabolism].
  • Changed! pfam Ammonium_transp 293aa 1e-48 in modified transcript
  • Changed! COG AmtB 108aa 2e-05 in modified transcript

RHCE

  • refseq_RHCE.F5 refseq_RHCE.R5 105 420
  • NCBIGene 36.3 6006
  • Multiple exon skipping, size difference: 315
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020485

  • Changed! pfam Ammonium_transp 360aa 2e-48 in ref transcript
    • Ammonium Transporter Family.
  • Changed! COG AmtB 104aa 1e-04 in ref transcript
    • Ammonia permease [Inorganic ion transport and metabolism].
  • Changed! pfam Ammonium_transp 150aa 4e-13 in modified transcript
  • Changed! pfam Ammonium_transp 110aa 4e-07 in modified transcript

RHOT1

  • refseq_RHOT1.F1 refseq_RHOT1.R1 111 207
  • NCBIGene 36.3 55288
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001033568

  • cd Miro1 165aa 8e-74 in ref transcript
    • Miro1 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the N-terminal GTPase domain of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
  • Changed! cd Miro2 166aa 8e-71 in ref transcript
    • Miro2 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the putative GTPase domain in the C terminus of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
  • pfam EF_assoc_2 89aa 2e-38 in ref transcript
    • EF hand associated. This region predominantly appears near EF-hands (pfam00036) in GTP-binding proteins. It is found in all three eukaryotic kingdoms.
  • pfam EF_assoc_1 75aa 1e-23 in ref transcript
    • EF hand associated. This region typically appears on the C-terminus of EF hands in GTP-binding proteins such as Arht/Rhot (may be involved in mitochondrial homeostasis and apoptosis). The EF hand associated region is found in yeast, vertebrates and plants.
  • smart RHO 163aa 5e-15 in ref transcript
    • Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
  • pfam Miro 111aa 2e-11 in ref transcript
    • Miro-like protein. Mitochondrial Rho proteins (Miro-1 and Miro-2), are atypical Rho GTPases. They have a unique domain organisation, with tandem GTP-binding domains and two EF hand domains (pfam00036), that may bind calcium. They are also larger than classical small GTPases. It has been proposed that they are involved in mitochondrial homeostasis and apoptosis.
  • COG COG1100 173aa 5e-15 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].
  • Changed! cd Miro2 167aa 1e-71 in modified transcript

RHOT1

  • refseq_RHOT1.F2 refseq_RHOT1.R2 276 400
  • NCBIGene 36.2 55288
  • Single exon skipping, size difference: 124
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001033568

  • Changed! cd Miro1 165aa 8e-74 in ref transcript
    • Miro1 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the N-terminal GTPase domain of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
  • Changed! cd Miro2 166aa 8e-71 in ref transcript
    • Miro2 subfamily. Miro (mitochondrial Rho) proteins have tandem GTP-binding domains separated by a linker region containing putative calcium-binding EF hand motifs. Genes encoding Miro-like proteins were found in several eukaryotic organisms. This CD represents the putative GTPase domain in the C terminus of Miro proteins. These atypical Rho GTPases have roles in mitochondrial homeostasis and apoptosis. Most Rho proteins contain a lipid modification site at the C-terminus; however, Miro is one of few Rho subfamilies that lack this feature.
  • Changed! pfam EF_assoc_2 89aa 2e-38 in ref transcript
    • EF hand associated. This region predominantly appears near EF-hands (pfam00036) in GTP-binding proteins. It is found in all three eukaryotic kingdoms.
  • Changed! pfam EF_assoc_1 75aa 1e-23 in ref transcript
    • EF hand associated. This region typically appears on the C-terminus of EF hands in GTP-binding proteins such as Arht/Rhot (may be involved in mitochondrial homeostasis and apoptosis). The EF hand associated region is found in yeast, vertebrates and plants.
  • Changed! smart RHO 163aa 5e-15 in ref transcript
    • Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
  • Changed! pfam Miro 111aa 2e-11 in ref transcript
    • Miro-like protein. Mitochondrial Rho proteins (Miro-1 and Miro-2), are atypical Rho GTPases. They have a unique domain organisation, with tandem GTP-binding domains and two EF hand domains (pfam00036), that may bind calcium. They are also larger than classical small GTPases. It has been proposed that they are involved in mitochondrial homeostasis and apoptosis.
  • Changed! COG COG1100 173aa 5e-15 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].
  • Changed! cd Miro1 29aa 6e-08 in modified transcript
  • Changed! pfam Miro 27aa 0.007 in modified transcript

RIPK5

  • refseq_RIPK5.F1 refseq_RIPK5.R1 114 249
  • NCBIGene 36.3 25778
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015375

  • Changed! cd S_TKc 245aa 2e-37 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! pfam Pkinase 245aa 2e-38 in ref transcript
    • Protein kinase domain.
  • Changed! COG SPS1 245aa 1e-20 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd S_TKc 200aa 2e-27 in modified transcript
  • Changed! pfam Pkinase 168aa 2e-27 in modified transcript
  • Changed! COG SPS1 176aa 2e-15 in modified transcript

RNASE4

  • refseq_RNASE4.F2 refseq_RNASE4.R2 278 382
  • NCBIGene 36.2 6038
  • Single exon skipping, size difference: 104
  • Exclusion of the stop codon
  • Reference transcript: NM_194430

RNF12

  • refseq_RNF12.F2 refseq_RNF12.R2 231 291
  • NCBIGene 36.3 51132
  • Single exon skipping, size difference: 60
  • Exclusion in 5'UTR
  • Reference transcript: NM_183353

  • cd RING 46aa 1e-10 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • smart RING 41aa 6e-09 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG COG5540 75aa 2e-09 in ref transcript
    • RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

RNF121

  • refseq_RNF121.F1 refseq_RNF121.R1 105 368
  • NCBIGene 36.3 55298
  • Multiple exon skipping, size difference: 263
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018320

  • Changed! cd RING 54aa 7e-04 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • Changed! smart RING 50aa 0.001 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! COG HRD1 58aa 7e-06 in ref transcript
    • HRD ubiquitin ligase complex, ER membrane component [Posttranslational modification, protein turnover, chaperones].

RNF121

  • refseq_RNF121.F3 refseq_RNF121.R3 161 199
  • NCBIGene 36.3 55298
  • Single exon skipping, size difference: 38
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_018320

  • Changed! cd RING 54aa 7e-04 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • Changed! smart RING 50aa 0.001 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! COG HRD1 58aa 7e-06 in ref transcript
    • HRD ubiquitin ligase complex, ER membrane component [Posttranslational modification, protein turnover, chaperones].

RNF13

  • refseq_RNF13.F2 refseq_RNF13.R2 184 286
  • NCBIGene 36.3 11342
  • Single exon skipping, size difference: 102
  • Exclusion in 5'UTR
  • Reference transcript: NM_007282

  • cd PA_C_RZF_like 158aa 8e-50 in ref transcript
    • PA_C-RZF_ like: Protease-associated (PA) domain C_RZF-like. This group includes various PA domain-containing proteins similar to C-RZF (chicken embryo RING zinc finger) protein. These proteins contain a C3H2C3 RING finger. C-RZF is expressed in embryo cells and is restricted mainly to brain and heart, it is localized to both the nucleus and endosomes. Additional C3H2C3 RING finger proteins belonging to this group, include Arabidopsis ReMembR-H2 protein and mouse sperizin. ReMembR-H2 is likely to be an integral membrane protein, and to traffic through the endosomal pathway. Sperizin is expressed in haploid germ cells and localized in the cytoplasm, it may participate in spermatogenesis. The significance of the PA domain to these proteins has not been ascertained. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate.
  • cd RING 46aa 1e-10 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam PA 92aa 2e-11 in ref transcript
    • PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family. It has been suggested that this domain forms a lid-like structure that covers the active site in active proteases, and is involved in protein recognition in vacuolar sorting receptors.
  • smart RING 42aa 7e-09 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG COG5540 253aa 1e-11 in ref transcript
    • RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

RNF135

  • refseq_RNF135.F1 refseq_RNF135.R1 120 283
  • NCBIGene 36.3 84282
  • Single exon skipping, size difference: 163
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032322

  • cd RING 44aa 5e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • Changed! smart SPRY 109aa 6e-15 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • Changed! smart RING 42aa 1e-05 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! smart PRY 51aa 0.006 in ref transcript
    • associated with SPRY domains.
  • Changed! COG PEX10 49aa 4e-04 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! TIGR rad18 73aa 8e-06 in modified transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! COG RAD18 69aa 1e-04 in modified transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].

RNF138

  • refseq_RNF138.F1 refseq_RNF138.R1 128 410
  • NCBIGene 36.3 51444
  • Multiple exon skipping, size difference: 282
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016271

  • Changed! cd RING 44aa 2e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • Changed! smart RING 40aa 9e-05 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! COG COG5222 40aa 2e-05 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

RNF14

  • refseq_RNF14.F1 refseq_RNF14.R1 176 350
  • NCBIGene 36.3 9604
  • Single exon skipping, size difference: 174
  • Exclusion in 5'UTR
  • Reference transcript: NM_004290

  • pfam RWD 130aa 3e-14 in ref transcript
    • RWD domain. This domain was identified in WD40 repeat proteins and Ring finger domain proteins. The function of this domain is unknown. GCN2 is the alpha-subunit of the only translation initiation factor (eIF2 alpha) kinase that appears in all eukaryotes. Its function requires an interaction with GCN1 via the domain at its N-terminus, which is termed the RWD domain after three major RWD-containing proteins: RING finger-containing proteins, WD-repeat-containing proteins, and yeast DEAD (DEXD)-like helicases. The structure forms an alpha + beta sandwich fold consisting of two layers: a four-stranded antiparallel beta-sheet, and three side-by-side alpha-helices.
  • pfam IBR 62aa 4e-12 in ref transcript
    • IBR domain. The IBR (In Between Ring fingers) domain is often found to occur between pairs of ring fingers (pfam00097). This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain. Proteins that contain two Ring fingers and an IBR domain (these proteins are also termed RBR family proteins) are thought to exist in all eukaryotic organisms. RBR family members play roles in protein quality control and can indirectly regulate transcription. Evidence suggests that RBR proteins are often parts of cullin-containing ubiquitin ligase complexes. The ubiquitin ligase Parkin is an RBR family protein whose mutations are involved in forms of familial Parkinson's disease.

RNF14

  • refseq_RNF14.F4 refseq_RNF14.R4 217 369
  • NCBIGene 36.3 9604
  • Single exon skipping, size difference: 152
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_183399

  • Changed! pfam RWD 130aa 3e-14 in ref transcript
    • RWD domain. This domain was identified in WD40 repeat proteins and Ring finger domain proteins. The function of this domain is unknown. GCN2 is the alpha-subunit of the only translation initiation factor (eIF2 alpha) kinase that appears in all eukaryotes. Its function requires an interaction with GCN1 via the domain at its N-terminus, which is termed the RWD domain after three major RWD-containing proteins: RING finger-containing proteins, WD-repeat-containing proteins, and yeast DEAD (DEXD)-like helicases. The structure forms an alpha + beta sandwich fold consisting of two layers: a four-stranded antiparallel beta-sheet, and three side-by-side alpha-helices.
  • Changed! pfam IBR 62aa 4e-12 in ref transcript
    • IBR domain. The IBR (In Between Ring fingers) domain is often found to occur between pairs of ring fingers (pfam00097). This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain. Proteins that contain two Ring fingers and an IBR domain (these proteins are also termed RBR family proteins) are thought to exist in all eukaryotic organisms. RBR family members play roles in protein quality control and can indirectly regulate transcription. Evidence suggests that RBR proteins are often parts of cullin-containing ubiquitin ligase complexes. The ubiquitin ligase Parkin is an RBR family protein whose mutations are involved in forms of familial Parkinson's disease.
  • Changed! pfam RWD 45aa 0.002 in modified transcript

RNF34

  • refseq_RNF34.F2 refseq_RNF34.R2 135 201
  • NCBIGene 36.3 80196
  • Single exon skipping, size difference: 66
  • Exclusion of the protein initiation site
  • Reference transcript: NM_194271

RNF38

  • refseq_RNF38.F2 refseq_RNF38.R2 227 377
  • NCBIGene 36.3 152006
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022781

  • cd RING 44aa 6e-07 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • smart RING 41aa 3e-06 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG COG5540 42aa 7e-10 in ref transcript
    • RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

RNF38

  • refseq_RNF38.F4 refseq_RNF38.R3 252 402
  • NCBIGene 36.3 152006
  • Single exon skipping, size difference: 150
  • Exclusion in 5'UTR
  • Reference transcript: NM_194328

  • cd RING 44aa 8e-07 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • smart RING 41aa 4e-06 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG COG5540 42aa 5e-10 in ref transcript
    • RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

RNF41

  • refseq_RNF41.F2 refseq_RNF41.R2 221 364
  • NCBIGene 36.3 10193
  • Single exon skipping, size difference: 143
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_194359

  • Changed! cd RING 43aa 7e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • Changed! pfam USP8_interact 179aa 8e-86 in ref transcript
    • USP8 interacting. This domain interacts with the UBP deubiquitinating enzyme USP8.
  • Changed! smart RING 38aa 4e-05 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! COG COG5219 44aa 0.002 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

RNF41

  • refseq_RNF41.F3 refseq_RNF41.R3 189 332
  • NCBIGene 36.3 10193
  • Alternative 5-prime, size difference: 143
  • Exclusion in 5'UTR
  • Reference transcript: NM_005785

  • cd RING 43aa 7e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam USP8_interact 179aa 8e-86 in ref transcript
    • USP8 interacting. This domain interacts with the UBP deubiquitinating enzyme USP8.
  • smart RING 38aa 4e-05 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG COG5219 44aa 0.002 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

RNF8

  • refseq_RNF8.F2 refseq_RNF8.R2 190 395
  • NCBIGene 36.3 9025
  • Single exon skipping, size difference: 205
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003958

  • cd FHA 95aa 2e-11 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • Changed! cd RING 43aa 5e-08 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam FHA 72aa 3e-12 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • Changed! smart RING 38aa 7e-08 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG COG1716 104aa 5e-06 in ref transcript
    • FOG: FHA domain [Signal transduction mechanisms].
  • Changed! COG RAD18 47aa 5e-06 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].

RNH1

  • refseq_RNH1.F2 refseq_RNH1.R2 186 254
  • NCBIGene 36.3 6050
  • Alternative 3-prime, size difference: 68
  • Inclusion in 5'UTR
  • Reference transcript: NM_203386

  • cd LRR_RI 320aa 6e-76 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • smart LRR_RI 28aa 0.009 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.

RNH1

  • refseq_RNH1.F3 refseq_RNH1.R1 163 231
  • NCBIGene 36.3 6050
  • Alternative 3-prime, size difference: 68
  • Inclusion in 5'UTR
  • Reference transcript: NM_203384

  • cd LRR_RI 320aa 6e-76 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • smart LRR_RI 28aa 0.009 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.

RNH1

  • refseq_RNH1.F6 refseq_RNH1.R3 124 297
  • NCBIGene 36.3 6050
  • Single exon skipping, size difference: 173
  • Exclusion in 5'UTR
  • Reference transcript: NM_203387

  • cd LRR_RI 320aa 6e-76 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • smart LRR_RI 28aa 0.009 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.

RP11-564C4.2

  • refseq_RP11-564C4.2.F1 refseq_RP11-564C4.2.R1 102 121
  • NCBIGene 36.2 653308
  • Alternative 5-prime, size difference: 19
  • Exclusion in 5'UTR
  • Reference transcript: XM_933039

  • pfam Ceramidase_alk 134aa 5e-45 in ref transcript
    • Neutral/alkaline non-lysosomal ceramidase. This family represents a group of neutral/alkaline ceramidases found in both bacteria and eukaryotes.

RP11-564C4.2

  • refseq_RP11-564C4.2.F1 refseq_RP11-564C4.2.R3 269 324
  • NCBIGene 36.2 653308
  • Alternative 3-prime, size difference: 74
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_933039

  • Changed! pfam Ceramidase_alk 134aa 5e-45 in ref transcript
    • Neutral/alkaline non-lysosomal ceramidase. This family represents a group of neutral/alkaline ceramidases found in both bacteria and eukaryotes.

RPAIN

  • refseq_RPAIN.F1 refseq_RPAIN.R1 153 217
  • NCBIGene 36.2 84268
  • Single exon skipping, size difference: 64
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001033002

RPE

  • refseq_RPE.F1 refseq_RPE.R1 171 251
  • NCBIGene 36.3 6120
  • Single exon skipping, size difference: 80
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_199229

  • Changed! cd RPE 209aa 3e-84 in ref transcript
    • Ribulose-5-phosphate 3-epimerase (RPE). This enzyme catalyses the interconversion of D-ribulose 5-phosphate (Ru5P) into D-xylulose 5-phosphate, as part of the Calvin cycle (reductive pentose phosphate pathway) in chloroplasts and in the oxidative pentose phosphate pathway. In the Calvin cycle Ru5P is phosphorylated by phosphoribulose kinase to ribulose-1,5-bisphosphate, which in turn is used by RubisCO (ribulose-1,5-bisphosphate carboxylase/oxygenase) to incorporate CO2 as the central step in carbohydrate synthesis.
  • Changed! pfam Ribul_P_3_epim 199aa 5e-62 in ref transcript
    • Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate.
  • Changed! PTZ PTZ00170 210aa 1e-82 in ref transcript
    • D-ribulose-5-phosphate 3-epimerase; Provisional.
  • Changed! cd RPE 37aa 2e-12 in modified transcript
  • Changed! pfam Ribul_P_3_epim 37aa 3e-09 in modified transcript
  • Changed! PRK PRK05581 37aa 1e-11 in modified transcript
    • ribulose-phosphate 3-epimerase; Validated.

RPE

  • refseq_RPE.F3 refseq_RPE.R3 192 246
  • NCBIGene 36.3 6120
  • Single exon skipping, size difference: 54
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_199229

  • Changed! cd RPE 209aa 3e-84 in ref transcript
    • Ribulose-5-phosphate 3-epimerase (RPE). This enzyme catalyses the interconversion of D-ribulose 5-phosphate (Ru5P) into D-xylulose 5-phosphate, as part of the Calvin cycle (reductive pentose phosphate pathway) in chloroplasts and in the oxidative pentose phosphate pathway. In the Calvin cycle Ru5P is phosphorylated by phosphoribulose kinase to ribulose-1,5-bisphosphate, which in turn is used by RubisCO (ribulose-1,5-bisphosphate carboxylase/oxygenase) to incorporate CO2 as the central step in carbohydrate synthesis.
  • Changed! pfam Ribul_P_3_epim 199aa 5e-62 in ref transcript
    • Ribulose-phosphate 3 epimerase family. This enzyme catalyses the conversion of D-ribulose 5-phosphate into D-xylulose 5-phosphate.
  • Changed! PTZ PTZ00170 210aa 1e-82 in ref transcript
    • D-ribulose-5-phosphate 3-epimerase; Provisional.
  • Changed! cd RPE 227aa 7e-82 in modified transcript
  • Changed! TIGR rpe 226aa 1e-59 in modified transcript
    • This family consists of Ribulose-phosphate 3-epimerase, also known as pentose-5-phosphate 3-epimerase (PPE). PPE converts D-ribulose 5-phosphate into D-xylulose 5-phosphate in Calvin's reductive pentose phosphate cycle. It has been found in a wide range of bacteria, archebacteria, fungi and plants.
  • Changed! PTZ PTZ00170 228aa 8e-80 in modified transcript

RPL17

  • refseq_RPL17.F1 refseq_RPL17.R1 111 360
  • NCBIGene 36.3 6139
  • Alternative 3-prime, size difference: 249
  • Inclusion in 5'UTR
  • Reference transcript: NM_001035006

  • cd Ribosomal_L22 136aa 1e-26 in ref transcript
    • Ribosomal protein L22/L17e. L22 (L17 in eukaryotes) is a core protein of the large ribosomal subunit. It is the only ribosomal protein that interacts with all six domains of 23S rRNA, and is one of the proteins important for directing the proper folding and stabilizing the conformation of 23S rRNA. L22 is the largest protein contributor to the surface of the polypeptide exit channel, the tunnel through which the polypeptide product passes. L22 is also one of six proteins located at the putative translocon binding site on the exterior surface of the ribosome.
  • TIGR L22_arch 150aa 6e-67 in ref transcript
    • This model describes the ribosomal protein of the eukaryotic cytosol and of the Archaea, variously designated as L17, L22, and L23. The corresponding bacterial homolog, described by a separate model, is designated L22.
  • PTZ PTZ00178 163aa 2e-73 in ref transcript
    • 60S ribosomal protein L22/L17e; Provisional.

RPL3

  • refseq_RPL3.F1 refseq_RPL3.R1 186 333
  • NCBIGene 36.3 6122
  • Alternative 5-prime and 3-prime, size difference: 147
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000967

  • Changed! TIGR L3_arch 367aa 3e-73 in ref transcript
    • This model describes exclusively the archaeal class of ribosomal protein L3. A separate model (TIGR03625) describes the bacterial/organelle form, and both belong to pfam00297. Eukaryotic proteins are excluded from this model.
  • Changed! PTZ PTZ00103 401aa 0.0 in ref transcript
    • 60S ribosomal protein L3; Provisional.
  • Changed! TIGR L3_arch 318aa 1e-66 in modified transcript
  • Changed! PTZ PTZ00103 352aa 1e-165 in modified transcript

RPL32

  • refseq_RPL32.F1 refseq_RPL32.R1 156 255
  • NCBIGene 36.3 6161
  • Alternative 3-prime, size difference: 99
  • Inclusion in 5'UTR
  • Reference transcript: NM_000994

  • cd Ribosomal_L32_L32e 105aa 8e-43 in ref transcript
    • Ribosomal_L32_L32e: L32 is a protein from the large subunit that contains a surface-exposed globular domain and a finger-like projection that extends into the RNA core to stabilize the tertiary structure. L32 does not appear to play a role in forming the A (aminacyl), P (peptidyl) or E (exit) sites of the ribosome, but does interact with 23S rRNA, which has a "kink-turn" secondary structure motif. L32 is overexpressed in human prostate cancer and has been identified as a stably expressed housekeeping gene in macrophages of human chronic obstructive pulmonary disease (COPD) patients. In Schizosaccharomyces pombe, L32 has also been suggested to play a role as a transcriptional regulator in the nucleus. Found in archaea and eukaryotes, this protein is known as L32 in eukaryotes and L32e in archaea.
  • pfam Ribosomal_L32e 106aa 3e-42 in ref transcript
    • Ribosomal protein L32. This family includes ribosomal protein L32 from eukaryotes and archaebacteria.
  • PTZ PTZ00159 133aa 7e-47 in ref transcript
    • 60S ribosomal protein L32; Provisional.

RPL38

  • refseq_RPL38.F1 refseq_RPL38.R1 133 168
  • NCBIGene 36.3 6169
  • Alternative 5-prime, size difference: 35
  • Exclusion in 5'UTR
  • Reference transcript: NM_000999

  • pfam Ribosomal_L38e 68aa 3e-14 in ref transcript
    • Ribosomal L38e protein family.
  • PTZ PTZ00181 62aa 1e-05 in ref transcript
    • 60S ribosomal protein L38; Provisional.

RPL6

  • refseq_RPL6.F1 refseq_RPL6.R1 219 395
  • NCBIGene 36.3 6128
  • Alternative 5-prime, size difference: 176
  • Exclusion in 5'UTR
  • Reference transcript: NM_001024662

  • pfam Ribosomal_L6e 108aa 6e-39 in ref transcript
    • Ribosomal protein L6e.
  • pfam Ribosomal_L6e_N 32aa 3e-07 in ref transcript
    • Ribosomal protein L6, N-terminal domain.
  • COG RPL14A 109aa 1e-06 in ref transcript
    • Ribosomal protein L14E/L6E/L27E [Translation, ribosomal structure and biogenesis].

RPLP0

  • refseq_RPLP0.F2 refseq_RPLP0.R2 150 210
  • NCBIGene 36.3 6175
  • Alternative 3-prime, size difference: 60
  • Inclusion in 5'UTR
  • Reference transcript: NM_001002

  • cd Ribosomal_P0_L10e 175aa 1e-69 in ref transcript
    • Ribosomal protein L10 family, P0 and L10e subfamily; composed of eukaryotic 60S ribosomal protein P0 and the archaeal P0 homolog, L10e. P0 or L10e forms a tight complex with multiple copies of the small acidic protein L12(e). This complex forms a stalk structure on the large subunit of the ribosome. The stalk is known to contain the binding site for elongation factors G and Tu (EF-G and EF-Tu, respectively); however, there is disagreement as to whether or not L10 is involved in forming the binding site. The stalk is believed to be associated with GTPase activities in protein synthesis. In a neuroblastoma cell line, L10 has been shown to interact with the SH3 domain of Src and to activate the binding of the Nck1 adaptor protein with skeletal proteins such as the Wiskott-Aldrich Syndrome Protein (WASP) and the WASP-interacting protein (WIP). These eukaryotic and archaeal P0 sequences have an additional C-terminal domain homologous with acidic proteins P1 and P2.
  • pfam Ribosomal_L10 103aa 2e-32 in ref transcript
    • Ribosomal protein L10.
  • PTZ PTZ00135 269aa 4e-82 in ref transcript
    • ribosomal phosphoprotein P0; Provisional.

RPLP1

  • refseq_RPLP1.F1 refseq_RPLP1.R1 189 264
  • NCBIGene 36.3 6176
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001003

  • Changed! cd Ribosomal_P1 108aa 7e-24 in ref transcript
    • Ribosomal protein P1. This subfamily represents the eukaryotic large ribosomal protein P1. Eukaryotic P1 and P2 are functionally equivalent to the bacterial protein L7/L12, but are not homologous to L7/L12. P1 is located in the L12 stalk, with proteins P2, P0, L11, and 28S rRNA. P1 and P2 are the only proteins in the ribosome to occur as multimers, always appearing as sets of heterodimers. Recent data indicate that eukaryotes have four copies (two heterodimers), while most archaeal species contain six copies of L12p (three homodimers) and bacteria may have four or six copies (two or three homodimers), depending on the species. Experiments using S. cerevisiae P1 and P2 indicate that P1 proteins are positioned more internally with limited reactivity in the C-terminal domains, while P2 proteins seem to be more externally located and are more likely to interact with other cellular components. In lower eukaryotes, P1 and P2 are further subdivided into P1A, P1B, P2A, and P2B, which form P1A/P2B and P1B/P2A heterodimers. Some plant species have a third P-protein, called P3, which is not homologous to P1 and P2. In humans, P1 and P2 are strongly autoimmunogenic. They play a significant role in the etiology and pathogenesis of systemic lupus erythema (SLE). In addition, the ribosome-inactivating protein trichosanthin (TCS) interacts with human P0, P1, and P2, with its primary binding site located in the C-terminal region of P2. TCS inactivates the ribosome by depurinating a specific adenine in the sarcin-ricin loop of 28S rRNA.
  • Changed! pfam Ribosomal_60s 92aa 8e-07 in ref transcript
    • 60s Acidic ribosomal protein. This family includes archaebacterial L12, eukaryotic P0, P1 and P2.
  • Changed! COG RPP1A 53aa 2e-05 in ref transcript
    • Ribosomal protein L12E/L44/L45/RPP1/RPP2 [Translation, ribosomal structure and biogenesis].
  • Changed! cd Ribosomal_P1 83aa 4e-07 in modified transcript

RPS6KA4

  • refseq_RPS6KA4.F1 refseq_RPS6KA4.R1 100 118
  • NCBIGene 36.3 8986
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003942

  • cd STKc_MSK2_N 333aa 1e-178 in ref transcript
    • STKc_MSK2_N: Serine/Threonine Kinases (STKs), Mitogen and stress-activated kinase (MSK) subfamily, MSK2, N-terminal catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MSK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MSKs contain an N-terminal kinase domain (NTD) from the AGC family and a C-terminal kinase domain (CTD) from the CAMK family, similar to 90 kDa ribosomal protein S6 kinases (RSKs). MSKs are activated by two major signaling cascades, the Ras-MAPK and p38 stress kinase pathways, which trigger phosphorylation in the activation loop (A-loop) of the CTD of MSK. The active CTD phosphorylates the hydrophobic motif (HM) of NTD, which facilitates the phosphorylation of the A-loop and activates the NTD, which in turn phosphorylates downstream targets. MSK2 and MSK1 play nonredundant roles in activating histone H3 kinases, which play pivotal roles in compaction of the chromatin fiber. MSK2 is the required H3 kinase in response to stress stimuli and activation of the p38 MAPK pathway. MSK2 also plays a role in the pathogenesis of psoriasis.
  • cd S_TKc 258aa 2e-57 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 259aa 5e-77 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart S_TKc 254aa 3e-62 in ref transcript
  • smart S_TK_X 61aa 2e-08 in ref transcript
    • Extension to Ser/Thr-type protein kinases.
  • PTZ PTZ00263 300aa 2e-74 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • PTZ PTZ00263 255aa 1e-30 in ref transcript

RPS6KB2

  • refseq_RPS6KB2.F2 refseq_RPS6KB2.R2 122 144
  • NCBIGene 36.2 6199
  • Alternative 5-prime, size difference: 22
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003952

  • Changed! cd STKc_p70S6K 323aa 0.0 in ref transcript
    • STKc_p70S6K: Serine/Threonine Kinases (STKs), 70 kDa ribosomal protein S6 kinase (p70S6K) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The p70S6K subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. p70S6K (or S6K) contains only one catalytic kinase domain, unlike p90 ribosomal S6 kinases (RSKs). It acts as a downstream effector of the STK mTOR (mammalian Target of Rapamycin) and plays a role in the regulation of the translation machinery during protein synthesis. p70S6K also plays a pivotal role in regulating cell size and glucose homeostasis. Its targets include S6, the translation initiation factor eIF3, and the insulin receptor substrate IRS-1, among others. Mammals contain two isoforms of p70S6K, named S6K1 and S6K2 (or S6K-beta).
  • Changed! smart S_TKc 252aa 2e-84 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! smart S_TK_X 63aa 2e-15 in ref transcript
    • Extension to Ser/Thr-type protein kinases.
  • Changed! PTZ PTZ00263 321aa 3e-81 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! cd STKc_p70S6K 190aa 1e-109 in modified transcript
  • Changed! smart S_TKc 183aa 1e-65 in modified transcript
  • Changed! PTZ PTZ00263 193aa 8e-54 in modified transcript

RRAGB

  • refseq_RRAGB.F1 refseq_RRAGB.R1 110 194
  • NCBIGene 36.3 10325
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016656

  • Changed! cd Ras_like_GTPase 180aa 8e-09 in ref transcript
    • Ras-like GTPase superfamily. The Ras-like superfamily of small GTPases consists of several families with an extremely high degree of structural and functional similarity. The Ras superfamily is divided into at least four families in eukaryotes: the Ras, Rho, Rab, and Sar1/Arf families. This superfamily also includes proteins like the GTP translation factors, Era-like GTPases, and G-alpha chain of the heterotrimeric G proteins. Members of the Ras superfamily regulate a wide variety of cellular functions: the Ras family regulates gene expression, the Rho family regulates cytoskeletal reorganization and gene expression, the Rab and Sar1/Arf families regulate vesicle trafficking, and the Ran family regulates nucleocytoplasmic transport and microtubule organization. The GTP translation factor family regulate initiation, elongation, termination, and release in translation, and the Era-like GTPase family regulates cell division, sporulation, and DNA replication. Members of the Ras superfamily are identified by the GTP binding site, which is made up of five characteristic sequence motifs, and the switch I and switch II regions.
  • pfam Gtr1_RagA 199aa 6e-74 in ref transcript
    • Gtr1/RagA G protein conserved region. GTR1 was first identified in Saccharomyces cerevisiae as a suppressor of a mutation in RCC1. Biochemical analysis revealed that Gtr1 is in fact a G protein of the Ras family. The RagA/B proteins are the human homologues of Gtr1. Included in this family is the human Rag C, a novel protein that has been shown to interact with RagA/B.
  • Changed! pfam Ras 178aa 2e-05 in ref transcript
    • Ras family. Includes sub-families Ras, Rab, Rac, Ral, Ran, Rap Ypt1 and more. Shares P-loop motif with GTP_EFTU, arf and myosin_head. See pfam00009 pfam00025, pfam00063. As regards Rab GTPases, these are important regulators of vesicle formation, motility and fusion. They share a fold in common with all Ras GTPases: this is a six-stranded beta-sheet surrounded by five alpha-helices.
  • Changed! cd Ras_like_GTPase 152aa 6e-11 in modified transcript
  • Changed! pfam Ras 150aa 7e-07 in modified transcript
  • Changed! COG COG1100 145aa 2e-06 in modified transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].

RREB1

  • refseq_RREB1.F1 refseq_RREB1.R1 161 326
  • NCBIGene 36.3 6239
  • Single exon skipping, size difference: 165
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001003699

RSPO4

  • refseq_RSPO4.F1 refseq_RSPO4.R1 314 500
  • NCBIGene 36.3 343637
  • Single exon skipping, size difference: 186
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001029871

  • Changed! smart FU 41aa 0.006 in modified transcript
    • Furin-like repeats.

RSU1

  • refseq_RSU1.F2 refseq_RSU1.R2 133 245
  • NCBIGene 36.3 6251
  • Single exon skipping, size difference: 112
  • Exclusion of the protein initiation site
  • Reference transcript: NM_012425

  • Changed! cd LRR_RI 154aa 0.002 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • Changed! COG COG4886 181aa 2e-09 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].
  • Changed! cd LRR_RI 114aa 3e-04 in modified transcript
  • Changed! COG COG4886 147aa 2e-10 in modified transcript

RTEL1

  • refseq_RTEL1.F1 refseq_RTEL1.R1 231 401
  • NCBIGene 36.3 51750
  • Alternative 5-prime, size difference: 170
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032957

  • pfam DEAD_2 162aa 1e-61 in ref transcript
    • DEAD_2. This represents a conserved region within a number of RAD3-like DNA-binding helicases that are seemingly ubiquitous - members include proteins of eukaryotic, bacterial and archaeal origin. RAD3 is involved in nucleotide excision repair, and forms part of the transcription factor TFIIH in yeast.
  • TIGR rad3 292aa 4e-60 in ref transcript
    • All proteins in this family for which funcitons are known are DNA-DNA helicases that funciton in the initiation of transcription and nucleotide excision repair as part of the TFIIH complex. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • COG DinG 737aa 1e-53 in ref transcript
    • Rad3-related DNA helicases [Transcription / DNA replication, recombination, and repair].

RTN3

  • refseq_RTN3.F2 refseq_RTN3.R2 308 367
  • NCBIGene 36.3 10313
  • Single exon skipping, size difference: 59
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_201428

  • Changed! pfam Reticulon 171aa 8e-41 in ref transcript
    • Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues.
  • Changed! pfam Reticulon 157aa 4e-36 in modified transcript

SGSM1

  • refseq_RUTBC2.F2 refseq_RUTBC2.R2 100 283
  • NCBIGene 36.3 129049
  • Single exon skipping, size difference: 183
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039948

  • smart TBC 145aa 2e-30 in ref transcript
    • Domain in Tre-2, BUB2p, and Cdc16p. Probable Rab-GAPs. Widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it performs a GTP-activator activity on Rab-like GTPases.
  • pfam RUN 146aa 5e-21 in ref transcript
    • RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
  • COG COG5210 212aa 2e-18 in ref transcript
    • GTPase-activating protein [General function prediction only].

RWDD1

  • refseq_RWDD1.F1 refseq_RWDD1.R1 102 230
  • NCBIGene 36.3 51389
  • Single exon skipping, size difference: 128
  • Exclusion in 5'UTR
  • Reference transcript: NM_016104

SAA1

  • refseq_SAA1.F1 refseq_SAA1.R1 162 309
  • NCBIGene 36.3 6288
  • Alternative 5-prime, size difference: 147
  • Exclusion in 5'UTR
  • Reference transcript: NM_000331

  • pfam SAA 103aa 2e-47 in ref transcript
    • Serum amyloid A protein.

SAR1B

  • refseq_SAR1B.F1 refseq_SAR1B.R1 113 232
  • NCBIGene 36.3 51128
  • Single exon skipping, size difference: 119
  • Exclusion in 5'UTR
  • Reference transcript: NM_001033503

  • cd Sar1 195aa 3e-98 in ref transcript
    • Sar1 subfamily. Sar1 is an essential component of COPII vesicle coats involved in export of cargo from the ER. The GTPase activity of Sar1 functions as a molecular switch to control protein-protein and protein-lipid interactions that direct vesicle budding from the ER. Activation of the GDP to the GTP-bound form of Sar1 involves the membrane-associated guanine nucleotide exchange factor (GEF) Sec12. Sar1 is unlike all Ras superfamily GTPases that use either myristoyl or prenyl groups to direct membrane association and function, in that Sar1 lacks such modification. Instead, Sar1 contains a unique nine-amino-acid N-terminal extension. This extension contains an evolutionarily conserved cluster of bulky hydrophobic amino acids, referred to as the Sar1-N-terminal activation recruitment (STAR) motif. The STAR motif mediates the recruitment of Sar1 to ER membranes and facilitates its interaction with mammalian Sec12 GEF leading to activation.
  • smart SAR 193aa 5e-75 in ref transcript
    • Sar1p-like members of the Ras-family of small GTPases. Yeast SAR1 is an essential gene required for transport of secretory proteins from the endoplasmic reticulum to the Golgi apparatus.
  • PTZ PTZ00133 191aa 3e-23 in ref transcript
    • ADP-ribosylation factor; Provisional.

SC4MOL

  • refseq_SC4MOL.F1 refseq_SC4MOL.R1 123 409
  • NCBIGene 36.3 6307
  • Single exon skipping, size difference: 286
  • Exclusion of the protein initiation site
  • Reference transcript: NM_006745

  • pfam FA_hydroxylase 113aa 3e-18 in ref transcript
    • Fatty acid hydroxylase superfamily. This superfamily includes fatty acid and carotene hydroxylases and sterol desaturases. Beta-carotene hydroxylase is involved in zeaxanthin synthesis by hydroxylating beta-carotene, but the enzyme may be involved in other pathways. This family includes C-5 sterol desaturase and C-4 sterol methyl oxidase. Members of this family are involved in cholesterol biosynthesis and biosynthesis a plant cuticular wax. These enzymes contain two copies of a HXHH motif. Members of this family are integral membrane proteins.
  • Changed! COG ERG3 234aa 6e-24 in ref transcript
    • Sterol desaturase [Lipid metabolism].
  • Changed! COG ERG3 143aa 1e-20 in modified transcript

SCAMP1

  • refseq_SCAMP1.F1 refseq_SCAMP1.R1 200 360
  • NCBIGene 36.2 9522
  • Single exon skipping, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004866

  • Changed! pfam SCAMP 128aa 4e-47 in ref transcript
    • SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cytoplasmic tail. SCAMPs probably function in endocytosis by recruiting EH-domain proteins to the N-terminal NPF repeats but may have additional functions mediated by their other sequences.
  • Changed! pfam SCAMP 42aa 6e-20 in modified transcript

SCAMP3

  • refseq_SCAMP3.F2 refseq_SCAMP3.R2 139 217
  • NCBIGene 36.3 10067
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005698

  • pfam SCAMP 178aa 1e-50 in ref transcript
    • SCAMP family. In vertebrates, secretory carrier membrane proteins (SCAMPs) 1-3 constitute a family of putative membrane-trafficking proteins composed of cytoplasmic N-terminal sequences with NPF repeats, four central transmembrane regions (TMRs), and a cytoplasmic tail. SCAMPs probably function in endocytosis by recruiting EH-domain proteins to the N-terminal NPF repeats but may have additional functions mediated by their other sequences.

SCARF1

  • refseq_SCARF1.F1 refseq_SCARF1.R1 171 208
  • NCBIGene 36.3 8578
  • Alternative 3-prime, size difference: 37
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003693

SCEL

  • refseq_SCEL.F1 refseq_SCEL.R1 337 397
  • NCBIGene 36.3 8796
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144777

  • smart LIM 39aa 7e-04 in ref transcript
    • Zinc-binding domain present in Lin-11, Isl-1, Mec-3. Zinc-binding domain family. Some LIM domains bind protein partners via tyrosine-containing motifs. LIM domains are found in many key regulators of developmental pathways.

SCFD1

  • refseq_SCFD1.F1 refseq_SCFD1.R1 109 180
  • NCBIGene 36.3 23256
  • Single exon skipping, size difference: 71
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016106

  • Changed! pfam Sec1 594aa 1e-167 in ref transcript
    • Sec1 family.
  • Changed! COG SEC1 628aa 2e-79 in ref transcript
    • Proteins involved in synaptic transmission and general secretion, Sec1 family [Intracellular trafficking and secretion].

SCMH1

  • refseq_SCMH1.F1 refseq_SCMH1.R1 221 287
  • NCBIGene 36.3 22955
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031694

  • cd SAM 65aa 2e-08 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • smart MBT 96aa 7e-35 in ref transcript
    • Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
  • smart MBT 99aa 1e-31 in ref transcript
  • pfam SAM_1 64aa 7e-12 in ref transcript
    • SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.

SCMH1

  • refseq_SCMH1.F3 refseq_SCMH1.R3 148 362
  • NCBIGene 36.3 22955
  • Multiple exon skipping, size difference: 214
  • Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031694

  • cd SAM 65aa 2e-08 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • Changed! smart MBT 96aa 7e-35 in ref transcript
    • Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
  • smart MBT 99aa 1e-31 in ref transcript
  • pfam SAM_1 64aa 7e-12 in ref transcript
    • SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
  • Changed! smart MBT 79aa 8e-27 in modified transcript

SCML1

  • refseq_SCML1.F2 refseq_SCML1.R2 151 303
  • NCBIGene 36.3 6322
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 152
  • Exclusion of the protein initiation site, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006746

  • cd SAM 66aa 8e-08 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • pfam SAM_1 65aa 1e-10 in ref transcript
    • SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.

SCML1

  • refseq_SCML1.F3 refseq_SCML1.R1 137 218
  • NCBIGene 36.3 6322
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 81
  • Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001037540

  • cd SAM 66aa 9e-08 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • pfam SAM_1 65aa 1e-10 in ref transcript
    • SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.

SCNM1

  • refseq_SCNM1.F2 refseq_SCNM1.R2 106 155
  • NCBIGene 36.2 79005
  • Alternative 5-prime, size difference: 49
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_024041

SCP2

  • refseq_SCP2.F1 refseq_SCP2.R1 101 233
  • NCBIGene 36.3 6342
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002979

  • Changed! cd nondecarbox_cond_enzymes 387aa 1e-153 in ref transcript
    • nondecarboxylating condensing enzymes; In general, thiolases catalyze the reversible thiolytic cleavage of 3-ketoacyl-CoA into acyl-CoA and acetyl-CoA, a 2-step reaction involving a covalent intermediate formed with a catalytic cysteine. There are 2 functional different classes: thiolase-I (3-ketoacyl-CoA thiolase) and thiolase-II (acetoacetyl-CoA thiolase). Thiolase-I can cleave longer fatty acid molecules and plays an important role in the beta-oxidative degradation of fatty acids. Thiolase-II has a high substrate specificity. Although it can cleave acetoacyl-CoA, its main function is the synthesis of acetoacyl-CoA from two molecules of acetyl-CoA, which gives it importance in several biosynthetic pathways.
  • Changed! TIGR AcCoA-C-Actrans 364aa 4e-26 in ref transcript
    • This model represents a large family of enzymes which catalyze the thiolysis of a linear fatty acid CoA (or acetoacetyl-CoA) using a second CoA molecule to produce acetyl-CoA and a CoA-ester product two carbons shorter (or, alternatively, the condensation of two molecules of acetyl-CoA to produce acetoacetyl-CoA and CoA). This enzyme is also known as "thiolase", "3-ketoacyl-CoA thiolase", "beta-ketothiolase" and "Fatty oxidation complex beta subunit". When catalyzing the degradative reaction on fatty acids the corresponding EC number is 2.3.1.16. The condensation reaction corresponds to 2.3.1.9. Note that the enzymes which catalyze the condensation are generally not involved in fatty acid biosynthesis, which is carried out by a decarboxylating condensation of acetyl and malonyl esters of acyl carrier proteins. Rather, this activity may produce acetoacetyl-CoA for pathways such as IPP biosynthesis in the absence of sufficient fatty acid oxidation.
  • pfam SCP2 88aa 7e-20 in ref transcript
    • SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145.
  • Changed! PRK PRK08256 393aa 0.0 in ref transcript
    • lipid-transfer protein; Provisional.
  • COG COG3154 66aa 2e-07 in ref transcript
    • Putative lipid carrier protein [Lipid metabolism].
  • Changed! cd nondecarbox_cond_enzymes 343aa 1e-129 in modified transcript
  • Changed! TIGR AcCoA-C-Actrans 177aa 5e-17 in modified transcript
  • Changed! PRK PRK08256 349aa 1e-168 in modified transcript

SCP2

  • refseq_SCP2.F3 refseq_SCP2.R3 118 248
  • NCBIGene 36.3 6342
  • Single exon skipping, size difference: 130
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002979

  • cd nondecarbox_cond_enzymes 387aa 1e-153 in ref transcript
    • nondecarboxylating condensing enzymes; In general, thiolases catalyze the reversible thiolytic cleavage of 3-ketoacyl-CoA into acyl-CoA and acetyl-CoA, a 2-step reaction involving a covalent intermediate formed with a catalytic cysteine. There are 2 functional different classes: thiolase-I (3-ketoacyl-CoA thiolase) and thiolase-II (acetoacetyl-CoA thiolase). Thiolase-I can cleave longer fatty acid molecules and plays an important role in the beta-oxidative degradation of fatty acids. Thiolase-II has a high substrate specificity. Although it can cleave acetoacyl-CoA, its main function is the synthesis of acetoacyl-CoA from two molecules of acetyl-CoA, which gives it importance in several biosynthetic pathways.
  • TIGR AcCoA-C-Actrans 364aa 4e-26 in ref transcript
    • This model represents a large family of enzymes which catalyze the thiolysis of a linear fatty acid CoA (or acetoacetyl-CoA) using a second CoA molecule to produce acetyl-CoA and a CoA-ester product two carbons shorter (or, alternatively, the condensation of two molecules of acetyl-CoA to produce acetoacetyl-CoA and CoA). This enzyme is also known as "thiolase", "3-ketoacyl-CoA thiolase", "beta-ketothiolase" and "Fatty oxidation complex beta subunit". When catalyzing the degradative reaction on fatty acids the corresponding EC number is 2.3.1.16. The condensation reaction corresponds to 2.3.1.9. Note that the enzymes which catalyze the condensation are generally not involved in fatty acid biosynthesis, which is carried out by a decarboxylating condensation of acetyl and malonyl esters of acyl carrier proteins. Rather, this activity may produce acetoacetyl-CoA for pathways such as IPP biosynthesis in the absence of sufficient fatty acid oxidation.
  • Changed! pfam SCP2 88aa 7e-20 in ref transcript
    • SCP-2 sterol transfer family. This domain is involved in binding sterols. It is found in the SCP2 protein, as well as the C terminus of the enzyme estradiol 17 beta-dehydrogenase EC:1.1.1.62. The UNC-24 protein contains an SPFH domain pfam01145.
  • PRK PRK08256 393aa 0.0 in ref transcript
    • lipid-transfer protein; Provisional.
  • Changed! COG COG3154 66aa 2e-07 in ref transcript
    • Putative lipid carrier protein [Lipid metabolism].

SCRIB

  • refseq_SCRIB.F1 refseq_SCRIB.R1 199 274
  • NCBIGene 36.3 23513
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182706

  • cd PDZ_signaling 89aa 1e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 88aa 4e-14 in ref transcript
  • cd PDZ_signaling 60aa 6e-12 in ref transcript
  • cd PDZ_signaling 92aa 3e-07 in ref transcript
  • cd LRR_RI 223aa 1e-05 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • smart PDZ 90aa 6e-19 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 91aa 3e-16 in ref transcript
  • smart PDZ 66aa 3e-14 in ref transcript
  • TIGR degP_htrA_DO 172aa 2e-08 in ref transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
  • smart PDZ 94aa 6e-06 in ref transcript
  • COG COG4886 200aa 4e-27 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].
  • COG COG4886 270aa 1e-21 in ref transcript
  • COG Prc 68aa 2e-07 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 47aa 2e-04 in ref transcript

SCYL3

  • refseq_SCYL3.F1 refseq_SCYL3.R1 236 398
  • NCBIGene 36.3 57147
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181093

  • cd S_TKc 211aa 5e-08 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 220aa 2e-07 in ref transcript
    • Protein kinase domain.

SDCCAG3

  • refseq_SDCCAG3.F1 refseq_SDCCAG3.R1 209 278
  • NCBIGene 36.3 10807
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039707

  • pfam SMC_N 162aa 4e-06 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.

SDF4

  • refseq_SDF4.F2 refseq_SDF4.R2 198 314
  • NCBIGene 36.3 51150
  • Alternative 3-prime, size difference: 116
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_016176

  • cd EFh 61aa 0.002 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! cd EFh 54aa 0.003 in ref transcript
  • Changed! COG FRQ1 89aa 0.009 in modified transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

SDHC

  • refseq_SDHC.F1 refseq_SDHC.R1 116 218
  • NCBIGene 36.3 6391
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003001

  • Changed! cd SQR_TypeC_SdhC 119aa 5e-24 in ref transcript
    • Succinate:quinone oxidoreductase (SQR) Type C subfamily, Succinate dehydrogenase C (SdhC) subunit; composed of bacterial SdhC and eukaryotic large cytochrome b binding (CybL) proteins. SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol. Members of this family reduce high potential quinones such as ubiquinone. SQR is also called succinate dehydrogenase or Complex II, and is part of the citric acid cycle and the aerobic respiratory chain. SQR is composed of a flavoprotein catalytic subunit, an iron-sulfur protein and one or two hydrophobic transmembrane subunits. Proteins in this subfamily are classified as Type C SQRs because they contain two transmembrane subunits and one heme group. The heme and quinone binding sites reside in the transmembrane subunits. The SdhC or CybL protein is one of the two transmembrane subunits of bacterial and eukaryotic SQRs. The two-electron oxidation of succinate in the flavoprotein active site is coupled to the two-electron reduction of quinone in the membrane anchor subunits via electron transport through FAD and three iron-sulfur centers. The reversible reduction of quinone is an essential feature of respiration, allowing transfer of electrons between respiratory complexes.
  • Changed! pfam Sdh_cyt 105aa 2e-22 in ref transcript
    • Succinate dehydrogenase cytochrome b subunit.
  • Changed! COG SdhC 103aa 7e-12 in ref transcript
    • Succinate dehydrogenase/fumarate reductase, cytochrome b subunit [Energy production and conversion].
  • Changed! cd SQR_TypeC_SdhC 116aa 3e-18 in modified transcript
  • Changed! pfam Sdh_cyt 100aa 4e-16 in modified transcript
  • Changed! COG SdhC 101aa 2e-07 in modified transcript

SDHC

  • refseq_SDHC.F4 refseq_SDHC.R4 214 378
  • NCBIGene 36.3 6391
  • Single exon skipping, size difference: 164
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003001

  • Changed! cd SQR_TypeC_SdhC 119aa 5e-24 in ref transcript
    • Succinate:quinone oxidoreductase (SQR) Type C subfamily, Succinate dehydrogenase C (SdhC) subunit; composed of bacterial SdhC and eukaryotic large cytochrome b binding (CybL) proteins. SQR catalyzes the oxidation of succinate to fumarate coupled to the reduction of quinone to quinol. Members of this family reduce high potential quinones such as ubiquinone. SQR is also called succinate dehydrogenase or Complex II, and is part of the citric acid cycle and the aerobic respiratory chain. SQR is composed of a flavoprotein catalytic subunit, an iron-sulfur protein and one or two hydrophobic transmembrane subunits. Proteins in this subfamily are classified as Type C SQRs because they contain two transmembrane subunits and one heme group. The heme and quinone binding sites reside in the transmembrane subunits. The SdhC or CybL protein is one of the two transmembrane subunits of bacterial and eukaryotic SQRs. The two-electron oxidation of succinate in the flavoprotein active site is coupled to the two-electron reduction of quinone in the membrane anchor subunits via electron transport through FAD and three iron-sulfur centers. The reversible reduction of quinone is an essential feature of respiration, allowing transfer of electrons between respiratory complexes.
  • Changed! pfam Sdh_cyt 105aa 2e-22 in ref transcript
    • Succinate dehydrogenase cytochrome b subunit.
  • Changed! COG SdhC 103aa 7e-12 in ref transcript
    • Succinate dehydrogenase/fumarate reductase, cytochrome b subunit [Energy production and conversion].
  • Changed! cd SQR_TypeC_SdhC 30aa 4e-08 in modified transcript
  • Changed! pfam Sdh_cyt 31aa 3e-08 in modified transcript
  • Changed! COG SdhC 29aa 5e-04 in modified transcript

SEC13L1

  • refseq_SEC13L1.F2 refseq_SEC13L1.R2 154 188
  • NCBIGene 36.2 6396
  • Single exon skipping, size difference: 34
  • Exclusion of the protein initiation site
  • Reference transcript: NM_030673

  • cd WD40 280aa 3e-23 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • smart WD40 38aa 0.001 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • pfam WD40 42aa 0.002 in ref transcript
    • WD domain, G-beta repeat.
  • COG COG2319 296aa 1e-16 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

SEC23B

  • refseq_SEC23B.F1 refseq_SEC23B.R1 102 120
  • NCBIGene 36.3 10483
  • Alternative 5-prime, size difference: 18
  • Exclusion in 5'UTR
  • Reference transcript: NM_032985

  • cd Sec23-like 265aa 1e-125 in ref transcript
    • Sec23-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 23 is very similar to Sec24. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup lack the consensus MIDAS motif but have the overall Para-Rossmann type fold that is characteristic of this superfamily.
  • pfam Sec23_trunk 267aa 5e-81 in ref transcript
    • Sec23/Sec24 trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.
  • pfam Sec23_helical 101aa 3e-36 in ref transcript
    • Sec23/Sec24 helical domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.
  • pfam Sec23_BS 104aa 1e-23 in ref transcript
    • Sec23/Sec24 beta-sandwich domain.
  • pfam zf-Sec23_Sec24 42aa 2e-13 in ref transcript
    • Sec23/Sec24 zinc finger. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.
  • pfam Gelsolin 87aa 1e-11 in ref transcript
    • Gelsolin repeat.
  • COG SEC23 758aa 0.0 in ref transcript
    • Vesicle coat complex COPII, subunit SEC23 [Intracellular trafficking and secretion].

SEC24C

  • refseq_SEC24C.F1 refseq_SEC24C.R1 236 323
  • NCBIGene 36.3 9632
  • Single exon skipping, size difference: 87
  • Exclusion in 5'UTR
  • Reference transcript: NM_004922

  • cd Sec24-like 245aa 3e-93 in ref transcript
    • Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.
  • pfam Sec23_trunk 245aa 4e-96 in ref transcript
    • Sec23/Sec24 trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.
  • pfam Sec23_helical 104aa 3e-29 in ref transcript
    • Sec23/Sec24 helical domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.
  • pfam Sec23_BS 84aa 2e-25 in ref transcript
    • Sec23/Sec24 beta-sandwich domain.
  • pfam zf-Sec23_Sec24 40aa 7e-15 in ref transcript
    • Sec23/Sec24 zinc finger. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.
  • pfam Gelsolin 31aa 7e-05 in ref transcript
    • Gelsolin repeat.
  • pfam PAT1 173aa 1e-04 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
  • COG COG5028 910aa 1e-132 in ref transcript
    • Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion].

SEC31A

  • refseq_SEC31L1.F1 refseq_SEC31L1.R1 118 235
  • NCBIGene 36.3 22872
  • Multiple exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077207

  • cd WD40 244aa 4e-22 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • COG COG2319 306aa 6e-15 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

SEMA6D

  • refseq_SEMA6D.F1 refseq_SEMA6D.R1 106 145
  • NCBIGene 36.3 80031
  • Alternative 3-prime, size difference: 39
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_153618

  • pfam Sema 418aa 1e-121 in ref transcript
    • Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor.
  • Changed! pfam PSI 45aa 7e-07 in ref transcript
    • Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas shows 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).
  • pfam DUF1043 111aa 0.007 in ref transcript
    • Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function.
  • Changed! pfam PSI 35aa 1e-05 in modified transcript

SEMA6D

  • refseq_SEMA6D.F3 refseq_SEMA6D.R3 220 388
  • NCBIGene 36.3 80031
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153618

  • pfam Sema 418aa 1e-121 in ref transcript
    • Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor.
  • pfam PSI 45aa 7e-07 in ref transcript
    • Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas shows 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).
  • pfam DUF1043 111aa 0.007 in ref transcript
    • Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function.

SEP15

  • refseq_SEP15.F1 refseq_SEP15.R1 160 210
  • NCBIGene 36.3 9403
  • Single exon skipping, size difference: 50
  • Exclusion in 3'UTR
  • Reference transcript: NM_004261

SEPN1

  • refseq_SEPN1.F1 refseq_SEPN1.R1 115 217
  • NCBIGene 36.3 57190
  • Single exon skipping, size difference: 102
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_206926

SEPT10

  • refseq_SEPT10.F1 refseq_SEPT10.R1 320 389
  • NCBIGene 36.3 151011
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144710

  • cd CDC_Septin 269aa 1e-112 in ref transcript
    • CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
  • pfam Septin 268aa 8e-90 in ref transcript
    • Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
  • COG CDC3 354aa 1e-81 in ref transcript
    • Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].

SEPT2

  • refseq_SEPT2.F2 refseq_SEPT2.R2 219 320
  • NCBIGene 36.3 4735
  • Single exon skipping, size difference: 101
  • Exclusion in 5'UTR
  • Reference transcript: NM_001008491

  • cd CDC_Septin 275aa 1e-127 in ref transcript
    • CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
  • pfam Septin 280aa 1e-139 in ref transcript
    • Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
  • COG CDC3 312aa 1e-92 in ref transcript
    • Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].

SEPT2

  • refseq_SEPT2.F3 refseq_SEPT2.R1 128 213
  • NCBIGene 36.3 4735
  • Single exon skipping, size difference: 85
  • Exclusion in 5'UTR
  • Reference transcript: NM_001008492

  • cd CDC_Septin 275aa 1e-127 in ref transcript
    • CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
  • pfam Septin 280aa 1e-139 in ref transcript
    • Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
  • COG CDC3 312aa 1e-92 in ref transcript
    • Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].

SEPT6

  • refseq_SEPT6.F1 refseq_SEPT6.R1 253 298
  • NCBIGene 36.3 23157
  • Single exon skipping, size difference: 45
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_145802

  • cd CDC_Septin 269aa 1e-117 in ref transcript
    • CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
  • pfam Septin 270aa 2e-88 in ref transcript
    • Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
  • COG CDC3 322aa 5e-77 in ref transcript
    • Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].

SERBP1

  • refseq_SERBP1.F1 refseq_SERBP1.R1 100 118
  • NCBIGene 36.3 26135
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001018067

  • Changed! pfam HABP4_PAI-RBP1 125aa 2e-16 in ref transcript
    • Hyaluronan / mRNA binding family. This family includes the HABP4 family of hyaluronan-binding proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1. HABP4 has been observed to bind hyaluronan (a glucosaminoglycan), but it is not known whether this is its primary role in vivo. It has also been observed to bind RNA, but with a lower affinity than that for hyaluronan. PAI-1 mRNA-binding protein specifically binds the mRNA of type-1 plasminogen activator inhibitor (PAI-1), and is thought to be involved in regulation of mRNA stability. However, in both cases, the sequence motifs predicted to be important for ligand binding are not conserved throughout the family, so it is not known whether members of this family share a common function.
  • Changed! pfam HABP4_PAI-RBP1 119aa 5e-18 in modified transcript

SERPINA1

  • refseq_SERPINA1.F1 refseq_SERPINA1.R1 100 414
  • NCBIGene 36.3 5265
  • Multiple exon skipping, size difference: 314
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_001002236

  • cd alpha-1-antitrypsin_like 358aa 1e-154 in ref transcript
    • alpha-1-antitrypsin_like. This family contains a variety of different members of clade A of the serpin superfamily. They include the classical serine proteinase inhibitors, alpha-1-antitrypsin and alpha-1-antichymotrypsin, protein C inhibitor, kallistatin, and noninhibitory serpins, like corticosteroid and thyroxin binding globulins. In general, SERine Proteinase INhibitors (serpins) exhibit conformational polymorphism shifting from native to cleaved, latent, delta, or polymorphic forms. Many serpins, such as antitrypsin and antichymotrypsin, function as serine protease inhibitors which regulate blood coagulation cascades. Non-inhibitory serpins perform many diverse functions such as chaperoning proteins or transporting hormones. Serpins are of medical interest because mutants have been associated with blood clotting disorders, emphysema, cirrhosis, and dementia.
  • pfam Serpin 373aa 1e-158 in ref transcript
    • Serpin (serine protease inhibitor). Structure is a multi-domain fold containing a bundle of helices and a beta sandwich.
  • COG COG4826 359aa 2e-52 in ref transcript
    • Serine protease inhibitor [Posttranslational modification, protein turnover, chaperones].

SETD4

  • refseq_SETD4.F1 refseq_SETD4.R1 113 207
  • NCBIGene 36.2 54093
  • Single exon skipping, size difference: 94
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_017438

  • Changed! pfam Rubis-subs-bind 119aa 2e-21 in ref transcript
    • Rubisco LSMT substrate-binding. Members of this family adopt a multihelical structure, with an irregular array of long and short alpha-helices. They allow binding of the protein to substrate, such as the N-terminal tails of histones H3 and H4 and the large subunit of the Rubisco holoenzyme complex.

SETD4

  • refseq_SETD4.F3 refseq_SETD4.R3 150 199
  • NCBIGene 36.2 54093
  • Alternative 3-prime, size difference: 49
  • Inclusion in 5'UTR
  • Reference transcript: NM_017438

  • pfam Rubis-subs-bind 119aa 2e-21 in ref transcript
    • Rubisco LSMT substrate-binding. Members of this family adopt a multihelical structure, with an irregular array of long and short alpha-helices. They allow binding of the protein to substrate, such as the N-terminal tails of histones H3 and H4 and the large subunit of the Rubisco holoenzyme complex.

SETD5

  • refseq_SETD5.F2 refseq_SETD5.R2 181 220
  • NCBIGene 36.2 55209
  • Single exon skipping, size difference: 39
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: XM_931455

  • pfam SET 102aa 4e-21 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • COG COG2940 100aa 7e-05 in ref transcript
    • Proteins containing SET domain [General function prediction only].

SETD5

  • refseq_SETD5.F3 refseq_SETD5.R3 263 320
  • NCBIGene 36.2 55209
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_931455

  • pfam SET 102aa 4e-21 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • COG COG2940 100aa 7e-05 in ref transcript
    • Proteins containing SET domain [General function prediction only].

SETD5

  • refseq_SETD5.F5 refseq_SETD5.R5 148 373
  • NCBIGene 36.2 55209
  • Single exon skipping, size difference: 225
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_931455

  • Changed! pfam SET 102aa 4e-21 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • Changed! COG COG2940 100aa 7e-05 in ref transcript
    • Proteins containing SET domain [General function prediction only].

SEZ6L2

  • refseq_SEZ6L2.F1 refseq_SEZ6L2.R1 221 260
  • NCBIGene 36.3 26470
  • Single exon skipping, size difference: 39
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_201575

  • cd CUB 91aa 6e-11 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 69aa 7e-11 in ref transcript
  • cd CCP 57aa 4e-09 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd CUB 113aa 3e-08 in ref transcript
  • cd CCP 59aa 3e-08 in ref transcript
  • cd CCP 60aa 2e-07 in ref transcript
  • cd CCP 56aa 2e-06 in ref transcript
  • pfam CUB 91aa 2e-08 in ref transcript
    • CUB domain.
  • smart CCP 56aa 3e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 56aa 5e-08 in ref transcript
  • pfam Sushi 58aa 1e-07 in ref transcript
    • Sushi domain (SCR repeat).
  • smart CCP 60aa 3e-07 in ref transcript
  • smart CUB 59aa 3e-06 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • smart CUB 102aa 5e-04 in ref transcript
  • pfam Sushi 57aa 0.003 in ref transcript

SF3A1

  • refseq_SF3A1.F1 refseq_SF3A1.R1 121 316
  • NCBIGene 36.3 10291
  • Alternative 5-prime and 3-prime, size difference: 195
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005877

  • cd SF3a120_C 76aa 4e-32 in ref transcript
    • SF3a120_C Mammalian splicing factor SF3a consists of three subunits of 60, 66, and 120 kDa and functions early during pre-mRNA splicing by converting the U2 snRNP to its active form. The 120kDa subunit (SF3a120) has a carboxy-terminal ubiquitin-like domain and two SWAP (suppressor-of-white-apricot) domains, referred to collectively as the SURP module, at its amino-terminus.
  • smart SWAP 54aa 4e-21 in ref transcript
    • Suppressor-of-White-APricot splicing regulator. domain present in regulators which are responsible for pre-mRNA splicing processes.
  • Changed! smart SWAP 54aa 3e-17 in ref transcript
  • pfam ubiquitin 67aa 1e-11 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
  • PTZ PTZ00044 69aa 0.002 in ref transcript
    • ubiquitin; Provisional.
  • Changed! smart SWAP 47aa 4e-14 in modified transcript

SFI1

  • refseq_SFI1.F1 refseq_SFI1.R1 253 346
  • NCBIGene 36.3 9814
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001007467

  • Changed! pfam Sfi1 551aa 3e-05 in ref transcript
    • Sfi1 spindle body protein. This is a family of fungal spindle pole body proteins that play a role in spindle body duplication. They contain binding sites for calmodulin-like proteins called centrins which are present in microtubule-organising centres.
  • Changed! pfam Sfi1 568aa 3e-07 in modified transcript

SFMBT1

  • refseq_SFMBT1.F1 refseq_SFMBT1.R1 205 324
  • NCBIGene 36.3 51460
  • Single exon skipping, size difference: 119
  • Exclusion in 5'UTR
  • Reference transcript: NM_001005158

  • cd SAM 63aa 3e-07 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • smart MBT 98aa 1e-30 in ref transcript
    • Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
  • smart MBT 94aa 4e-28 in ref transcript
  • smart MBT 91aa 2e-23 in ref transcript
  • smart MBT 102aa 1e-21 in ref transcript
  • smart SAM 65aa 7e-06 in ref transcript
    • Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.

SFRS5

  • refseq_SFRS5.F1 refseq_SFRS5.R1 228 357
  • NCBIGene 36.3 6430
  • Alternative 5-prime, size difference: 129
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039465

  • cd RRM 67aa 6e-12 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 68aa 3e-08 in ref transcript
  • pfam RRM_1 63aa 4e-12 in ref transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
  • smart RRM_2 68aa 8e-09 in ref transcript
    • RNA recognition motif.
  • TIGR PABP-1234 168aa 3e-07 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • COG COG0724 72aa 2e-06 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 69aa 0.001 in ref transcript

SFXN4

  • refseq_SFXN4.F2 refseq_SFXN4.R2 271 392
  • NCBIGene 36.2 119559
  • Single exon skipping, size difference: 121
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_213649

  • Changed! TIGR mtc 298aa 2e-27 in ref transcript
    • The MTC family consists of a limited number of homologues, all from eukaryotes. A single member of the family has been functionally characterized, the tricarboxylate carrier from rat liver mitochondria. The rat liver mitochondrial tricarboxylate carrier has been reported to transport citrate, cis-aconitate, threo-D-isocitrate, D- and L-tartrate, malate, succinate and phosphoenolpyruvate. It presumably functions by a proton symport mechanism.
  • Changed! TIGR mtc 271aa 2e-25 in modified transcript

SFXN4

  • refseq_SFXN4.F3 refseq_SFXN4.R3 103 130
  • NCBIGene 36.2 119559
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_213649

  • Changed! TIGR mtc 298aa 2e-27 in ref transcript
    • The MTC family consists of a limited number of homologues, all from eukaryotes. A single member of the family has been functionally characterized, the tricarboxylate carrier from rat liver mitochondria. The rat liver mitochondrial tricarboxylate carrier has been reported to transport citrate, cis-aconitate, threo-D-isocitrate, D- and L-tartrate, malate, succinate and phosphoenolpyruvate. It presumably functions by a proton symport mechanism.
  • Changed! TIGR mtc 289aa 1e-26 in modified transcript

SGK3

  • refseq_SGK3.F2 refseq_SGK3.R2 162 258
  • NCBIGene 36.3 23678
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001033578

  • Changed! cd STKc_SGK3 325aa 0.0 in ref transcript
    • STKc_SGK3: Serine/Threonine Kinases (STKs), Serum- and Glucocorticoid-induced Kinase (SGK) subfamily, SGK3 isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The SGK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. There are three isoforms of SGK, named SGK1, SGK2, and SGK3 (also called cytokine-independent survival kinase CISK). SGK3 is expressed in most tissues and is most abundant in the embryo and adult heart and spleen. It was originally discovered in a screen for antiapoptotic genes. It phosphorylates and inhibits the proapoptotic proteins, Bad and FKHRL1. SGK3 also regulates many transporters, ion channels, and receptors. It plays a critical role in hair follicle morphogenesis and hair cycling.
  • cd PX_CISK 109aa 9e-53 in ref transcript
    • The phosphoinositide binding Phox Homology Domain of Cytokine-Independent Survival Kinase. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Cytokine-independent survival kinase (CISK), also called Serum- and Glucocorticoid-induced Kinase 3 (SGK3), plays a role in cell growth and survival. It is expressed in most tissues and is most abundant in the embryo and adult heart and spleen. It was originally discovered in a screen for antiapoptotic genes. It phosphorylates and inhibits the proapoptotic proteins, Bad and FKHRL1. CISK/SGK3 also regulates many transporters, ion channels, and receptors. It plays a critical role in hair follicle morphogenesis and hair cycling. N-terminal to a catalytic kinase domain, CISK contains a PX domain which binds highly phosphorylated PIs, directs membrane localization, and regulates the enzyme's activity.
  • Changed! smart S_TKc 248aa 7e-79 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam PX 104aa 7e-19 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • smart S_TK_X 67aa 9e-10 in ref transcript
    • Extension to Ser/Thr-type protein kinases.
  • Changed! PTZ PTZ00263 293aa 9e-76 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • COG COG5391 113aa 1e-08 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
  • Changed! cd STKc_SGK3 293aa 1e-158 in modified transcript
  • Changed! smart S_TKc 216aa 3e-58 in modified transcript
  • Changed! PTZ PTZ00263 261aa 2e-57 in modified transcript

SH3BP5

  • refseq_SH3BP5.F1 refseq_SH3BP5.R1 134 202
  • NCBIGene 36.3 9467
  • Alternative 3-prime, size difference: 68
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004844

  • Changed! pfam SH3BP5 233aa 3e-82 in ref transcript
    • SH3 domain-binding protein 5 (SH3BP5). This family consists of several eukaryotic SH3 domain-binding protein 5 or c-Jun N-terminal kinase (JNK)-interacting proteins (SH3BP5 or Sab). Sab binds to and serves as a substrate for JNK in vitro, and has been found to interact with the Src homology 3 (SH3) domain of Bruton's tyrosine kinase (Btk). Inspection of the sequence of Sab reveals the presence of two putative mitogen-activated protein kinase interaction motifs (KIMs) similar to that found in the JNK docking domain of the c-Jun transcription factor, and four potential serine-proline JNK phosphorylation sites in the C-terminal half of the molecule.
  • Changed! PRK PRK10929 193aa 0.008 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! pfam SH3BP5 27aa 9e-05 in modified transcript

SHANK2

  • refseq_SHANK2.F1 refseq_SHANK2.R1 100 121
  • NCBIGene 36.3 22941
  • Single exon skipping, size difference: 21
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_012309

  • cd SAM 62aa 2e-16 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • cd PDZ_signaling 94aa 3e-15 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • pfam SAM_1 61aa 1e-18 in ref transcript
    • SAM domain (Sterile alpha motif). It has been suggested that SAM is an evolutionarily conserved protein binding domain that is involved in the regulation of numerous developmental processes in diverse eukaryotes. The SAM domain can potentially function as a protein interaction module through its ability to homo- and heterooligomerise with other SAM domains.
  • smart PDZ 95aa 8e-13 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.

SHOX2

  • refseq_SHOX2.F2 refseq_SHOX2.R2 186 222
  • NCBIGene 36.3 6474
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006884

  • cd homeodomain 59aa 2e-15 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam Homeobox 57aa 8e-22 in ref transcript
    • Homeobox domain.
  • COG COG5576 60aa 1e-09 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

PUF60

  • refseq_SIAHBP1.F1 refseq_SIAHBP1.R1 138 189
  • NCBIGene 36.3 22827
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_078480

  • cd RRM 75aa 3e-17 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 75aa 1e-15 in ref transcript
  • cd RRM 74aa 3e-05 in ref transcript
  • Changed! TIGR half-pint 295aa 1e-132 in ref transcript
    • In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.
  • TIGR half-pint 127aa 2e-59 in ref transcript
  • TIGR half-pint 56aa 7e-13 in ref transcript
  • Changed! COG COG0724 258aa 1e-16 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! TIGR half-pint 278aa 1e-130 in modified transcript
  • Changed! COG COG0724 160aa 3e-15 in modified transcript

SIGLEC6

  • refseq_SIGLEC6.F1 refseq_SIGLEC6.R1 130 306
  • NCBIGene 36.3 946
  • Multiple exon skipping, size difference: 176
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001245

  • cd IGcam 80aa 8e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam I-set 83aa 8e-08 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam V-set 105aa 3e-07 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • pfam C2-set_2 74aa 0.003 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.

SIGLEC6

  • refseq_SIGLEC6.F4 refseq_SIGLEC6.R4 228 276
  • NCBIGene 36.3 946
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001245

  • cd IGcam 80aa 8e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam I-set 83aa 8e-08 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam V-set 105aa 3e-07 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • pfam C2-set_2 74aa 0.003 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.

SIGLEC7

  • refseq_SIGLEC7.F1 refseq_SIGLEC7.R1 206 485
  • NCBIGene 36.3 27036
  • Single exon skipping, size difference: 279
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014385

  • cd IGcam 74aa 1e-04 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • smart IG_like 78aa 2e-08 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam V-set 99aa 1e-07 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • Changed! pfam C2-set_2 74aa 7e-05 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.

SIL1

  • refseq_SIL1.F1 refseq_SIL1.R1 151 239
  • NCBIGene 36.3 64374
  • Single exon skipping, size difference: 88
  • Exclusion in 5'UTR
  • Reference transcript: NM_001037633

SIP1

  • refseq_SIP1.F1 refseq_SIP1.R1 107 166
  • NCBIGene 36.3 8487
  • Single exon skipping, size difference: 59
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003616

  • Changed! pfam SIP1 247aa 5e-97 in ref transcript
    • Survival motor neuron (SMN) interacting protein 1 (SIP1). Survival motor neuron (SMN) interacting protein 1 (SIP1) interacts with SMN protein and plays a crucial role in the biogenesis of spliceosomes. There is evidence that the protein is linked to spinal muscular atrophy (SMA) and amyotrophic lateral sclerosis(ALS) in humans.
  • Changed! pfam SIP1 221aa 2e-85 in modified transcript

SIP1

  • refseq_SIP1.F3 refseq_SIP1.R3 216 261
  • NCBIGene 36.3 8487
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003616

  • Changed! pfam SIP1 247aa 5e-97 in ref transcript
    • Survival motor neuron (SMN) interacting protein 1 (SIP1). Survival motor neuron (SMN) interacting protein 1 (SIP1) interacts with SMN protein and plays a crucial role in the biogenesis of spliceosomes. There is evidence that the protein is linked to spinal muscular atrophy (SMA) and amyotrophic lateral sclerosis(ALS) in humans.
  • Changed! pfam SIP1 232aa 1e-91 in modified transcript

SIRPB2

  • refseq_SIRPB2.F1 refseq_SIRPB2.R1 106 400
  • NCBIGene 36.2 284759
  • Alternative 5-prime, size difference: 294
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_209363

  • cd IGcam 84aa 5e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam V-set 107aa 3e-10 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • Changed! pfam V-set 113aa 7e-10 in ref transcript

SIRPG

  • refseq_SIRPG.F2 refseq_SIRPG.R2 103 436
  • NCBIGene 36.3 55423
  • Single exon skipping, size difference: 333
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018556

  • Changed! cd IGc 87aa 2e-06 in ref transcript
    • Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGc 89aa 5e-05 in ref transcript
  • cd IGv 98aa 7e-05 in ref transcript
    • Immunoglobulin domain variable region (v) subfamily; members of the IGv subfamily are components of immunoglobulins and T-cell receptors. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. Within the variable domain, there are regions of even more variability called the hypervariable or complementarity-determining regions (CDRs) which are responsible for antigen binding. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! pfam V-set 90aa 3e-11 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • smart IGc1 73aa 5e-07 in ref transcript
    • Immunoglobulin C-Type.
  • Changed! smart IGc1 70aa 9e-06 in ref transcript
  • Changed! pfam V-set 103aa 8e-12 in modified transcript

SIRT2

  • refseq_SIRT2.F1 refseq_SIRT2.R1 183 230
  • NCBIGene 36.3 22933
  • Single exon skipping, size difference: 47
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_012237

  • Changed! cd SIRT1 255aa 1e-108 in ref transcript
    • SIRT1: Eukaryotic group (class1) which includes human sirtuins SIRT1-3 and yeast Hst1-4; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation. Sir2 proteins have been shown to regulate gene silencing, DNA repair, and life span. The most-studied function, gene silencing, involves the inactivation of chromosome domains containing key regulatory genes by packaging them into a specialized chromatin structure that is inaccessible to DNA-binding proteins. The nuclear SIRT1 has been shown to target the p53 tumor suppressor protein for deacetylation to suppress DNA damage, and the cytoplasmic SIRT2 homolog has been shown to target alpha-tubulin for deacetylation for the maintenance of cell integrity.
  • Changed! pfam SIR2 182aa 3e-69 in ref transcript
    • Sir2 family. This region is characteristic of Silent information regulator 2 (Sir2) proteins, or sirtuins. These are protein deacetylases that depend on nicotine adenine dinucleotide (NAD). They are found in many subcellular locations, including the nucleus, cytoplasm and mitochondria. Eukaryotic forms play in important role in the regulation of transcriptional repression. Moreover, they are involved in microtubule organisation and DNA damage repair processes.
  • Changed! COG SIR2 206aa 3e-48 in ref transcript
    • NAD-dependent protein deacetylases, SIR2 family [Transcription].

SIRT3

  • refseq_SIRT3.F1 refseq_SIRT3.R1 138 284
  • NCBIGene 36.3 23410
  • Alternative 5-prime, size difference: 146
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_012239

  • Changed! cd SIRT1 236aa 2e-94 in ref transcript
    • SIRT1: Eukaryotic group (class1) which includes human sirtuins SIRT1-3 and yeast Hst1-4; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation. Sir2 proteins have been shown to regulate gene silencing, DNA repair, and life span. The most-studied function, gene silencing, involves the inactivation of chromosome domains containing key regulatory genes by packaging them into a specialized chromatin structure that is inaccessible to DNA-binding proteins. The nuclear SIRT1 has been shown to target the p53 tumor suppressor protein for deacetylation to suppress DNA damage, and the cytoplasmic SIRT2 homolog has been shown to target alpha-tubulin for deacetylation for the maintenance of cell integrity.
  • Changed! pfam SIR2 179aa 3e-56 in ref transcript
    • Sir2 family. This region is characteristic of Silent information regulator 2 (Sir2) proteins, or sirtuins. These are protein deacetylases that depend on nicotine adenine dinucleotide (NAD). They are found in many subcellular locations, including the nucleus, cytoplasm and mitochondria. Eukaryotic forms play in important role in the regulation of transcriptional repression. Moreover, they are involved in microtubule organisation and DNA damage repair processes.
  • Changed! COG SIR2 253aa 4e-47 in ref transcript
    • NAD-dependent protein deacetylases, SIR2 family [Transcription].

SIVA1

  • refseq_SIVA.F1 refseq_SIVA.R1 118 313
  • NCBIGene 36.3 10572
  • Single exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006427

  • Changed! pfam Siva 175aa 4e-84 in ref transcript
    • Cd27 binding protein (Siva). Siva binds to the CD27 cytoplasmic tail. It has a DD homology region, a box-B-like ring finger, and a zinc finger-like domain. Overexpression of Siva in various cell lines induces apoptosis, suggesting an important role for Siva in the CD27-transduced apoptotic pathway. Siva-1 binds to and inhibits BCL-X(L)-mediated protection against UV radiation-induced apoptosis. Indeed, the unique amphipathic helical region (SAH) present in Siva-1 is required for its binding to BCL-X(L) and sensitising cells to UV radiation. Natural complexes of Siva-1/BCL-X(L) are detected in HUT78 and murine thymocyte, suggesting a potential role for Siva-1 in regulating T cell homeostasis. This family contains both Siva-1 and the shorter Siva-2 lacking the sequence coded by exon 2. It has been suggested that Siva-2 could regulate the function of Siva-1.
  • Changed! pfam Siva 71aa 2e-29 in modified transcript

SKIP

  • refseq_SKIP__51763.F1 refseq_SKIP__51763.R1 171 402
  • NCBIGene 36.3 51763
  • Single exon skipping, size difference: 231
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016532

  • Changed! smart IPPc 314aa 3e-82 in ref transcript
    • Inositol polyphosphate phosphatase, catalytic domain homologues. Mg(2+)-dependent/Li(+)-sensitive enzymes.
  • Changed! pfam CALCOCO1 70aa 8e-04 in ref transcript
    • Calcium binding and coiled-coil domain (CALCOCO1) like. Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
  • Changed! COG COG5411 277aa 2e-31 in ref transcript
    • Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms].

SLA2

  • refseq_SLA2.F1 refseq_SLA2.R1 149 199
  • NCBIGene 36.3 84174
  • Alternative 3-prime, size difference: 50
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032214

  • Changed! cd SH2 99aa 2e-18 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • cd SH3 54aa 0.004 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam SH2 83aa 1e-17 in ref transcript
    • SH2 domain.
  • pfam SH3_1 55aa 3e-05 in ref transcript
    • SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
  • Changed! cd SH2 85aa 6e-16 in modified transcript

SEPSECS

  • refseq_SLA_LP.F2 refseq_SLA_LP.R2 123 278
  • NCBIGene 36.3 51091
  • Single exon skipping, size difference: 155
  • Exclusion of the protein initiation site
  • Reference transcript: NM_153825

  • cd AAT_like 182aa 5e-04 in ref transcript
    • Aspartate aminotransferase family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). Pyridoxal phosphate combines with an alpha-amino acid to form a compound called a Schiff base or aldimine intermediate, which depending on the reaction, is the substrate in four kinds of reactions (1) transamination (movement of amino groups), (2) racemization (redistribution of enantiomers), (3) decarboxylation (removing COOH groups), and (4) various side-chain reactions depending on the enzyme involved. Pyridoxal phosphate (PLP) dependent enzymes were previously classified into alpha, beta and gamma classes, based on the chemical characteristics (carbon atom involved) of the reaction they catalyzed. The availability of several structures allowed a comprehensive analysis of the evolutionary classification of PLP dependent enzymes, and it was found that the functional classification did not always agree with the evolutionary history of these enzymes. The major groups in this CD corresponds to Aspartate aminotransferase a, b and c, Tyrosine, Alanine, Aromatic-amino-acid, Glutamine phenylpyruvate, 1-Aminocyclopropane-1-carboxylate synthase, Histidinol-phosphate, gene products of malY and cobC, Valine-pyruvate aminotransferase and Rhizopine catabolism regulatory protein.
  • Changed! TIGR selenium_SpcS 398aa 0.0 in ref transcript
    • In the archaea and eukaryotes, the conversion of the mischarged serine to selenocysteine (Sec) on its tRNA is accomplished in two steps. This enzyme, O-phosphoseryl-tRNA(Sec) selenium transferase, acts second, after a phosphophorylation step catalyzed by a homolog of the bacterial SelA protein.
  • COG GadB 342aa 3e-16 in ref transcript
    • Glutamate decarboxylase and related PLP-dependent proteins [Amino acid transport and metabolism].
  • Changed! TIGR selenium_SpcS 369aa 0.0 in modified transcript

SLC22A1

  • refseq_SLC22A1.F1 refseq_SLC22A1.R1 149 262
  • NCBIGene 36.3 6580
  • Single exon skipping, size difference: 113
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003057

  • cd MFS 166aa 2e-16 in ref transcript
    • The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
  • Changed! cd MFS 143aa 4e-08 in ref transcript
  • Changed! TIGR 2A0119 513aa 1e-180 in ref transcript
  • COG AraJ 124aa 5e-09 in ref transcript
    • Arabinose efflux permease [Carbohydrate transport and metabolism].
  • Changed! PRK xylE 301aa 3e-07 in ref transcript
    • D-xylose transporter XylE; Provisional.
  • Changed! cd MFS 124aa 1e-05 in modified transcript
  • Changed! TIGR 2A0119 454aa 1e-166 in modified transcript
  • Changed! PRK xylE 241aa 1e-07 in modified transcript

SLC22A12

  • refseq_SLC22A12.F2 refseq_SLC22A12.R2 197 352
  • NCBIGene 36.3 116085
  • Single exon skipping, size difference: 155
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_144585

  • Changed! cd MFS 158aa 1e-13 in ref transcript
    • The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
  • Changed! cd MFS 141aa 0.004 in ref transcript
  • Changed! TIGR 2A0119 516aa 1e-103 in ref transcript
  • Changed! COG AraJ 93aa 6e-05 in ref transcript
    • Arabinose efflux permease [Carbohydrate transport and metabolism].
  • Changed! TIGR 2A0119 159aa 2e-22 in modified transcript

SLC22A17

  • refseq_SLC22A17.F1 refseq_SLC22A17.R1 306 360
  • NCBIGene 36.3 51310
  • Alternative 5-prime, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020372

  • Changed! cd MFS 75aa 0.004 in ref transcript
    • The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
  • Changed! TIGR 2A0119 374aa 2e-21 in ref transcript
  • Changed! PRK xylE 158aa 4e-06 in ref transcript
    • D-xylose transporter XylE; Provisional.
  • Changed! cd MFS 118aa 2e-04 in modified transcript
  • Changed! TIGR 2A0119 356aa 1e-23 in modified transcript
  • Changed! PRK xylE 213aa 7e-08 in modified transcript

SLC22A6

  • refseq_SLC22A6.F1 refseq_SLC22A6.R1 121 253
  • NCBIGene 36.3 9356
  • Alternative 3-prime, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004790

  • cd MFS 123aa 6e-14 in ref transcript
    • The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
  • Changed! cd MFS 171aa 2e-09 in ref transcript
  • Changed! TIGR 2A0119 505aa 0.0 in ref transcript
  • Changed! PRK xylE 429aa 3e-08 in ref transcript
    • D-xylose transporter XylE; Provisional.
  • Changed! cd MFS 122aa 0.003 in modified transcript
  • Changed! TIGR 2A0119 461aa 1e-172 in modified transcript
  • Changed! COG AraJ 67aa 3e-07 in modified transcript
    • Arabinose efflux permease [Carbohydrate transport and metabolism].
  • Changed! PRK PRK03893 128aa 1e-05 in modified transcript
    • putative sialic acid transporter; Provisional.
  • Changed! PRK xylE 353aa 5e-05 in modified transcript

SLC22A6

  • refseq_SLC22A6.F2 refseq_SLC22A6.R2 149 188
  • NCBIGene 36.3 9356
  • Alternative 5-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004790

  • cd MFS 123aa 6e-14 in ref transcript
    • The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
  • cd MFS 171aa 2e-09 in ref transcript
  • TIGR 2A0119 505aa 0.0 in ref transcript
  • Changed! PRK xylE 429aa 3e-08 in ref transcript
    • D-xylose transporter XylE; Provisional.
  • Changed! PRK xylE 430aa 1e-07 in modified transcript

SLC24A4

  • refseq_SLC24A4.F1 refseq_SLC24A4.R1 100 157
  • NCBIGene 36.3 123041
  • Alternative 5-prime, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153646

  • TIGR 2A1904 173aa 3e-59 in ref transcript
  • TIGR 2A1904 193aa 2e-45 in ref transcript
  • COG ECM27 132aa 1e-15 in ref transcript
    • Ca2+/Na+ antiporter [Inorganic ion transport and metabolism].
  • COG ECM27 151aa 2e-15 in ref transcript

SLC25A25

  • refseq_SLC25A25.F2 refseq_SLC25A25.R2 183 226
  • NCBIGene 36.3 114789
  • Alternative 5-prime, size difference: 43
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_052901

  • Changed! cd EFh 60aa 4e-09 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! pfam Mito_carr 93aa 7e-24 in ref transcript
    • Mitochondrial carrier protein.
  • Changed! pfam Mito_carr 89aa 2e-23 in ref transcript
  • Changed! pfam Mito_carr 92aa 1e-16 in ref transcript
  • Changed! PTZ PTZ00169 269aa 1e-29 in ref transcript
    • ADP/ATP transporter on adenylate translocase; Provisional.
  • Changed! COG FRQ1 113aa 9e-11 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

SLC25A26

  • refseq_SLC25A26.F1 refseq_SLC25A26.R1 349 434
  • NCBIGene 36.2 115286
  • Single exon skipping, size difference: 85
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_173471

  • Changed! pfam Mito_carr 83aa 4e-11 in ref transcript
    • Mitochondrial carrier protein.
  • Changed! pfam Mito_carr 89aa 2e-10 in ref transcript
  • Changed! PTZ PTZ00168 168aa 1e-17 in ref transcript
    • mitochondrial carrier protein; Provisional.
  • Changed! pfam Mito_carr 61aa 8e-08 in modified transcript
  • Changed! PTZ PTZ00168 61aa 4e-06 in modified transcript

SLC25A29

  • refseq_SLC25A29.F2 refseq_SLC25A29.R2 246 290
  • NCBIGene 36.2 123096
  • Single exon skipping, size difference: 44
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_152333

SLC25A3

  • refseq_SLC25A3.F1 refseq_SLC25A3.R1 100 391
  • NCBIGene 36.2 5250
  • Alternative 5-prime, size difference: 291
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_005888

  • Changed! pfam Mito_carr 75aa 2e-16 in ref transcript
    • Mitochondrial carrier protein.
  • Changed! pfam Mito_carr 83aa 6e-16 in ref transcript
  • Changed! pfam Mito_carr 78aa 2e-06 in ref transcript
  • Changed! PTZ PTZ00169 259aa 4e-08 in ref transcript
    • ADP/ATP transporter on adenylate translocase; Provisional.

SLC26A1

  • refseq_SLC26A1.F2 refseq_SLC26A1.R2 104 319
  • NCBIGene 36.3 10861
  • Single exon skipping, size difference: 215
  • Exclusion in 5'UTR
  • Reference transcript: NM_213613

  • cd STAS_SulP_like_sulfate_transporter 142aa 1e-14 in ref transcript
    • Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
  • TIGR sulP 629aa 1e-145 in ref transcript
    • (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3- (out).
  • COG SUL1 640aa 3e-54 in ref transcript
    • Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism].

SLC26A10

  • refseq_SLC26A10.F1 refseq_SLC26A10.R1 305 400
  • NCBIGene 36.2 65012
  • Single exon skipping, size difference: 107
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_133489

  • Changed! cd STAS_SulP_like_sulfate_transporter 123aa 4e-08 in ref transcript
    • Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
  • Changed! TIGR sulP 496aa 2e-49 in ref transcript
    • (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3- (out).
  • Changed! COG SUL1 374aa 8e-31 in ref transcript
    • Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism].
  • Changed! TIGR sulP 395aa 1e-46 in modified transcript
  • Changed! COG SUL1 397aa 1e-29 in modified transcript

SLC26A10

  • refseq_SLC26A10.F3 refseq_SLC26A10.R4 155 262
  • NCBIGene 36.2 65012
  • Single exon skipping, size difference: 107
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_133489

  • Changed! cd STAS_SulP_like_sulfate_transporter 123aa 4e-08 in ref transcript
    • Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
  • Changed! TIGR sulP 496aa 2e-49 in ref transcript
    • (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3- (out).
  • Changed! COG SUL1 374aa 8e-31 in ref transcript
    • Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism].
  • Changed! TIGR sulP 395aa 1e-46 in modified transcript
  • Changed! COG SUL1 397aa 1e-29 in modified transcript

SLC26A8

  • refseq_SLC26A8.F1 refseq_SLC26A8.R1 101 416
  • NCBIGene 36.3 116369
  • Multiple exon skipping, size difference: 315
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_052961

  • cd STAS_SulP_like_sulfate_transporter 58aa 1e-08 in ref transcript
    • Sulphate Transporter and Anti-Sigma factor antagonist domain of SulP-like sulfate transporters, plays a role in the function and regulation of the transport activity, proposed general NTP binding function. The SulP family is a large and diverse family of anion transporters, with members from eubacteria, plants, fungi, and mammals. They contain 10 to 14 transmembrane helices which form the catalytic core of the protein and a C-terminal extension, the STAS (Sulphate Transporter and AntiSigma factor antagonist) domain which plays a role in the function and regulation of the transport activity. The STAS domain is found in the C-terminal region of sulphate transporters and bacterial anti-sigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
  • Changed! TIGR sulP 502aa 2e-71 in ref transcript
    • (2) SO42- (out) + nHCO3- (in) SO42- (in) + nHCO3- (out).
  • pfam STAS 72aa 2e-06 in ref transcript
    • STAS domain. The STAS (after Sulphate Transporter and AntiSigma factor antagonist) domain is found in the C terminal region of Sulphate transporters and bacterial antisigma factor antagonists. It has been suggested that this domain may have a general NTP binding function.
  • Changed! COG SUL1 504aa 3e-53 in ref transcript
    • Sulfate permease and related transporters (MFS superfamily) [Inorganic ion transport and metabolism].
  • COG SpoIIAA 61aa 0.009 in ref transcript
    • Anti-anti-sigma regulatory factor (antagonist of anti-sigma factor) [Signal transduction mechanisms].
  • Changed! TIGR sulP 288aa 2e-41 in modified transcript
  • Changed! TIGR sulP 134aa 4e-14 in modified transcript
  • Changed! COG SUL1 399aa 8e-32 in modified transcript

SLC2A11

  • refseq_SLC2A11.F1 refseq_SLC2A11.R1 119 230
  • NCBIGene 36.3 66035
  • Single exon skipping, size difference: 111
  • Exclusion of the protein initiation site
  • Reference transcript: NM_030807

  • cd MFS 119aa 1e-04 in ref transcript
    • The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
  • cd MFS 181aa 0.002 in ref transcript
  • Changed! pfam Sugar_tr 454aa 2e-63 in ref transcript
    • Sugar (and other) transporter.
  • PRK xylE 378aa 6e-19 in ref transcript
    • D-xylose transporter XylE; Provisional.
  • Changed! pfam Sugar_tr 445aa 3e-62 in modified transcript

SLC30A10

  • refseq_SLC30A10.F1 refseq_SLC30A10.R1 104 142
  • NCBIGene 36.2 55532
  • Alternative 3-prime, size difference: 38
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_018713

  • Changed! TIGR CDF 153aa 2e-28 in ref transcript
    • This model describes a broadly distributed family of transporters, a number of which have been shown to transport divalent cations of cobalt, cadmium and/or zinc. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.
  • TIGR CDF 94aa 1e-18 in ref transcript
  • Changed! COG CzcD 152aa 9e-30 in ref transcript
    • Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism].
  • COG CzcD 102aa 5e-23 in ref transcript

SLC30A2

  • refseq_SLC30A2.F1 refseq_SLC30A2.R1 143 290
  • NCBIGene 36.3 7780
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001004434

  • Changed! TIGR CDF 275aa 2e-69 in ref transcript
    • This model describes a broadly distributed family of transporters, a number of which have been shown to transport divalent cations of cobalt, cadmium and/or zinc. The family has six predicted transmembrane domains. Members of the family are variable in length because of variably sized inserts, often containing low-complexity sequence.
  • Changed! COG CzcD 304aa 4e-53 in ref transcript
    • Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism].
  • Changed! TIGR CDF 218aa 2e-46 in modified transcript
  • Changed! COG CzcD 216aa 2e-32 in modified transcript

SLC37A3

  • refseq_SLC37A3.F1 refseq_SLC37A3.R1 101 403
  • NCBIGene 36.3 84255
  • Multiple exon skipping, size difference: 302
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_207113

  • cd MFS 144aa 6e-13 in ref transcript
    • The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
  • Changed! cd MFS 185aa 3e-09 in ref transcript
  • Changed! TIGR 2A0104 371aa 3e-24 in ref transcript
  • Changed! COG UhpC 466aa 3e-42 in ref transcript
    • Sugar phosphate permease [Carbohydrate transport and metabolism].
  • Changed! TIGR 2A0104 258aa 2e-17 in modified transcript
  • Changed! COG UhpC 328aa 4e-29 in modified transcript

SLC3A2

  • refseq_SLC3A2.F1 refseq_SLC3A2.R1 268 361
  • NCBIGene 36.3 6520
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012661

  • TIGR trehalose_treC 136aa 4e-16 in ref transcript
    • Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
  • pfam Alpha-amylase 244aa 8e-14 in ref transcript
    • Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain.
  • TIGR treS_nterm 120aa 3e-08 in ref transcript
    • Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PMID:15378530).
  • COG AmyA 340aa 3e-18 in ref transcript
    • Glycosidases [Carbohydrate transport and metabolism].

SLC41A3

  • refseq_SLC41A3.F1 refseq_SLC41A3.R1 287 395
  • NCBIGene 36.3 54946
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008485

  • Changed! pfam MgtE 136aa 2e-14 in ref transcript
    • Divalent cation transporter. This region is the integral membrane part of the eubacterial MgtE family of magnesium transporters. Related regions are found also in archaebacterial and eukaryotic proteins. All the archaebacterial and eukaryotic examples have two copies of the region. This suggests that the eubacterial examples may act as dimers. Members of this family probably transport Mg2+ or other divalent cations into the cell. The alignment contains two highly conserved aspartates that may be involved in cation binding (Bateman A unpubl.).
  • pfam MgtE 132aa 0.007 in ref transcript
  • COG COG1824 170aa 1e-05 in ref transcript
    • Permease, similar to cation transporters [Inorganic ion transport and metabolism].
  • Changed! COG COG1824 171aa 1e-05 in ref transcript
  • Changed! pfam MgtE 65aa 3e-07 in modified transcript
  • Changed! COG COG1824 55aa 0.010 in modified transcript

SLC4A3

  • refseq_SLC4A3.F1 refseq_SLC4A3.R1 189 270
  • NCBIGene 36.3 6508
  • Alternative 3-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201574

  • TIGR ae 910aa 0.0 in ref transcript
    • They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.

SLC4A5

  • refseq_SLC4A5.F2 refseq_SLC4A5.R2 229 277
  • NCBIGene 36.3 57835
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021196

  • Changed! TIGR ae 957aa 0.0 in ref transcript
    • They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.
  • Changed! TIGR ae 941aa 0.0 in modified transcript

SLC6A20

  • refseq_SLC6A20.F1 refseq_SLC6A20.R1 243 354
  • NCBIGene 36.3 54716
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020208

  • Changed! pfam SNF 576aa 1e-123 in ref transcript
    • Sodium:neurotransmitter symporter family.
  • Changed! COG COG0733 300aa 3e-35 in ref transcript
    • Na+-dependent transporters of the SNF family [General function prediction only].
  • COG COG0733 164aa 6e-08 in ref transcript
  • Changed! pfam SNF 539aa 1e-102 in modified transcript
  • Changed! COG COG0733 117aa 2e-20 in modified transcript
  • Changed! COG COG0733 68aa 2e-07 in modified transcript

SLC6A9

  • refseq_SLC6A9.F2 refseq_SLC6A9.R2 241 403
  • NCBIGene 36.3 6536
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201649

  • pfam SNF 551aa 0.0 in ref transcript
    • Sodium:neurotransmitter symporter family.
  • COG COG0733 495aa 1e-55 in ref transcript
    • Na+-dependent transporters of the SNF family [General function prediction only].

SMARCA1

  • refseq_SMARCA1.F1 refseq_SMARCA1.R1 157 220
  • NCBIGene 36.2 6594
  • Alternative 5-prime and 3-prime, size difference: 63
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003069

  • cd HELICc 144aa 1e-21 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • Changed! cd DEXDc 140aa 4e-19 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • cd SANT 40aa 6e-04 in ref transcript
    • 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
  • Changed! pfam SNF2_N 271aa 1e-108 in ref transcript
    • SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
  • pfam SLIDE 110aa 4e-42 in ref transcript
    • SLIDE. The SLIDE domain adopts a secondary structure comprising a main core of three alpha-helices. It has a role in DNA binding, contacting DNA target sites similar to c-Myb (pfam00249) repeats or homeodomains.
  • pfam HAND 81aa 3e-26 in ref transcript
    • HAND. The HAND domain adopts a secondary structure consisting of four alpha helices, three of which (H2, H3, H4) form an L-like configuration. Helix H2 runs antiparallel to helices H3 and H4, packing closely against helix H4, whilst helix H1 reposes in the concave surface formed by these three helices and runs perpendicular to them. The domain confers DNA and nucleosome binding properties to the protein.
  • smart HELICc 97aa 1e-18 in ref transcript
    • helicase superfamily c-terminal domain.
  • smart SANT 42aa 9e-05 in ref transcript
    • SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
  • Changed! COG HepA 493aa 3e-86 in ref transcript
    • Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
  • Changed! cd DEXDc 116aa 1e-12 in modified transcript
  • Changed! pfam SNF2_N 250aa 2e-93 in modified transcript
  • Changed! COG HepA 472aa 3e-76 in modified transcript

SMARCA2

  • refseq_SMARCA2.F1 refseq_SMARCA2.R1 238 292
  • NCBIGene 36.3 6595
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003070

  • Changed! cd Bromo_SNF2L2 125aa 6e-47 in ref transcript
    • Bromodomain, SNF2L2-like subfamily, specific to animals. SNF2L2 (SNF2-alpha) or SWI/SNF-related matrix-associated actin-dependent regulator of chromatin subfamily A member 2 is a global transcriptional activator, which cooperates with nuclear hormone receptors to boost transcriptional activation. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd HELICc 133aa 2e-20 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • cd DEXDc 140aa 3e-18 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • pfam SNF2_N 286aa 3e-95 in ref transcript
    • SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
  • Changed! smart BROMO 126aa 8e-24 in ref transcript
    • bromo domain.
  • pfam Helicase_C 81aa 5e-18 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • smart HSA 58aa 5e-13 in ref transcript
    • domain in helicases and associated with SANT domains.
  • pfam BRK 45aa 4e-12 in ref transcript
    • BRK domain. The function of this domain is unknown. It is often found associated with helicases and transcription factors.
  • pfam QLQ 37aa 4e-10 in ref transcript
    • QLQ. The QLQ domain is named after the conserved Gln, Leu, Gln motif. The QLQ domain is found at the N-terminus of SWI2/SNF2 protein, which has been shown to be involved in protein-protein interactions. This domain has thus been postulated to be involved in mediating protein interactions.
  • COG HepA 502aa 2e-82 in ref transcript
    • Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
  • Changed! COG COG5076 91aa 4e-12 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • Changed! cd Bromo_SNF2L2 107aa 3e-50 in modified transcript
  • Changed! smart BROMO 108aa 4e-26 in modified transcript
  • Changed! COG COG5076 122aa 6e-14 in modified transcript

SMARCB1

  • refseq_SMARCB1.F2 refseq_SMARCB1.R2 133 160
  • NCBIGene 36.3 6598
  • Alternative 5-prime, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003073

  • pfam SNF5 219aa 5e-93 in ref transcript
    • SNF5 / SMARCB1 / INI1. SNF5 is a component of the yeast SWI/SNF complex, which is an ATP-dependent nucleosome-remodelling complex that regulates the transcription of a subset of yeast genes. SNF5 is a key component of all SWI/SNF-class complexes characterised so far. This family consists of the conserved region of SNF5, including a direct repeat motif. SNF5 is essential for the assembly promoter targeting and chromatin remodelling activity of the SWI-SNF complex. SNF5 is also known as SMARCB1, for SWI/SNF-related, matrix-associated, actin-dependent regulator of chromatin, subfamily b, member 1, and also INI1 for integrase interactor 1. Loss-of function mutations in SNF5 are thought to contribute to oncogenesis in malignant rhabdoid tumours (MRTs).

SMARCC2

  • refseq_SMARCC2.F1 refseq_SMARCC2.R1 137 230
  • NCBIGene 36.3 6601
  • Single exon skipping, size difference: 93
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_003075

  • pfam SWIRM 85aa 3e-29 in ref transcript
    • SWIRM domain. This SWIRM domain is a small alpha-helical domain of about 85 amino acid residues found in chromosomal proteins. It contains a helix-turn helix motif and binds to DNA.
  • smart CHROMO 36aa 0.001 in ref transcript
    • Chromatin organization modifier domain.
  • Changed! COG RSC8 193aa 2e-49 in ref transcript
    • RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription].
  • Changed! COG RSC8 114aa 4e-17 in ref transcript
  • COG RSC8 42aa 3e-04 in ref transcript
  • COG MDN1 173aa 0.007 in ref transcript
    • AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only].
  • Changed! COG RSC8 381aa 2e-67 in modified transcript

SMARCC2

  • refseq_SMARCC2.F3 refseq_SMARCC2.R3 112 457
  • NCBIGene 36.3 6601
  • Alternative 5-prime, size difference: 345
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003075

  • pfam SWIRM 85aa 3e-29 in ref transcript
    • SWIRM domain. This SWIRM domain is a small alpha-helical domain of about 85 amino acid residues found in chromosomal proteins. It contains a helix-turn helix motif and binds to DNA.
  • smart CHROMO 36aa 0.001 in ref transcript
    • Chromatin organization modifier domain.
  • COG RSC8 193aa 2e-49 in ref transcript
    • RSC chromatin remodeling complex subunit RSC8 [Chromatin structure and dynamics / Transcription].
  • COG RSC8 114aa 4e-17 in ref transcript
  • COG RSC8 42aa 3e-04 in ref transcript
  • COG MDN1 173aa 0.007 in ref transcript
    • AAA ATPase containing von Willebrand factor type A (vWA) domain [General function prediction only].

SMARCD1

  • refseq_SMARCD1.F1 refseq_SMARCD1.R1 199 322
  • NCBIGene 36.3 6602
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003076

  • smart SWIB 77aa 3e-24 in ref transcript
    • SWI complex, BAF60b domains.
  • Changed! pfam Bindin 60aa 0.009 in ref transcript
    • Bindin.
  • COG COG5531 204aa 4e-15 in ref transcript
    • SWIB-domain-containing proteins implicated in chromatin remodeling [Chromatin structure and dynamics].

SMC4

  • refseq_SMC4.F2 refseq_SMC4.R2 220 295
  • NCBIGene 36.2 10051
  • Alternative 3-prime, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001002800

  • cd ABC_SMC4_euk 160aa 1e-51 in ref transcript
    • Eukaryotic SMC4 proteins; SMC proteins are large (approximately 110 to 170 kDa), and each is arranged into five recognizable domains. Amino-acid sequence homology of SMC proteins between species is largely confined to the amino- and carboxy-terminal globular domains. The amino-terminal domain contains a 'Walker A' nucleotide-binding domain (GxxGxGKS/T, in the single-letter amino-acid code), which by mutational studies has been shown to be essential in several proteins. The carboxy-terminal domain contains a sequence (the DA-box) that resembles a 'Walker B' motif, and a motif with homology to the signature sequence of the ATP-binding cassette (ABC) family of ATPases. The sequence homology within the carboxy-terminal domain is relatively high within the SMC1-SMC4 group, whereas SMC5 and SMC6 show some divergence in both of these sequences. In eukaryotic cells, the proteins are found as heterodimers of SMC1 paired with SMC3, SMC2 with SMC4, and SMC5 with SMC6 (formerly known as Rad18).
  • cd ABC_SMC4_euk 95aa 9e-51 in ref transcript
  • pfam SMC_N 1192aa 0.0 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • COG Smc 1201aa 1e-123 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

SMG7

  • refseq_SMG7.F1 refseq_SMG7.R1 256 394
  • NCBIGene 36.3 9887
  • Alternative 5-prime, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_173156

  • cd TPR 79aa 0.003 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • pfam EST1 119aa 6e-25 in ref transcript
    • Telomerase activating protein Est1. Est1 is a protein which recruits or activates telomerase at the site of polymerisation.

SMG7

  • refseq_SMG7.F3 refseq_SMG7.R3 101 133
  • NCBIGene 36.3 9887
  • Single exon skipping, size difference: 32
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_173156

  • Changed! cd TPR 79aa 0.003 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • Changed! pfam EST1 119aa 6e-25 in ref transcript
    • Telomerase activating protein Est1. Est1 is a protein which recruits or activates telomerase at the site of polymerisation.

SMN2

  • refseq_SMN2.F1 refseq_SMN2.R1 132 186
  • NCBIGene 36.3 6607
  • Single exon skipping, size difference: 54
  • Exclusion of the stop codon
  • Reference transcript: NM_017411

  • cd TUDOR 48aa 2e-08 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • pfam SMN 166aa 3e-40 in ref transcript
    • Survival motor neuron protein (SMN). This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.
  • Changed! pfam SMN 35aa 2e-11 in ref transcript
  • Changed! pfam SMN 27aa 2e-08 in modified transcript

SMOX

  • refseq_SMOX.F1 refseq_SMOX.R1 263 353
  • NCBIGene 36.3 54498
  • Single exon skipping, size difference: 90
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_175839

  • Changed! pfam Amino_oxidase 500aa 1e-53 in ref transcript
    • Flavin containing amine oxidoreductase. This family consists of various amine oxidases, including maze polyamine oxidase (PAO) and various flavin containing monoamine oxidases (MAO). The aligned region includes the flavin binding site of these enzymes. The family also contains phytoene dehydrogenases and related enzymes. In vertebrates MAO plays an important role regulating the intracellular levels of amines via there oxidation; these include various neurotransmitters, neurotoxins and trace amines. In lower eukaryotes such as aspergillus and in bacteria the main role of amine oxidases is to provide a source of ammonium. PAOs in plants, bacteria and protozoa oxidase spermidine and spermine to an aminobutyral, diaminopropane and hydrogen peroxide and are involved in the catabolism of polyamines. Other members of this family include tryptophan 2-monooxygenase, putrescine oxidase, corticosteroid binding proteins and antibacterial glycoproteins.
  • Changed! COG COG1231 241aa 1e-08 in ref transcript
    • Monoamine oxidase [Amino acid transport and metabolism].
  • COG COG1233 72aa 3e-04 in ref transcript
    • Phytoene dehydrogenase and related proteins [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam Amino_oxidase 530aa 9e-51 in modified transcript
  • Changed! COG COG1231 181aa 5e-06 in modified transcript

SMPD1

  • refseq_SMPD1.F2 refseq_SMPD1.R2 217 349
  • NCBIGene 36.3 6609
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 132
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000543

  • smart SapB 74aa 1e-06 in ref transcript
    • Saposin (B) Domains. Present in multiple copies in prosaposin and in pulmonary surfactant-associated protein B. In plant aspartic proteinases, a saposin domain is circularly permuted. This causes the prediction algorithm to predict two such domains, where only one is truly present.
  • Changed! pfam Metallophos 97aa 0.006 in ref transcript
    • Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacterial SbcD or yeast MRE11. The most conserved regions in this superfamily centre around the metal chelating residues.
  • COG COG0535 101aa 0.003 in ref transcript
    • Predicted Fe-S oxidoreductases [General function prediction only].
  • Changed! pfam Metallophos 151aa 1e-06 in modified transcript

SMPD4

  • refseq_SMPD4.F1 refseq_SMPD4.R1 118 256
  • NCBIGene 36.2 55627
  • Alternative 3-prime, size difference: 138
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_017751

SMPD4

  • refseq_SMPD4.F4 refseq_SMPD4.R4 300 387
  • NCBIGene 36.3 55627
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017951

SMTN

  • refseq_SMTN.F2 refseq_SMTN.R2 186 282
  • NCBIGene 36.3 6525
  • Alternative 5-prime, size difference: 96
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_134270

  • Changed! cd CH 128aa 1e-19 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • Changed! pfam CH 128aa 6e-23 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • Changed! COG SAC6 92aa 1e-12 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • Changed! cd CH 101aa 9e-21 in modified transcript
  • Changed! pfam CH 101aa 9e-24 in modified transcript
  • Changed! COG SAC6 102aa 2e-14 in modified transcript

SMURF1

  • refseq_SMURF1.F2 refseq_SMURF1.R2 270 348
  • NCBIGene 36.3 57154
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020429

  • cd HECTc 356aa 1e-129 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • cd C2 105aa 8e-12 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • cd WW 30aa 2e-06 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • cd WW 31aa 1e-04 in ref transcript
  • pfam HECT 306aa 1e-125 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • pfam C2 85aa 7e-15 in ref transcript
    • C2 domain.
  • smart WW 32aa 7e-08 in ref transcript
    • Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
  • smart WW 32aa 1e-05 in ref transcript
  • Changed! COG HUL4 530aa 1e-133 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG COG5038 89aa 0.001 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
  • Changed! COG HUL4 504aa 1e-133 in modified transcript
  • Changed! COG HUL4 301aa 3e-04 in modified transcript

SNAP23

  • refseq_SNAP23.F1 refseq_SNAP23.R1 188 347
  • NCBIGene 36.3 8773
  • Single exon skipping, size difference: 159
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003825

  • cd t_SNARE 60aa 3e-07 in ref transcript
    • Soluble NSF (N-ethylmaleimide-sensitive fusion protein)-Attachment protein (SNAP) REceptor domain; these alpha-helical motifs form twisted and parallel heterotetrameric helix bundles; the core complex contains one helix from a protein that is anchored in the vesicle membrane (synaptobrevin), one helix from a protein of the target membrane (syntaxin), and two helices from another protein anchored in the target membrane (SNAP-25); their interaction forms a core which is composed of a polar zero layer, a flanking leucine-zipper layer acts as a water tight shield to isolate ionic interactions in the zero layer from the surrounding solvent.
  • Changed! cd t_SNARE 59aa 0.006 in ref transcript
  • pfam SNARE 58aa 5e-09 in ref transcript
    • SNARE domain. Most if not all vesicular membrane fusion events in eukaryotic cells are believed to be mediated by a conserved fusion machinery, the SNARE [soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptors] machinery. The SNARE domain is thought to act as a protein-protein interaction module in the assembly of a SNARE protein complex.
  • Changed! pfam SNAP-25 62aa 2e-08 in ref transcript
    • SNAP-25 family. SNAP-25 (synaptosome-associated protein 25 kDa) proteins are components of SNARE complexes. Members of this family contain a cluster of cysteine residues that can be palmitoylated for membrane attachment.
  • smart t_SNARE 66aa 2e-05 in ref transcript
    • Helical region found in SNAREs. All alpha-helical motifs that form twisted and parallel four-helix bundles in target soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptor proteins. This motif found in "Q-SNAREs".

SNCA

  • refseq_SNCA.F2 refseq_SNCA.R2 283 367
  • NCBIGene 36.3 6622
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000345

  • Changed! pfam Synuclein 132aa 8e-26 in ref transcript
    • Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid plaques in Alzheimer's patients.
  • Changed! pfam Synuclein 107aa 1e-17 in modified transcript

SNCB

  • refseq_SNCB.F1 refseq_SNCB.R1 204 361
  • NCBIGene 36.3 6620
  • Single exon skipping, size difference: 157
  • Exclusion in 5'UTR
  • Reference transcript: NM_001001502

  • pfam Synuclein 92aa 8e-26 in ref transcript
    • Synuclein. There are three types of synucleins in humans, these are called alpha, beta and gamma. Alpha synuclein has been found mutated in families with autosomal dominant Parkinson's disease. A peptide of alpha synuclein has also been found in amyloid plaques in Alzheimer's patients.

SNRP70

  • refseq_SNRP70.F2 refseq_SNRP70.R2 109 181
  • NCBIGene 36.2 6625
  • Single exon skipping, size difference: 72
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_003089

  • Changed! cd RRM 74aa 5e-17 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! smart RRM 71aa 2e-16 in ref transcript
    • RNA recognition motif.
  • Changed! COG COG0724 91aa 1e-10 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! cd RRM 53aa 3e-12 in modified transcript
  • Changed! smart RRM 51aa 1e-11 in modified transcript
  • Changed! COG COG0724 68aa 8e-08 in modified transcript

SNRPB

  • refseq_SNRPB.F1 refseq_SNRPB.R1 160 306
  • NCBIGene 36.3 6628
  • Alternative 3-prime, size difference: 146
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_198216

  • cd Sm_B 79aa 2e-40 in ref transcript
    • The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit B heterodimerizes with subunit D3 and three such heterodimers form a hexameric ring structure with alternating B and D3 subunits. The D3 - B heterodimer also assembles into a heptameric ring containing D1, D2, E, F, and G subunits. Sm-like proteins exist in archaea as well as prokaryotes which form heptameric and hexameric ring structures similar to those found in eukaryotes.
  • smart Sm 76aa 2e-16 in ref transcript
    • snRNP Sm proteins. small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing.
  • COG LSM1 82aa 3e-08 in ref transcript
    • Small nuclear ribonucleoprotein (snRNP) homolog [Transcription].

SNRPB2

  • refseq_SNRPB2.F1 refseq_SNRPB2.R1 169 269
  • NCBIGene 36.3 6629
  • Alternative 5-prime, size difference: 100
  • Exclusion in 5'UTR
  • Reference transcript: NM_003092

  • cd RRM 76aa 2e-11 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 60aa 5e-08 in ref transcript
  • smart RRM_2 75aa 3e-11 in ref transcript
    • RNA recognition motif.
  • pfam RRM_1 60aa 3e-10 in ref transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
  • COG COG0724 95aa 5e-07 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 89aa 0.002 in ref transcript

SNRPD2

  • refseq_SNRPD2.F1 refseq_SNRPD2.R1 157 272
  • NCBIGene 36.3 6633
  • Single exon skipping, size difference: 115
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_004597

  • Changed! cd Sm_D2 88aa 2e-35 in ref transcript
    • The eukaryotic Sm and Sm-like (LSm) proteins associate with RNA to form core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet. Sm subunit D2 heterodimerizes with subunit D1 and three such heterodimers form a hexameric ring structure with alternating D1 and D2 subunits. The D1 - D2 heterodimer also assembles into a heptameric ring containing D2, D3, E, F, and G subunits. Sm-like proteins exist in archaea as well as prokaryotes which form heptameric and hexameric ring structures similar to those found in eukaryotes.
  • Changed! smart Sm 72aa 1e-10 in ref transcript
    • snRNP Sm proteins. small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing.
  • Changed! COG LSM1 82aa 2e-07 in ref transcript
    • Small nuclear ribonucleoprotein (snRNP) homolog [Transcription].

SNX1

  • refseq_SNX1.F1 refseq_SNX1.R1 301 496
  • NCBIGene 36.3 6642
  • Multiple exon skipping, size difference: 195
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003099

  • Changed! cd PX_SNX1 123aa 3e-64 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 1. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX1 is both membrane associated and a cytosolic protein that exists as a tetramer in protein complexes. It can associate reversibly with membranes of the endosomal compartment, thereby coating these vesicles. SNX1 is a component of the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi. The retromer consists of a cargo-recognition subcomplex and a subcomplex formed by a dimer of sorting nexins (SNX1 and/or SNX2), which ensures efficient cargo sorting by facilitating proper membrane localization of the cargo-recognition subcomplex. SNX1 contains a Bin/Amphiphysin/Rvs (BAR) domain C-terminal to the PX domain. The PX domain of SNX1 specifically binds phosphatidylinositol-3-phosphate (PI3P) and PI(3,5)P2, while the BAR domain detects membrane curvature. Both domains help determine the specific membrane-targeting of SNX1, which is localized to a microdomain in early endosomes where it regulates cation-independent mannose-6-phosphate receptor retrieval to the trans Golgi network.
  • pfam Vps5 234aa 6e-81 in ref transcript
    • Vps5 C terminal like. Vps5 is a sorting nexin that functions in membrane trafficking. This is the C terminal dimerisation domain.
  • Changed! smart PX 118aa 8e-24 in ref transcript
    • PhoX homologous domain, present in p47phox and p40phox. Eukaryotic domain of unknown function present in phox proteins, PLD isoforms, a PI3K isoform.
  • Changed! pfam Sorting_nexin 122aa 1e-22 in ref transcript
    • Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2.
  • Changed! COG COG5391 355aa 7e-08 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
  • Changed! cd PX_SNX1 113aa 2e-59 in modified transcript
  • Changed! smart PX 111aa 5e-21 in modified transcript
  • Changed! pfam Sorting_nexin 81aa 3e-14 in modified transcript
  • Changed! COG COG5391 392aa 9e-09 in modified transcript

SNX1

  • refseq_SNX1.F2 refseq_SNX1.R2 189 333
  • NCBIGene 36.3 6642
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003099

  • cd PX_SNX1 123aa 3e-64 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 1. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX1 is both membrane associated and a cytosolic protein that exists as a tetramer in protein complexes. It can associate reversibly with membranes of the endosomal compartment, thereby coating these vesicles. SNX1 is a component of the retromer complex, a membrane coat multimeric complex required for endosomal retrieval of lysosomal hydrolase receptors to the Golgi. The retromer consists of a cargo-recognition subcomplex and a subcomplex formed by a dimer of sorting nexins (SNX1 and/or SNX2), which ensures efficient cargo sorting by facilitating proper membrane localization of the cargo-recognition subcomplex. SNX1 contains a Bin/Amphiphysin/Rvs (BAR) domain C-terminal to the PX domain. The PX domain of SNX1 specifically binds phosphatidylinositol-3-phosphate (PI3P) and PI(3,5)P2, while the BAR domain detects membrane curvature. Both domains help determine the specific membrane-targeting of SNX1, which is localized to a microdomain in early endosomes where it regulates cation-independent mannose-6-phosphate receptor retrieval to the trans Golgi network.
  • Changed! pfam Vps5 234aa 6e-81 in ref transcript
    • Vps5 C terminal like. Vps5 is a sorting nexin that functions in membrane trafficking. This is the C terminal dimerisation domain.
  • smart PX 118aa 8e-24 in ref transcript
    • PhoX homologous domain, present in p47phox and p40phox. Eukaryotic domain of unknown function present in phox proteins, PLD isoforms, a PI3K isoform.
  • pfam Sorting_nexin 122aa 1e-22 in ref transcript
    • Sorting nexin, N-terminal domain. These proteins bins to the cytoplasmic domain of plasma membrane receptors. and are involved in endocytic protein trafficking. The N-terminal domain appears to be specific to sorting nexins 1 and 2.
  • Changed! COG COG5391 355aa 7e-08 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
  • Changed! pfam Vps5 186aa 2e-56 in modified transcript
  • Changed! COG COG5391 109aa 8e-07 in modified transcript

SNX11

  • refseq_SNX11.F1 refseq_SNX11.R1 114 174
  • NCBIGene 36.3 29916
  • Single exon skipping, size difference: 60
  • Exclusion in 5'UTR
  • Reference transcript: NM_152244

  • cd PX_SNX10 92aa 7e-37 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 10. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. SNX10 may be involved in the regulation of endosome homeostasis. Its expression induces the formation of giant vacuoles in mammalian cells.
  • pfam PX 91aa 2e-20 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • COG COG5391 93aa 3e-09 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].

SNX14

  • refseq_SNX14.F1 refseq_SNX14.R1 136 163
  • NCBIGene 36.3 57231
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153816

  • cd PX_SNX14 123aa 3e-50 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 14. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX14 may be involved in recruiting other proteins to the membrane via protein-protein and protein-ligand interaction. It is expressed in the embryonic nervous system of mice, and is co-expressed in the motoneurons and the anterior pituary with Islet-1. SNX14 shows a similar domain architecture as SNX13, containing an N-terminal PXA domain, a regulator of G protein signaling (RGS) domain, a PX domain, and a C-terminal domain that is conserved in some SNXs.
  • pfam Nexin_C 106aa 5e-25 in ref transcript
    • Sorting nexin C terminal. This region is found a the C terminal of proteins belonging to the sorting nexin family. It is found on proteins which also contain pfam00787.
  • smart PXA 156aa 6e-22 in ref transcript
    • Domain associated with PX domains. unpubl. observations.
  • pfam PX 105aa 2e-17 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • smart RGS 128aa 2e-07 in ref transcript
    • Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
  • COG COG5391 140aa 0.002 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].

SNX15

  • refseq_SNX15.F1 refseq_SNX15.R1 141 399
  • NCBIGene 36.3 29907
  • Single exon skipping, size difference: 258
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013306

  • cd PX_SNX15 118aa 9e-51 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 15. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX15 contains an N-terminal PX domain and a C-terminal Microtubule Interacting and Trafficking (MIT) domain. It plays a role in protein trafficking processes in the endocytic pathway and the trans-Golgi network. The PX domain of SNX15 interacts with the PDGF receptor and is responsible for the membrane association of the protein.
  • Changed! cd MIT_SNX15 75aa 1e-19 in ref transcript
    • MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
  • Changed! smart MIT 77aa 4e-18 in ref transcript
    • Microtubule Interacting and Trafficking molecule domain.
  • pfam PX 112aa 2e-12 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • Changed! cd MIT_SNX15 74aa 2e-10 in modified transcript
  • Changed! smart MIT 74aa 9e-09 in modified transcript

SNX16

  • refseq_SNX16.F2 refseq_SNX16.R2 269 356
  • NCBIGene 36.3 64089
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152836

  • Changed! cd PX_SNX16 110aa 2e-57 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 16. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX16 contains a central PX domain followed by a coiled-coil region. SNX16 is localized in early and recycling endosomes through the binding of its PX domain to phosphatidylinositol-3-phosphate (PI3P). It plays a role in epidermal growth factor (EGF) signaling by regulating EGF receptor membrane trafficking.
  • Changed! pfam PX 89aa 3e-16 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • Changed! COG COG5391 148aa 7e-05 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
  • Changed! cd PX_SNX16 81aa 3e-36 in modified transcript
  • Changed! pfam PX 57aa 3e-07 in modified transcript

SNX7

  • refseq_SNX7.F2 refseq_SNX7.R2 222 375
  • NCBIGene 36.3 51375
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015976

  • cd PX_SNX7 116aa 2e-62 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 7. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. SNX7 harbors a Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature, C-terminal to the PX domain, similar to the sorting nexins SNX1-2, SNX4-6, SNX8, SNX30, and SNX32. Both domains have been shown to determine the specific membrane-targeting of SNX1. The specific function of SNX7 has yet to be elucidated.
  • pfam PX 114aa 1e-21 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • Changed! COG COG5391 374aa 2e-12 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
  • Changed! COG COG5391 236aa 5e-12 in modified transcript

SNX7

  • refseq_SNX7.F3 refseq_SNX7.R3 283 403
  • NCBIGene 36.3 51375
  • Single exon skipping, size difference: 120
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015976

  • Changed! cd PX_SNX7 116aa 2e-62 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 7. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. SNX7 harbors a Bin/Amphiphysin/Rvs (BAR) domain, which detects membrane curvature, C-terminal to the PX domain, similar to the sorting nexins SNX1-2, SNX4-6, SNX8, SNX30, and SNX32. Both domains have been shown to determine the specific membrane-targeting of SNX1. The specific function of SNX7 has yet to be elucidated.
  • Changed! pfam PX 114aa 1e-21 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • Changed! COG COG5391 374aa 2e-12 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].

SOCS4

  • refseq_SOCS4.F1 refseq_SOCS4.R1 236 365
  • NCBIGene 36.3 122809
  • Single exon skipping, size difference: 129
  • Exclusion in 5'UTR
  • Reference transcript: NM_199421

  • cd SOCS_SOCS4 56aa 8e-27 in ref transcript
    • SOCS (suppressors of cytokine signaling) box of SOCS4-like proteins. Together with CIS1, the CIS/SOCS family of proteins is characterized by the presence of a C-terminal SOCS box and a central SH2 domain. The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
  • cd SH2 96aa 1e-11 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • smart SH2 85aa 2e-14 in ref transcript
    • Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
  • smart SOCS 40aa 9e-09 in ref transcript
    • suppressors of cytokine signalling. suppressors of cytokine signalling.

SORBS1

  • refseq_SORBS1.F1 refseq_SORBS1.R1 182 389
  • NCBIGene 36.3 10580
  • Alternative 3-prime, size difference: 207
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001034954

  • cd SH3 54aa 3e-12 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 52aa 8e-11 in ref transcript
  • cd SH3 52aa 6e-10 in ref transcript
  • smart SH3 59aa 4e-13 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 53aa 3e-12 in ref transcript
  • smart SH3 59aa 7e-11 in ref transcript
  • pfam Sorb 41aa 4e-10 in ref transcript
    • Sorbin homologous domain.

SORBS1

  • refseq_SORBS1.F4 refseq_SORBS1.R4 195 363
  • NCBIGene 36.3 10580
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001034954

  • cd SH3 54aa 3e-12 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 52aa 8e-11 in ref transcript
  • cd SH3 52aa 6e-10 in ref transcript
  • smart SH3 59aa 4e-13 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 53aa 3e-12 in ref transcript
  • smart SH3 59aa 7e-11 in ref transcript
  • pfam Sorb 41aa 4e-10 in ref transcript
    • Sorbin homologous domain.

SORBS1

  • refseq_SORBS1.F6 refseq_SORBS1.R6 189 285
  • NCBIGene 36.3 10580
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001034954

  • cd SH3 54aa 3e-12 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 52aa 8e-11 in ref transcript
  • cd SH3 52aa 6e-10 in ref transcript
  • smart SH3 59aa 4e-13 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 53aa 3e-12 in ref transcript
  • smart SH3 59aa 7e-11 in ref transcript
  • pfam Sorb 41aa 4e-10 in ref transcript
    • Sorbin homologous domain.

SORBS1

  • refseq_SORBS1.F7 refseq_SORBS1.R7 130 157
  • NCBIGene 36.3 10580
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001034954

  • cd SH3 54aa 3e-12 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 52aa 8e-11 in ref transcript
  • cd SH3 52aa 6e-10 in ref transcript
  • smart SH3 59aa 4e-13 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 53aa 3e-12 in ref transcript
  • smart SH3 59aa 7e-11 in ref transcript
  • pfam Sorb 41aa 4e-10 in ref transcript
    • Sorbin homologous domain.

SORBS2

  • refseq_SORBS2.F1 refseq_SORBS2.R1 204 273
  • NCBIGene 36.3 8470
  • Alternative 3-prime, size difference: 69
  • Exclusion in 5'UTR
  • Reference transcript: NM_021069

  • cd SH3 52aa 8e-14 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 54aa 9e-12 in ref transcript
  • cd SH3 54aa 3e-11 in ref transcript
  • pfam Sorb 50aa 9e-21 in ref transcript
    • Sorbin homologous domain.
  • smart SH3 53aa 5e-16 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 59aa 2e-12 in ref transcript
  • smart SH3 59aa 2e-12 in ref transcript

SORBS2

  • refseq_SORBS2.F2 refseq_SORBS2.R2 217 358
  • NCBIGene 36.3 8470
  • Multiple exon skipping, size difference: 141
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_021069

  • cd SH3 52aa 8e-14 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 54aa 9e-12 in ref transcript
  • cd SH3 54aa 3e-11 in ref transcript
  • pfam Sorb 50aa 9e-21 in ref transcript
    • Sorbin homologous domain.
  • smart SH3 53aa 5e-16 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 59aa 2e-12 in ref transcript
  • smart SH3 59aa 2e-12 in ref transcript

SORCS1

  • refseq_SORCS1.F1 refseq_SORCS1.R1 268 377
  • NCBIGene 36.3 114815
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 109
  • Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001013031

  • smart VPS10 598aa 0.0 in ref transcript
    • VPS10 domain.
  • pfam PKD 80aa 8e-06 in ref transcript
    • PKD domain. This domain was first identified in the Polycystic kidney disease protein PKD1. This domain has been predicted to contain an Ig-like fold.

SOX6

  • refseq_SOX6.F1 refseq_SOX6.R1 227 269
  • NCBIGene 36.3 55553
  • Alternative 5-prime, size difference: 42
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_033326

  • cd SOX-TCF_HMG-box 64aa 1e-24 in ref transcript
    • SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
  • smart HMG 64aa 5e-16 in ref transcript
    • high mobility group.
  • COG NHP6B 52aa 2e-06 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

SOX6

  • refseq_SOX6.F4 refseq_SOX6.R4 137 260
  • NCBIGene 36.3 55553
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033326

  • cd SOX-TCF_HMG-box 64aa 1e-24 in ref transcript
    • SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
  • smart HMG 64aa 5e-16 in ref transcript
    • high mobility group.
  • COG NHP6B 52aa 2e-06 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

SOX6

  • refseq_SOX6.F5 refseq_SOX6.R5 185 245
  • NCBIGene 36.3 55553
  • Alternative 3-prime, size difference: 60
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_033326

  • cd SOX-TCF_HMG-box 64aa 1e-24 in ref transcript
    • SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
  • smart HMG 64aa 5e-16 in ref transcript
    • high mobility group.
  • COG NHP6B 52aa 2e-06 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

SP110

  • refseq_SP110.F2 refseq_SP110.R2 295 367
  • NCBIGene 36.3 3431
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080424

  • Changed! cd Bromo_SP100C_like 101aa 8e-39 in ref transcript
    • Bromodomain, SP100C_like subfamily. The SP100C protein is a splice variant of SP100, a major component of PML-SP100 nuclear bodies (NBs), which are poorly understood. It is covalently modified by SUMO-1 and may play a role in processes at the chromatin level. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • pfam SAND 82aa 7e-29 in ref transcript
    • SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation.
  • pfam Sp100 101aa 3e-26 in ref transcript
    • Sp100 domain. The function of this domain is unknown. It is about 105 amino acid residues in length and is predicted to be predominantly alpha helical. This domain is usually found at the amino terminus of protein that contain a SAND domain pfam01342.
  • Changed! smart BROMO 98aa 5e-09 in ref transcript
    • bromo domain.
  • Changed! cd Bromo_SP100C_like 110aa 2e-38 in modified transcript
  • Changed! smart BROMO 74aa 1e-06 in modified transcript

SPAST

  • refseq_SPAST.F1 refseq_SPAST.R1 231 327
  • NCBIGene 36.3 6683
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014946

  • cd MIT_spastin 80aa 2e-22 in ref transcript
    • MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
  • cd AAA 164aa 5e-21 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • pfam AAA 186aa 3e-50 in ref transcript
    • ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes.
  • smart MIT 78aa 3e-13 in ref transcript
    • Microtubule Interacting and Trafficking molecule domain.
  • pfam Mg_chelatase 54aa 3e-05 in ref transcript
    • Magnesium chelatase, subunit ChlI. Magnesium-chelatase is a three-component enzyme that catalyses the insertion of Mg2+ into protoporphyrin IX. This is the first unique step in the synthesis of (bacterio)chlorophyll. Due to this, it is thought that Mg-chelatase has an important role in channelling inter- mediates into the (bacterio)chlorophyll branch in response to conditions suitable for photosynthetic growth. ChlI and BchD have molecular weight between 38-42 kDa.
  • COG SpoVK 268aa 7e-60 in ref transcript
    • ATPases of the AAA+ class [Posttranslational modification, protein turnover, chaperones].

SPATA7

  • refseq_SPATA7.F1 refseq_SPATA7.R1 110 206
  • NCBIGene 36.3 55812
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018418

SPHK1

  • refseq_SPHK1.F1 refseq_SPHK1.R1 107 470
  • NCBIGene 36.3 8877
  • Alternative 5-prime, size difference: 363
  • Exclusion of the protein initiation site
  • Reference transcript: NM_182965

  • pfam DAGK_cat 104aa 1e-19 in ref transcript
    • Diacylglycerol kinase catalytic domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues. YegS is the Escherichia coli protein in this family whose crystal structure reveals an active site in the inter-domain cleft formed by four conserved sequence motifs, revealing a novel metal-binding site. The residues of this site are conserved across the family.
  • COG LCB5 333aa 4e-18 in ref transcript
    • Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].

SPHK1

  • refseq_SPHK1.F3 refseq_SPHK1.R3 164 206
  • NCBIGene 36.3 8877
  • Alternative 3-prime, size difference: 42
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_182965

  • pfam DAGK_cat 104aa 1e-19 in ref transcript
    • Diacylglycerol kinase catalytic domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. The catalytic domain is assumed from the finding of bacterial homologues. YegS is the Escherichia coli protein in this family whose crystal structure reveals an active site in the inter-domain cleft formed by four conserved sequence motifs, revealing a novel metal-binding site. The residues of this site are conserved across the family.
  • COG LCB5 333aa 4e-18 in ref transcript
    • Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].

SPINT1

  • refseq_SPINT1.F1 refseq_SPINT1.R1 192 240
  • NCBIGene 36.3 6692
  • Alternative 5-prime, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181642

  • cd KU 52aa 8e-12 in ref transcript
    • BPTI/Kunitz family of serine protease inhibitors; Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure.
  • cd KU 54aa 1e-11 in ref transcript
  • cd LDLa 35aa 7e-06 in ref transcript
    • Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
  • pfam MANEC 92aa 3e-31 in ref transcript
    • MANEC domain. This region of similarity, comprising 8 conserved cysteines, is found in the N-terminal region of several membrane-associated and extracellular proteins. Although formerly called MANSC (for motif at N terminus with seven cysteines) it has now been renamed by MANEC (motif at N terminus with eight cysteines) by Richard Mitter and Stephen Fitzgerald after the discovery of an eighth conserved cysteine. It is postulated that this domain may play a role in the formation of protein complexes involving various protease activators and inhibitors.
  • pfam Kunitz_BPTI 53aa 3e-14 in ref transcript
    • Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways. TAP molecules are highly dipolar, and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix.
  • pfam Kunitz_BPTI 52aa 4e-13 in ref transcript
  • smart LDLa 33aa 3e-06 in ref transcript
    • Low-density lipoprotein receptor domain class A. Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.

SPO11

  • refseq_SPO11.F2 refseq_SPO11.R2 263 377
  • NCBIGene 36.3 23626
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012444

  • cd TOPRIM_TopoIIB_SPO 163aa 2e-53 in ref transcript
    • TOPRIM_TopoIIB_SPO: topoisomerase-primase (TOPRIM) nucleotidyl transferase/hydrolase domain of the type found in the type IIB family of DNA topoisomerases and Spo11. This subgroup contains proteins similar to Sulfolobus shibatae topoisomerase VI (TopoVI) and Saccharomyces cerevisiae meiotic recombination factor: Spo11. Type II DNA topoisomerases catalyze the ATP-dependent transport of one DNA duplex through another, in the process generating transient double strand breaks via covalent attachments to both DNA strands at the 5' positions. TopoVI enzymes are heterotetramers found in archaea and plants. Spo11 plays a role in generating the double strand breaks that initiate homologous recombination during meiosis. S. shibatae TopoVI relaxes both positive and negative supercoils, and in addition has a strong decatenase activity. The TOPRIM domain has two conserved motifs, one of which centers at a conserved glutamate and the other one at two conserved aspartates (DxD. For topoisomerases the conserved glutamate is believed to act as a general base in strand joining and, as a general acid in strand cleavage. The DXD motif may co-ordinate Mg2+, a cofactor required for full catalytic function.
  • pfam SPO11_like 42aa 3e-19 in ref transcript
    • SPO11 homologue.
  • pfam TP6A_N 68aa 7e-19 in ref transcript
    • Type IIB DNA topoisomerase. Type II DNA topoisomerases are ubiquitous enzymes that catalyse the ATP-dependent transport of one DNA duplex through a second DNA segment via a transient double-strand break. Type II DNA topoisomerases are now subdivided into two sub-families, type IIA and IIB DNA topoisomerases. TP6A_N is present in type IIB topoisomerase and is thought to be involved in DNA binding owing to its sequence similarity to Escherichia coli catabolite activator protein (CAP).
  • Changed! PRK PRK04342 354aa 4e-65 in ref transcript
    • DNA topoisomerase VI subunit A; Provisional.
  • Changed! PRK PRK04342 291aa 2e-63 in modified transcript

SPOP

  • refseq_SPOP.F1 refseq_SPOP.R1 118 240
  • NCBIGene 36.3 8405
  • Single exon skipping, size difference: 122
  • Exclusion in 5'UTR
  • Reference transcript: NM_001007226

  • cd MATH_SPOP 139aa 8e-70 in ref transcript
    • Speckle-type POZ protein (SPOP) family, MATH domain; composed of proteins with similarity to human SPOP. SPOP was isolated as a novel antigen recognized by serum from a scleroderma patient, whose overexpression in COS cells results in a discrete speckled pattern in the nuclei. It contains an N-terminal MATH domain and a C-terminal BTB (also called POZ) domain. Together with Cul3, SPOP constitutes an ubiquitin E3 ligase which is able to ubiquitinate the PcG protein BMI1, the variant histone macroH2A1 and the death domain-associated protein Daxx. Therefore, SPOP may be involved in the regulation of these proteins and may play a role in transcriptional regulation, apoptosis and X-chromosome inactivation. Cul3 binds to the BTB domain of SPOP whereas Daxx and the macroH2A1 nonhistone region have been shown to bind to the MATH domain. Both MATH and BTB domains are necessary for the nuclear speckled accumulation of SPOP. There are many proteins, mostly uncharacterized, containing both MATH and BTB domains from C. elegans and plants which are excluded from this family.
  • pfam BTB 108aa 3e-21 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • smart MATH 102aa 8e-10 in ref transcript
    • meprin and TRAF homology.

SPOP

  • refseq_SPOP.F2 refseq_SPOP.R2 121 172
  • NCBIGene 36.3 8405
  • Single exon skipping, size difference: 51
  • Exclusion in 5'UTR
  • Reference transcript: NM_001007230

  • cd MATH_SPOP 139aa 8e-70 in ref transcript
    • Speckle-type POZ protein (SPOP) family, MATH domain; composed of proteins with similarity to human SPOP. SPOP was isolated as a novel antigen recognized by serum from a scleroderma patient, whose overexpression in COS cells results in a discrete speckled pattern in the nuclei. It contains an N-terminal MATH domain and a C-terminal BTB (also called POZ) domain. Together with Cul3, SPOP constitutes an ubiquitin E3 ligase which is able to ubiquitinate the PcG protein BMI1, the variant histone macroH2A1 and the death domain-associated protein Daxx. Therefore, SPOP may be involved in the regulation of these proteins and may play a role in transcriptional regulation, apoptosis and X-chromosome inactivation. Cul3 binds to the BTB domain of SPOP whereas Daxx and the macroH2A1 nonhistone region have been shown to bind to the MATH domain. Both MATH and BTB domains are necessary for the nuclear speckled accumulation of SPOP. There are many proteins, mostly uncharacterized, containing both MATH and BTB domains from C. elegans and plants which are excluded from this family.
  • pfam BTB 108aa 3e-21 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • smart MATH 102aa 8e-10 in ref transcript
    • meprin and TRAF homology.

SREBF1

  • refseq_SREBF1.F1 refseq_SREBF1.R1 182 272
  • NCBIGene 36.3 6720
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005291

  • cd HLH 58aa 9e-12 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam HLH 51aa 2e-12 in ref transcript
    • Helix-loop-helix DNA-binding domain.

SS18

  • refseq_SS18.F1 refseq_SS18.R1 115 208
  • NCBIGene 36.3 6760
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001007559

  • pfam SSXT 66aa 4e-20 in ref transcript
    • SSXT protein (N-terminal region). The SSXT or SS18 protein is involved in synovial sarcoma in humans. A SYT-SSX fusion gene resulting from the chromosomal translocation t(X;18) (p11;q11) is characteristic of synovial sarcomas. This translocation fuses the SSXT (SYT) gene from chromosome 18 to either of two homologous genes at Xp11, SSX1 or SSX2.

SSBP3

  • refseq_SSBP3.F1 refseq_SSBP3.R1 126 207
  • NCBIGene 36.3 23648
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145716

  • pfam SSDP 144aa 2e-06 in ref transcript
    • Single-stranded DNA binding protein, SSDP. This is a family of eukaryotic single-stranded DNA binding proteins with specificity to a pyrimidine-rich element found in the promoter region of the alpha2(I) collagen gene.

SSBP4

  • refseq_SSBP4.F1 refseq_SSBP4.R1 100 166
  • NCBIGene 36.3 170463
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032627

  • pfam SSDP 149aa 7e-15 in ref transcript
    • Single-stranded DNA binding protein, SSDP. This is a family of eukaryotic single-stranded DNA binding proteins with specificity to a pyrimidine-rich element found in the promoter region of the alpha2(I) collagen gene.
  • Changed! pfam PAT1 157aa 3e-07 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
  • Changed! PRK PRK07764 149aa 1e-04 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.
  • Changed! PRK PRK07764 137aa 5e-04 in modified transcript

SSH3

  • refseq_SSH3.F1 refseq_SSH3.R1 246 344
  • NCBIGene 36.2 54961
  • Alternative 3-prime, size difference: 98
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017857

  • cd DSPc 136aa 2e-37 in ref transcript
    • Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
  • smart DSPc 135aa 3e-36 in ref transcript
    • Dual specificity phosphatase, catalytic domain.
  • pfam DEK_C 54aa 4e-12 in ref transcript
    • DEK C terminal domain. DEK is a chromatin associated protein that is linked with cancers and autoimmune disease. This domain is found at the C terminal of DEK and is of clinical importance since it can reverse the characteristic abnormal DNA-mutagen sensitivity in fibroblasts from ataxia-telangiectasia (A-T) patients. The structure of this domain shows it to be homologous to the E2F/DP transcription factor family. This domain is also found in chitin synthase proteins, and in protein phosphatases.
  • PRK PRK12361 131aa 7e-12 in ref transcript
    • hypothetical protein; Provisional.

SSX4

  • refseq_SSX4B.F1 refseq_SSX4B.R1 121 257
  • NCBIGene 36.3 6759
  • Single exon skipping, size difference: 136
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005636

  • Changed! pfam SSXRD 34aa 2e-08 in ref transcript
    • SSXRD motif. SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localise to this region.
  • smart KRAB 60aa 1e-05 in ref transcript
    • krueppel associated box.

ST3GAL3

  • refseq_ST3GAL3.F1 refseq_ST3GAL3.R1 107 401
  • NCBIGene 36.3 6487
  • Multiple exon skipping, size difference: 294
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_174963

  • Changed! pfam Glyco_transf_29 218aa 4e-38 in ref transcript
    • Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29.
  • Changed! pfam Glyco_transf_29 85aa 4e-19 in modified transcript

ST3GAL3

  • refseq_ST3GAL3.F3 refseq_ST3GAL3.R3 134 179
  • NCBIGene 36.3 6487
  • Alternative 5-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_174963

  • pfam Glyco_transf_29 218aa 4e-38 in ref transcript
    • Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29.

ST3GAL3

  • refseq_ST3GAL3.F5 refseq_ST3GAL3.R5 109 271
  • NCBIGene 36.3 6487
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_174963

  • pfam Glyco_transf_29 218aa 4e-38 in ref transcript
    • Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29.

ST6GALNAC4

  • refseq_ST6GALNAC4.F1 refseq_ST6GALNAC4.R1 137 224
  • NCBIGene 36.3 27090
  • Single exon skipping, size difference: 87
  • Exclusion of the protein initiation site
  • Reference transcript: NM_175039

  • Changed! pfam Glyco_transf_29 273aa 2e-69 in ref transcript
    • Glycosyltransferase family 29 (sialyltransferase). Members of this family belong to glycosyltransferase family 29.
  • Changed! pfam Glyco_transf_29 201aa 3e-61 in modified transcript

ST7

  • refseq_ST7.F1 refseq_ST7.R1 197 266
  • NCBIGene 36.3 7982
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021908

  • Changed! pfam ST7 535aa 0.0 in ref transcript
    • ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
  • Changed! pfam ST7 512aa 0.0 in modified transcript

ST7L

  • refseq_ST7L.F1 refseq_ST7L.R1 166 259
  • NCBIGene 36.3 54879
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017744

  • Changed! pfam ST7 502aa 0.0 in ref transcript
    • ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
  • Changed! pfam ST7 471aa 0.0 in modified transcript

ST7L

  • refseq_ST7L.F2 refseq_ST7L.R2 101 152
  • NCBIGene 36.3 54879
  • Alternative 3-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017744

  • Changed! pfam ST7 502aa 0.0 in ref transcript
    • ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
  • Changed! pfam ST7 485aa 0.0 in modified transcript

STAP2

  • refseq_STAP2.F2 refseq_STAP2.R2 223 361
  • NCBIGene 36.3 55620
  • Alternative 3-prime, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017720

  • cd SH2 96aa 8e-05 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • pfam SH2 78aa 0.003 in ref transcript
    • SH2 domain.

STARD5

  • refseq_STARD5.F2 refseq_STARD5.R2 108 158
  • NCBIGene 36.2 80765
  • Single exon skipping, size difference: 50
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_181900

  • Changed! cd START 211aa 8e-31 in ref transcript
    • START(STeroidogenic Acute Regulatory (STAR) related lipid Transfer) Domain. These domains are 200-210 amino acid in length and occur in proteins involved in lipid transport (phosphatidylcholine) and metabolism, signal transduction, and transcriptional regulation. The most striking feature of the START domain structure is a predominantly hydrophobic tunnel extending nearly the entire protein and used to binding a single molecule of large lipophilic compounds, like cholesterol.
  • Changed! pfam START 206aa 2e-17 in ref transcript
    • START domain.

STATH

  • refseq_STATH.F1 refseq_STATH.R1 112 142
  • NCBIGene 36.3 6779
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003154

  • Changed! pfam Statherin 19aa 5e-05 in ref transcript
    • Statherin. Statherin functions biologically to inhibit the nucleation and growth of calcium phosphate minerals. The N-terminus of statherin is highly charge, the glutamic acids of which have been shown to be important in the recognition hydroxyapatite.

STAU1

  • refseq_STAU1.F1 refseq_STAU1.R1 101 119
  • NCBIGene 36.3 6780
  • Alternative 3-prime, size difference: 18
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_017453

  • cd DSRM 67aa 1e-12 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • smart DSRM 67aa 3e-14 in ref transcript
    • Double-stranded RNA binding motif.
  • Changed! smart DSRM 32aa 0.001 in ref transcript
  • PRK rnc 72aa 1e-10 in ref transcript
    • ribonuclease III; Reviewed.

STAU1

  • refseq_STAU1.F2 refseq_STAU1.R2 107 396
  • NCBIGene 36.3 6780
  • Single exon skipping, size difference: 289
  • Exclusion of the protein initiation site
  • Reference transcript: NM_017453

  • cd DSRM 67aa 1e-12 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • smart DSRM 67aa 3e-14 in ref transcript
    • Double-stranded RNA binding motif.
  • smart DSRM 32aa 0.001 in ref transcript
  • PRK rnc 72aa 1e-10 in ref transcript
    • ribonuclease III; Reviewed.

STRN4

  • refseq_STRN4.F1 refseq_STRN4.R1 118 139
  • NCBIGene 36.3 29888
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039877

  • cd WD40 306aa 2e-42 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam Striatin 134aa 3e-31 in ref transcript
    • Striatin family. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (pfam00400) repeat domain. It acts as a scaffold protein and is involved in signalling pathways.
  • smart WD40 40aa 1e-08 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 40aa 1e-06 in ref transcript
  • smart WD40 40aa 6e-06 in ref transcript
  • COG COG2319 303aa 5e-25 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

STX16

  • refseq_STX16.F1 refseq_STX16.R1 112 175
  • NCBIGene 36.3 8675
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 63
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001433

  • cd t_SNARE 47aa 3e-06 in ref transcript
    • Soluble NSF (N-ethylmaleimide-sensitive fusion protein)-Attachment protein (SNAP) REceptor domain; these alpha-helical motifs form twisted and parallel heterotetrameric helix bundles; the core complex contains one helix from a protein that is anchored in the vesicle membrane (synaptobrevin), one helix from a protein of the target membrane (syntaxin), and two helices from another protein anchored in the target membrane (SNAP-25); their interaction forms a core which is composed of a polar zero layer, a flanking leucine-zipper layer acts as a water tight shield to isolate ionic interactions in the zero layer from the surrounding solvent.
  • pfam Syntaxin 107aa 6e-13 in ref transcript
    • Syntaxin. Syntaxins are the prototype family of SNARE proteins. They usually consist of three main regions - a C-terminal transmembrane region, a central SNARE domain which is characteristic of and conserved in all syntaxins (pfam05739), and an N-terminal domain that is featured in this entry. This domain varies between syntaxin isoforms; in syntaxin 1A it is found as three alpha-helices with a left-handed twist. It may fold back on the SNARE domain to allow the molecule to adopt a 'closed' configuration that prevents formation of the core fusion complex - it thus has an auto-inhibitory role. The function of syntaxins is determined by their localisation. They are involved in neuronal exocytosis, ER-Golgi transport and Golgi-endosome transport, for example. They also interact with other proteins as well as those involved in SNARE complexes. These include vesicle coat proteins, Rab GTPases, and tethering factors.
  • pfam SNARE 52aa 6e-10 in ref transcript
    • SNARE domain. Most if not all vesicular membrane fusion events in eukaryotic cells are believed to be mediated by a conserved fusion machinery, the SNARE [soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptors] machinery. The SNARE domain is thought to act as a protein-protein interaction module in the assembly of a SNARE protein complex.
  • COG COG5325 227aa 3e-15 in ref transcript
    • t-SNARE complex subunit, syntaxin [Intracellular trafficking and secretion].

STX2

  • refseq_STX2.F1 refseq_STX2.R1 192 318
  • NCBIGene 36.3 2054
  • Single exon skipping, size difference: 126
  • Exclusion of the stop codon
  • Reference transcript: NM_194356

  • cd SynN 152aa 1e-29 in ref transcript
    • Syntaxin N-terminus domain; syntaxins are nervous system-specific proteins implicated in the docking of synaptic vesicles with the presynaptic plasma membrane; they are a family of receptors for intracellular transport vesicles; each target membrane may be identified by a specific member of the syntaxin family; syntaxins contain a moderately well conserved amino-terminal domain, called Habc, whose structure is an antiparallel three-helix bundle; a linker of about 30 amino acids connects this to the carboxy-terminal region, designated H3 (t_SNARE), of the syntaxin cytoplasmic domain; the highly conserved H3 region forms a single, long alpha-helix when it is part of the core SNARE complex and anchors the protein on the cytoplasmic surface of cellular membranes; H3 is not included in defining this domain.
  • cd t_SNARE 60aa 2e-07 in ref transcript
    • Soluble NSF (N-ethylmaleimide-sensitive fusion protein)-Attachment protein (SNAP) REceptor domain; these alpha-helical motifs form twisted and parallel heterotetrameric helix bundles; the core complex contains one helix from a protein that is anchored in the vesicle membrane (synaptobrevin), one helix from a protein of the target membrane (syntaxin), and two helices from another protein anchored in the target membrane (SNAP-25); their interaction forms a core which is composed of a polar zero layer, a flanking leucine-zipper layer acts as a water tight shield to isolate ionic interactions in the zero layer from the surrounding solvent.
  • smart SynN 119aa 4e-23 in ref transcript
    • Syntaxin N-terminal domain. Three-helix domain that (in Sso1p) slows the rate of its reaction with the SNAP-25 homologue Sec9p.
  • pfam SNARE 62aa 6e-13 in ref transcript
    • SNARE domain. Most if not all vesicular membrane fusion events in eukaryotic cells are believed to be mediated by a conserved fusion machinery, the SNARE [soluble N-ethylmaleimide-sensitive factor (NSF) attachment protein (SNAP) receptors] machinery. The SNARE domain is thought to act as a protein-protein interaction module in the assembly of a SNARE protein complex.
  • Changed! COG COG5074 234aa 9e-15 in ref transcript
    • t-SNARE complex subunit, syntaxin [Intracellular trafficking and secretion].
  • Changed! COG COG5074 235aa 7e-14 in modified transcript

STXBP1

  • refseq_STXBP1.F1 refseq_STXBP1.R1 227 353
  • NCBIGene 36.3 6812
  • Single exon skipping, size difference: 126
  • Exclusion of the stop codon
  • Reference transcript: NM_003165

  • Changed! pfam Sec1 549aa 1e-180 in ref transcript
    • Sec1 family.
  • Changed! COG SEC1 543aa 7e-55 in ref transcript
    • Proteins involved in synaptic transmission and general secretion, Sec1 family [Intracellular trafficking and secretion].
  • Changed! pfam Sec1 543aa 1e-178 in modified transcript
  • Changed! COG SEC1 535aa 2e-53 in modified transcript

SULF2

  • refseq_SULF2.F1 refseq_SULF2.R1 159 213
  • NCBIGene 36.3 55959
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018837

  • pfam Sulfatase 352aa 3e-34 in ref transcript
    • Sulfatase.
  • COG AslA 353aa 2e-29 in ref transcript
    • Arylsulfatase A and related enzymes [Inorganic ion transport and metabolism].

SULT1A1

  • refseq_SULT1A1.F1 refseq_SULT1A1.R1 129 160
  • NCBIGene 36.3 6817
  • Alternative 3-prime, size difference: 31
  • Inclusion in 5'UTR
  • Reference transcript: NM_001055

  • pfam Sulfotransfer_1 250aa 7e-77 in ref transcript
    • Sulfotransferase domain.

SULT1C2

  • refseq_SULT1C1.F1 refseq_SULT1C1.R1 160 193
  • NCBIGene 36.3 6819
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 33
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_176825

  • Changed! pfam Sulfotransfer_1 261aa 7e-86 in ref transcript
    • Sulfotransferase domain.
  • Changed! pfam Sulfotransfer_1 250aa 2e-95 in modified transcript

SUMO1

  • refseq_SUMO1.F1 refseq_SUMO1.R1 288 363
  • NCBIGene 36.3 7341
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003352

  • Changed! cd Sumo 87aa 2e-31 in ref transcript
    • Small ubiquitin-related modifier (SUMO) proteins are conjugated to numerous intracellular targets and serve to modulate protein interaction, localization, activity or stability. SUMO (also known as "Smt3" and "sentrin" in other organisms) is linked to several different pathways, including nucleocytoplasmic transport. Attachment of SUMO to targets proteins is stimulated by PIAS (Protein inhibitor of activated STATs) proteins which serve as E3-like ligases.
  • pfam ubiquitin 66aa 2e-10 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
  • Changed! COG SMT3 96aa 2e-19 in ref transcript
    • Ubiquitin-like protein (sentrin) [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Sumo 71aa 1e-27 in modified transcript
  • Changed! COG SMT3 72aa 4e-17 in modified transcript

SUOX

  • refseq_SUOX.F1 refseq_SUOX.R1 121 247
  • NCBIGene 36.3 6821
  • Single exon skipping, size difference: 126
  • Exclusion in 5'UTR
  • Reference transcript: NM_000456

  • cd eukary_SO_Moco 359aa 1e-145 in ref transcript
    • molybdopterin binding domain of sulfite oxidase (SO). SO catalyzes the terminal reaction in the oxidative degradation of the sulfur-containing amino acids cysteine and methionine. Common features of all known members of the sulfite oxidase (SO) family of molybdopterin binding domains are that they contain one single pterin cofactor and part of the coordination of the metal (Mo) is a cysteine ligand of the protein and that they catalyze the transfer of an oxygen to or from a lone pair of electrons on the substrate.
  • pfam Mo-co_dimer 127aa 5e-46 in ref transcript
    • Mo-co oxidoreductase dimerisation domain. This domain is found in molybdopterin cofactor (Mo-co) oxidoreductases. It is involved in dimer formation, and has an Ig-fold structure.
  • pfam Oxidored_molyb 183aa 1e-32 in ref transcript
    • Oxidoreductase molybdopterin binding domain. This domain is found in a variety of oxidoreductases. This domain binds to a molybdopterin cofactor. Xanthine dehydrogenases, that also bind molybdopterin, have essentially no similarity.
  • pfam Cyt-b5 77aa 1e-14 in ref transcript
    • Cytochrome b5-like Heme/Steroid binding domain. This family includes heme binding domains from a diverse range of proteins. This family also includes proteins that bind to steroids. The family includes progesterone receptors. Many members of this subfamily are membrane anchored by an N-terminal transmembrane alpha helix. This family also includes a domain in some chitin synthases. There is no known ligand for this domain in the chitin synthases.
  • COG COG2041 237aa 5e-18 in ref transcript
    • Sulfite oxidase and related enzymes [General function prediction only].
  • COG CYB5 90aa 2e-05 in ref transcript
    • Cytochrome b involved in lipid metabolism [Energy production and conversion / Lipid metabolism].

SUPT3H

  • refseq_SUPT3H.F1 refseq_SUPT3H.R1 243 428
  • NCBIGene 36.3 8464
  • Multiple exon skipping, size difference: 185
  • Exclusion of the protein initiation site, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181356

  • Changed! pfam TFIID-18kDa 82aa 5e-18 in ref transcript
    • Transcription initiation factor IID, 18kD subunit. This family includes the Spt3 yeast transcription factors and the 18kD subunit from human transcription initiation factor IID (TFIID-18). Determination of the crystal structure reveals an atypical histone fold.
  • Changed! pfam TFIID-18kDa 91aa 2e-22 in modified transcript

SVIL

  • refseq_SVIL.F1 refseq_SVIL.R1 160 256
  • NCBIGene 36.3 6840
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021738

  • smart GEL 95aa 5e-17 in ref transcript
    • Gelsolin homology domain. Gelsolin/severin/villin homology domain. Calcium-binding and actin-binding. Both intra- and extracellular domains.
  • smart GEL 97aa 7e-11 in ref transcript
  • smart VHP 36aa 7e-11 in ref transcript
    • Villin headpiece domain.
  • smart GEL 107aa 5e-10 in ref transcript
  • smart GEL 96aa 4e-05 in ref transcript
  • smart GEL 99aa 0.002 in ref transcript

SYF2

  • refseq_SYF2.F1 refseq_SYF2.R1 196 322
  • NCBIGene 36.3 25949
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015484

  • Changed! pfam SYF2 145aa 6e-46 in ref transcript
    • SYF2 splicing factor. Proteins in this family are involved in cell cycle progression and pre-mRNA splicing.
  • Changed! pfam SYF2 152aa 4e-49 in modified transcript

SYN3

  • refseq_SYN3.F1 refseq_SYN3.R1 109 401
  • NCBIGene 36.3 8224
  • Single exon skipping, size difference: 292
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003490

  • pfam Synapsin_C 203aa 1e-119 in ref transcript
    • Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown.
  • pfam Synapsin 105aa 2e-52 in ref transcript
    • Synapsin, N-terminal domain.
  • pfam Synapsin_N 29aa 1e-10 in ref transcript
    • Synapsin N-terminal. This highly conserved domain of synapsin proteins has a serine at position 9 or 10 which is a phosphorylation site. The domain appears to be the part of the molecule that binds to calmodulin.
  • Changed! COG RimK 227aa 0.001 in ref transcript
    • Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis].
  • Changed! COG RimK 138aa 0.002 in modified transcript

SYNE1

  • refseq_SYNE1.F2 refseq_SYNE1.R2 104 125
  • NCBIGene 36.3 23345
  • Single exon skipping, size difference: 21
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_182961

  • cd SPEC 211aa 6e-14 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • Changed! cd CH 107aa 1e-13 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 216aa 6e-11 in ref transcript
  • cd SPEC 210aa 2e-09 in ref transcript
  • cd CH 105aa 8e-09 in ref transcript
  • cd SPEC 219aa 1e-07 in ref transcript
  • cd SPEC 217aa 4e-07 in ref transcript
  • cd SPEC 217aa 7e-07 in ref transcript
  • cd SPEC 215aa 8e-07 in ref transcript
  • cd SPEC 206aa 2e-05 in ref transcript
  • cd SPEC 212aa 3e-05 in ref transcript
  • cd SPEC 210aa 7e-05 in ref transcript
  • cd SPEC 227aa 8e-05 in ref transcript
  • cd SPEC 216aa 1e-04 in ref transcript
  • cd SPEC 217aa 3e-04 in ref transcript
  • cd SPEC 223aa 7e-04 in ref transcript
  • cd SPEC 206aa 0.003 in ref transcript
  • cd SPEC 212aa 0.003 in ref transcript
  • cd SPEC 208aa 0.004 in ref transcript
  • Changed! smart CH 103aa 1e-14 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam KASH 60aa 1e-13 in ref transcript
    • Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
  • pfam CH 104aa 9e-11 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • pfam SMC_N 808aa 2e-09 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • pfam SMC_N 755aa 2e-06 in ref transcript
  • smart SPEC 100aa 2e-06 in ref transcript
    • Spectrin repeats.
  • pfam Spectrin 98aa 8e-05 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • pfam SMC_N 247aa 3e-04 in ref transcript
  • pfam SMC_N 199aa 5e-04 in ref transcript
  • TIGR SMC_prok_B 254aa 6e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 103aa 0.001 in ref transcript
  • smart SPEC 97aa 0.002 in ref transcript
  • pfam SMC_N 247aa 0.003 in ref transcript
  • pfam Spectrin 101aa 0.005 in ref transcript
  • pfam SMC_N 296aa 0.009 in ref transcript
  • Changed! COG SAC6 259aa 3e-20 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG Smc 854aa 2e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 819aa 6e-06 in ref transcript
  • COG Smc 303aa 0.002 in ref transcript
  • PRK PRK02224 459aa 0.002 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! cd CH 114aa 1e-11 in modified transcript
  • Changed! smart CH 110aa 2e-12 in modified transcript
  • Changed! TIGR SMC_prok_B 839aa 0.010 in modified transcript
  • Changed! COG SAC6 266aa 3e-18 in modified transcript

SYNE1

  • refseq_SYNE1.F3 refseq_SYNE1.R3 251 317
  • NCBIGene 36.3 23345
  • Alternative 5-prime, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182961

  • cd SPEC 211aa 6e-14 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd CH 107aa 1e-13 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 216aa 6e-11 in ref transcript
  • cd SPEC 210aa 2e-09 in ref transcript
  • cd CH 105aa 8e-09 in ref transcript
  • cd SPEC 219aa 1e-07 in ref transcript
  • cd SPEC 217aa 4e-07 in ref transcript
  • cd SPEC 217aa 7e-07 in ref transcript
  • cd SPEC 215aa 8e-07 in ref transcript
  • cd SPEC 206aa 2e-05 in ref transcript
  • cd SPEC 212aa 3e-05 in ref transcript
  • cd SPEC 210aa 7e-05 in ref transcript
  • cd SPEC 227aa 8e-05 in ref transcript
  • cd SPEC 216aa 1e-04 in ref transcript
  • cd SPEC 217aa 3e-04 in ref transcript
  • cd SPEC 223aa 7e-04 in ref transcript
  • cd SPEC 206aa 0.003 in ref transcript
  • cd SPEC 212aa 0.003 in ref transcript
  • cd SPEC 208aa 0.004 in ref transcript
  • smart CH 103aa 1e-14 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam KASH 60aa 1e-13 in ref transcript
    • Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
  • pfam CH 104aa 9e-11 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • pfam SMC_N 808aa 2e-09 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • pfam SMC_N 755aa 2e-06 in ref transcript
  • smart SPEC 100aa 2e-06 in ref transcript
    • Spectrin repeats.
  • pfam Spectrin 98aa 8e-05 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • pfam SMC_N 247aa 3e-04 in ref transcript
  • pfam SMC_N 199aa 5e-04 in ref transcript
  • TIGR SMC_prok_B 254aa 6e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 103aa 0.001 in ref transcript
  • smart SPEC 97aa 0.002 in ref transcript
  • pfam SMC_N 247aa 0.003 in ref transcript
  • pfam Spectrin 101aa 0.005 in ref transcript
  • pfam SMC_N 296aa 0.009 in ref transcript
  • COG SAC6 259aa 3e-20 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG Smc 854aa 2e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! COG Smc 819aa 6e-06 in ref transcript
  • COG Smc 303aa 0.002 in ref transcript
  • PRK PRK02224 459aa 0.002 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! TIGR SMC_prok_A 180aa 0.005 in modified transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! TIGR SMC_prok_B 839aa 0.010 in modified transcript
  • Changed! COG Smc 760aa 4e-06 in modified transcript

SYNE1

  • refseq_SYNE1.F5 refseq_SYNE1.R5 157 226
  • NCBIGene 36.3 23345
  • Single exon skipping, size difference: 69
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_182961

  • cd SPEC 211aa 6e-14 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd CH 107aa 1e-13 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 216aa 6e-11 in ref transcript
  • cd SPEC 210aa 2e-09 in ref transcript
  • cd CH 105aa 8e-09 in ref transcript
  • cd SPEC 219aa 1e-07 in ref transcript
  • cd SPEC 217aa 4e-07 in ref transcript
  • cd SPEC 217aa 7e-07 in ref transcript
  • cd SPEC 215aa 8e-07 in ref transcript
  • cd SPEC 206aa 2e-05 in ref transcript
  • cd SPEC 212aa 3e-05 in ref transcript
  • cd SPEC 210aa 7e-05 in ref transcript
  • cd SPEC 227aa 8e-05 in ref transcript
  • cd SPEC 216aa 1e-04 in ref transcript
  • cd SPEC 217aa 3e-04 in ref transcript
  • cd SPEC 223aa 7e-04 in ref transcript
  • cd SPEC 206aa 0.003 in ref transcript
  • cd SPEC 212aa 0.003 in ref transcript
  • cd SPEC 208aa 0.004 in ref transcript
  • smart CH 103aa 1e-14 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam KASH 60aa 1e-13 in ref transcript
    • Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
  • pfam CH 104aa 9e-11 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • pfam SMC_N 808aa 2e-09 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • pfam SMC_N 755aa 2e-06 in ref transcript
  • smart SPEC 100aa 2e-06 in ref transcript
    • Spectrin repeats.
  • pfam Spectrin 98aa 8e-05 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • pfam SMC_N 247aa 3e-04 in ref transcript
  • pfam SMC_N 199aa 5e-04 in ref transcript
  • TIGR SMC_prok_B 254aa 6e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 103aa 0.001 in ref transcript
  • smart SPEC 97aa 0.002 in ref transcript
  • pfam SMC_N 247aa 0.003 in ref transcript
  • pfam Spectrin 101aa 0.005 in ref transcript
  • pfam SMC_N 296aa 0.009 in ref transcript
  • COG SAC6 259aa 3e-20 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG Smc 854aa 2e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 819aa 6e-06 in ref transcript
  • COG Smc 303aa 0.002 in ref transcript
  • PRK PRK02224 459aa 0.002 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! TIGR SMC_prok_B 839aa 0.010 in modified transcript

SYNE1

  • refseq_SYNE1.F8 refseq_SYNE1.R8 236 404
  • NCBIGene 36.3 23345
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182961

  • cd SPEC 211aa 6e-14 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd CH 107aa 1e-13 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 216aa 6e-11 in ref transcript
  • cd SPEC 210aa 2e-09 in ref transcript
  • cd CH 105aa 8e-09 in ref transcript
  • Changed! cd SPEC 219aa 1e-07 in ref transcript
  • cd SPEC 217aa 4e-07 in ref transcript
  • cd SPEC 217aa 7e-07 in ref transcript
  • cd SPEC 215aa 8e-07 in ref transcript
  • cd SPEC 206aa 2e-05 in ref transcript
  • cd SPEC 212aa 3e-05 in ref transcript
  • cd SPEC 210aa 7e-05 in ref transcript
  • Changed! cd SPEC 227aa 8e-05 in ref transcript
  • cd SPEC 216aa 1e-04 in ref transcript
  • cd SPEC 217aa 3e-04 in ref transcript
  • cd SPEC 223aa 7e-04 in ref transcript
  • cd SPEC 206aa 0.003 in ref transcript
  • cd SPEC 212aa 0.003 in ref transcript
  • cd SPEC 208aa 0.004 in ref transcript
  • smart CH 103aa 1e-14 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam KASH 60aa 1e-13 in ref transcript
    • Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
  • pfam CH 104aa 9e-11 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • pfam SMC_N 808aa 2e-09 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • pfam SMC_N 755aa 2e-06 in ref transcript
  • smart SPEC 100aa 2e-06 in ref transcript
    • Spectrin repeats.
  • pfam Spectrin 98aa 8e-05 in ref transcript
    • Spectrin repeat. Spectrin repeats are found in several proteins involved in cytoskeletal structure. These include spectrin, alpha-actinin and dystrophin. The sequence repeat used in this family is taken from the structural repeat in reference. The spectrin repeat forms a three helix bundle. The second helix is interrupted by proline in some sequences. The repeats are defined by a characteristic tryptophan (W) residue at position 17 in helix A and a leucine (L) at 2 residues from the carboxyl end of helix C.
  • pfam SMC_N 247aa 3e-04 in ref transcript
  • pfam SMC_N 199aa 5e-04 in ref transcript
  • TIGR SMC_prok_B 254aa 6e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 103aa 0.001 in ref transcript
  • smart SPEC 97aa 0.002 in ref transcript
  • pfam SMC_N 247aa 0.003 in ref transcript
  • pfam Spectrin 101aa 0.005 in ref transcript
  • pfam SMC_N 296aa 0.009 in ref transcript
  • COG SAC6 259aa 3e-20 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG Smc 854aa 2e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 819aa 6e-06 in ref transcript
  • COG Smc 303aa 0.002 in ref transcript
  • PRK PRK02224 459aa 0.002 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! pfam SMC_N 200aa 0.002 in modified transcript
  • Changed! pfam SMC_N 336aa 0.003 in modified transcript
  • Changed! TIGR SMC_prok_B 839aa 0.010 in modified transcript
  • Changed! COG SbcC 570aa 4e-04 in modified transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].

SYNJ1

  • refseq_SYNJ1.F2 refseq_SYNJ1.R2 174 222
  • NCBIGene 36.3 8867
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003895

  • cd RRM 59aa 3e-04 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • pfam Syja_N 289aa 1e-113 in ref transcript
    • SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin.
  • smart IPPc 344aa 1e-110 in ref transcript
    • Inositol polyphosphate phosphatase, catalytic domain homologues. Mg(2+)-dependent/Li(+)-sensitive enzymes.
  • pfam DUF1866 143aa 7e-55 in ref transcript
    • Domain of unknown function (DUF1866). This domain, found in Synaptojanin, has no known function.
  • COG COG5329 442aa 4e-80 in ref transcript
    • Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms].
  • COG COG5411 362aa 1e-56 in ref transcript
    • Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms].

SYNJ1

  • refseq_SYNJ1.F4 refseq_SYNJ1.R4 136 160
  • NCBIGene 36.3 8867
  • Single exon skipping, size difference: 24
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_003895

  • cd RRM 59aa 3e-04 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • pfam Syja_N 289aa 1e-113 in ref transcript
    • SacI homology domain. This Pfam family represents a protein domain which shows homology to the yeast protein SacI. The SacI homology domain is most notably found at the amino terminal of the inositol 5'-phosphatase synaptojanin.
  • smart IPPc 344aa 1e-110 in ref transcript
    • Inositol polyphosphate phosphatase, catalytic domain homologues. Mg(2+)-dependent/Li(+)-sensitive enzymes.
  • pfam DUF1866 143aa 7e-55 in ref transcript
    • Domain of unknown function (DUF1866). This domain, found in Synaptojanin, has no known function.
  • COG COG5329 442aa 4e-80 in ref transcript
    • Phosphoinositide polyphosphatase (Sac family) [Signal transduction mechanisms].
  • Changed! COG COG5411 362aa 1e-56 in ref transcript
    • Phosphatidylinositol 5-phosphate phosphatase [Signal transduction mechanisms].
  • Changed! COG COG5411 369aa 5e-58 in modified transcript

SYTL2

  • refseq_SYTL2.F1 refseq_SYTL2.R1 203 323
  • NCBIGene 36.3 54843
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206927

  • cd C2_1 127aa 9e-17 in ref transcript
    • Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
  • cd C2_1 128aa 4e-15 in ref transcript
  • pfam C2 90aa 2e-10 in ref transcript
    • C2 domain.
  • pfam C2 88aa 7e-10 in ref transcript

TAC1

  • refseq_TAC1.F1 refseq_TAC1.R1 106 151
  • NCBIGene 36.3 6863
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003182

TAC1

  • refseq_TAC1.F2 refseq_TAC1.R2 114 168
  • NCBIGene 36.3 6863
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003182

TAC3

  • refseq_TAC3.F1 refseq_TAC3.R1 127 244
  • NCBIGene 36.3 6866
  • Single exon skipping, size difference: 117
  • Exclusion of the stop codon
  • Reference transcript: NM_001006667

  • pfam Neurokinin_B 55aa 4e-20 in ref transcript
    • Neurokinin B.

TACC2

  • refseq_TACC2.F1 refseq_TACC2.R1 124 214
  • NCBIGene 36.3 10579
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206862

  • pfam TACC 206aa 2e-92 in ref transcript
    • Transforming acidic coiled-coil-containing protein (TACC). This family contains the proteins TACC 1, 2 and 3 the genes for which are found concentrated in the centrosomes of eukaryotic and may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1). The functional homologue (Alp7) in Schizosaccharomyces pombe has been shown to be required for organisation of bipolar spindles.
  • COG Smc 200aa 7e-10 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • PRK PRK07003 236aa 2e-04 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.

TACC2

  • refseq_TACC2.F4 refseq_TACC2.R4 123 258
  • NCBIGene 36.3 10579
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206862

  • pfam TACC 206aa 2e-92 in ref transcript
    • Transforming acidic coiled-coil-containing protein (TACC). This family contains the proteins TACC 1, 2 and 3 the genes for which are found concentrated in the centrosomes of eukaryotic and may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1). The functional homologue (Alp7) in Schizosaccharomyces pombe has been shown to be required for organisation of bipolar spindles.
  • COG Smc 200aa 7e-10 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • PRK PRK07003 236aa 2e-04 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.

TAF1

  • refseq_TAF1.F2 refseq_TAF1.R2 229 292
  • NCBIGene 36.3 6872
  • Alternative 3-prime, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004606

  • cd Bromo_TFIID 113aa 7e-51 in ref transcript
    • Bromodomain, TFIID-like subfamily. Human TAFII250 (or TAF250) is the largest subunit of TFIID, a large multi-domain complex, which initiates the assembly of the transcription machinery. TAFII250 contains two bromodomains that specifically bind to acetylated histone H4. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd Bromo_TFIID 89aa 2e-37 in ref transcript
  • smart BROMO 108aa 2e-27 in ref transcript
    • bromo domain.
  • smart BROMO 96aa 1e-20 in ref transcript
  • pfam TBP-binding 61aa 4e-11 in ref transcript
    • TATA box-binding protein binding. Members of this family adopt a structure consisting of three alpha helices and a beta-hairpin. They bind to TATA box-binding protein (TBP), inhibiting TBP interaction with the TATA element, thereby resulting in shutting down of gene transcription.
  • COG TAF1 742aa 3e-73 in ref transcript
    • Transcription initiation factor TFIID, subunit TAF1 [Transcription].
  • COG COG5076 210aa 1e-16 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].

TAF1A

  • refseq_TAF1A.F1 refseq_TAF1A.R1 121 291
  • NCBIGene 36.3 9015
  • Single exon skipping, size difference: 170
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005681

TAF1C

  • refseq_TAF1C.F1 refseq_TAF1C.R1 118 279
  • NCBIGene 36.3 9013
  • Alternative 3-prime, size difference: 161
  • Exclusion of the protein initiation site
  • Reference transcript: NM_005679

TAF6

  • refseq_TAF6.F1 refseq_TAF6.R1 126 450
  • NCBIGene 36.3 6878
  • Alternative 5-prime, size difference: 324
  • Exclusion in 5'UTR
  • Reference transcript: NM_005641

  • pfam DUF1546 95aa 7e-35 in ref transcript
    • Protein of unknown function (DUF1546). Associated with pfam02969 in Transcription initiation factor TFIID subunit 6 (TAF6).
  • pfam TAF 66aa 1e-25 in ref transcript
    • TATA box binding protein associated factor (TAF). TAF proteins adopt a histone-like fold.
  • COG TAF6 400aa 2e-72 in ref transcript
    • Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) [Transcription].

TAF6

  • refseq_TAF6.F3 refseq_TAF6.R3 170 200
  • NCBIGene 36.3 6878
  • Alternative 5-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139315

  • pfam DUF1546 95aa 7e-35 in ref transcript
    • Protein of unknown function (DUF1546). Associated with pfam02969 in Transcription initiation factor TFIID subunit 6 (TAF6).
  • pfam TAF 66aa 1e-25 in ref transcript
    • TATA box binding protein associated factor (TAF). TAF proteins adopt a histone-like fold.
  • COG TAF6 400aa 2e-72 in ref transcript
    • Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) [Transcription].

TAF6

  • refseq_TAF6.F6 refseq_TAF6.R6 149 179
  • NCBIGene 36.3 6878
  • Alternative 5-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139315

  • pfam DUF1546 95aa 7e-35 in ref transcript
    • Protein of unknown function (DUF1546). Associated with pfam02969 in Transcription initiation factor TFIID subunit 6 (TAF6).
  • Changed! pfam TAF 66aa 1e-25 in ref transcript
    • TATA box binding protein associated factor (TAF). TAF proteins adopt a histone-like fold.
  • Changed! COG TAF6 400aa 2e-72 in ref transcript
    • Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) [Transcription].
  • Changed! pfam TAF 56aa 2e-17 in modified transcript
  • Changed! COG TAF6 390aa 6e-66 in modified transcript

TAGAP

  • refseq_TAGAP.F1 refseq_TAGAP.R1 201 368
  • NCBIGene 36.3 117289
  • Single exon skipping, size difference: 167
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_054114

  • Changed! cd RhoGAP_ARHGAP20 197aa 7e-66 in ref transcript
    • RhoGAP_ARHGAP20: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of ArhGAP20-like proteins. ArhGAP20, also known as KIAA1391 and RA-RhoGAP, contains a RhoGAP, a RA, and a PH domain, and ANXL repeats. ArhGAP20 is activated by Rap1 and induces inactivation of Rho, which in turn leads to neurite outgrowth. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • Changed! pfam RhoGAP 142aa 5e-31 in ref transcript
    • RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.

TAGLN

  • refseq_TAGLN.F1 refseq_TAGLN.R1 100 497
  • NCBIGene 36.3 6876
  • Alternative 5-prime, size difference: 397
  • Exclusion in 5'UTR
  • Reference transcript: NM_001001522

  • cd CH 113aa 5e-16 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • smart CH 107aa 1e-16 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam Calponin 26aa 7e-06 in ref transcript
    • Calponin family repeat.
  • COG SCP1 177aa 1e-16 in ref transcript
    • Calponin [Cytoskeleton].

TAGLN3

  • refseq_TAGLN3.F1 refseq_TAGLN3.R1 167 341
  • NCBIGene 36.3 29114
  • Alternative 3-prime, size difference: 174
  • Inclusion in 5'UTR
  • Reference transcript: NM_001008272

  • cd CH 109aa 1e-17 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • smart CH 106aa 3e-19 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam Calponin 25aa 1e-05 in ref transcript
    • Calponin family repeat.
  • COG SCP1 176aa 3e-18 in ref transcript
    • Calponin [Cytoskeleton].

TAS1R1

  • refseq_TAS1R1.F2 refseq_TAS1R1.R2 103 437
  • NCBIGene 36.3 80835
  • Multiple exon skipping, size difference: 334
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_138697

  • Changed! cd PBP1_Taste_receptor 455aa 1e-175 in ref transcript
    • Ligand-binding domain of the T1R taste receptor. The T1R is a member of the family C receptors within the G-protein coupled receptor superfamily, which also includes the metabotropic glutamate receptors, GABAb receptors, the calcium-sensing receptor (CaSR), the V2R pheromone receptors, and a small group of uncharacterized orphan receptors.
  • Changed! pfam ANF_receptor 384aa 1e-48 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • Changed! pfam 7tm_3 235aa 8e-34 in ref transcript
    • 7 transmembrane sweet-taste receptor of 3 GCPR. This is a domain of seven transmembrane regions that forms the C-terminus of some subclass 3 G-coupled-protein receptors. It is often associated with a downstream cysteine-rich linker domain, NCD3G pfam07562, which is the human sweet-taste receptor, and the N-terminal domain, ANF_receptor pfam01094. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness.
  • Changed! pfam NCD3G 54aa 7e-16 in ref transcript
    • Nine Cysteines Domain of family 3 GPCR. This conserved sequence contains several highly-conserved Cys residues that are predicted to form disulphide bridges. It is predicted to lie outside the cell membrane, tethered to the pfam00003 in several receptor proteins.
  • Changed! COG LivK 280aa 3e-15 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
  • Changed! cd PBP1_Taste_receptor 389aa 1e-147 in modified transcript
  • Changed! pfam ANF_receptor 283aa 4e-45 in modified transcript
  • Changed! COG LivK 212aa 1e-14 in modified transcript

TAZ

  • refseq_TAZ.F1 refseq_TAZ.R1 101 207
  • NCBIGene 36.2 6901
  • Alternative 5-prime, size difference: 106
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_000116

  • Changed! pfam Acyltransferase 175aa 3e-18 in ref transcript
    • Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene.
  • Changed! PRK PRK08633 232aa 4e-06 in ref transcript
    • 2-acyl-glycerophospho-ethanolamine acyltransferase; Validated.
  • Changed! pfam Acyltransferase 46aa 0.001 in modified transcript

TAZ

  • refseq_TAZ.F2 refseq_TAZ.R2 100 190
  • NCBIGene 36.3 6901
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000116

  • Changed! pfam Acyltransferase 175aa 3e-18 in ref transcript
    • Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene.
  • Changed! PRK PRK08633 232aa 4e-06 in ref transcript
    • 2-acyl-glycerophospho-ethanolamine acyltransferase; Validated.
  • Changed! pfam Acyltransferase 145aa 5e-21 in modified transcript
  • Changed! PRK PRK08633 202aa 7e-09 in modified transcript

TAZ

  • refseq_TAZ.F3 refseq_TAZ.R3 128 170
  • NCBIGene 36.3 6901
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000116

  • Changed! pfam Acyltransferase 175aa 3e-18 in ref transcript
    • Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene.
  • Changed! PRK PRK08633 232aa 4e-06 in ref transcript
    • 2-acyl-glycerophospho-ethanolamine acyltransferase; Validated.
  • Changed! pfam Acyltransferase 141aa 3e-12 in modified transcript
  • Changed! PRK PRK08633 218aa 4e-04 in modified transcript

TBC1D9B

  • refseq_TBC1D9B.F1 refseq_TBC1D9B.R1 129 180
  • NCBIGene 36.3 23061
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198868

  • pfam TBC 210aa 3e-49 in ref transcript
    • TBC domain. Identification of a TBC domain in GYP6_YEAST and GYP7_YEAST, which are GTPase activator proteins of yeast Ypt6 and Ypt7, imply that these domains are GTPase activator proteins of Rab-like small GTPases.
  • pfam GRAM 59aa 4e-16 in ref transcript
    • GRAM domain. The GRAM domain is found in in glucosyltransferases, myotubularins and other putative membrane-associated proteins.
  • pfam GRAM 59aa 7e-13 in ref transcript
  • COG COG5210 343aa 2e-40 in ref transcript
    • GTPase-activating protein [General function prediction only].

TBCD

  • refseq_TBCD.F1 refseq_TBCD.R1 289 340
  • NCBIGene 36.2 6904
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005993

  • COG CIN1 277aa 4e-20 in ref transcript
    • Beta-tubulin folding cofactor D [Posttranslational modification, protein turnover, chaperones / Cytoskeleton].
  • COG CIN1 63aa 2e-04 in ref transcript
  • Changed! COG CIN1 109aa 0.003 in ref transcript

TBL1Y

  • refseq_TBL1Y.F1 refseq_TBL1Y.R1 114 209
  • NCBIGene 36.3 90665
  • Single exon skipping, size difference: 95
  • Exclusion in 5'UTR
  • Reference transcript: NM_033284

  • cd WD40 307aa 8e-63 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • smart WD40 40aa 1e-07 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 38aa 2e-07 in ref transcript
  • pfam LisH 27aa 2e-06 in ref transcript
    • LisH. The LisH (lis homology) domain mediates protein dimerisation and tetramerisation. The LisH domain is found in Sif2, a component of the Set3 complex which is responsible for repressing meiotic genes. It has been shown that the LisH domain helps mediate interaction with components of the Set3 complex.
  • pfam WD40 35aa 1e-04 in ref transcript
    • WD domain, G-beta repeat.
  • smart WD40 46aa 2e-04 in ref transcript
  • smart WD40 40aa 0.001 in ref transcript
  • pfam eIF2A 209aa 0.002 in ref transcript
    • Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
  • COG COG2319 378aa 5e-30 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

TBL1Y

  • refseq_TBL1Y.F2 refseq_TBL1Y.R2 134 165
  • NCBIGene 36.3 90665
  • Single exon skipping, size difference: 31
  • Exclusion in 5'UTR
  • Reference transcript: NM_033284

  • cd WD40 307aa 8e-63 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • smart WD40 40aa 1e-07 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 38aa 2e-07 in ref transcript
  • pfam LisH 27aa 2e-06 in ref transcript
    • LisH. The LisH (lis homology) domain mediates protein dimerisation and tetramerisation. The LisH domain is found in Sif2, a component of the Set3 complex which is responsible for repressing meiotic genes. It has been shown that the LisH domain helps mediate interaction with components of the Set3 complex.
  • pfam WD40 35aa 1e-04 in ref transcript
    • WD domain, G-beta repeat.
  • smart WD40 46aa 2e-04 in ref transcript
  • smart WD40 40aa 0.001 in ref transcript
  • pfam eIF2A 209aa 0.002 in ref transcript
    • Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
  • COG COG2319 378aa 5e-30 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

TBRG4

  • refseq_TBRG4.F2 refseq_TBRG4.R2 102 432
  • NCBIGene 36.3 9238
  • Multiple exon skipping, size difference: 330
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004749

  • pfam FAST_1 71aa 3e-17 in ref transcript
    • FAST kinase-like protein, subdomain 1. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This region is often found immediately N-terminal to the FAST kinase-like protein, subdomain 2.
  • pfam RAP 58aa 4e-12 in ref transcript
    • RAP domain. This domain is found in various eukaryotic species, particularly in apicomplexans such as Plasmodium falciparum, where it is found in proteins that are important in various parasite-host cell interactions. It is thought to be an RNA-binding domain.
  • pfam FAST_2 84aa 3e-11 in ref transcript
    • FAST kinase-like protein, subdomain 2. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This subdomain is often found associated with the FAST kinase-like protein, subdomain 2.

TBRG4

  • refseq_TBRG4.F3 refseq_TBRG4.R3 115 136
  • NCBIGene 36.3 9238
  • Alternative 5-prime, size difference: 21
  • Exclusion in 5'UTR
  • Reference transcript: NM_004749

  • pfam FAST_1 71aa 3e-17 in ref transcript
    • FAST kinase-like protein, subdomain 1. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This region is often found immediately N-terminal to the FAST kinase-like protein, subdomain 2.
  • pfam RAP 58aa 4e-12 in ref transcript
    • RAP domain. This domain is found in various eukaryotic species, particularly in apicomplexans such as Plasmodium falciparum, where it is found in proteins that are important in various parasite-host cell interactions. It is thought to be an RNA-binding domain.
  • pfam FAST_2 84aa 3e-11 in ref transcript
    • FAST kinase-like protein, subdomain 2. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This subdomain is often found associated with the FAST kinase-like protein, subdomain 2.

TBX5

  • refseq_TBX5.F1 refseq_TBX5.R1 166 351
  • NCBIGene 36.3 6910
  • Single exon skipping, size difference: 185
  • Exclusion of the protein initiation site
  • Reference transcript: NM_000192

  • cd TBOX 189aa 5e-91 in ref transcript
    • T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
  • pfam T-box 183aa 9e-90 in ref transcript
    • T-box. The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.

TBXAS1

  • refseq_TBXAS1.F1 refseq_TBXAS1.R1 157 320
  • NCBIGene 36.3 6916
  • Single exon skipping, size difference: 163
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001061

  • Changed! pfam p450 482aa 2e-71 in ref transcript
    • Cytochrome P450. Cytochrome P450s are haem-thiolate proteins involved in the oxidative degradation of various compounds. They are particularly well known for their role in the degradation of environmental toxins and mutagens. They can be divided into 4 classes, according to the method by which electrons from NAD(P)H are delivered to the catalytic site. Sequence conservation is relatively low within the family - there are only 3 absolutely conserved residues - but their general topography and structural fold are highly conserved. The conserved core is composed of a coil termed the 'meander', a four-helix bundle, helices J and K, and two sets of beta-sheets. These constitute the haem-binding loop (with an absolutely conserved cysteine that serves as the 5th ligand for the haem iron), the proton-transfer groove and the absolutely conserved EXXR motif in helix K. While prokaryotic P450s are soluble proteins, most eukaryotic P450s are associated with microsomal membranes. their general enzymatic function is to catalyse regiospecific and stereospecific oxidation of non-activated hydrocarbons at physiological temperatures.
  • Changed! COG CypX 478aa 3e-38 in ref transcript
    • Cytochrome P450 [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam p450 410aa 2e-57 in modified transcript
  • Changed! COG CypX 410aa 3e-32 in modified transcript

TCEA1

  • refseq_TCEA1.F1 refseq_TCEA1.R1 269 332
  • NCBIGene 36.3 6917
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006756

  • Changed! cd TFIIS_I 77aa 7e-16 in ref transcript
    • N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a domain found in elongin A and CRSP70; likely to be involved in transcription; domain I from TFIIS interacts with RNA polymerase II holoenzyme.
  • Changed! TIGR TFSII 298aa 4e-89 in ref transcript
    • This model represents eukaryotic transcription elongation factor S-II. This protein allows stalled RNA transcription complexes to perform a cleavage of the nascent RNA and restart at the newly generated 3-prime end.
  • COG RPB9 68aa 4e-11 in ref transcript
    • DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription].
  • Changed! cd TFIIS_I 56aa 1e-05 in modified transcript
  • Changed! TIGR TFSII 277aa 9e-77 in modified transcript

TCEAL8

  • refseq_TCEAL8.F1 refseq_TCEAL8.R1 228 301
  • NCBIGene 36.3 90843
  • Single exon skipping, size difference: 73
  • Exclusion in 5'UTR
  • Reference transcript: NM_153333

  • pfam TFA 111aa 3e-08 in ref transcript
    • Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.

TCERG1

  • refseq_TCERG1.F1 refseq_TCERG1.R1 130 193
  • NCBIGene 36.3 10915
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006706

  • cd WW 28aa 2e-04 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • cd WW 25aa 0.004 in ref transcript
  • cd WW 30aa 0.008 in ref transcript
  • pfam FF 52aa 5e-09 in ref transcript
    • FF domain. This domain has been predicted to be involved in protein-protein interaction. This domain was recently shown to bind the hyperphosphorylated C-terminal repeat domain of RNA polymerase II, confirming its role in protein-protein interactions.
  • smart FF 54aa 4e-07 in ref transcript
    • Contains two conserved F residues. A novel motif that often accompanies WW domains. Often contains two conserved Phe (F) residues.
  • pfam FF 61aa 1e-05 in ref transcript
  • smart FF 54aa 1e-05 in ref transcript
  • smart WW 28aa 2e-05 in ref transcript
    • Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
  • smart FF 56aa 2e-04 in ref transcript
  • pfam WW 27aa 5e-04 in ref transcript
    • WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.
  • pfam FF 48aa 6e-04 in ref transcript
  • pfam WW 25aa 0.001 in ref transcript
  • COG PRP40 350aa 3e-09 in ref transcript
    • Splicing factor [RNA processing and modification].
  • COG PRP40 38aa 4e-04 in ref transcript
  • COG SbcC 223aa 0.005 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].

TCF12

  • refseq_TCF12.F2 refseq_TCF12.R2 219 291
  • NCBIGene 36.3 6938
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207037

  • cd HLH 61aa 1e-06 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • smart HLH 54aa 7e-10 in ref transcript
    • helix loop helix domain.

TCF20

  • refseq_TCF20.F2 refseq_TCF20.R2 101 229
  • NCBIGene 36.3 6942
  • Single exon skipping, size difference: 128
  • Exclusion of the stop codon
  • Reference transcript: NM_005650

TCF7

  • refseq_TCF7.F1 refseq_TCF7.R1 100 196
  • NCBIGene 36.3 6932
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 96
  • Inclusion in the protein causing a frameshift, Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_003202

  • cd SOX-TCF_HMG-box 71aa 9e-24 in ref transcript
    • SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
  • pfam CTNNB1_binding 187aa 6e-33 in ref transcript
    • N-terminal CTNNB1 binding. This region tends to appear at the N-terminus of proteins also containing DNA-binding HMG (high mobility group) boxes (pfam00505) and appears to bind the armadillo repeat of CTNNB1 (beta-catenin), forming a stable complex. Signaling by Wnt through TCF/LCF is involved in developmental patterning, induction of neural tissues, cell fate decisions and stem cell differentiation. Isoforms of HMG T-cell factors lacking the N-terminal CTNNB1-binding domain cannot fulfill their role as transcriptional activators in T-cell differentiation.
  • pfam HMG_box 68aa 5e-18 in ref transcript
    • HMG (high mobility group) box.
  • Changed! PTZ PTZ00199 57aa 4e-04 in ref transcript
    • high mobility group protein; Provisional.
  • Changed! COG NHP6B 96aa 2e-04 in modified transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

TCL6

  • refseq_TCL6.F1 refseq_TCL6.R1 118 254
  • NCBIGene 36.3 27004
  • Single exon skipping, size difference: 136
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020554

TCOF1

  • refseq_TCOF1.F2 refseq_TCOF1.R2 217 331
  • NCBIGene 36.3 6949
  • Single exon skipping, size difference: 114
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001008656

  • pfam Treacle 321aa 9e-36 in ref transcript
    • Treacher Collins syndrome protein Treacle.

TCP1

  • refseq_TCP1.F1 refseq_TCP1.R1 137 223
  • NCBIGene 36.3 6950
  • Single exon skipping, size difference: 86
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_030752

  • Changed! cd TCP1_alpha 527aa 0.0 in ref transcript
    • TCP-1 (CTT or eukaryotic type II) chaperonin family, alpha subunit. Chaperonins are involved in productive folding of proteins. They share a common general morphology, a double toroid of 2 stacked rings. In contrast to bacterial group I chaperonins (GroEL), each ring of the eukaryotic cytosolic chaperonin (CTT) consists of eight different, but homologous subunits. Their common function is to sequester nonnative proteins inside their central cavity and promote folding by using energy derived from ATP hydrolysis. The best studied in vivo substrates of CTT are actin and tubulin.
  • Changed! TIGR chap_CCT_alpha 536aa 0.0 in ref transcript
    • Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT alpha chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
  • Changed! COG GroL 533aa 1e-111 in ref transcript
    • Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].

TDP1

  • refseq_TDP1.F1 refseq_TDP1.R1 125 348
  • NCBIGene 36.3 55775
  • Single exon skipping, size difference: 223
  • Exclusion in 5'UTR
  • Reference transcript: NM_018319

  • pfam Tyr-DNA_phospho 422aa 1e-104 in ref transcript
    • Tyrosyl-DNA phosphodiesterase. Covalent intermediates between topoisomerase I and DNA can become dead-end complexes that lead to cell death. Tyrosyl-DNA phosphodiesterase can hydrolyse the bond between topoisomerase I and DNA.

TEAD4

  • refseq_TEAD4.F2 refseq_TEAD4.R2 106 235
  • NCBIGene 36.3 7004
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003213

  • Changed! pfam TEA 353aa 1e-130 in ref transcript
    • TEA/ATTS domain family.
  • Changed! pfam TEA 310aa 1e-128 in modified transcript

TEAD4

  • refseq_TEAD4.F3 refseq_TEAD4.R3 109 364
  • NCBIGene 36.3 7004
  • Single exon skipping, size difference: 255
  • Exclusion of the protein initiation site
  • Reference transcript: NM_003213

  • Changed! pfam TEA 353aa 1e-130 in ref transcript
    • TEA/ATTS domain family.
  • Changed! pfam TEA 270aa 1e-107 in modified transcript

TEPP

  • refseq_TEPP.F2 refseq_TEPP.R2 106 187
  • NCBIGene 36.3 374739
  • Alternative 3-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199046

TEX11

  • refseq_TEX11.F1 refseq_TEX11.R1 205 327
  • NCBIGene 36.3 56159
  • Single exon skipping, size difference: 122
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001003811

  • pfam SPO22 257aa 1e-58 in ref transcript
    • Meiosis protein SPO22/ZIP4 like. SPO22/ZIP4 in yeast is a meiosis specific protein involved in sporulation. It has been shown to regulate crossover distribution by promoting synaptonemal complex formation.

TEX14

  • refseq_TEX14.F1 refseq_TEX14.R1 123 243
  • NCBIGene 36.3 56155
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198393

  • cd S_TKc 234aa 5e-19 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd ANK 109aa 5e-17 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • pfam Pkinase_Tyr 214aa 9e-23 in ref transcript
    • Protein tyrosine kinase.
  • COG SPS1 328aa 1e-08 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • COG Arp 141aa 5e-08 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

TFEC

  • refseq_TFEC.F2 refseq_TFEC.R2 255 342
  • NCBIGene 36.3 22797
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012252

  • cd HLH 61aa 1e-05 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • Changed! pfam HLH 54aa 3e-04 in ref transcript
    • Helix-loop-helix DNA-binding domain.
  • Changed! smart HLH 53aa 2e-04 in modified transcript
    • helix loop helix domain.

TFG

  • refseq_TFG.F2 refseq_TFG.R2 171 461
  • NCBIGene 36.3 10342
  • Alternative 5-prime, size difference: 290
  • Exclusion in 5'UTR
  • Reference transcript: NM_006070

  • cd PB1_TFG 81aa 4e-41 in ref transcript
    • The PB1 domain found in TFG protein, an oncogenic gene product and fusion partner to nerve growth factor tyrosine kinase receptor TrkA and to the tyrosine kinase ALK. The PB1 domain is a modular domain mediating specific protein-protein interaction in many critical cell processes, such as osteoclastogenesis, angiogenesis, early cardiovascular development and cell polarity. A canonical PB1-PB1 interaction, which involves heterodimerization of two PB1 domains, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The PB1 domains of TFG represent a type I/II PB1 domain. The physiological function of TFG remains unknown.
  • smart PB1 77aa 5e-16 in ref transcript
    • PB1 domain. Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate.

TFIP11

  • refseq_TFIP11.F2 refseq_TFIP11.R2 267 344
  • NCBIGene 36.3 24144
  • Single exon skipping, size difference: 77
  • Exclusion in 5'UTR
  • Reference transcript: NM_001008697

  • pfam TFP11 235aa 1e-106 in ref transcript
    • Tuftelin interacting protein 11. This family contains tuftelin interacting protein 11 which has been identified as both a nuclear and cytoplasmic protein, and has been implicated in the secretory pathway. Sip1, a septin interacting protein is also a member of this family. Proteins in this family often contain pfam01595.
  • pfam G-patch 45aa 4e-13 in ref transcript
    • G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.

TGIF1

  • refseq_TGIF.F2 refseq_TGIF.R2 119 303
  • NCBIGene 36.3 7050
  • Single exon skipping, size difference: 184
  • Exclusion of the protein initiation site
  • Reference transcript: NM_173208

  • cd homeodomain 53aa 0.003 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • smart HOX 53aa 0.001 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.

TGM5

  • refseq_TGM5.F1 refseq_TGM5.R1 142 388
  • NCBIGene 36.3 9333
  • Single exon skipping, size difference: 246
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201631

  • pfam Transglut_C 97aa 9e-23 in ref transcript
    • Transglutaminase family, C-terminal ig like domain.
  • Changed! pfam Transglut_N 117aa 3e-22 in ref transcript
    • Transglutaminase family.
  • smart TGc 94aa 4e-20 in ref transcript
    • Transglutaminase/protease-like homologues. Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
  • pfam Transglut_C 100aa 4e-16 in ref transcript
  • COG COG1305 98aa 3e-04 in ref transcript
    • Transglutaminase-like enzymes, putative cysteine proteases [Amino acid transport and metabolism].
  • Changed! pfam Transglut_N 61aa 1e-11 in modified transcript

THADA

  • refseq_THADA.F2 refseq_THADA.R2 121 303
  • NCBIGene 36.2 63892
  • Multiple exon skipping, size difference: 182
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022065

  • pfam DUF2428 304aa 3e-68 in ref transcript
    • Putative death-receptor fusion protein (DUF2428). This is a family of proteins conserved from plants to humans. The function is not known. Several members have been annotated as being HEAT repeat-containing proteins while others are designated as death-receptor interacting proteins, but neither of these could be confirmed.
  • Changed! COG COG5543 475aa 5e-37 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! COG COG5543 239aa 2e-19 in modified transcript

THAP1

  • refseq_THAP1.F1 refseq_THAP1.R1 109 305
  • NCBIGene 36.3 55145
  • Single exon skipping, size difference: 196
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_018105

  • Changed! pfam THAP 79aa 2e-16 in ref transcript
    • THAP domain. The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologs of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes.

THEG

  • refseq_THEG.F1 refseq_THEG.R1 131 203
  • NCBIGene 36.3 51298
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016585

THOC5

  • refseq_THOC5.F2 refseq_THOC5.R2 108 224
  • NCBIGene 36.3 8563
  • Single exon skipping, size difference: 116
  • Exclusion in 5'UTR
  • Reference transcript: NM_001002877

  • pfam FimP 358aa 1e-128 in ref transcript
    • Fms-interacting protein. This entry carries part of the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C-terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes.

THOC5

  • refseq_THOC5.F4 refseq_THOC5.R3 197 313
  • NCBIGene 36.3 8563
  • Single exon skipping, size difference: 116
  • Exclusion in 5'UTR
  • Reference transcript: NM_001002878

  • pfam FimP 358aa 1e-128 in ref transcript
    • Fms-interacting protein. This entry carries part of the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C-terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes.

THSD1

  • refseq_THSD1.F1 refseq_THSD1.R1 228 387
  • NCBIGene 36.3 55901
  • Single exon skipping, size difference: 159
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018676

  • Changed! smart TSP1 50aa 5e-10 in ref transcript
    • Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.

THSD3

  • refseq_THSD3.F2 refseq_THSD3.R2 243 348
  • NCBIGene 36.3 145501
  • Single exon skipping, size difference: 105
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_199296

  • Changed! pfam AMOP 158aa 2e-40 in ref transcript
    • AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges.
  • Changed! smart TSP1 41aa 3e-06 in ref transcript
    • Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.

THSD3

  • refseq_THSD3.F3 refseq_THSD3.R3 100 446
  • NCBIGene 36.3 145501
  • Single exon skipping, size difference: 346
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_199296

  • Changed! pfam AMOP 158aa 2e-40 in ref transcript
    • AMOP domain. This domain may have a role in cell adhesion. It is called the AMOP domain after Adhesion associated domain in MUC4 and Other Proteins. This domain is extracellular and contains a number of cysteines that probably form disulphide bridges.
  • Changed! smart TSP1 41aa 3e-06 in ref transcript
    • Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.

THYN1

  • refseq_THYN1.F1 refseq_THYN1.R1 170 321
  • NCBIGene 36.3 29087
  • Single exon skipping, size difference: 151
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014174

  • Changed! pfam DUF55 134aa 3e-29 in ref transcript
    • Protein of unknown function DUF55. This family of proteins have no known function.
  • Changed! COG COG2947 169aa 3e-46 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam DUF55 66aa 4e-16 in modified transcript
  • Changed! COG COG2947 105aa 2e-34 in modified transcript

TIAL1

  • refseq_TIAL1.F1 refseq_TIAL1.R1 281 332
  • NCBIGene 36.3 7073
  • Alternative 3-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001033925

  • cd RRM 74aa 2e-19 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! cd RRM 90aa 2e-16 in ref transcript
  • cd RRM 69aa 3e-13 in ref transcript
  • Changed! TIGR PABP-1234 275aa 3e-28 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • Changed! COG COG0724 277aa 2e-20 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! cd RRM 73aa 3e-18 in modified transcript
  • Changed! TIGR PABP-1234 258aa 3e-29 in modified transcript
  • Changed! COG COG0724 192aa 3e-20 in modified transcript

TJP1

  • refseq_TJP1.F1 refseq_TJP1.R1 145 385
  • NCBIGene 36.3 7082
  • Single exon skipping, size difference: 240
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003257

  • cd PDZ_signaling 86aa 3e-14 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 79aa 1e-10 in ref transcript
  • cd PDZ_signaling 78aa 7e-10 in ref transcript
  • pfam ZU5 103aa 7e-35 in ref transcript
    • ZU5 domain. Domain present in ZO-1 and Unc5-like netrin receptors Domain of unknown function.
  • smart GuKc 176aa 5e-34 in ref transcript
    • Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
  • smart PDZ 91aa 4e-14 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 81aa 1e-12 in ref transcript
  • pfam PDZ 77aa 3e-12 in ref transcript
    • PDZ domain (Also known as DHR or GLGF). PDZ domains are found in diverse signaling proteins.
  • pfam SH3_2 62aa 3e-06 in ref transcript
    • Variant SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
  • TIGR degP_htrA_DO 214aa 7e-06 in ref transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
  • TIGR degP_htrA_DO 131aa 0.004 in ref transcript
  • COG Prc 85aa 5e-08 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 60aa 0.006 in ref transcript
  • COG Prc 58aa 0.009 in ref transcript

TLR8

  • refseq_TLR8.F1 refseq_TLR8.R1 142 279
  • NCBIGene 36.2 51311
  • Single exon skipping, size difference: 137
  • Exclusion of the protein initiation site
  • Reference transcript: NM_016610

  • cd LRR_RI 214aa 1e-06 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • pfam TIR 139aa 4e-15 in ref transcript
    • TIR domain. The Toll/interleukin-1 receptor (TIR) homology domain is an intracellular signalling domain found in MyD88, interleukin 1 receptor and the Toll receptor. It contains three highly-conserved regions, and mediates protein-protein interactions between the Toll-like receptors (TLRs) and signal-transduction components. TIR-like motifs are also found in plant proteins thought to be involved in resistance to disease. When activated, TIR domains recruit cytoplasmic adaptor proteins MyD88 and TOLLIP (Toll interacting protein). In turn, these associate with various kinases to set off signalling cascades.
  • TIGR PCC 84aa 2e-06 in ref transcript
    • Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.

TM2D2

  • refseq_TM2D2.F1 refseq_TM2D2.R1 100 489
  • NCBIGene 36.3 83877
  • Alternative 3-prime, size difference: 389
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_078473

  • Changed! pfam TM2 50aa 2e-11 in ref transcript
    • TM2 domain. This family is composed of a pair of transmembrane alpha helices connected by a short linker. The function of this domain is unknown, however it occurs in a wide range or protein contexts.
  • Changed! PHA PHA01886 53aa 9e-04 in ref transcript
    • TM2 domain-containing protein.

TM2D2

  • refseq_TM2D2.F3 refseq_TM2D2.R3 172 378
  • NCBIGene 36.3 83877
  • Alternative 5-prime, size difference: 206
  • Exclusion in 5'UTR
  • Reference transcript: NM_031940

  • pfam TM2 50aa 1e-10 in ref transcript
    • TM2 domain. This family is composed of a pair of transmembrane alpha helices connected by a short linker. The function of this domain is unknown, however it occurs in a wide range or protein contexts.
  • PHA PHA01886 53aa 0.001 in ref transcript
    • TM2 domain-containing protein.

TM2D3

  • refseq_TM2D3.F1 refseq_TM2D3.R1 101 179
  • NCBIGene 36.3 80213
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_078474

  • pfam TM2 50aa 2e-06 in ref transcript
    • TM2 domain. This family is composed of a pair of transmembrane alpha helices connected by a short linker. The function of this domain is unknown, however it occurs in a wide range or protein contexts.

TM6SF2

  • refseq_TM6SF2.F1 refseq_TM6SF2.R1 162 287
  • NCBIGene 36.2 53345
  • Single exon skipping, size difference: 125
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001001524

TMEM10

  • refseq_TMEM10.F1 refseq_TMEM10.R1 105 144
  • NCBIGene 36.3 93377
  • Single exon skipping, size difference: 39
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_033207

TMEM107

  • refseq_TMEM107.F1 refseq_TMEM107.R1 101 119
  • NCBIGene 36.3 84314
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032354

TMEM150

  • refseq_TMEM150.F1 refseq_TMEM150.R1 151 219
  • NCBIGene 36.3 129303
  • Single exon skipping, size difference: 68
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001031738

  • Changed! pfam Frag1 229aa 2e-44 in ref transcript
    • Frag1/DRAM/Sfk1 family. This family includes Frag1, DRAM and Sfk1 proteins. Frag1 (FGF receptor activating protein 1) is a protein that is conserved from fungi to humans. There are four potential iso-prenylation sites throughout the peptide, viz CILW, CIIW and CIGL. Frag1 is a membrane-spanning protein that is ubiquitously expressed in adult tissues suggesting an important cellular function. Dram is a family of proteins conserved from nematodes to humans with six hydrophobic transmembrane regions and an Endoplasmic Reticulum signal peptide. It is a lysosomal protein that induces macro-autophagy as an effector of p53-mediated death, where p53 is the tumour-suppressor gene that is frequently mutated in cancer. Expression of Dram is stress-induced. This region is also part of a family of small plasma membrane proteins, referred to as Sfk1, that may act together with or upstream of Stt4p to generate normal levels of the essential phospholipid PI4P, thus allowing proper localisation of Stt4p to the actin cytoskeleton.
  • Changed! pfam Frag1 66aa 2e-07 in modified transcript

TMEM44

  • refseq_TMEM44.F1 refseq_TMEM44.R1 108 141
  • NCBIGene 36.3 93109
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138399

TMEM93

  • refseq_TMEM93.F1 refseq_TMEM93.R1 229 342
  • NCBIGene 36.3 83460
  • Alternative 5-prime, size difference: 113
  • Exclusion in 5'UTR
  • Reference transcript: NM_001014764

  • pfam DUF786 110aa 5e-25 in ref transcript
    • Protein of unknown function (DUF786). This family consists of several eukaryotic proteins of unknown function.

TMEM98

  • refseq_TMEM98.F1 refseq_TMEM98.R1 199 275
  • NCBIGene 36.3 26022
  • Single exon skipping, size difference: 76
  • Exclusion in 5'UTR
  • Reference transcript: NM_015544

TMPRSS4

  • refseq_TMPRSS4.F1 refseq_TMPRSS4.R1 150 293
  • NCBIGene 36.2 56649
  • Single exon skipping, size difference: 143
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_019894

  • Changed! cd Tryp_SPc 228aa 2e-68 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • cd LDLa 35aa 0.002 in ref transcript
    • Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
  • Changed! smart Tryp_SPc 226aa 2e-77 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • smart SR 91aa 9e-07 in ref transcript
    • Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
  • pfam Ldl_recept_a 22aa 0.005 in ref transcript
    • Low-density lipoprotein receptor domain class A.
  • Changed! COG COG5640 228aa 8e-26 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Tryp_SPc 132aa 1e-32 in modified transcript
  • Changed! smart Tryp_SPc 134aa 1e-36 in modified transcript
  • Changed! COG COG5640 133aa 2e-08 in modified transcript

TMUB2

  • refseq_TMUB2.F1 refseq_TMUB2.R1 112 401
  • NCBIGene 36.2 79089
  • Alternative 5-prime, size difference: 289
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_177441

  • Changed! cd UBL 63aa 1e-05 in ref transcript
    • UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
  • Changed! pfam ubiquitin 51aa 5e-08 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.

TNFRSF18

  • refseq_TNFRSF18.F1 refseq_TNFRSF18.R1 101 122
  • NCBIGene 36.3 8784
  • Alternative 5-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004195

TNFRSF25

  • refseq_TNFRSF25.F1 refseq_TNFRSF25.R1 153 180
  • NCBIGene 36.3 8718
  • Alternative 5-prime, size difference: 7
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_148965

  • Changed! cd TNFR 107aa 4e-13 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • Changed! smart DEATH 85aa 6e-12 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • Changed! cd TNFR 88aa 3e-09 in modified transcript

TNFRSF25

  • refseq_TNFRSF25.F1 refseq_TNFRSF25.R5 107 127
  • NCBIGene 36.3 8718
  • Alternative 3-prime, size difference: 20
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_148965

  • Changed! cd TNFR 107aa 4e-13 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • Changed! smart DEATH 85aa 6e-12 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • Changed! cd TNFR 88aa 2e-08 in modified transcript

TNFRSF25

  • refseq_TNFRSF25.F4 refseq_TNFRSF25.R4 235 370
  • NCBIGene 36.3 8718
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_148965

  • Changed! cd TNFR 107aa 4e-13 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • Changed! smart DEATH 85aa 6e-12 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • Changed! cd TNFR 100aa 0.001 in modified transcript
  • Changed! pfam Death 80aa 6e-12 in modified transcript
    • Death domain.

TNFSF13

  • refseq_TNFSF13.F2 refseq_TNFSF13.R2 217 265
  • NCBIGene 36.3 8741
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003808

  • Changed! cd TNF 132aa 2e-18 in ref transcript
    • Tumor Necrosis Factor; TNF superfamily members include the cytokines: TNF (TNF-alpha), LT (lymphotoxin-alpha, TNF-beta), CD40 ligand, Apo2L (TRAIL), Fas ligand, and osteoprotegerin (OPG) ligand. These proteins generally have an intracellular N-terminal domain, a short transmembrane segment, an extracellular stalk, and a globular TNF-like extracellular domain of about 150 residues. They initiate apoptosis by binding to related receptors, some of which have intracellular death domains. They generally form homo- or hetero- trimeric complexes.TNF cytokines bind one elongated receptor molecule along each of three clefts formed by neighboring monomers of the trimer with ligand trimerization a requiste for receptor binding.
  • pfam TNF 112aa 3e-17 in ref transcript
    • TNF(Tumour Necrosis Factor) family.
  • Changed! cd TNF 118aa 2e-17 in modified transcript

TNFSF14

  • refseq_TNFSF14.F2 refseq_TNFSF14.R2 123 231
  • NCBIGene 36.3 8740
  • Alternative 5-prime, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003807

  • cd TNF 145aa 2e-23 in ref transcript
    • Tumor Necrosis Factor; TNF superfamily members include the cytokines: TNF (TNF-alpha), LT (lymphotoxin-alpha, TNF-beta), CD40 ligand, Apo2L (TRAIL), Fas ligand, and osteoprotegerin (OPG) ligand. These proteins generally have an intracellular N-terminal domain, a short transmembrane segment, an extracellular stalk, and a globular TNF-like extracellular domain of about 150 residues. They initiate apoptosis by binding to related receptors, some of which have intracellular death domains. They generally form homo- or hetero- trimeric complexes.TNF cytokines bind one elongated receptor molecule along each of three clefts formed by neighboring monomers of the trimer with ligand trimerization a requiste for receptor binding.
  • smart TNF 129aa 2e-25 in ref transcript
    • Tumour necrosis factor family. Family of cytokines that form homotrimeric or heterotrimeric complexes. TNF mediates mature T-cell receptor-induced apoptosis through the p75 TNF receptor.

TNK2

  • refseq_TNK2.F1 refseq_TNK2.R1 328 418
  • NCBIGene 36.3 10188
  • Single exon skipping, size difference: 90
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001010938

  • cd PTKc_Ack_like 258aa 1e-147 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Activated Cdc42-associated kinase. Protein Tyrosine Kinase (PTK) family; Activated Cdc42-associated kinase (Ack) subfamily; catalytic (c) domain. Ack subfamily members include Ack1, thirty-eight-negative kinase 1 (Tnk1), and similar proteins. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Ack subfamily members are cytoplasmic (or nonreceptor) tyr kinases containing an N-terminal catalytic domain, an SH3 domain, a Cdc42-binding CRIB domain, and a proline-rich region. They are mainly expressed in brain and skeletal tissues and are involved in the regulation of cell adhesion and growth, receptor degradation, and axonal guidance. Ack1 is also associated with androgen-independent prostate cancer progression. Tnk1 regulates TNFalpha signaling and may play an important role in cell death.
  • cd SH3 52aa 6e-06 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart TyrKc 260aa 1e-105 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • pfam GTPase_binding 68aa 2e-24 in ref transcript
    • GTPase binding. The GTPase binding domain binds to the G protein Cdc42, inhibiting both its intrinsic and stimulated GTPase activity. The domain is largely unstructured in the absence of Cdc42.
  • smart SH3 54aa 2e-06 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • COG SPS1 266aa 6e-25 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! COG DedD 114aa 0.010 in modified transcript
    • Uncharacterized protein conserved in bacteria [Function unknown].

TNK2

  • refseq_TNK2.F3 refseq_TNK2.R3 197 242
  • NCBIGene 36.3 10188
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001010938

  • cd PTKc_Ack_like 258aa 1e-147 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Activated Cdc42-associated kinase. Protein Tyrosine Kinase (PTK) family; Activated Cdc42-associated kinase (Ack) subfamily; catalytic (c) domain. Ack subfamily members include Ack1, thirty-eight-negative kinase 1 (Tnk1), and similar proteins. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Ack subfamily members are cytoplasmic (or nonreceptor) tyr kinases containing an N-terminal catalytic domain, an SH3 domain, a Cdc42-binding CRIB domain, and a proline-rich region. They are mainly expressed in brain and skeletal tissues and are involved in the regulation of cell adhesion and growth, receptor degradation, and axonal guidance. Ack1 is also associated with androgen-independent prostate cancer progression. Tnk1 regulates TNFalpha signaling and may play an important role in cell death.
  • cd SH3 52aa 6e-06 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart TyrKc 260aa 1e-105 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • pfam GTPase_binding 68aa 2e-24 in ref transcript
    • GTPase binding. The GTPase binding domain binds to the G protein Cdc42, inhibiting both its intrinsic and stimulated GTPase activity. The domain is largely unstructured in the absence of Cdc42.
  • smart SH3 54aa 2e-06 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • COG SPS1 266aa 6e-25 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

TNRC5

  • refseq_TNRC5.F1 refseq_TNRC5.R1 171 295
  • NCBIGene 36.2 10695
  • Single exon skipping, size difference: 124
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006586

TNRC6A

  • refseq_TNRC6A.F1 refseq_TNRC6A.R1 148 273
  • NCBIGene 36.2 27327
  • Single exon skipping, size difference: 125
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_020847

  • Changed! cd RRM 68aa 3e-05 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! pfam Ago_hook 69aa 6e-11 in ref transcript
    • Argonaute hook. This region has been called the argonaute hook. It has been shown to bind to the Piwi domain pfam02171 of Argnonaute proteins.

TPD52L1

  • refseq_TPD52L1.F1 refseq_TPD52L1.R1 285 346
  • NCBIGene 36.3 7164
  • Single exon skipping, size difference: 61
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003287

  • Changed! pfam TPD52 185aa 4e-39 in ref transcript
    • Tumour protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family. These proteins have an amino terminal coiled-coil that allows members to form homo- and heterodimers with each other.
  • Changed! pfam TPD52 128aa 7e-27 in modified transcript

TPD52L2

  • refseq_TPD52L2.F1 refseq_TPD52L2.R1 108 135
  • NCBIGene 36.3 7165
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199360

  • Changed! pfam TPD52 163aa 4e-26 in ref transcript
    • Tumour protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family. These proteins have an amino terminal coiled-coil that allows members to form homo- and heterodimers with each other.
  • Changed! pfam TPD52 154aa 7e-27 in modified transcript

TPM1

  • refseq_TPM1.F1 refseq_TPM1.R1 151 199
  • NCBIGene 36.3 7168
  • Alternative 5-prime, size difference: 48
  • Exclusion of the stop codon
  • Reference transcript: NM_000366

  • Changed! pfam Tropomyosin 236aa 3e-45 in ref transcript
    • Tropomyosin.
  • Changed! pfam Tropomyosin 237aa 1e-45 in modified transcript

TPO

  • refseq_TPO.F1 refseq_TPO.R1 225 396
  • NCBIGene 36.3 7173
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000547

  • cd CCP 54aa 3e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd EGF_CA 44aa 3e-06 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • Changed! pfam An_peroxidase 559aa 0.0 in ref transcript
    • Animal haem peroxidase.
  • pfam EGF_CA 43aa 2e-09 in ref transcript
    • Calcium binding EGF domain.
  • smart CCP 53aa 3e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • Changed! pfam An_peroxidase 502aa 1e-163 in modified transcript

TPO

  • refseq_TPO.F4 refseq_TPO.R4 115 247
  • NCBIGene 36.3 7173
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000547

  • cd CCP 54aa 3e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • Changed! cd EGF_CA 44aa 3e-06 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • pfam An_peroxidase 559aa 0.0 in ref transcript
    • Animal haem peroxidase.
  • Changed! pfam EGF_CA 43aa 2e-09 in ref transcript
    • Calcium binding EGF domain.
  • smart CCP 53aa 3e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.

TPO

  • refseq_TPO.F6 refseq_TPO.R6 221 351
  • NCBIGene 36.2 7173
  • Single exon skipping, size difference: 130
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000547

  • cd CCP 54aa 3e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd EGF_CA 44aa 3e-06 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • pfam An_peroxidase 559aa 0.0 in ref transcript
    • Animal haem peroxidase.
  • pfam EGF_CA 43aa 2e-09 in ref transcript
    • Calcium binding EGF domain.
  • smart CCP 53aa 3e-06 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.

TPTE

  • refseq_TPTE.F2 refseq_TPTE.R2 213 273
  • NCBIGene 36.3 7179
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199259

  • cd PTPc 53aa 0.002 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • pfam PTEN_C2 131aa 5e-30 in ref transcript
    • C2 domain of PTEN tumour-suppressor protein. This is the C2 domain-like domain, in greek key form, of the PTEN protein, phosphatidyl-inositol triphosphate phosphatase, and it is the C-terminus. This domain may well include a CBR3 loop which means it plays a central role in membrane binding. This domain associates across an extensive interface with the N-terminal phosphatase domain DSPc (pfam00782) suggesting that the C2 domain productively positions the catalytic part of the protein on the membrane 1].
  • smart PTPc_motif 59aa 2e-04 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain motif.
  • pfam Y_phosphatase 108aa 7e-04 in ref transcript
    • Protein-tyrosine phosphatase.
  • COG CDC14 87aa 2e-06 in ref transcript
    • Predicted protein-tyrosine phosphatase [Signal transduction mechanisms].

TRAF3IP2

  • refseq_TRAF3IP2.F2 refseq_TRAF3IP2.R2 283 394
  • NCBIGene 36.3 10758
  • Single exon skipping, size difference: 111
  • Exclusion of the protein initiation site
  • Reference transcript: NM_147200

  • pfam SEFIR 139aa 1e-33 in ref transcript
    • SEFIR domain. This family comprises IL17 receptors (IL17Rs) and SEF proteins. The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (pfam01582), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways.

TRAF6

  • refseq_TRAF6.F2 refseq_TRAF6.R2 296 389
  • NCBIGene 36.3 7189
  • Single exon skipping, size difference: 93
  • Exclusion in 5'UTR
  • Reference transcript: NM_145803

  • cd MATH_TRAF6 150aa 2e-67 in ref transcript
    • Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF6 subfamily, TRAF domain, C-terminal MATH subdomain; composed of proteins with similarity to human TRAF6, including the Drosophila protein DTRAF2. TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF6 is the most divergent in its TRAF domain among the mammalian TRAFs. In addition to mediating TNFR family signaling, it is also an essential signaling molecule of the interleukin-1/Toll-like receptor superfamily. Whereas other TRAF molecules display similar and overlapping TNFR-binding specificities, TRAF6 binds completely different sites on receptors such as CD40 and RANK. TRAF6 serves as a molecular bridge between innate and adaptive immunity and plays a central role in osteoimmunology. DTRAF2, as an activator of nuclear factor-kappaB, plays a pivotal role in Drosophila development and innate immunity. TRAF6 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal alpha helical region (TRAF-N), and a highly conserved C-terminal MATH subdomain (TRAF-C) with an eight-stranded beta-sandwich structure. TRAF-N mediates trimerization while TRAF-C interacts with receptors.
  • cd RING 39aa 3e-04 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam MATH 143aa 1e-18 in ref transcript
    • MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode Caenorhabditis elegans.
  • pfam zf-TRAF 58aa 9e-14 in ref transcript
    • TRAF-type zinc finger.
  • smart RING 38aa 2e-06 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • pfam zf-TRAF 57aa 1e-04 in ref transcript
  • pfam Myosin_tail_1 46aa 0.007 in ref transcript
    • Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
  • COG COG5222 65aa 1e-04 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

TRAF7

  • refseq_TRAF7.F1 refseq_TRAF7.R1 145 194
  • NCBIGene 36.2 84231
  • Alternative 5-prime and 3-prime, size difference: 49
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032271

  • Changed! cd WD40 281aa 4e-47 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! pfam WD40 39aa 1e-06 in ref transcript
    • WD domain, G-beta repeat.
  • Changed! TIGR rad18 34aa 4e-05 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! pfam WD40 36aa 0.001 in ref transcript
  • Changed! pfam WD40 41aa 0.002 in ref transcript
  • Changed! pfam SMC_N 79aa 0.002 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! COG COG2319 330aa 2e-18 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! COG COG5152 36aa 4e-04 in ref transcript
    • Uncharacterized conserved protein, contains RING and CCCH-type Zn-fingers [General function prediction only].

TRAPPC2

  • refseq_TRAPPC2.F1 refseq_TRAPPC2.R1 109 251
  • NCBIGene 36.3 6399
  • Single exon skipping, size difference: 142
  • Exclusion in 5'UTR
  • Reference transcript: NM_001011658

  • pfam Sedlin_N 128aa 9e-49 in ref transcript
    • Sedlin, N-terminal conserved region. Mutations in this protein are associated with the X-linked spondyloepiphyseal dysplasia tarda syndrome (OMIM:313400). This family represents an N-terminal conserved region.
  • COG TRS20 133aa 4e-32 in ref transcript
    • Subunit of TRAPP, an ER-Golgi tethering complex [Cell motility and secretion].

TRERF1

  • refseq_TRERF1.F1 refseq_TRERF1.R1 150 200
  • NCBIGene 36.2 55809
  • Alternative 5-prime, size difference: 50
  • Inclusion in 5'UTR
  • Reference transcript: NM_033502

  • pfam ELM2 57aa 9e-10 in ref transcript
    • ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
  • smart SANT 46aa 0.002 in ref transcript
    • SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.

TRERF1

  • refseq_TRERF1.F3 refseq_TRERF1.R3 167 203
  • NCBIGene 36.2 55809
  • Alternative 3-prime, size difference: 36
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_033502

  • pfam ELM2 57aa 9e-10 in ref transcript
    • ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
  • smart SANT 46aa 0.002 in ref transcript
    • SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.

TRERF1

  • refseq_TRERF1.F6 refseq_TRERF1.R6 165 414
  • NCBIGene 36.2 55809
  • Single exon skipping, size difference: 249
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033502

  • pfam ELM2 57aa 9e-10 in ref transcript
    • ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
  • smart SANT 46aa 0.002 in ref transcript
    • SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.

TREX1

  • refseq_TREX1.F1 refseq_TREX1.R1 157 242
  • NCBIGene 36.2 11277
  • Single exon skipping, size difference: 85
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_130384

ATRIP

  • refseq_TREX1.F4 refseq_TREX1.R4 316 397
  • NCBIGene 36.3 84126
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130384

RNF216

  • refseq_TRIAD3.F1 refseq_TRIAD3.R1 193 364
  • NCBIGene 36.3 54476
  • Alternative 3-prime, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207111

TRIM17

  • refseq_TRIM17.F1 refseq_TRIM17.R1 116 152
  • NCBIGene 36.3 51127
  • Alternative 3-prime, size difference: 36
  • Inclusion in 5'UTR
  • Reference transcript: NM_001024940

  • cd RING 53aa 6e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 38aa 2e-05 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • smart SPRY 121aa 5e-25 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • pfam zf-B_box 41aa 2e-07 in ref transcript
    • B-box zinc finger.
  • smart PRY 53aa 2e-07 in ref transcript
    • associated with SPRY domains.
  • smart RING 50aa 2e-06 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG PEX10 51aa 1e-04 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

TRIM24

  • refseq_TRIM24.F1 refseq_TRIM24.R1 159 261
  • NCBIGene 36.3 8805
  • Alternative 5-prime, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015905

  • cd Bromo_tif1_like 108aa 4e-46 in ref transcript
    • Bromodomain; tif1_like subfamily. Tif1 (transcription intermediary factor 1) is a member of the tripartite motif (TRIM) protein family, which is characterized by a particular domain architecture. It functions by recruiting coactivators and/or corepressors to modulate transcription. Vertebrate Tif1-gamma, also labeled E3 ubiquitin-protein ligase TRIM33, plays a role in the control of hematopoiesis. Its homologue in Xenopus laevis, Ectodermin, has been shown to function in germ-layer specification and control of cell growth during embryogenesis. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd BBOX 39aa 1e-07 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • cd BAH_plant_2 62aa 0.006 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • smart BBC 127aa 6e-32 in ref transcript
    • B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
  • smart BROMO 103aa 2e-29 in ref transcript
    • bromo domain.
  • pfam zf-B_box 42aa 6e-10 in ref transcript
    • B-box zinc finger.
  • pfam PHD 44aa 4e-08 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • pfam zf-B_box 46aa 0.003 in ref transcript
  • COG COG5076 125aa 3e-14 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].

TRIM3

  • refseq_TRIM3.F1 refseq_TRIM3.R1 203 368
  • NCBIGene 36.3 10612
  • Single exon skipping, size difference: 165
  • Exclusion in 5'UTR
  • Reference transcript: NM_006458

  • cd RING 44aa 2e-08 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 37aa 3e-06 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • cd WD40 261aa 9e-06 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • smart BBC 127aa 2e-32 in ref transcript
    • B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
  • smart IG_FLMN 98aa 2e-22 in ref transcript
    • Filamin-type immunoglobulin domains. These form a rod-like structure in the actin-binding cytoskeleton protein, filamin. The C-terminal repeats of filamin bind beta1-integrin (CD29).
  • smart BBOX 42aa 4e-10 in ref transcript
    • B-Box-type zinc finger.
  • smart RING 41aa 7e-10 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • TIGR rad18 83aa 2e-07 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • pfam NHL 28aa 2e-06 in ref transcript
    • NHL repeat. The NHL (NCL-1, HT2A and LIN-41) repeat is found in multiple tandem copies. It is about 40 residues long and resembles the WD repeat pfam00400. The repeats have a catalytic activity in the peptidyl-glycine alpha-amidating monooxygenase (PAM), proteolysis has shown that the Peptidyl-alpha-hydroxyglycine alpha-amidating lyase (PAL) activity is localised to the repeats. The human tripartite motif-containing protein 32 interacts with the activation domain of Tat. This interaction is me diated by the NHL repeats.
  • pfam NHL 28aa 1e-04 in ref transcript
  • pfam NHL 28aa 3e-04 in ref transcript
  • pfam NHL 28aa 0.002 in ref transcript
  • pfam NHL 28aa 0.003 in ref transcript
  • COG COG3391 246aa 1e-11 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • COG RAD18 79aa 3e-05 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].
  • PRK PRK08268 82aa 0.006 in ref transcript
    • 3-hydroxybutyryl-CoA dehydrogenase; Validated.

TRIM31

  • refseq_TRIM31.F1 refseq_TRIM31.R1 110 177
  • NCBIGene 36.2 11074
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 67
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007028

  • cd RING 45aa 1e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 37aa 7e-04 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • smart RING 41aa 2e-08 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • pfam zf-B_box 42aa 2e-06 in ref transcript
    • B-box zinc finger.
  • COG PEX10 50aa 4e-05 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

TRIM33

  • refseq_TRIM33.F1 refseq_TRIM33.R1 229 280
  • NCBIGene 36.3 51592
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015906

  • Changed! cd Bromo_tif1_like 126aa 8e-48 in ref transcript
    • Bromodomain; tif1_like subfamily. Tif1 (transcription intermediary factor 1) is a member of the tripartite motif (TRIM) protein family, which is characterized by a particular domain architecture. It functions by recruiting coactivators and/or corepressors to modulate transcription. Vertebrate Tif1-gamma, also labeled E3 ubiquitin-protein ligase TRIM33, plays a role in the control of hematopoiesis. Its homologue in Xenopus laevis, Ectodermin, has been shown to function in germ-layer specification and control of cell growth during embryogenesis. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd BBOX 39aa 2e-07 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • smart BBC 127aa 2e-27 in ref transcript
    • B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
  • Changed! smart BROMO 122aa 3e-24 in ref transcript
    • bromo domain.
  • pfam zf-B_box 42aa 2e-09 in ref transcript
    • B-box zinc finger.
  • pfam PHD 44aa 3e-07 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • smart BBOX 37aa 0.004 in ref transcript
    • B-Box-type zinc finger.
  • Changed! COG COG5076 121aa 3e-09 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • Changed! cd Bromo_tif1_like 109aa 2e-50 in modified transcript
  • Changed! smart BROMO 105aa 1e-27 in modified transcript
  • Changed! COG COG5076 104aa 3e-12 in modified transcript

TRIM39

  • refseq_TRIM39.F1 refseq_TRIM39.R1 124 214
  • NCBIGene 36.3 56658
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021253

  • cd RING 45aa 5e-08 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 38aa 4e-05 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • smart SPRY 109aa 3e-31 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart PRY 52aa 3e-17 in ref transcript
    • associated with SPRY domains.
  • smart RING 41aa 1e-09 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! TIGR alt_F1F0_F0_B 178aa 1e-05 in ref transcript
    • CC and in principle may run in either direction. This model represents the F0 subunit B of this apparent second ATP synthase.
  • pfam zf-B_box 38aa 1e-04 in ref transcript
    • B-box zinc finger.
  • COG RAD18 151aa 1e-06 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].
  • Changed! PRK PRK02224 125aa 0.001 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! TIGR SMC_prok_A 148aa 8e-05 in modified transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! PRK PRK02224 202aa 3e-04 in modified transcript

TRIM4

  • refseq_TRIM4.F1 refseq_TRIM4.R1 224 302
  • NCBIGene 36.3 89122
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033017

  • cd RING 45aa 8e-08 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 38aa 1e-05 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • smart SPRY 117aa 4e-13 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart RING 41aa 5e-08 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • pfam zf-B_box 38aa 2e-05 in ref transcript
    • B-box zinc finger.
  • smart PRY 70aa 0.002 in ref transcript
    • associated with SPRY domains.
  • COG RAD18 69aa 6e-05 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].

TRIM54

  • refseq_TRIM54.F1 refseq_TRIM54.R1 172 298
  • NCBIGene 36.3 57159
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032546

  • cd RING 60aa 3e-05 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam zf-C3HC4 25aa 0.010 in ref transcript
    • Zinc finger, C3HC4 type (RING finger). The C3HC4 type zinc-finger (RING finger) is a cysteine-rich domain of 40 to 60 residues that coordinates two zinc ions, and has the consensus sequence: C-X2-C-X(9-39)-C-X(1-3)-H-X(2-3)-C-X2-C-X(4-48)-C-X2-C where X is any amino acid. Many proteins containing a RING finger play a key role in the ubiquitination pathway.

TRIM55

  • refseq_TRIM55.F2 refseq_TRIM55.R2 205 293
  • NCBIGene 36.3 84675
  • Single exon skipping, size difference: 88
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_184085

  • cd RING 60aa 4e-06 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 40aa 0.007 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • pfam zf-B_box 40aa 2e-04 in ref transcript
    • B-box zinc finger.
  • Changed! smart RING 56aa 0.001 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • smart BBC 127aa 0.005 in ref transcript
    • B-Box C-terminal domain. Coiled coil region C-terminal to (some) B-Box domains.
  • Changed! smart RING 35aa 0.002 in modified transcript

TRIM7

  • refseq_TRIM7.F1 refseq_TRIM7.R1 121 398
  • NCBIGene 36.3 81786
  • Alternative 5-prime, size difference: 277
  • Exclusion in 5'UTR
  • Reference transcript: NM_203296

  • smart SPRY 107aa 5e-22 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart PRY 48aa 5e-11 in ref transcript
    • associated with SPRY domains.

TRIM7

  • refseq_TRIM7.F2 refseq_TRIM7.R2 226 294
  • NCBIGene 36.3 81786
  • Alternative 5-prime, size difference: 68
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_203293

  • cd RING 56aa 1e-05 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BBOX 37aa 1e-04 in ref transcript
    • B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
  • Changed! smart SPRY 107aa 2e-22 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • Changed! smart PRY 48aa 2e-10 in ref transcript
    • associated with SPRY domains.
  • Changed! smart RING 53aa 2e-06 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • pfam zf-B_box 37aa 3e-06 in ref transcript
    • B-box zinc finger.
  • COG RAD18 65aa 8e-05 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].
  • Changed! PRK PRK08476 53aa 0.001 in modified transcript
    • F0F1 ATP synthase subunit B'; Validated.

TRNT1

  • refseq_TRNT1.F1 refseq_TRNT1.R1 283 343
  • NCBIGene 36.2 51095
  • Alternative 5-prime, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016000

  • pfam PolyA_pol 124aa 3e-32 in ref transcript
    • Poly A polymerase head domain. This family includes nucleic acid independent RNA polymerases, such as Poly(A) polymerase, which adds the poly (A) tail to mRNA EC:2.7.7.19. This family also includes the tRNA nucleotidyltransferase that adds the CCA to the 3' of the tRNA EC:2.7.7.25.
  • Changed! COG PcnB 394aa 2e-51 in ref transcript
    • tRNA nucleotidyltransferase/poly(A) polymerase [Translation, ribosomal structure and biogenesis].
  • Changed! COG PcnB 222aa 3e-49 in modified transcript

TRPC4AP

  • refseq_TRPC4AP.F1 refseq_TRPC4AP.R1 133 157
  • NCBIGene 36.3 26133
  • Alternative 3-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015638

TRPM3

  • refseq_TRPM3.F2 refseq_TRPM3.R2 197 233
  • NCBIGene 36.3 80036
  • Single exon skipping, size difference: 36
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001007471

  • pfam Ion_trans 122aa 4e-07 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • TIGR trp 320aa 4e-05 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).

TRPM3

  • refseq_TRPM3.F3 refseq_TRPM3.R3 149 179
  • NCBIGene 36.3 80036
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001007471

  • pfam Ion_trans 122aa 4e-07 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • TIGR trp 320aa 4e-05 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).

TRPM3

  • refseq_TRPM3.F6 refseq_TRPM3.R6 130 205
  • NCBIGene 36.3 80036
  • Single exon skipping, size difference: 75
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001007471

  • pfam Ion_trans 122aa 4e-07 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • TIGR trp 320aa 4e-05 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).

TRPT1

  • refseq_TRPT1.F2 refseq_TRPT1.R2 101 185
  • NCBIGene 36.3 83707
  • Single exon skipping, size difference: 84
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001033678

  • Changed! pfam PTS_2-RNA 179aa 2e-33 in ref transcript
    • RNA 2'-phosphotransferase, Tpt1 / KptA family. Tpt1 catalyses the last step of tRNA splicing in yeast. It transfers the splice junction 2'-phosphate from ligated tRNA to NAD, to produce ADP-ribose 1"-2"-cyclic phosphate. This is presumed to be followed by a transesterification step to release the RNA. The first step of this reaction is similar to that catalysed by some bacterial toxins. Escherichia coli KptA and mouse Tpt1 are likely to use the same reaction mechanism.
  • Changed! PTZ PTZ00315 180aa 7e-39 in ref transcript
    • 2'-phosphotransferase; Provisional.
  • Changed! pfam PTS_2-RNA 154aa 2e-27 in modified transcript
  • Changed! PTZ PTZ00315 150aa 1e-31 in modified transcript

TRPV4

  • refseq_TRPV4.F2 refseq_TRPV4.R2 165 345
  • NCBIGene 36.3 59341
  • Single exon skipping, size difference: 180
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021625

  • Changed! cd ANK 155aa 6e-11 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! TIGR trp 651aa 1e-137 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! cd ANK 148aa 2e-10 in modified transcript
  • Changed! TIGR trp 591aa 1e-119 in modified transcript

TSGA10

  • refseq_TSGA10.F1 refseq_TSGA10.R1 155 199
  • NCBIGene 36.3 80705
  • Alternative 3-prime, size difference: 44
  • Inclusion in 5'UTR
  • Reference transcript: NM_182911

  • TIGR SMC_prok_B 333aa 2e-12 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • TIGR SMC_prok_B 298aa 9e-09 in ref transcript
  • pfam Macoilin 150aa 0.009 in ref transcript
    • Transmembrane protein. This entry is a highly conserved protein present in eukaryotes.
  • COG Smc 363aa 5e-10 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG COG4372 204aa 2e-05 in ref transcript
    • Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown].

TSPAN17

  • refseq_TSPAN17.F1 refseq_TSPAN17.R1 125 157
  • NCBIGene 36.3 26262
  • Alternative 5-prime, size difference: 32
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_012171

  • cd TM4SF9_like_LEL 124aa 7e-59 in ref transcript
    • Tetraspanin, extracellular domain or large extracellular loop (LEL), TM4SF9_like subfamily. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". This subfamily contaions transmembrane 4 superfamily 9 (TM4SF9) or Tetraspanin-5 and related proteins. TM4SF9 is strongly expressed witin the central nervous system, and expression levels appear to correlate with differentiation status of particular neurons, hinting at a role in neuronal maturation.
  • Changed! pfam Tetraspannin 252aa 2e-41 in ref transcript
    • Tetraspanin family.
  • Changed! pfam Tetraspannin 245aa 1e-38 in modified transcript

TSPAN3

  • refseq_TSPAN3.F2 refseq_TSPAN3.R2 187 262
  • NCBIGene 36.3 10099
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005724

  • Changed! cd TM4SF8_like_LEL 107aa 6e-48 in ref transcript
    • Tetraspanin, extracellular domain or large extracellular loop (LEL), TM4SF8_like subfamily. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". This subfamily contaions transmembrane 4 superfamily 8 (TM4SF8) or Tspan-3 and related proteins. Tspan-3 has been reported to form a complex with integrin beta1 and OSP/claudin-11, which may be involved in oligodendrocyte proliferation and migration.
  • Changed! pfam Tetraspannin 228aa 1e-35 in ref transcript
    • Tetraspanin family.
  • Changed! cd TM4SF8_like_LEL 102aa 8e-44 in modified transcript
  • Changed! pfam Tetraspannin 203aa 8e-32 in modified transcript

TSPAN4

  • refseq_TSPAN4.F1 refseq_TSPAN4.R1 117 217
  • NCBIGene 36.3 7106
  • Single exon skipping, size difference: 100
  • Exclusion in 5'UTR
  • Reference transcript: NM_003271

  • cd NET-5_like_LEL 98aa 1e-40 in ref transcript
    • Tetraspanin, extracellular domain or large extracellular loop (LEL), NET-5_like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". This sub-family contains proteins similar to human tetraspan NET-5.
  • pfam Tetraspannin 222aa 3e-35 in ref transcript
    • Tetraspanin family.

TSPO

  • refseq_TSPO.F1 refseq_TSPO.R1 189 400
  • NCBIGene 36.3 706
  • Single exon skipping, size difference: 211
  • Exclusion of the protein initiation site
  • Reference transcript: NM_000714

  • Changed! pfam TspO_MBR 152aa 1e-43 in ref transcript
    • TspO/MBR family. Tryptophan-rich sensory protein (TspO) is an integral membrane protein that acts as a negative regulator of the expression of specific photosynthesis genes in response to oxygen/light. It is involved in the efflux of porphyrin intermediates from the cell. This reduces the activity of coproporphyrinogen III oxidase, which is thought to lead to the accumulation of a putative repressor molecule that inhibits the expression of specific photosynthesis genes. Several conserved aromatic residues are necessary for TspO function: they are thought to be involved in binding porphyrin intermediates. In, the rat mitochondrial peripheral benzodiazepine receptor (MBR) was shown to not only retain its structure within a bacterial outer membrane, but also to be able to functionally substitute for TspO in TspO- mutants, and to act in a similar manner to TspO in its in situ location: the outer mitochondrial membrane. The biological significance of MBR remains unclear, however. It is thought to be involved in a variety of cellular functions, including cholesterol transport in steroidogenic tissues.
  • Changed! COG COG3476 150aa 5e-19 in ref transcript
    • Tryptophan-rich sensory protein (mitochondrial benzodiazepine receptor homolog) [Signal transduction mechanisms].

TTC8

  • refseq_TTC8.F2 refseq_TTC8.R2 291 381
  • NCBIGene 36.3 123016
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144596

  • cd TPR 93aa 1e-08 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • cd TPR 90aa 1e-07 in ref transcript
  • TIGR PEP_TPR_lipo 148aa 5e-06 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • TIGR type_IV_pilW 161aa 8e-04 in ref transcript
    • Members of this family are designated PilF in ref (PMID:8973346) and PilW in ref (PMID:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
  • COG PilF 202aa 0.001 in ref transcript
    • Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion].

TTC8

  • refseq_TTC8.F3 refseq_TTC8.R3 102 132
  • NCBIGene 36.3 123016
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144596

  • cd TPR 93aa 1e-08 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • cd TPR 90aa 1e-07 in ref transcript
  • TIGR PEP_TPR_lipo 148aa 5e-06 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • TIGR type_IV_pilW 161aa 8e-04 in ref transcript
    • Members of this family are designated PilF in ref (PMID:8973346) and PilW in ref (PMID:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
  • COG PilF 202aa 0.001 in ref transcript
    • Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion].

TTLL1

  • refseq_TTLL1.F1 refseq_TTLL1.R1 199 271
  • NCBIGene 36.3 25809
  • Single exon skipping, size difference: 72
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_012263

  • Changed! pfam TTL 306aa 1e-100 in ref transcript
    • Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis.

TTLL3

  • refseq_TTLL3.F1 refseq_TTLL3.R1 228 370
  • NCBIGene 36.3 26140
  • Single exon skipping, size difference: 142
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001025930

  • pfam TTL 292aa 7e-90 in ref transcript
    • Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis.

TTYH1

  • refseq_TTYH1.F1 refseq_TTYH1.R1 162 208
  • NCBIGene 36.3 57348
  • Single exon skipping, size difference: 46
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001005367

  • Changed! pfam Tweety 398aa 2e-71 in ref transcript
    • Tweety. The tweety (tty) gene has not been characterised at the protein level. However, it is thought to form a membrane protein with five potential membrane-spanning regions. A number of potential functions have been suggested.
  • Changed! pfam Tweety 397aa 3e-71 in modified transcript

TUSC3

  • refseq_TUSC3.F1 refseq_TUSC3.R1 159 224
  • NCBIGene 36.3 7991
  • Single exon skipping, size difference: 65
  • Exclusion of the stop codon
  • Reference transcript: NM_006765

  • pfam OST3_OST6 308aa 1e-119 in ref transcript
    • OST3 / OST6 family. The proteins in this family are part of a complex of eight ER proteins that transfers core oligosaccharide from dolichol carrier to Asn-X-Ser/Thr motifs. This family includes both OST3 and OST6, each of which contains four predicted transmembrane helices. Disruption of OST3 and OST6 leads to a defect in the assembly of the complex. Hence, the function of these genes seems to be essential for recruiting a fully active complex necessary for efficient N-glycosylation.

TXNRD1

  • refseq_TXNRD1.F2 refseq_TXNRD1.R2 279 389
  • NCBIGene 36.3 7296
  • Single exon skipping, size difference: 110
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003330

  • Changed! cd GRX_GRXh_1_2_like 47aa 5e-06 in ref transcript
    • Glutaredoxin (GRX) family, GRX human class 1 and 2 (h_1_2)-like subfamily; composed of proteins similar to human GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH -> GSH reductase -> GSH -> GRX -> protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known including human GRX1 and GRX2, which are members of this subfamily. Also included in this subfamily are the N-terminal GRX domains of proteins similar to human thioredoxin reductase 1 and 3.
  • Changed! TIGR TGR 487aa 0.0 in ref transcript
    • This homodimeric, FAD-containing member of the pyridine nucleotide disulfide oxidoreductase family contains a C-terminal motif Cys-SeCys-Gly, where SeCys is selenocysteine encoded by TGA (in some sequence reports interpreted as a stop codon). In some members of this subfamily, Cys-SeCys-Gly is replaced by Cys-Cys-Gly. The reach of the selenium atom at the C-term arm of the protein is proposed to allow broad substrate specificity.
  • Changed! PTZ PTZ00052 487aa 1e-137 in ref transcript
    • thioredoxin reductase; Provisional.

TXNRD1

  • refseq_TXNRD1.F3 refseq_TXNRD1.R3 156 393
  • NCBIGene 36.3 7296
  • Alternative 5-prime, size difference: 237
  • Exclusion of the protein initiation site
  • Reference transcript: NM_003330

  • Changed! cd GRX_GRXh_1_2_like 47aa 5e-06 in ref transcript
    • Glutaredoxin (GRX) family, GRX human class 1 and 2 (h_1_2)-like subfamily; composed of proteins similar to human GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH -> GSH reductase -> GSH -> GRX -> protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known including human GRX1 and GRX2, which are members of this subfamily. Also included in this subfamily are the N-terminal GRX domains of proteins similar to human thioredoxin reductase 1 and 3.
  • TIGR TGR 487aa 0.0 in ref transcript
    • This homodimeric, FAD-containing member of the pyridine nucleotide disulfide oxidoreductase family contains a C-terminal motif Cys-SeCys-Gly, where SeCys is selenocysteine encoded by TGA (in some sequence reports interpreted as a stop codon). In some members of this subfamily, Cys-SeCys-Gly is replaced by Cys-Cys-Gly. The reach of the selenium atom at the C-term arm of the protein is proposed to allow broad substrate specificity.
  • PTZ PTZ00052 487aa 1e-137 in ref transcript
    • thioredoxin reductase; Provisional.

TYSND1

  • refseq_TYSND1.F1 refseq_TYSND1.R1 103 420
  • NCBIGene 36.3 219743
  • Multiple exon skipping, size difference: 317
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_173555

  • Changed! TIGR protease_degS 165aa 4e-07 in ref transcript
    • This family consists of the periplasmic serine protease DegS (HhoB), a shorter paralog of protease DO (HtrA, DegP) and DegQ (HhoA). It is found in E. coli and several other Proteobacteria of the gamma subdivision. It contains a trypsin domain and a single copy of PDZ domain (in contrast to DegP with two copies). A critical role of this DegS is to sense stress in the periplasm and partially degrade an inhibitor of sigma(E).
  • Changed! COG DegQ 173aa 2e-05 in ref transcript
    • Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [Posttranslational modification, protein turnover, chaperones].

U2AF1

  • refseq_U2AF1.F1 refseq_U2AF1.R1 100 167
  • NCBIGene 36.3 7307
  • Single exon skipping, size difference: 67
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_006758

  • Changed! cd RRM 44aa 3e-04 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! smart RRM_1 44aa 6e-09 in ref transcript
    • RNA recognition motif.
  • pfam zf-CCCH 27aa 0.010 in ref transcript
    • Zinc finger C-x8-C-x5-C-x3-H type (and similar).

U2AF1

  • refseq_U2AF1.F4 refseq_U2AF1.R4 167 234
  • NCBIGene 36.3 7307
  • Single exon skipping, size difference: 67
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001025203

  • Changed! cd RRM 44aa 3e-04 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! smart RRM_1 44aa 7e-09 in ref transcript
    • RNA recognition motif.
  • Changed! pfam zf-CCCH 27aa 0.006 in modified transcript
    • Zinc finger C-x8-C-x5-C-x3-H type (and similar).

U2AF1L4

  • refseq_U2AF1L4.F1 refseq_U2AF1L4.R1 102 160
  • NCBIGene 36.3 199746
  • Alternative 5-prime, size difference: 58
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_144987

  • cd RRM 63aa 1e-05 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! smart RRM_1 64aa 2e-11 in ref transcript
    • RNA recognition motif.
  • Changed! pfam zf-CCCH 27aa 0.002 in ref transcript
    • Zinc finger C-x8-C-x5-C-x3-H type (and similar).
  • Changed! smart RRM_1 66aa 2e-09 in modified transcript

UBA52

  • refseq_UBA52.F1 refseq_UBA52.R1 112 134
  • NCBIGene 36.3 7311
  • Alternative 5-prime, size difference: 22
  • Exclusion in 5'UTR
  • Reference transcript: NM_001033930

  • cd Ubiquitin 76aa 7e-37 in ref transcript
    • Ubiquitin (includes Ubq/RPL40e and Ubq/RPS27a fusions as well as homopolymeric multiubiquitin protein chains).
  • pfam ubiquitin 69aa 2e-25 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
  • pfam Ribosomal_L40e 52aa 2e-18 in ref transcript
    • Ribosomal L40e family. Bovine L40 has been identified as a secondary RNA binding protein. L40 is fused to a ubiquitin protein.
  • PTZ PTZ00044 76aa 7e-20 in ref transcript
    • ubiquitin; Provisional.
  • COG RPL40A 49aa 3e-07 in ref transcript
    • Ribosomal protein L40E [Translation, ribosomal structure and biogenesis].

UBASH3A

  • refseq_UBASH3A.F1 refseq_UBASH3A.R1 192 306
  • NCBIGene 36.3 53347
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018961

  • cd HP_PGM_like 213aa 2e-09 in ref transcript
    • Histidine phosphatase domain found in phosphoglycerate mutases and related proteins, mostly phosphatases; contains a His residue which is phosphorylated during the reaction. Subgroup of the catalytic domain of a functionally diverse set of proteins, most of which are phosphatases. The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. This subgroup contains cofactor-dependent and cofactor-independent phosphoglycerate mutases (dPGM, and BPGM respectively), fructose-2,6-bisphosphatase (F26BP)ase, Sts-1, SixA, and related proteins. Functions include roles in metabolism, signaling, or regulation, for example, F26BPase affects glycolysis and gluconeogenesis through controlling the concentration of F26BP; BPGM controls the concentration of 2,3-BPG (the main allosteric effector of hemoglobin in human blood cells); human Sts-1 is a T-cell regulator; Escherichia coli Six A participates in the ArcB-dependent His-to-Asp phosphorelay signaling system. Deficiency and mutation in many of the human members result in disease, for example erythrocyte BPGM deficiency is a disease associated with a decrease in the concentration of 2,3-BPG.
  • cd SH3 57aa 1e-06 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam PGAM 143aa 2e-14 in ref transcript
    • Phosphoglycerate mutase family. Y019_MYCTU and YK23_YEAST are not included in the Prosite entry. However these sequences are significantly similar and contain identical active site residues.
  • smart SH3 57aa 8e-07 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • COG GpmB 189aa 4e-12 in ref transcript
    • Fructose-2,6-bisphosphatase [Carbohydrate transport and metabolism].
  • COG UBP14 41aa 2e-05 in ref transcript
    • Isopeptidase T [Posttranslational modification, protein turnover, chaperones].

UBA3

  • refseq_UBE1C.F1 refseq_UBE1C.R1 134 176
  • NCBIGene 36.3 9039
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003968

  • cd Uba3_RUB 298aa 1e-157 in ref transcript
    • Ubiquitin activating enzyme (E1) subunit UBA3. UBA3 is part of the heterodimeric activating enzyme (E1), specific for the Rub family of ubiquitin-like proteins (Ubls). E1 enzymes are part of a conjugation cascade to attach Ub or Ubls, covalently to substrate proteins. consisting of activating (E1), conjugating (E2), and/or ligating (E3) enzymes. E1 activates ubiquitin(-like) by C-terminal adenylation, and subsequently forms a highly reactive thioester bond between its catalytic cysteine and Ubls C-terminus. E1 also associates with E2 and promotes ubiquitin transfer to the E2's catalytic cysteine. Post-translational modification by Rub family of ubiquitin-like proteins (Ublps) activates SCF ubiquitin ligases and is involved in cell cycle control, signaling and embryogenesis. UBA3 contains both the nucleotide-binding motif involved in adenylation and the catalytic cysteine involved in the thioester intermediate and Ublp transfer to E2.
  • pfam ThiF 144aa 1e-38 in ref transcript
    • ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family.
  • pfam E2_bind 89aa 4e-25 in ref transcript
    • E2 binding domain. E1 and E2 enzymes play a central role in ubiquitin and ubiquitin-like protein transfer cascades. This is an E2 binding domain that is found on NEDD8 activating E1 enzyme. The domain resembles ubiquitin, and recruits the catalytic core of the E2 enzyme Ubc12 in a similar manner to that in which ubiquitin interacts with ubiquitin binding domains.
  • pfam UBACT 66aa 4e-22 in ref transcript
    • Repeat in ubiquitin-activating (UBA) protein.
  • COG ThiF 167aa 2e-34 in ref transcript
    • Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Coenzyme metabolism].

UBA5

  • refseq_UBE1DC1.F1 refseq_UBE1DC1.R1 97 447
  • NCBIGene 36.3 79876
  • Alternative 5-prime, size difference: 350
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_024818

  • Changed! cd ThiF_MoeB_HesA_family 246aa 2e-53 in ref transcript
    • ThiF_MoeB_HesA. Family of E1-like enzymes involved in molybdopterin and thiamine biosynthesis family. The common reaction mechanism catalyzed by MoeB and ThiF, like other E1 enzymes, begins with a nucleophilic attack of the C-terminal carboxylate of MoaD and ThiS, respectively, on the alpha-phosphate of an ATP molecule bound at the active site of the activating enzymes, leading to the formation of a high-energy acyladenylate intermediate and subsequently to the formation of a thiocarboxylate at the C termini of MoaD and ThiS. MoeB, as the MPT synthase (MoaE/MoaD complex) sulfurase, is involved in the biosynthesis of the molybdenum cofactor, a derivative of the tricyclic pterin, molybdopterin (MPT). ThiF catalyzes the adenylation of ThiS, as part of the biosynthesis pathway of thiamin pyrophosphate (vitamin B1).
  • Changed! pfam ThiF 134aa 4e-23 in ref transcript
    • ThiF family. This family contains a repeated domain in ubiquitin activating enzyme E1 and members of the bacterial ThiF/MoeB/HesA family.
  • Changed! COG ThiF 249aa 7e-20 in ref transcript
    • Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Coenzyme metabolism].

UBE2A

  • refseq_UBE2A.F1 refseq_UBE2A.R1 102 192
  • NCBIGene 36.3 7319
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003336

  • Changed! cd UBCc 135aa 6e-50 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 138aa 3e-58 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 151aa 4e-52 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBCc 105aa 1e-30 in modified transcript
  • Changed! pfam UQ_con 108aa 6e-37 in modified transcript
  • Changed! COG COG5078 121aa 4e-33 in modified transcript

UBE2C

  • refseq_UBE2C.F2 refseq_UBE2C.R2 210 264
  • NCBIGene 36.3 11065
  • Mutually exclusive exon skipping, size difference: 54
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007019

  • Changed! cd UBCc 138aa 6e-45 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 137aa 5e-53 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 140aa 3e-43 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBCc 120aa 8e-13 in modified transcript
  • Changed! pfam UQ_con 119aa 1e-12 in modified transcript
  • Changed! COG COG5078 122aa 3e-07 in modified transcript

UBE2D2

  • refseq_UBE2D2.F1 refseq_UBE2D2.R1 251 415
  • NCBIGene 36.3 7322
  • Single exon skipping, size difference: 164
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_003339

  • Changed! cd UBCc 139aa 1e-50 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 138aa 3e-57 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 147aa 5e-56 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBE2D3

  • refseq_UBE2D3.F2 refseq_UBE2D3.R2 102 254
  • NCBIGene 36.3 7323
  • Single exon skipping, size difference: 152
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_181893

  • Changed! cd UBCc 137aa 2e-49 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 137aa 4e-55 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 147aa 2e-53 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBE2D3

  • refseq_UBE2D3.F3 refseq_UBE2D3.R3 184 392
  • NCBIGene 36.3 7323
  • Alternative 3-prime, size difference: 208
  • Inclusion in 5'UTR
  • Reference transcript: NM_003340

  • cd UBCc 139aa 4e-50 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • pfam UQ_con 138aa 4e-56 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • COG COG5078 147aa 7e-55 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBE2D3

  • refseq_UBE2D3.F5 refseq_UBE2D3.R5 101 151
  • NCBIGene 36.3 7323
  • Single exon skipping, size difference: 50
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_181893

  • Changed! cd UBCc 137aa 2e-49 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 137aa 4e-55 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 147aa 2e-53 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBCc 137aa 8e-50 in modified transcript
  • Changed! pfam UQ_con 137aa 1e-55 in modified transcript
  • Changed! COG COG5078 147aa 3e-54 in modified transcript

UBE2D3

  • refseq_UBE2D3.F7 refseq_UBE2D3.R3 187 395
  • NCBIGene 36.3 7323
  • Alternative 3-prime, size difference: 208
  • Inclusion in 5'UTR
  • Reference transcript: NM_181891

  • cd UBCc 139aa 4e-50 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • pfam UQ_con 138aa 4e-56 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • COG COG5078 147aa 7e-55 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBE2E1

  • refseq_UBE2E1.F1 refseq_UBE2E1.R1 196 247
  • NCBIGene 36.3 7324
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003341

  • Changed! cd UBCc 139aa 2e-47 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 138aa 4e-54 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 147aa 1e-48 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBCc 121aa 3e-41 in modified transcript
  • Changed! pfam UQ_con 121aa 4e-48 in modified transcript
  • Changed! COG COG5078 125aa 2e-42 in modified transcript

UBE2G1

  • refseq_UBE2G1.F1 refseq_UBE2G1.R1 118 395
  • NCBIGene 36.2 7326
  • Multiple exon skipping, size difference: 277
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003342

  • Changed! cd UBCc 143aa 2e-41 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 143aa 4e-48 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 146aa 9e-45 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBCc 30aa 0.002 in modified transcript
  • Changed! smart UBCc 31aa 3e-05 in modified transcript
    • Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 30aa 4e-05 in modified transcript

UBE2G2

  • refseq_UBE2G2.F1 refseq_UBE2G2.R1 106 221
  • NCBIGene 36.3 7327
  • Single exon skipping, size difference: 115
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_003343

  • Changed! cd UBCc 153aa 3e-43 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 152aa 4e-50 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 163aa 5e-48 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBE2H

  • refseq_UBE2H.F2 refseq_UBE2H.R2 248 341
  • NCBIGene 36.3 7328
  • Multiple exon skipping, size difference: 93
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003344

  • Changed! cd UBCc 104aa 4e-30 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 104aa 2e-34 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 131aa 2e-29 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBCc 73aa 6e-14 in modified transcript
  • Changed! pfam UQ_con 73aa 8e-17 in modified transcript
  • Changed! COG COG5078 100aa 7e-12 in modified transcript

UBE2I

  • refseq_UBE2I.F1 refseq_UBE2I.R1 104 361
  • NCBIGene 36.3 7329
  • Single exon skipping, size difference: 257
  • Exclusion in 5'UTR
  • Reference transcript: NM_194259

  • cd UBCc 146aa 5e-40 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • pfam UQ_con 145aa 1e-53 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • COG COG5078 156aa 4e-45 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBE2L3

  • refseq_UBE2L3.F2 refseq_UBE2L3.R2 283 392
  • NCBIGene 36.3 7332
  • Single exon skipping, size difference: 109
  • Exclusion in 5'UTR
  • Reference transcript: NM_198157

UBE2V1

  • refseq_UBE2V1.F1 refseq_UBE2V1.R1 147 296
  • NCBIGene 36.3 7335
  • Single exon skipping, size difference: 149
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_199144

  • Changed! cd UBCc 117aa 1e-11 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! smart UBCc 128aa 3e-25 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic domain homologues. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. This pathway functions in regulating many fundamental processes required for cell viability.TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 131aa 4e-16 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBE2W

  • refseq_UBE2W.F1 refseq_UBE2W.R1 198 301
  • NCBIGene 36.2 55284
  • Single exon skipping, size difference: 103
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001001481

  • Changed! cd UBCc 115aa 3e-25 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 142aa 3e-30 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 114aa 2e-25 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBE2W

  • refseq_UBE2W.F2 refseq_UBE2W.R2 107 140
  • NCBIGene 36.3 55284
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001481

  • Changed! cd UBCc 115aa 3e-25 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • pfam UQ_con 142aa 3e-30 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 114aa 2e-25 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBCc 114aa 3e-25 in modified transcript
  • Changed! COG COG5078 114aa 2e-25 in modified transcript

UBE3B

  • refseq_UBE3B.F1 refseq_UBE3B.R1 106 451
  • NCBIGene 36.3 89910
  • Alternative 5-prime, size difference: 345
  • Exclusion in 5'UTR
  • Reference transcript: NM_130466

  • cd HECTc 385aa 1e-113 in ref transcript
    • HECT domain; C-terminal catalytic domain of a subclass of Ubiquitin-protein ligase (E3). It binds specific ubiquitin-conjugating enzymes (E2), accepts ubiquitin from E2, transfers ubiquitin to substrate lysine side chains, and transfers additional ubiquitin molecules to the end of growing ubiquitin chains.
  • pfam HECT 331aa 1e-94 in ref transcript
    • HECT-domain (ubiquitin-transferase). The name HECT comes from Homologous to the E6-AP Carboxyl Terminus.
  • COG HUL4 399aa 2e-79 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].

UBL7

  • refseq_UBL7.F1 refseq_UBL7.R1 167 238
  • NCBIGene 36.3 84993
  • Alternative 5-prime, size difference: 71
  • Exclusion in 5'UTR
  • Reference transcript: NM_032907

  • cd BMSC_UbP_N 75aa 1e-29 in ref transcript
    • BMSC_UbP (bone marrow stromal cell-derived ubiquitin-like protein) has an N-terminal ubiquitin-like (UBQ) domain and a C-terminal ubiquitin-associated (UBA) domain, a domain architecture similar to those of the UBIN, Chap1, and ubiquilin proteins. This CD represents the N-terminal ubiquitin-like domain.
  • pfam ubiquitin 54aa 1e-07 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
  • PTZ PTZ00044 34aa 0.001 in ref transcript
    • ubiquitin; Provisional.

UBOX5

  • refseq_UBOX5.F2 refseq_UBOX5.R2 142 304
  • NCBIGene 36.3 22888
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014948

  • cd RING 50aa 0.005 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam U-box 81aa 2e-23 in ref transcript
    • U-box domain. This domain is related to the Ring finger pfam00097 but lacks the zinc binding residues.
  • COG UFD2 79aa 1e-05 in ref transcript
    • Ubiquitin fusion degradation protein 2 [Posttranslational modification, protein turnover, chaperones].

UBQLN1

  • refseq_UBQLN1.F1 refseq_UBQLN1.R1 100 184
  • NCBIGene 36.3 29979
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013438

  • cd hPLIC_N 58aa 4e-24 in ref transcript
    • hPLIC-1 and hPLIC-2 (human homologs of the yeast ubiquitin-like Dsk2 protein) are type2 UBL's (ubiquitin-like) proteins that are thought to serve as adaptors that link the ubiquitination machinery to the proteasome. The hPLIC's have an N-terminal UBL domain that binds the S5a subunit of the proteasome and a C-terminal UBA (ubiquitin-associated) domain that binds a ubiquitylated protein.
  • cd UBA 38aa 6e-04 in ref transcript
    • Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
  • pfam ubiquitin 59aa 3e-12 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
  • smart UBA 37aa 4e-04 in ref transcript
    • Ubiquitin associated domain. Present in Rad23, SNF1-like kinases. The newly-found UBA in p62 is known to bind ubiquitin.

UBXD5

  • refseq_UBXD5.F2 refseq_UBXD5.R2 132 231
  • NCBIGene 36.3 91544
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_183008

  • pfam SEP 72aa 6e-19 in ref transcript
    • SEP domain. The SEP domain is named after Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47. In p47, the SEP domain has been shown to bind to and inhibit the cysteine protease cathepsin L. Most SEP domains are succeeded closely by a UBX domain.
  • PRK bchH 138aa 0.007 in ref transcript
    • magnesium chelatase subunit H; Provisional.

UBXD5

  • refseq_UBXD5.F3 refseq_UBXD5.R3 121 246
  • NCBIGene 36.3 91544
  • Single exon skipping, size difference: 125
  • Exclusion in 5'UTR
  • Reference transcript: NM_183008

  • pfam SEP 72aa 6e-19 in ref transcript
    • SEP domain. The SEP domain is named after Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47. In p47, the SEP domain has been shown to bind to and inhibit the cysteine protease cathepsin L. Most SEP domains are succeeded closely by a UBX domain.
  • PRK bchH 138aa 0.007 in ref transcript
    • magnesium chelatase subunit H; Provisional.

UCRC

  • refseq_UCRC.F1 refseq_UCRC.R1 297 356
  • NCBIGene 36.3 29796
  • Alternative 5-prime, size difference: 59
  • Exclusion in 5'UTR
  • Reference transcript: NM_001003684

UFD1L

  • refseq_UFD1L.F1 refseq_UFD1L.R1 175 208
  • NCBIGene 36.3 7353
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005659

  • Changed! pfam UFD1 181aa 6e-88 in ref transcript
    • Ubiquitin fusion degradation protein UFD1. Post-translational ubiquitin-protein conjugates are recognised for degradation by the ubiquitin fusion degradation (UFD) pathway. Several proteins involved in this pathway have been identified. This family includes UFD1, a 40kD protein that is essential for vegetative cell viability. The human UFD1 gene is expressed at high levels during embryogenesis, especially in the eyes and in the inner ear primordia and is thought to be important in the determination of ectoderm-derived structures, including neural crest cells. In addition, this gene is deleted in the CATCH-22 (cardiac defects, abnormal facies, thymic hypoplasia, cleft palate and hypocalcaemia with deletions on chromosome 22) syndrome. This clinical syndrome is associated with a variety of developmental defects, all characterised by microdeletions on 22q11.2. Two such developmental defects are the DiGeorge syndrome OMIM:188400, and the velo-cardio- facial syndrome OMIM:145410. Several of the abnormalities associated with these conditions are thought to be due to defective neural crest cell differentiation.
  • Changed! COG UFD1 285aa 1e-55 in ref transcript
    • Ubiquitin fusion-degradation protein [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam UFD1 170aa 9e-79 in modified transcript
  • Changed! COG UFD1 274aa 1e-48 in modified transcript

UGCGL1

  • refseq_UGCGL1.F1 refseq_UGCGL1.R1 175 356
  • NCBIGene 36.3 56886
  • Alternative 5-prime, size difference: 181
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_020120

  • Changed! cd GT8_HUGT1_C_like 248aa 1e-145 in ref transcript
    • The C-terminal domain of HUGT1-like is highly homologous to the GT 8 family. C-terminal domain of glycoprotein glucosyltransferase (UGT). UGT is a large glycoprotein whose C-terminus contains the catalytic activity. This catalytic C-terminal domain is highly homologous to Glycosyltransferase Family 8 (GT 8) and contains the DXD motif that coordinates donor sugar binding, characteristic for Family 8 glycosyltransferases. GT 8 proteins are retaining enzymes based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed. The non-catalytic N-terminal portion of the human UTG1 (HUGT1) has been shown to monitor the protein folding status and activate its glucosyltransferase activity.
  • Changed! pfam UDP-g_GGTase 209aa 3e-79 in ref transcript
    • UDP-glucose:Glycoprotein Glucosyltransferase. The N-terminal region of this group of proteins is required for correct folding of the ER UDP-Glc: glucosyltransferase.

UCHL5IP

  • refseq_UIP1.F1 refseq_UIP1.R1 112 289
  • NCBIGene 36.3 55559
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207107

UIP1

  • refseq_UIP1.F3 refseq_UIP1.R3 135 253
  • NCBIGene 36.2 55559
  • Alternative 3-prime, size difference: 118
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_207107

UNC45A

  • refseq_UNC45A.F2 refseq_UNC45A.R2 158 203
  • NCBIGene 36.3 55898
  • Alternative 5-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018671

  • cd TPR 103aa 1e-12 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • cd ARM 120aa 3e-05 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • TIGR 3a0801s09 93aa 5e-10 in ref transcript
  • COG NrfG 95aa 0.001 in ref transcript
    • FOG: TPR repeat [General function prediction only].

UNG2

  • refseq_UNG2.F1 refseq_UNG2.R1 102 478
  • NCBIGene 36.2 10309
  • Alternative 5-prime, size difference: 376
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_021147

  • Changed! cd CYCLIN 90aa 9e-11 in ref transcript
    • Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
  • Changed! pfam Cyclin_N 125aa 5e-24 in ref transcript
    • Cyclin, N-terminal domain. Cyclins regulate cyclin dependent kinases (CDKs). One member is a Uracil-DNA glycosylase that is related to other cyclins. Cyclins contain two domains of similar all-alpha fold, of which this family corresponds with the N-terminal domain.
  • Changed! COG COG5024 175aa 3e-20 in ref transcript
    • Cyclin [Cell division and chromosome partitioning].

UPF3A

  • refseq_UPF3A.F1 refseq_UPF3A.R1 341 440
  • NCBIGene 36.3 65110
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_023011

  • Changed! pfam Smg4_UPF3 163aa 2e-48 in ref transcript
    • Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3.
  • Changed! pfam Smg4_UPF3 130aa 2e-35 in modified transcript

UPK3B

  • refseq_UPK3B.F2 refseq_UPK3B.R2 185 265
  • NCBIGene 36.3 80761
  • Single exon skipping, size difference: 80
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_030570

USF1

  • refseq_USF1.F2 refseq_USF1.R2 165 196
  • NCBIGene 36.3 7391
  • Alternative 5-prime, size difference: 31
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007122

  • Changed! cd HLH 63aa 2e-10 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • Changed! pfam HLH 56aa 3e-12 in ref transcript
    • Helix-loop-helix DNA-binding domain.

USF2

  • refseq_USF2.F1 refseq_USF2.R1 155 356
  • NCBIGene 36.3 7392
  • Single exon skipping, size difference: 201
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003367

  • cd HLH 63aa 7e-11 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam HLH 56aa 5e-12 in ref transcript
    • Helix-loop-helix DNA-binding domain.
  • TIGR DNA_bind_RsfA 63aa 0.006 in ref transcript
    • In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.

USH1C

  • refseq_USH1C.F1 refseq_USH1C.R1 114 223
  • NCBIGene 36.3 10083
  • Single exon skipping, size difference: 109
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_153676

  • cd PDZ_signaling 81aa 7e-17 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 81aa 1e-14 in ref transcript
  • cd PDZ_signaling 88aa 3e-08 in ref transcript
  • smart PDZ 83aa 9e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 72aa 8e-14 in ref transcript
  • TIGR degP_htrA_DO 194aa 2e-10 in ref transcript
    • This family consists of a set proteins various designated DegP, heat shock protein HtrA, and protease DO. The ortholog in Pseudomonas aeruginosa is designated MucD and is found in an operon that controls mucoid phenotype. This family also includes the DegQ (HhoA) paralog in E. coli which can rescue a DegP mutant, but not the smaller DegS paralog, which cannot. Members of this family are located in the periplasm and have separable functions as both protease and chaperone. Members have a trypsin domain and two copies of a PDZ domain. This protein protects bacteria from thermal and other stresses and may be important for the survival of bacterial pathogens.// The chaperone function is dominant at low temperatures, whereas the proteolytic activity is turned on at elevated temperatures.
  • smart PDZ 87aa 6e-09 in ref transcript
  • PRK PRK10942 172aa 2e-05 in ref transcript
    • serine endoprotease; Provisional.

USP1

  • refseq_USP1.F1 refseq_USP1.R1 170 265
  • NCBIGene 36.3 7398
  • Alternative 5-prime, size difference: 95
  • Exclusion in 5'UTR
  • Reference transcript: NM_001017415

  • cd Peptidase_C19O 176aa 2e-84 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19O 208aa 5e-46 in ref transcript
  • cd Peptidase_C19O 38aa 0.001 in ref transcript
  • pfam UCH 196aa 1e-26 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • pfam UCH 157aa 7e-18 in ref transcript
  • COG UBP5 177aa 7e-11 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • COG UBP5 150aa 1e-07 in ref transcript

USP14

  • refseq_USP14.F1 refseq_USP14.R1 129 234
  • NCBIGene 36.3 9097
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005151

  • cd Peptidase_C19A 376aa 1e-113 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyse bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • Changed! cd UBP_N 70aa 8e-05 in ref transcript
    • The UBP (ubiquitin processing protease) domain (also referred to as USP which stands for "ubiquitin-specific protease") is present at in a large family of cysteine proteases that specifically cleave ubiquitin conjugates. This family includes Rpn11, UBP6 (USP14), USP7 (HAUSP). This domain is closely related to the amino-terminal ubiquitin-like domain of BAG1 (Bcl2-associated anthanogene1) protein and is found only in eukaryotes.
  • pfam UCH 379aa 4e-59 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • Changed! smart UBQ 63aa 3e-06 in ref transcript
    • Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression.
  • Changed! COG UBP12 130aa 4e-07 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • COG COG5077 117aa 3e-06 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBP_N 64aa 0.009 in modified transcript
  • Changed! smart UBQ 53aa 1e-05 in modified transcript
  • Changed! COG UBP12 114aa 4e-07 in modified transcript

USP16

  • refseq_USP16.F2 refseq_USP16.R2 181 244
  • NCBIGene 36.3 10600
  • Single exon skipping, size difference: 63
  • Exclusion in 5'UTR
  • Reference transcript: NM_001032410

  • cd Peptidase_C19K 198aa 7e-64 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19K 206aa 1e-37 in ref transcript
  • pfam UCH 196aa 7e-32 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • pfam UCH 200aa 7e-27 in ref transcript
  • pfam zf-UBP 82aa 7e-08 in ref transcript
    • Zn-finger in ubiquitin-hydrolases and other protein.
  • COG UBP12 204aa 3e-17 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 203aa 1e-16 in ref transcript

USP20

  • refseq_USP20.F1 refseq_USP20.R1 122 220
  • NCBIGene 36.3 10868
  • Alternative 5-prime, size difference: 98
  • Exclusion in 5'UTR
  • Reference transcript: NM_001008563

  • cd Peptidase_C19R 241aa 7e-49 in ref transcript
    • A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19D 103aa 3e-12 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19M 209aa 6e-09 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • pfam UCH 240aa 4e-47 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • pfam UCH 106aa 2e-25 in ref transcript
  • smart DUSP 84aa 2e-25 in ref transcript
    • Domain in ubiquitin-specific proteases.
  • smart DUSP 82aa 2e-19 in ref transcript
  • pfam zf-UBP 64aa 8e-07 in ref transcript
    • Zn-finger in ubiquitin-hydrolases and other protein.
  • COG UBP12 164aa 4e-29 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 108aa 2e-17 in ref transcript
  • COG UBP14 238aa 4e-13 in ref transcript
    • Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 62aa 8e-05 in ref transcript

USP21

  • refseq_USP21.F1 refseq_USP21.R1 250 391
  • NCBIGene 36.3 27005
  • Single exon skipping, size difference: 141
  • Exclusion in 5'UTR
  • Reference transcript: NM_001014443

  • cd Peptidase_C19R 259aa 1e-60 in ref transcript
    • A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19R 22aa 5e-06 in ref transcript
  • cd Peptidase_C19F 52aa 4e-04 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • pfam UCH 344aa 1e-80 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • COG UBP5 346aa 2e-50 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].

USP33

  • refseq_USP33.F1 refseq_USP33.R1 278 457
  • NCBIGene 36.3 23032
  • Single exon skipping, size difference: 179
  • Exclusion of the protein initiation site
  • Reference transcript: NM_015017

  • cd Peptidase_C19R 241aa 7e-50 in ref transcript
    • A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19E 100aa 7e-13 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19M 215aa 5e-11 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • pfam UCH 259aa 4e-49 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • smart DUSP 83aa 1e-24 in ref transcript
    • Domain in ubiquitin-specific proteases.
  • pfam UCH 102aa 2e-23 in ref transcript
  • smart DUSP 70aa 1e-12 in ref transcript
  • pfam zf-UBP 73aa 1e-07 in ref transcript
    • Zn-finger in ubiquitin-hydrolases and other protein.
  • COG UBP12 155aa 1e-30 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 102aa 1e-14 in ref transcript
  • COG UBP14 245aa 8e-14 in ref transcript
    • Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 63aa 6e-05 in ref transcript

USP33

  • refseq_USP33.F3 refseq_USP33.R3 136 160
  • NCBIGene 36.3 23032
  • Alternative 3-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015017

  • Changed! cd Peptidase_C19R 241aa 7e-50 in ref transcript
    • A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19E 100aa 7e-13 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19M 215aa 5e-11 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • Changed! pfam UCH 259aa 4e-49 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • smart DUSP 83aa 1e-24 in ref transcript
    • Domain in ubiquitin-specific proteases.
  • pfam UCH 102aa 2e-23 in ref transcript
  • smart DUSP 70aa 1e-12 in ref transcript
  • pfam zf-UBP 73aa 1e-07 in ref transcript
    • Zn-finger in ubiquitin-hydrolases and other protein.
  • COG UBP12 155aa 1e-30 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 102aa 1e-14 in ref transcript
  • COG UBP14 245aa 8e-14 in ref transcript
    • Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 63aa 6e-05 in ref transcript
  • Changed! cd Peptidase_C19R 233aa 7e-51 in modified transcript
  • Changed! pfam UCH 251aa 5e-50 in modified transcript
  • Changed! COG UBP14 180aa 0.002 in modified transcript

USP4

  • refseq_USP4.F1 refseq_USP4.R1 226 367
  • NCBIGene 36.3 7375
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003363

  • cd Peptidase_C19R 146aa 1e-47 in ref transcript
    • A subfamily of peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19E 170aa 1e-17 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • pfam DUF1055 135aa 2e-57 in ref transcript
    • Domain of Unknown Function (DUF1055). This region is found in Ubiquitin-specific proteases.
  • pfam UCH 150aa 4e-49 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • pfam UCH 181aa 1e-43 in ref transcript
  • smart DUSP 97aa 1e-25 in ref transcript
    • Domain in ubiquitin-specific proteases.
  • Changed! COG UBP12 274aa 3e-65 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 149aa 1e-47 in ref transcript
  • Changed! COG UBP12 150aa 0.009 in ref transcript
  • Changed! COG UBP12 502aa 3e-71 in modified transcript

USP45

  • refseq_USP45.F1 refseq_USP45.R1 209 383
  • NCBIGene 36.2 85015
  • Multiple exon skipping, size difference: 174
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_931094

  • cd Peptidase_C19K 209aa 1e-54 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • Changed! cd Peptidase_C19K 197aa 1e-29 in ref transcript
  • pfam UCH 195aa 1e-27 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • Changed! pfam UCH 197aa 3e-20 in ref transcript
  • pfam zf-UBP 56aa 6e-07 in ref transcript
    • Zn-finger in ubiquitin-hydrolases and other protein.
  • Changed! COG UBP14 244aa 1e-15 in ref transcript
    • Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
  • COG UBP12 241aa 2e-13 in ref transcript
    • Ubiquitin C-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Peptidase_C19K 139aa 5e-21 in modified transcript
  • Changed! pfam UCH 168aa 5e-14 in modified transcript
  • Changed! COG UBP14 255aa 4e-16 in modified transcript

USP9X

  • refseq_USP9X.F1 refseq_USP9X.R1 123 171
  • NCBIGene 36.3 8239
  • Alternative 5-prime, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039590

  • cd peptidase_C19C 404aa 1e-119 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • pfam UCH 399aa 1e-70 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • COG COG5077 447aa 6e-45 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase [Posttranslational modification, protein turnover, chaperones].

VAPA

  • refseq_VAPA.F1 refseq_VAPA.R1 151 286
  • NCBIGene 36.3 9218
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003574

  • pfam Motile_Sperm 104aa 4e-29 in ref transcript
    • MSP (Major sperm protein) domain. Major sperm proteins are involved in sperm motility. These proteins oligomerise to form filaments. This family contains many other proteins.
  • COG SCS2 110aa 9e-20 in ref transcript
    • VAMP-associated protein involved in inositol metabolism [Intracellular trafficking and secretion].

VCL

  • refseq_VCL.F2 refseq_VCL.R2 196 400
  • NCBIGene 36.3 7414
  • Single exon skipping, size difference: 204
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014000

  • pfam Vinculin 358aa 1e-122 in ref transcript
    • Vinculin family.
  • Changed! pfam Vinculin 541aa 2e-95 in ref transcript
  • Changed! pfam Vinculin 241aa 4e-76 in ref transcript
  • Changed! pfam Vinculin 692aa 1e-168 in modified transcript

VKORC1

  • refseq_VKORC1.F1 refseq_VKORC1.R1 259 369
  • NCBIGene 36.3 79001
  • Single exon skipping, size difference: 110
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024006

  • Changed! smart VKc 149aa 3e-20 in ref transcript
    • Family of likely enzymes that includes the catalytic subunit of vitamin K epoxide reductase. Bacterial homologues are fused to members of the thioredoxin family of oxidoreductases.
  • Changed! smart VKc 54aa 2e-06 in modified transcript

VLDLR

  • refseq_VLDLR.F1 refseq_VLDLR.R1 152 236
  • NCBIGene 36.3 7436
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003383

  • cd LDLa 35aa 7e-08 in ref transcript
    • Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
  • cd LDLa 35aa 9e-08 in ref transcript
  • cd LDLa 35aa 1e-07 in ref transcript
  • cd LDLa 32aa 1e-06 in ref transcript
  • cd LDLa 35aa 3e-06 in ref transcript
  • cd LDLa 37aa 4e-04 in ref transcript
  • cd LDLa 31aa 0.001 in ref transcript
  • cd EGF_CA 31aa 0.008 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • pfam Ldl_recept_b 41aa 6e-12 in ref transcript
    • Low-density lipoprotein receptor repeat class B. This domain is also known as the YWTD motif after the most conserved region of the repeat. The YWTD repeat is found in multiple tandem repeats and has been predicted to form a beta-propeller structure.
  • pfam Ldl_recept_a 37aa 5e-10 in ref transcript
    • Low-density lipoprotein receptor domain class A.
  • pfam Ldl_recept_a 36aa 1e-09 in ref transcript
  • pfam Ldl_recept_a 36aa 5e-09 in ref transcript
  • smart LY 43aa 5e-09 in ref transcript
    • Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
  • pfam Ldl_recept_a 37aa 1e-08 in ref transcript
  • pfam Ldl_recept_a 34aa 6e-08 in ref transcript
  • smart LY 43aa 5e-07 in ref transcript
  • pfam Ldl_recept_a 39aa 2e-06 in ref transcript
  • pfam Ldl_recept_a 31aa 7e-06 in ref transcript
  • pfam Ldl_recept_b 40aa 6e-05 in ref transcript
  • pfam Ldl_recept_b 41aa 9e-05 in ref transcript
  • smart EGF_CA 31aa 2e-04 in ref transcript
    • Calcium-binding EGF-like domain.

VNN3

  • refseq_VNN3.F2 refseq_VNN3.R2 195 241
  • NCBIGene 36.3 55350
  • Alternative 3-prime, size difference: 46
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_018399

  • Changed! pfam CN_hydrolase 161aa 7e-17 in ref transcript
    • Carbon-nitrogen hydrolase. This family contains hydrolases that break carbon-nitrogen bonds. The family includes: Nitrilase EC:3.5.5.1, Aliphatic amidase EC:3.5.1.4, Biotidinase EC:3.5.1.12, Beta-ureidopropionase EC:3.5.1.6. Nitrilase-related proteins generally have a conserved E-K-C catalytic triad, and are multimeric alpha-beta-beta-alpha sandwich proteins.
  • Changed! COG COG0388 160aa 3e-12 in ref transcript
    • Predicted amidohydrolase [General function prediction only].
  • Changed! TIGR agmatine_aguB 27aa 0.008 in modified transcript
    • Members of this family are N-carbamoylputrescine amidase (3.5.1.53). Bacterial genes are designated AguB. The AguAB pathway replaces SpeB for conversion of agmatine to putrescine in two steps rather than one.

VPS13A

  • refseq_VPS13A.F1 refseq_VPS13A.R1 115 232
  • NCBIGene 36.3 23230
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033305

  • pfam DUF1162 279aa 2e-82 in ref transcript
    • Protein of unknown function (DUF1162). This family represents a conserved region within several hypothetical eukaryotic proteins. Family members might be vacuolar protein sorting related-proteins.
  • Changed! COG MRS6 2101aa 2e-80 in ref transcript
    • Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion].
  • COG MRS6 422aa 7e-33 in ref transcript
  • Changed! COG MRS6 2062aa 1e-78 in modified transcript

VPS13C

  • refseq_VPS13C.F1 refseq_VPS13C.R1 271 400
  • NCBIGene 36.3 54832
  • Multiple exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020821

  • pfam DUF1162 284aa 1e-88 in ref transcript
    • Protein of unknown function (DUF1162). This family represents a conserved region within several hypothetical eukaryotic proteins. Family members might be vacuolar protein sorting related-proteins.
  • Changed! COG MRS6 1049aa 1e-82 in ref transcript
    • Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion].
  • COG MRS6 479aa 1e-32 in ref transcript
  • COG MRS6 888aa 7e-09 in ref transcript
  • Changed! COG MRS6 1006aa 1e-84 in modified transcript

VPS13D

  • refseq_VPS13D.F1 refseq_VPS13D.R1 121 196
  • NCBIGene 36.3 55187
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015378

  • cd UBA 36aa 0.001 in ref transcript
    • Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
  • pfam DUF1162 284aa 6e-52 in ref transcript
    • Protein of unknown function (DUF1162). This family represents a conserved region within several hypothetical eukaryotic proteins. Family members might be vacuolar protein sorting related-proteins.
  • pfam UBA 37aa 2e-04 in ref transcript
    • UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to ubiquitin and the ubiquitination pathway. The structure of the UBA domain consists of a compact three helix bundle. This domain is found at the N terminus of EF-TS hence the name TS-N. The structure of EF-TS is known and this domain is implicated in its interaction with EF-TU. The domain has been found in non EF-TS proteins such as alpha-NAC and MJ0280.
  • COG MRS6 1269aa 1e-53 in ref transcript
    • Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion].
  • COG MRS6 463aa 3e-35 in ref transcript
  • COG MRS6 167aa 0.004 in ref transcript

VPS24

  • refseq_VPS24.F2 refseq_VPS24.R2 307 368
  • NCBIGene 36.3 51652
  • Single exon skipping, size difference: 61
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016079

  • Changed! pfam Snf7 171aa 9e-29 in ref transcript
    • Snf7. This family of proteins are involved in protein sorting and transport from the endosome to the vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes play an important role in the degradation of both lipids and cellular proteins. In order to perform this degradative function, vacuoles/lysosomes contain numerous hydrolases which have been transported in the form of inactive precursors via the biosynthetic pathway and are proteolytically activated upon delivery to the vacuole/lysosome. The delivery of transmembrane proteins, such as activated cell surface receptors to the lumen of the vacuole/lysosome, either for degradation/downregulation, or in the case of hydrolases, for proper localisation, requires the formation of multivesicular bodies (MVBs). These late endosomal structures are formed by invaginating and budding of the limiting membrane into the lumen of the compartment. During this process, a subset of the endosomal membrane proteins is sorted into the forming vesicles. Mature MVBs fuse with the vacuole/lysosome, thereby releasing cargo containing vesicles into its hydrolytic lumen for degradation. Endosomal proteins that are not sorted into the intralumenal MVB vesicles are either recycled back to the plasma membrane or Golgi complex, or remain in the limiting membrane of the MVB and are thereby transported to the limiting membrane of the vacuole/lysosome as a consequence of fusion. Therefore, the MVB sorting pathway plays a critical role in the decision between recycling and degradation of membrane proteins. A few archaeal sequences are also present within this family.
  • Changed! COG VPS24 124aa 0.001 in ref transcript
    • Conserved protein implicated in secretion [Cell motility and secretion].

VPS41

  • refseq_VPS41.F2 refseq_VPS41.R2 231 306
  • NCBIGene 36.3 27072
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014396

  • smart CLH 138aa 4e-25 in ref transcript
    • Clathrin heavy chain repeat homology.

VPS54

  • refseq_VPS54.F1 refseq_VPS54.R1 110 146
  • NCBIGene 36.3 51542
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016516

  • pfam Vps54 135aa 2e-49 in ref transcript
    • Vps54-like protein. This family contains various proteins that are homologs of the yeast Vps54 protein, such as the rat homolog, the human homolog, and the mouse homolog. In yeast, Vps54 associates with Vps52 and Vps53 proteins to form a trimolecular complex that is involved in protein transport between Golgi, endosomal, and vacuolar compartments. All Vps54 homologs contain a coiled coil region (not found in the region featured in this family) and multiple dileucine motifs.

VRK3

  • refseq_VRK3.F2 refseq_VRK3.R2 250 400
  • NCBIGene 36.3 51231
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016440

  • cd S_TKc 205aa 8e-12 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 187aa 2e-09 in ref transcript
    • Protein kinase domain.
  • COG SPS1 207aa 9e-07 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

VWA1

  • refseq_VWA1.F2 refseq_VWA1.R2 100 495
  • NCBIGene 36.3 64856
  • Alternative 3-prime, size difference: 395
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_022834

  • Changed! cd vWA_collagen 141aa 3e-29 in ref transcript
    • von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.
  • Changed! cd FN3 74aa 4e-06 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • Changed! pfam VWA 141aa 6e-28 in ref transcript
    • von Willebrand factor type A domain.
  • Changed! pfam fn3 69aa 2e-07 in ref transcript
    • Fibronectin type III domain.
  • Changed! pfam fn3 67aa 5e-05 in ref transcript

WAC

  • refseq_WAC.F1 refseq_WAC.R1 103 412
  • NCBIGene 36.3 51322
  • Single exon skipping, size difference: 309
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016628

  • cd WW 30aa 3e-05 in ref transcript
    • Two conserved tryptophans domain; also known as the WWP or rsp5 domain; around 40 amino acids; functions as an interaction module in a diverse set of signalling proteins; binds specific proline-rich sequences but at low affinities compared to other peptide recognition proteins such as antibodies and receptors; WW domains have a single groove formed by a conserved Trp and Tyr which recognizes a pair of residues of the sequence X-Pro; variable loops and neighboring domains confer specificity in this domain; there are five distinct groups based on binding: 1) PPXY motifs 2) the PPLP motif; 3) PGM motifs; 4) PSP or PTP motifs; 5) PR motifs.
  • smart WW 28aa 7e-05 in ref transcript
    • Domain with 2 conserved Trp (W) residues. Also known as the WWP or rsp5 domain. Binds proline-rich polypeptides.
  • COG PRP40 40aa 0.007 in ref transcript
    • Splicing factor [RNA processing and modification].
  • PRK PRK10856 86aa 0.008 in ref transcript
    • hypothetical protein; Provisional.

WARS

  • refseq_WARS.F1 refseq_WARS.R1 145 317
  • NCBIGene 36.3 7453
  • Single exon skipping, size difference: 172
  • Exclusion of the protein initiation site
  • Reference transcript: NM_173701

  • cd TrpRS_core 284aa 3e-74 in ref transcript
    • Tryptophanyl-tRNA synthetase (TrpRS) catalytic core domain. TrpRS is a homodimer, which attaches Tyr to the appropriate tRNA. TrpRS is a class I tRNA synthetases, so it aminoacylates the 2'-OH of the nucleotide at the 3' end of the tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains class I characteristic HIGH and KMSKS motifs, which are involved in ATP binding.
  • Changed! cd WEPRS_RNA 50aa 1e-14 in ref transcript
    • WEPRS_RNA binding domain. This short RNA-binding domain is found in several higher eukaryote aminoacyl-tRNA synthetases (aaRSs). It is found in multiple copies in eukaryotic bifunctional glutamyl-prolyl-tRNA synthetases (EPRS) in a region that separates the N-terminal glutamyl-tRNA synthetase (GluRS) from the C-terminal prolyl-tRNA synthetase (ProRS). It is also found at the N-terminus of vertebrate tryptophanyl-tRNA synthetases (TrpRS). This domain consists of a helix-turn-helix structure, which is similar to other RNA-binding proteins. It is involved in both protein-RNA interactions by binding tRNA and protein-protein interactions, which are important for the formation of aaRSs into multienzyme complexes.
  • TIGR trpS 318aa 1e-105 in ref transcript
    • This model represents tryptophanyl-tRNA synthetase. Some members of the family have a pfam00458 domain amino-terminal to the region described by this HMM.
  • Changed! pfam WHEP-TRS 54aa 3e-17 in ref transcript
    • WHEP-TRS domain.
  • PRK PRK12285 386aa 1e-110 in ref transcript
    • tryptophanyl-tRNA synthetase; Reviewed.

WARS

  • refseq_WARS.F3 refseq_WARS.R3 235 407
  • NCBIGene 36.3 7453
  • Single exon skipping, size difference: 172
  • Exclusion of the protein initiation site
  • Reference transcript: NM_004184

  • cd TrpRS_core 284aa 3e-74 in ref transcript
    • Tryptophanyl-tRNA synthetase (TrpRS) catalytic core domain. TrpRS is a homodimer, which attaches Tyr to the appropriate tRNA. TrpRS is a class I tRNA synthetases, so it aminoacylates the 2'-OH of the nucleotide at the 3' end of the tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains class I characteristic HIGH and KMSKS motifs, which are involved in ATP binding.
  • Changed! cd WEPRS_RNA 50aa 1e-14 in ref transcript
    • WEPRS_RNA binding domain. This short RNA-binding domain is found in several higher eukaryote aminoacyl-tRNA synthetases (aaRSs). It is found in multiple copies in eukaryotic bifunctional glutamyl-prolyl-tRNA synthetases (EPRS) in a region that separates the N-terminal glutamyl-tRNA synthetase (GluRS) from the C-terminal prolyl-tRNA synthetase (ProRS). It is also found at the N-terminus of vertebrate tryptophanyl-tRNA synthetases (TrpRS). This domain consists of a helix-turn-helix structure, which is similar to other RNA-binding proteins. It is involved in both protein-RNA interactions by binding tRNA and protein-protein interactions, which are important for the formation of aaRSs into multienzyme complexes.
  • TIGR trpS 318aa 1e-105 in ref transcript
    • This model represents tryptophanyl-tRNA synthetase. Some members of the family have a pfam00458 domain amino-terminal to the region described by this HMM.
  • Changed! pfam WHEP-TRS 54aa 3e-17 in ref transcript
    • WHEP-TRS domain.
  • PRK PRK12285 386aa 1e-110 in ref transcript
    • tryptophanyl-tRNA synthetase; Reviewed.

WARS2

  • refseq_WARS2.F2 refseq_WARS2.R2 132 161
  • NCBIGene 36.3 10352
  • Alternative 3-prime, size difference: 29
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015836

  • Changed! cd TrpRS_core 278aa 3e-95 in ref transcript
    • Tryptophanyl-tRNA synthetase (TrpRS) catalytic core domain. TrpRS is a homodimer, which attaches Tyr to the appropriate tRNA. TrpRS is a class I tRNA synthetases, so it aminoacylates the 2'-OH of the nucleotide at the 3' end of the tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains class I characteristic HIGH and KMSKS motifs, which are involved in ATP binding.
  • Changed! TIGR trpS 327aa 7e-76 in ref transcript
    • This model represents tryptophanyl-tRNA synthetase. Some members of the family have a pfam00458 domain amino-terminal to the region described by this HMM.
  • Changed! PRK PRK00927 325aa 1e-123 in ref transcript
    • tryptophanyl-tRNA synthetase; Reviewed.
  • Changed! cd TrpRS_core 178aa 1e-70 in modified transcript
  • Changed! TIGR trpS 180aa 3e-60 in modified transcript
  • Changed! PRK PRK00927 178aa 6e-89 in modified transcript

WASF1

  • refseq_WASF1.F2 refseq_WASF1.R2 128 226
  • NCBIGene 36.3 8936
  • Single exon skipping, size difference: 98
  • Exclusion in 5'UTR
  • Reference transcript: NM_003931

WASF1

  • refseq_WASF1.F4 refseq_WASF1.R4 129 274
  • NCBIGene 36.3 8936
  • Single exon skipping, size difference: 145
  • Exclusion in 5'UTR
  • Reference transcript: NM_003931

WBP5

  • refseq_WBP5.F1 refseq_WBP5.R1 250 329
  • NCBIGene 36.3 51186
  • Single exon skipping, size difference: 79
  • Exclusion in 5'UTR
  • Reference transcript: NM_001006612

  • pfam TFA 63aa 6e-18 in ref transcript
    • Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.

EIF4H

  • refseq_WBSCR1.F1 refseq_WBSCR1.R1 144 204
  • NCBIGene 36.3 7458
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022170

  • cd RRM 70aa 2e-10 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM 69aa 6e-11 in ref transcript
    • RNA recognition motif.
  • Changed! COG COG0724 87aa 3e-07 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! COG COG0724 81aa 1e-06 in modified transcript

WDHD1

  • refseq_WDHD1.F1 refseq_WDHD1.R1 143 236
  • NCBIGene 36.3 11169
  • Single exon skipping, size difference: 93
  • Exclusion of the protein initiation site
  • Reference transcript: NM_007086

  • Changed! cd WD40 248aa 5e-27 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd HMG-box 56aa 2e-06 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • smart HMG 58aa 8e-08 in ref transcript
    • high mobility group.
  • pfam WD40 38aa 3e-06 in ref transcript
    • WD domain, G-beta repeat.
  • Changed! smart WD40 31aa 0.001 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • Changed! COG COG2319 289aa 5e-17 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG NHP6B 96aa 0.003 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
  • Changed! cd WD40 161aa 3e-18 in modified transcript
  • Changed! COG COG2319 175aa 4e-08 in modified transcript

WDR16

  • refseq_WDR16.F1 refseq_WDR16.R1 167 357
  • NCBIGene 36.2 146845
  • Single exon skipping, size difference: 190
  • Exclusion of the protein initiation site
  • Reference transcript: NM_145054

  • cd WD40 285aa 6e-34 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 343aa 2e-16 in ref transcript
  • smart WD40 40aa 0.004 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 42aa 0.007 in ref transcript
  • COG COG2319 286aa 2e-21 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG COG2319 353aa 6e-15 in ref transcript

WDR17

  • refseq_WDR17.F1 refseq_WDR17.R1 106 229
  • NCBIGene 36.3 116966
  • Single exon skipping, size difference: 123
  • Exclusion of the protein initiation site
  • Reference transcript: NM_170710

  • cd WD40 291aa 4e-48 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 348aa 2e-17 in ref transcript
  • smart WD40 41aa 4e-06 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 38aa 4e-04 in ref transcript
  • pfam WD40 40aa 0.002 in ref transcript
    • WD domain, G-beta repeat.
  • pfam WD40 32aa 0.003 in ref transcript
  • pfam WD40 40aa 0.004 in ref transcript
  • COG COG2319 328aa 7e-32 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG COG2319 401aa 2e-14 in ref transcript

WDR21A

  • refseq_WDR21A.F2 refseq_WDR21A.R2 197 297
  • NCBIGene 36.3 26094
  • Single exon skipping, size difference: 100
  • Exclusion of the protein initiation site
  • Reference transcript: NM_015604

  • cd WD40 187aa 2e-11 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • COG COG2319 181aa 4e-07 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

WDR21A

  • refseq_WDR21A.F4 refseq_WDR21A.R4 218 401
  • NCBIGene 36.3 26094
  • Multiple exon skipping, size difference: 183
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015604

  • cd WD40 187aa 2e-11 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • COG COG2319 181aa 4e-07 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

WDR21A

  • refseq_WDR21A.F5 refseq_WDR21A.R5 102 429
  • NCBIGene 36.3 26094
  • Multiple exon skipping, size difference: 327
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015604

  • Changed! cd WD40 187aa 2e-11 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! COG COG2319 181aa 4e-07 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! cd WD40 185aa 4e-11 in modified transcript
  • Changed! COG COG2319 202aa 4e-07 in modified transcript

WDR23

  • refseq_WDR23.F2 refseq_WDR23.R2 258 336
  • NCBIGene 36.3 80344
  • Alternative 5-prime and 3-prime, size difference: 78
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_025230

  • cd WD40 343aa 8e-31 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam WD40 38aa 2e-05 in ref transcript
    • WD domain, G-beta repeat.
  • pfam WD40 38aa 0.002 in ref transcript
  • COG COG2319 351aa 9e-19 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

WDR35

  • refseq_WDR35.F1 refseq_WDR35.R1 182 215
  • NCBIGene 36.3 57539
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001006657

  • cd WD40 301aa 2e-11 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • COG COG2319 186aa 7e-11 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

WDR7

  • refseq_WDR7.F2 refseq_WDR7.R2 109 208
  • NCBIGene 36.3 23335
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015285

  • cd WD40 188aa 2e-14 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 266aa 1e-08 in ref transcript
  • cd WD40 70aa 5e-08 in ref transcript
  • smart WD40 39aa 3e-04 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • COG COG2319 252aa 5e-10 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG COG2319 178aa 8e-06 in ref transcript
  • COG COG2319 120aa 2e-04 in ref transcript

WFDC10B

  • refseq_WFDC10B.F1 refseq_WFDC10B.R1 157 312
  • NCBIGene 36.3 280664
  • Single exon skipping, size difference: 155
  • Exclusion of the stop codon
  • Reference transcript: NM_172006

WFDC2

  • refseq_WFDC2.F1 refseq_WFDC2.R1 128 272
  • NCBIGene 36.2 10406
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006103

  • Changed! cd WAP 63aa 4e-04 in ref transcript
    • whey acidic protein-type four-disulfide core domains. Members of the family include whey acidic protein, elafin (elastase-specific inhibitor), caltrin-like protein (a calcium transport inhibitor) and other extracellular proteinase inhibitors. A group of proteins containing 8 characteristically-spaced cysteine residuesforming disulphide bonds, have been termed '4-disulphide core' proteins. Protease inhibition occurs by insertion of the inhibitory loop into the active site pocket and interference with the catalytic residues of the protease.
  • smart WAP 48aa 2e-07 in ref transcript
    • Four-disulfide core domains.
  • Changed! pfam WAP 42aa 0.010 in ref transcript
    • WAP-type (Whey Acidic Protein) 'four-disulfide core'.
  • Changed! cd WAP 50aa 0.001 in modified transcript

WFDC3

  • refseq_WFDC3.F1 refseq_WFDC3.R1 200 335
  • NCBIGene 36.2 140686
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080614

  • pfam WAP 42aa 0.002 in ref transcript
    • WAP-type (Whey Acidic Protein) 'four-disulfide core'.
  • pfam WAP 41aa 0.002 in ref transcript
  • pfam WAP 40aa 0.007 in ref transcript

WFDC3

  • refseq_WFDC3.F3 refseq_WFDC3.R3 100 189
  • NCBIGene 36.2 140686
  • Single exon skipping, size difference: 89
  • Exclusion of the protein initiation site
  • Reference transcript: NM_080614

  • Changed! pfam WAP 42aa 0.002 in ref transcript
    • WAP-type (Whey Acidic Protein) 'four-disulfide core'.
  • Changed! pfam WAP 41aa 0.002 in ref transcript
  • Changed! pfam WAP 40aa 0.007 in ref transcript

WFDC3

  • refseq_WFDC3.F5 refseq_WFDC3.R5 164 311
  • NCBIGene 36.2 140686
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080614

  • Changed! pfam WAP 42aa 0.002 in ref transcript
    • WAP-type (Whey Acidic Protein) 'four-disulfide core'.
  • pfam WAP 41aa 0.002 in ref transcript
  • pfam WAP 40aa 0.007 in ref transcript

WIPI2

  • refseq_WIPI2.F1 refseq_WIPI2.R1 269 323
  • NCBIGene 36.3 26100
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015610

  • Changed! cd WD40 194aa 2e-07 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 236aa 2e-05 in ref transcript
  • Changed! COG COG2319 164aa 7e-05 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! cd WD40 245aa 8e-09 in modified transcript
  • Changed! COG COG2319 242aa 3e-05 in modified transcript

WIPI2

  • refseq_WIPI2.F2 refseq_WIPI2.R2 169 202
  • NCBIGene 36.3 26100
  • Alternative 5-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015610

  • cd WD40 194aa 2e-07 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 236aa 2e-05 in ref transcript
  • COG COG2319 164aa 7e-05 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

WNK3

  • refseq_WNK3.F2 refseq_WNK3.R2 100 241
  • NCBIGene 36.3 65267
  • Alternative 5-prime, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020922

  • cd S_TKc 254aa 5e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 9e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 335aa 1e-27 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

WNK3

  • refseq_WNK3.F3 refseq_WNK3.R3 142 172
  • NCBIGene 36.3 65267
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020922

  • cd S_TKc 254aa 5e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 9e-61 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 335aa 1e-27 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

WRNIP1

  • refseq_WRNIP1.F1 refseq_WRNIP1.R1 180 255
  • NCBIGene 36.3 56897
  • Alternative 3-prime, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020135

  • Changed! cd AAA 131aa 2e-17 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • Changed! pfam AAA 137aa 2e-15 in ref transcript
    • ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes.
  • smart ZnF_Rad18 24aa 9e-07 in ref transcript
    • Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
  • Changed! PRK PRK13342 437aa 1e-165 in ref transcript
    • recombination factor protein RarA; Reviewed.
  • COG RAD18 25aa 0.007 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].
  • Changed! cd AAA 99aa 9e-13 in modified transcript
  • Changed! pfam AAA 76aa 6e-12 in modified transcript
  • Changed! TIGR dnaX_nterm 203aa 1e-06 in modified transcript
    • This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
  • Changed! PRK PRK13342 412aa 1e-144 in modified transcript

XRCC4

  • refseq_XRCC4.F2 refseq_XRCC4.R2 167 214
  • NCBIGene 36.3 7518
  • Alternative 5-prime, size difference: 47
  • Inclusion in 5'UTR
  • Reference transcript: NM_022406

  • pfam XRCC4 334aa 1e-167 in ref transcript
    • DNA double-strand break repair and V(D)J recombination protein XRCC4. This family consists of several mammalian specific DNA double-strand break repair and V(D)J recombination protein XRCC4 sequences. In the non-homologous end joining pathway of DNA double-strand break repair, the ligation step is catalysed by a complex of XRCC4 and DNA ligase IV. It is thought that XRCC4 and ligase IV are essential for alignment-based gap filling, as well as for final ligation of the breaks.

YAF2

  • refseq_YAF2.F2 refseq_YAF2.R2 125 297
  • NCBIGene 36.2 10138
  • Multiple exon skipping, size difference: 172
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_005748

  • smart ZnF_RBZ 21aa 7e-04 in ref transcript
    • Zinc finger domain. Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.

YME1L1

  • refseq_YME1L1.F2 refseq_YME1L1.R2 176 347
  • NCBIGene 36.3 10730
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139312

  • cd AAA 123aa 8e-16 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • TIGR FtsH_fam 434aa 1e-162 in ref transcript
    • HflB(FtsH) is a pleiotropic protein required for correct cell division in bacteria. It has ATP-dependent zinc metalloprotease activity. It was formerly designated cell division protein FtsH.
  • COG HflB 476aa 1e-142 in ref transcript
    • ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones].

YTHDC1

  • refseq_YTHDC1.F1 refseq_YTHDC1.R1 303 357
  • NCBIGene 36.3 91746
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031732

  • pfam YTH 94aa 2e-29 in ref transcript
    • YT521-B-like family. A protein of the YTH family has been shown to selectively remove transcripts of meiosis-specific genes expressed in mitotic cells. It has been speculated that in higher eukaryotic YTH-family members may be involved in similar mechanaisms to supress gene regulation during gametogenesis or general silencing. The rat protein YT521-B is a tyrosine-phosphorylated nuclear protein, that interacts with the nuclear transcriptosomal component scaffold attachment factor B, and the 68-kDa Src substrate associated during mitosis, Sam68. In vivo splicing assays demonstrated that YT521-B modulates alternative splice site selection in a concentration-dependent manner. The domain is predicted to have four alpha helices and six beta strands.

YWHAB

  • refseq_YWHAB.F1 refseq_YWHAB.R1 307 402
  • NCBIGene 36.3 7529
  • Single exon skipping, size difference: 95
  • Exclusion in 5'UTR
  • Reference transcript: NM_003404

  • pfam 14-3-3 234aa 1e-118 in ref transcript
    • 14-3-3 protein.
  • COG BMH1 245aa 1e-81 in ref transcript
    • 14-3-3 family protein [Signal transduction mechanisms].

YY1AP1

  • refseq_YY1AP1.F1 refseq_YY1AP1.R1 340 400
  • NCBIGene 36.3 55249
  • Alternative 3-prime, size difference: 60
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_139118

YY1AP1

  • refseq_YY1AP1.F2 refseq_YY1AP1.R2 181 337
  • NCBIGene 36.2 55249
  • Single exon skipping, size difference: 156
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_139118

YY1AP1

  • refseq_YY1AP1.F3 refseq_YY1AP1.R3 104 235
  • NCBIGene 36.2 55249
  • Single exon skipping, size difference: 131
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_139118

YY1AP1

  • refseq_YY1AP1.F4 refseq_YY1AP1.R1 106 166
  • NCBIGene 36.3 55249
  • Alternative 3-prime, size difference: 60
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_139118

ZAN

  • refseq_ZAN.F1 refseq_ZAN.R1 127 400
  • NCBIGene 36.3 7455
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 273
  • Inclusion in 3'UTR, Exclusion in 3'UTR, Exclusion in 3'UTR
  • Reference transcript: NM_003386

  • cd MAM 162aa 8e-32 in ref transcript
    • Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
  • cd MAM 162aa 2e-28 in ref transcript
  • cd MAM 155aa 1e-18 in ref transcript
  • pfam VWD 153aa 1e-38 in ref transcript
    • von Willebrand factor type D domain. The Cypridina-type luciferase from Vargula hilgendorfii contains a vwd domain. Its function is unrelated but the similarity is very strong by several methods.
  • pfam MAM 162aa 1e-36 in ref transcript
    • MAM domain. An extracellular domain found in many receptors.
  • smart VWD 162aa 6e-35 in ref transcript
    • von Willebrand factor (vWF) type D domain. Von Willebrand factor contains several type D domains: D1 and D2 are present within the N-terminal propeptide whereas the remaining D domains are required for multimerisation.
  • pfam MAM 164aa 7e-31 in ref transcript
  • pfam MAM 158aa 8e-21 in ref transcript
  • smart C8 73aa 2e-17 in ref transcript
    • C8 domain. This domain contains 8 conserved cysteine residues, but this family only contains 7 of them to overlaps with other domains. It is found in disease-related proteins including von Willebrand factor, Alpha tectorin, Zonadhesin and Mucin.
  • smart C8 75aa 1e-15 in ref transcript
  • pfam TIL 54aa 4e-06 in ref transcript
    • Trypsin Inhibitor like cysteine rich domain. This family contains trypsin inhibitors as well as a domain found in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9.
  • pfam TIL 50aa 8e-06 in ref transcript

ZC3H14

  • refseq_ZC3H14.F2 refseq_ZC3H14.R2 101 494
  • NCBIGene 36.3 79882
  • Multiple exon skipping, size difference: 393
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024824

ZCCHC4

  • refseq_ZCCHC4.F2 refseq_ZCCHC4.R2 303 397
  • NCBIGene 36.2 29063
  • Alternative 5-prime, size difference: 94
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_376310

  • pfam zf-GRF 41aa 4e-05 in ref transcript
    • GRF zinc finger. This presumed zinc binding domain is found in a variety of DNA-binding proteins. It seems likely that this domain is involved in nucleic acid binding. It is named GRF after three conserved residues in the centre of the alignment of the domain. This zinc finger may be related to pfam01396.

ZDHHC13

  • refseq_ZDHHC13.F1 refseq_ZDHHC13.R1 129 275
  • NCBIGene 36.3 54503
  • Single exon skipping, size difference: 146
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_019028

  • Changed! cd ANK 114aa 2e-22 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd ANK 152aa 4e-22 in ref transcript
  • Changed! pfam zf-DHHC 53aa 5e-17 in ref transcript
    • DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
  • Changed! pfam Ank 29aa 1e-05 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • Changed! pfam Ank 32aa 3e-04 in ref transcript
  • Changed! COG COG5273 248aa 4e-23 in ref transcript
    • Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
  • Changed! COG Arp 146aa 1e-14 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! COG Arp 140aa 6e-14 in ref transcript

ZDHHC14

  • refseq_ZDHHC14.F1 refseq_ZDHHC14.R1 236 281
  • NCBIGene 36.3 79683
  • Alternative 3-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024630

  • pfam zf-DHHC 63aa 6e-24 in ref transcript
    • DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
  • COG COG5273 249aa 3e-31 in ref transcript
    • Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].

ZDHHC16

  • refseq_ZDHHC16.F2 refseq_ZDHHC16.R2 272 320
  • NCBIGene 36.3 84287
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032327

  • pfam zf-DHHC 54aa 2e-21 in ref transcript
    • DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
  • Changed! COG COG5273 88aa 1e-20 in ref transcript
    • Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
  • Changed! COG COG5273 222aa 8e-22 in modified transcript

ZDHHC16

  • refseq_ZDHHC16.F3 refseq_ZDHHC16.R3 220 400
  • NCBIGene 36.3 84287
  • Single exon skipping, size difference: 180
  • Exclusion in 5'UTR
  • Reference transcript: NM_198046

  • pfam zf-DHHC 54aa 2e-21 in ref transcript
    • DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
  • COG COG5273 88aa 1e-20 in ref transcript
    • Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].

ZDHHC16

  • refseq_ZDHHC16.F6 refseq_ZDHHC16.R6 123 240
  • NCBIGene 36.3 84287
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 117
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032327

  • Changed! pfam zf-DHHC 54aa 2e-21 in ref transcript
    • DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
  • Changed! COG COG5273 88aa 1e-20 in ref transcript
    • Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
  • Changed! COG COG5273 50aa 0.001 in modified transcript

ZDHHC16

  • refseq_ZDHHC16.F7 refseq_ZDHHC16.R7 127 322
  • NCBIGene 36.3 84287
  • Single exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032327

  • pfam zf-DHHC 54aa 2e-21 in ref transcript
    • DHHC zinc finger domain. This domain is also known as NEW1. This domain is predicted to be a zinc binding domain. The function of this domain is unknown, but it has been predicted to be involved in protein-protein or protein-DNA interactions, and palmitoyltransferase activity.
  • Changed! COG COG5273 88aa 1e-20 in ref transcript
    • Uncharacterized protein containing DHHC-type Zn finger [General function prediction only].
  • Changed! COG COG5273 63aa 2e-20 in modified transcript

ZFP64

  • refseq_ZFP64.F1 refseq_ZFP64.R1 173 335
  • NCBIGene 36.3 55734
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018197

ZFYVE9

  • refseq_ZFYVE9.F2 refseq_ZFYVE9.R2 221 398
  • NCBIGene 36.3 9372
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004799

  • cd FYVE 54aa 1e-11 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • pfam FYVE 66aa 8e-19 in ref transcript
    • FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.

ZGPAT

  • refseq_ZGPAT.F1 refseq_ZGPAT.R1 185 255
  • NCBIGene 36.3 84619
  • Alternative 5-prime, size difference: 70
  • Exclusion in 5'UTR
  • Reference transcript: NM_032527

  • cd TUDOR 31aa 0.005 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • pfam G-patch 42aa 2e-08 in ref transcript
    • G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.
  • smart ZnF_C3H1 22aa 6e-04 in ref transcript
    • zinc finger.
  • smart TUDOR 36aa 0.006 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).

ZHX1

  • refseq_ZHX1.F1 refseq_ZHX1.R1 121 216
  • NCBIGene 36.3 11244
  • Alternative 5-prime, size difference: 95
  • Exclusion in 5'UTR
  • Reference transcript: NM_001017926

  • cd homeodomain 56aa 4e-07 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • cd homeodomain 53aa 7e-07 in ref transcript
  • cd homeodomain 41aa 0.002 in ref transcript
  • cd homeodomain 48aa 0.002 in ref transcript
  • cd homeodomain 44aa 0.009 in ref transcript
  • smart HOX 52aa 5e-08 in ref transcript
    • Homeodomain. DNA-binding factors that are involved in the transcriptional regulation of key developmental processes.
  • smart HOX 53aa 1e-07 in ref transcript
  • smart HOX 37aa 2e-04 in ref transcript
  • smart HOX 41aa 3e-04 in ref transcript
  • smart HOX 44aa 0.003 in ref transcript

ZMAT1

  • refseq_ZMAT1.F2 refseq_ZMAT1.R2 152 273
  • NCBIGene 36.2 84460
  • Single exon skipping, size difference: 121
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001011657

  • smart ZnF_U1 35aa 0.001 in ref transcript
    • U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.

ZMAT3

  • refseq_ZMAT3.F1 refseq_ZMAT3.R1 154 260
  • NCBIGene 36.3 64393
  • Alternative 5-prime and 3-prime, size difference: 106
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_022470

  • smart ZnF_U1 35aa 2e-06 in ref transcript
    • U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
  • smart ZnF_U1 34aa 2e-05 in ref transcript
  • smart ZnF_U1 35aa 8e-05 in ref transcript

ZMAT5

  • refseq_ZMAT5.F1 refseq_ZMAT5.R1 194 309
  • NCBIGene 36.3 55954
  • Single exon skipping, size difference: 115
  • Exclusion in 5'UTR
  • Reference transcript: NM_019103

  • pfam zf-CCCH 23aa 6e-04 in ref transcript
    • Zinc finger C-x8-C-x5-C-x3-H type (and similar).
  • pfam zf-U1 37aa 0.003 in ref transcript
    • U1 zinc finger. This family consists of several U1 small nuclear ribonucleoprotein C (U1-C) proteins. The U1 small nuclear ribonucleoprotein (U1 snRNP) binds to the pre-mRNA 5' splice site (ss) at early stages of spliceosome assembly. Recruitment of U1 to a class of weak 5' ss is promoted by binding of the protein TIA-1 to uridine-rich sequences immediately downstream from the 5' ss. Binding of TIA-1 in the vicinity of a 5' ss helps to stabilise U1 snRNP recruitment, at least in part, via a direct interaction with U1-C, thus providing one molecular mechanism for the function of this splicing regulator. This domain is probably a zinc-binding. It is found in multiple copies in some members of the family.
  • COG COG5136 43aa 0.002 in ref transcript
    • U1 snRNP-specific protein C [RNA processing and modification].

ZMYM2

  • refseq_ZMYM2.F2 refseq_ZMYM2.R2 122 169
  • NCBIGene 36.3 7750
  • Single exon skipping, size difference: 47
  • Exclusion in 5'UTR
  • Reference transcript: NM_003453

  • pfam zf-FCS 41aa 6e-06 in ref transcript
    • MYM-type Zinc finger with FCS sequence motif. MYM-type zinc fingers were identified in MYM family proteins. Human zinc finger protein 261 is involved in a chromosomal translocation and may be responsible for X-linked retardation in XQ13.1. Human zinc finger protein 198 is also involved in disease. In myeloproliferative disorders it is fused to FGF receptor 1; in atypical myeloproliferative disorders it is rearranged. Members of the family generally are involved in development. This Zn-finger domain functions as a transcriptional trans-activator of late vaccinia viral genes, and orthologues are also found in all nucleocytoplasmic large DNA viruses, NCLDV. This domain is also found fused to the C termini of recombinases from certain prokaryotic transposons.
  • pfam zf-FCS 41aa 1e-05 in ref transcript
  • pfam zf-FCS 44aa 8e-05 in ref transcript
  • pfam zf-FCS 39aa 0.002 in ref transcript
  • pfam zf-FCS 40aa 0.002 in ref transcript

ZNF16

  • refseq_ZNF16.F1 refseq_ZNF16.R1 264 350
  • NCBIGene 36.3 7564
  • Single exon skipping, size difference: 86
  • Exclusion in 5'UTR
  • Reference transcript: NM_001029976

  • COG COG5048 349aa 2e-09 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 210aa 6e-04 in ref transcript

ZNF160

  • refseq_ZNF160.F1 refseq_ZNF160.R1 105 190
  • NCBIGene 36.3 90338
  • Single exon skipping, size difference: 85
  • Exclusion in 5'UTR
  • Reference transcript: NM_198893

  • smart KRAB 61aa 1e-24 in ref transcript
    • krueppel associated box.
  • COG COG5048 458aa 8e-12 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG SFP1 117aa 0.007 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].

ZNF182

  • refseq_ZNF182.F2 refseq_ZNF182.R2 183 277
  • NCBIGene 36.3 7569
  • Single exon skipping, size difference: 94
  • Exclusion of the protein initiation site
  • Reference transcript: NM_006962

  • smart KRAB 59aa 1e-23 in ref transcript
    • krueppel associated box.
  • COG COG5048 399aa 7e-11 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF189

  • refseq_ZNF189.F1 refseq_ZNF189.R1 190 284
  • NCBIGene 36.3 7743
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 94
  • Inclusion in the protein causing a new stop codon, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003452

  • Changed! pfam KRAB 41aa 2e-19 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • Changed! COG COG5048 368aa 5e-09 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF2

  • refseq_ZNF2.F2 refseq_ZNF2.R2 312 384
  • NCBIGene 36.3 7549
  • Single exon skipping, size difference: 72
  • Exclusion of the protein initiation site
  • Reference transcript: NM_021088

  • Changed! smart KRAB 61aa 2e-23 in ref transcript
    • krueppel associated box.
  • COG COG5048 158aa 2e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! smart KRAB 32aa 2e-08 in modified transcript

ZNF200

  • refseq_ZNF200.F2 refseq_ZNF200.R2 107 458
  • NCBIGene 36.3 7752
  • Alternative 5-prime, size difference: 351
  • Exclusion in 5'UTR
  • Reference transcript: NM_003454

  • pfam zf-C2H2 23aa 0.008 in ref transcript
    • Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.

ZNF207

  • refseq_ZNF207.F1 refseq_ZNF207.R1 291 384
  • NCBIGene 36.3 7756
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098507

ZNF207

  • refseq_ZNF207.F4 refseq_ZNF207.R4 205 253
  • NCBIGene 36.3 7756
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098507

ZNF211

  • refseq_ZNF211.F1 refseq_ZNF211.R1 217 256
  • NCBIGene 36.3 10520
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006385

  • pfam KRAB 41aa 4e-16 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • COG SFP1 79aa 0.004 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].

ZNF266

  • refseq_ZNF266.F2 refseq_ZNF266.R2 124 404
  • NCBIGene 36.3 10781
  • Multiple exon skipping, size difference: 280
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_006631

  • smart KRAB 28aa 2e-06 in ref transcript
    • krueppel associated box.
  • COG COG5048 340aa 1e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF274

  • refseq_ZNF274.F1 refseq_ZNF274.R1 278 374
  • NCBIGene 36.3 10782
  • Single exon skipping, size difference: 96
  • Exclusion in 5'UTR
  • Reference transcript: NM_133502

  • pfam KRAB 41aa 1e-18 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • pfam zf-C2H2 23aa 8e-04 in ref transcript
    • Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.

PATZ1

  • refseq_ZNF278.F1 refseq_ZNF278.R1 193 258
  • NCBIGene 36.3 23598
  • Alternative 3-prime, size difference: 65
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014323

  • pfam BTB 130aa 8e-22 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • Changed! COG SFP1 58aa 0.004 in modified transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].

ZNF302

  • refseq_ZNF302.F1 refseq_ZNF302.R1 125 148
  • NCBIGene 36.3 55900
  • Alternative 5-prime, size difference: 23
  • Exclusion in 5'UTR
  • Reference transcript: NM_018443

  • smart KRAB 61aa 1e-23 in ref transcript
    • krueppel associated box.

ZNF37B

  • refseq_ZNF37B.F2 refseq_ZNF37B.R2 296 350
  • NCBIGene 36.2 256112
  • Alternative 3-prime, size difference: 54
  • Inclusion in 5'UTR
  • Reference transcript: XM_001127459

ZNF384

  • refseq_ZNF384.F2 refseq_ZNF384.R2 203 251
  • NCBIGene 36.3 171017
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039917

ZNF451

  • refseq_ZNF451.F2 refseq_ZNF451.R2 178 322
  • NCBIGene 36.3 26036
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031623

ZNF473

  • refseq_ZNF473.F1 refseq_ZNF473.R1 232 354
  • NCBIGene 36.3 25888
  • Alternative 3-prime, size difference: 122
  • Inclusion in 5'UTR
  • Reference transcript: NM_001006656

  • pfam KRAB 41aa 1e-08 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • COG COG5048 405aa 4e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 341aa 6e-04 in ref transcript

ZNF484

  • refseq_ZNF484.F1 refseq_ZNF484.R1 133 178
  • NCBIGene 36.3 83744
  • Single exon skipping, size difference: 45
  • Exclusion of the protein initiation site
  • Reference transcript: NM_031486

  • Changed! smart KRAB 60aa 1e-24 in ref transcript
    • krueppel associated box.
  • COG COG5048 402aa 8e-10 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! smart KRAB 31aa 4e-07 in modified transcript

ZNF107

  • refseq_ZNF588.F2 refseq_ZNF588.R2 280 407
  • NCBIGene 36.3 51427
  • Single exon skipping, size difference: 127
  • Exclusion in 5'UTR
  • Reference transcript: NM_016220

  • COG COG5048 313aa 1e-07 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 390aa 3e-07 in ref transcript

ZNF107

  • refseq_ZNF588.F3 refseq_ZNF588.R3 109 251
  • NCBIGene 36.3 51427
  • Single exon skipping, size difference: 142
  • Exclusion in 5'UTR
  • Reference transcript: NM_016220

  • COG COG5048 313aa 1e-07 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 390aa 3e-07 in ref transcript

ZNF599

  • refseq_ZNF599.F1 refseq_ZNF599.R1 184 322
  • NCBIGene 36.2 148103
  • Single exon skipping, size difference: 138
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001007248

  • Changed! smart KRAB 61aa 7e-23 in ref transcript
    • krueppel associated box.
  • Changed! COG COG5048 313aa 3e-13 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF655

  • refseq_ZNF655.F1 refseq_ZNF655.R1 192 417
  • NCBIGene 36.2 79027
  • Multiple exon skipping, size difference: 225
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_138494

  • Changed! COG COG5048 106aa 6e-04 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! PRK truB 85aa 0.007 in ref transcript
    • tRNA pseudouridine synthase B; Provisional.

IKZF3

  • refseq_ZNFN1A3.F2 refseq_ZNFN1A3.R2 190 307
  • NCBIGene 36.3 22806
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012481

IKZF3

  • refseq_ZNFN1A3.F3 refseq_ZNFN1A3.R3 231 399
  • NCBIGene 36.3 22806
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012481

IKZF3

  • refseq_ZNFN1A3.F5 refseq_ZNFN1A3.R5 146 263
  • NCBIGene 36.3 22806
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012481

ZPBP2

  • refseq_ZPBP2.F1 refseq_ZPBP2.R1 184 250
  • NCBIGene 36.3 124626
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199321

  • pfam Sp38 272aa 1e-142 in ref transcript
    • Zona-pellucida-binding protein (Sp38). This family contains a number of zona-pellucida-binding proteins that seem to be restricted to mammals. These are sperm proteins that bind to the 90-kDa family of zona pellucida glycoproteins in a calcium-dependent manner. These represent some of the specific molecules that mediate the first steps of gamete interaction, allowing fertilisation to occur.

ZRANB2

  • refseq_ZRANB2.F1 refseq_ZRANB2.R1 209 284
  • NCBIGene 36.3 9406
  • Single exon skipping, size difference: 75
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_203350

  • smart ZnF_RBZ 25aa 6e-04 in ref transcript
    • Zinc finger domain. Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.
  • smart ZnF_RBZ 25aa 0.009 in ref transcript

ZWINT

  • refseq_ZWINT.F2 refseq_ZWINT.R2 234 375
  • NCBIGene 36.3 11130
  • Alternative 5-prime and 3-prime, size difference: 141
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007057

  • Changed! PRK mukB 138aa 0.003 in ref transcript
    • cell division protein MukB; Provisional.

A26A1

  • rs.A26A1.F1 rs.A26A1.R1 141 279
  • NCBIGene 36.3 340441
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005365

  • cd ANK 126aa 2e-26 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd ANK 125aa 3e-24 in ref transcript
  • Changed! TIGR trp 175aa 6e-07 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • COG Arp 173aa 1e-11 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! TIGR trp 121aa 2e-05 in modified transcript

AADACL3

  • rs.AADACL3.F1 rs.AADACL3.R1 160 377
  • NCBIGene 36.3 126767
  • Single exon skipping, size difference: 217
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001103170

  • Changed! pfam Abhydrolase_3 264aa 2e-43 in ref transcript
    • alpha/beta hydrolase fold. This catalytic domain is found in a very wide range of enzymes.
  • Changed! COG Aes 342aa 2e-33 in ref transcript
    • Esterase/lipase [Lipid metabolism].
  • Changed! pfam Abhydrolase_3 250aa 3e-35 in modified transcript
  • Changed! COG Aes 277aa 7e-24 in modified transcript

ABCC1

  • rs.ABCC1.F1 rs.ABCC1.R1 229 424
  • NCBIGene 36.3 4363
  • Single exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004996

  • Changed! cd ABCC_MRP_domain2 221aa 9e-96 in ref transcript
    • Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
  • cd ABCC_MRP_domain1 202aa 6e-89 in ref transcript
    • Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
  • Changed! TIGR MRP_assoc_pro 1524aa 0.0 in ref transcript
    • This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
  • Changed! PTZ PTZ00243 869aa 1e-161 in ref transcript
    • ABC transporter; Provisional.
  • COG MdlB 558aa 2e-56 in ref transcript
    • ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
  • Changed! cd ABCC_MRP_domain2 142aa 2e-50 in modified transcript
  • Changed! TIGR MRP_assoc_pro 1426aa 0.0 in modified transcript
  • Changed! TIGR MRP_assoc_pro 35aa 5e-13 in modified transcript
  • Changed! PTZ PTZ00243 772aa 1e-131 in modified transcript

ABCC1

  • rs.ABCC1.F2 rs.ABCC1.R2 119 296
  • NCBIGene 36.3 4363
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004996

  • cd ABCC_MRP_domain2 221aa 9e-96 in ref transcript
    • Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
  • Changed! cd ABCC_MRP_domain1 202aa 6e-89 in ref transcript
    • Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
  • Changed! TIGR MRP_assoc_pro 1524aa 0.0 in ref transcript
    • This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
  • Changed! PTZ PTZ00243 869aa 1e-161 in ref transcript
    • ABC transporter; Provisional.
  • Changed! COG MdlB 558aa 2e-56 in ref transcript
    • ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
  • Changed! cd ABCC_MRP_domain1 97aa 3e-36 in modified transcript
  • Changed! cd ABCC_MRP_domain1 63aa 1e-15 in modified transcript
  • Changed! TIGR MRP_assoc_pro 1465aa 0.0 in modified transcript
  • Changed! PTZ PTZ00243 766aa 1e-133 in modified transcript
  • Changed! PTZ PTZ00243 326aa 8e-39 in modified transcript
  • Changed! PRK PRK11160 68aa 4e-10 in modified transcript
    • cysteine/glutathione ABC transporter membrane/ATP-binding component; Reviewed.
  • Changed! COG COG1123 146aa 2e-07 in modified transcript
    • ATPase components of various ABC-type transport systems, contain duplicated ATPase [General function prediction only].

ABCC1

  • rs.ABCC1.F3 rs.ABCC1.R3 172 340
  • NCBIGene 36.3 4363
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004996

  • cd ABCC_MRP_domain2 221aa 9e-96 in ref transcript
    • Domain 2 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminus, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resistance lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
  • Changed! cd ABCC_MRP_domain1 202aa 6e-89 in ref transcript
    • Domain 1 of the ABC subfamily C. This family is also known as MRP (mulrtidrug resisitance-associated protein). Some of the MRP members have five additional transmembrane segments in their N-terminas, but the function of these additional membrane-spanning domains is not clear. The MRP was found in the multidrug-resisting lung cancer cell in which p-glycoprotein was not overexpressed. MRP exports glutathione by drug stimulation, as well as, certain substrates in conjugated forms with anions, such as glutathione, glucuronate, and sulfate.
  • Changed! TIGR MRP_assoc_pro 1524aa 0.0 in ref transcript
    • This model describes multi drug resistance-associated protein (MRP) in eukaryotes. The multidrug resistance-associated protein is an integral membrane protein that causes multidrug resistance when overexpressed in mammalian cells. It belongs to ABC transporter superfamily. The protein topology and function was experimentally demonstrated by epitope tagging and immunofluorescence. Insertion of tags in the critical regions associated with drug efflux, abrogated its function. The C-terminal domain seem to highly conserved.
  • Changed! PTZ PTZ00243 869aa 1e-161 in ref transcript
    • ABC transporter; Provisional.
  • Changed! COG MdlB 558aa 2e-56 in ref transcript
    • ABC-type multidrug transport system, ATPase and permease components [Defense mechanisms].
  • Changed! cd ABCC_MRP_domain1 146aa 2e-47 in modified transcript
  • Changed! TIGR MRP_assoc_pro 758aa 0.0 in modified transcript
  • Changed! TIGR MRP_assoc_pro 711aa 0.0 in modified transcript
  • Changed! PTZ PTZ00243 813aa 1e-126 in modified transcript
  • Changed! PTZ PTZ00243 326aa 9e-39 in modified transcript
  • Changed! COG CydC 305aa 2e-19 in modified transcript
    • ABC-type transport system involved in cytochrome bd biosynthesis, fused ATPase and permease components [Energy production and conversion / Posttranslational modification, protein turnover, chaperones].

ABI1

  • rs.ABI1.F1 rs.ABI1.R1 100 115
  • NCBIGene 36.3 10006
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005470

  • cd SH3 52aa 3e-15 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! pfam Abi_HHR 79aa 4e-32 in ref transcript
    • Abl-interactor HHR. The region featured in this family is found towards the N-terminus of a number of adaptor proteins that interact with Abl-family tyrosine kinases. More specifically, it is termed the homeo-domain homologous region (HHR), as it is similar to the DNA-binding region of homeo-domain proteins. Other homeo-domain proteins have been implicated in specifying positional information during embryonic development, and in the regulation of the expression of cell-type specific genes. The Abl-interactor proteins are thought to coordinate the cytoplasmic and nuclear functions of the Abl-family kinases, and seem to be involved in cytoskeletal reorganisation, but their precise role remains unclear.
  • smart SH3 56aa 2e-17 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! pfam Abi_HHR 74aa 6e-30 in modified transcript

ABL2

  • rs.ABL2.F1 rs.ABL2.R1 204 267
  • NCBIGene 36.3 27
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100108

  • cd PTKc_Abl 263aa 1e-158 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Abelson kinase. Protein Tyrosine Kinase (PTK) family; Abelson (Abl) kinase; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Abl (or c-Abl) is a ubiquitously-expressed cytoplasmic (or nonreceptor) tyr kinase that contains SH3, SH2, and tyr kinase domains in its N-terminal region, as well as nuclear localization motifs, a putative DNA-binding domain, and F- and G-actin binding domains in its C-terminal tail. It also contains a short autoinhibitory cap region in its N-terminus. Abl is normally inactive and requires phosphorylation and myristoylation for activation. Abl function depends on its subcellular localization. In the cytoplasm, Abl plays a role in cell proliferation and survival. In response to DNA damage or oxidative stress, Abl is transported to the nucleus where it induces apoptosis. In chronic myelogenous leukemia (CML) patients, an aberrant translocation results in the replacement of the first exon of Abl with the BCR (breakpoint cluster region) gene. The resulting BCR-Abl fusion protein is constitutively active and associates into tetramers, resulting in a hyperactive kinase sending a continuous signal. This leads to uncontrolled proliferation, morphological transformation and anti-apoptotic effects. BCR-Abl is the target of selective inhibitors, such as imatinib (Gleevec), used in the treatment of CML. Abl2, also known as ARG (Abelson-related gene), is thought to play a cooperative role with Abl in the proper development of the nervous system. The Tel-ARG fusion protein, resulting from reciprocal translocation between chromosomes 1 and 12, is associated with acute myeloid leukemia (AML). The TEL gene is a frequent fusion partner of other tyr kinase oncogenes, including Tel/Abl, Tel/PDGFRbeta, and Tel/Jak2, found in patients with leukemia and myeloproliferative disorders.
  • cd SH2 90aa 5e-24 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • cd SH3 53aa 2e-09 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam Pkinase_Tyr 252aa 1e-112 in ref transcript
    • Protein tyrosine kinase.
  • pfam F_actin_bind 163aa 1e-48 in ref transcript
    • F-actin binding. The F-actin binding domain forms a compact bundle of four antiparallel alpha-helices, which are arranged in a left-handed topology. Binding of F-actin to the F-actin binding domain may result in cytoplasmic retention and subcellular distribution of the protein, as well as possible inhibition of protein function.
  • smart SH2 84aa 2e-27 in ref transcript
    • Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
  • pfam SH3_1 54aa 1e-12 in ref transcript
    • SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
  • COG SPS1 331aa 1e-20 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

ACD

  • rs.ACD.F1 rs.ACD.R1 173 212
  • NCBIGene 36.3 65057
  • Alternative 3-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001082486

ACOX3

  • rs.ACOX3.F1 rs.ACOX3.R1 367 435
  • NCBIGene 36.3 8310
  • Single exon skipping, size difference: 68
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003501

  • Changed! cd AXO 654aa 0.0 in ref transcript
    • Peroxisomal acyl-CoA oxidases (AXO) catalyze the first set in the peroxisomal fatty acid beta-oxidation, the alpha,beta dehydrogenation of the corresponding trans-enoyl-CoA by FAD, which becomes reduced. In a second oxidative half-reaction, the reduced FAD is reoxidized by molecular oxygen. AXO is generally a homodimer, but it has been reported to form a different type of oligomer in yeast. There are several subtypes of AXO's, based on substrate specificity. Palmitoyl-CoA oxidase acts on straight-chain fatty acids and prostanoids; whereas, the closely related Trihydroxycoprostanoly-CoA oxidase has the greatest activity for 2-methyl branched side chains of bile precursors. Pristanoyl-CoA oxidase, acts on 2-methyl branched fatty acids. AXO has an additional domain, C-terminal to the region with similarity to acyl-CoA dehydrogenases, which is included in this alignment.
  • Changed! pfam ACOX 190aa 1e-51 in ref transcript
    • Acyl-CoA oxidase. This is a family of Acyl-CoA oxidases EC:1.3.3.6. Acyl-coA oxidase converts acyl-CoA into trans-2- enoyl-CoA.
  • pfam Acyl-CoA_dh_M 58aa 2e-09 in ref transcript
    • Acyl-CoA dehydrogenase, middle domain. Central domain of Acyl-CoA dehydrogenase has a beta-barrel fold.
  • pfam Acyl-CoA_dh_1 162aa 1e-05 in ref transcript
    • Acyl-CoA dehydrogenase, C-terminal domain. C-terminal domain of Acyl-CoA dehydrogenase is an all-alpha, four helical up-and-down bundle.
  • COG CaiA 373aa 6e-24 in ref transcript
    • Acyl-CoA dehydrogenases [Lipid metabolism].
  • Changed! cd AXO 591aa 0.0 in modified transcript
  • Changed! pfam ACOX 103aa 8e-24 in modified transcript

ACP1

  • rs.ACP1.F1 rs.ACP1.R1 233 388
  • NCBIGene 36.3 52
  • Alternative 3-prime, size difference: 155
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_007099

  • Changed! cd LMWPc 149aa 4e-43 in ref transcript
    • Low molecular weight phosphatase family;.
  • Changed! pfam LMWPc 150aa 3e-41 in ref transcript
    • Low molecular weight phosphotyrosine protein phosphatase.
  • Changed! COG Wzb 152aa 8e-31 in ref transcript
    • Protein-tyrosine-phosphatase [Signal transduction mechanisms].
  • Changed! cd LMWPc 78aa 4e-23 in modified transcript
  • Changed! pfam LMWPc 78aa 2e-21 in modified transcript
  • Changed! COG Wzb 80aa 6e-19 in modified transcript

ACPL2

  • rs.ACPL2.F1 rs.ACPL2.R1 206 380
  • NCBIGene 36.3 92370
  • Multiple exon skipping, size difference: 174
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_152282

  • cd HP_HAP_like 81aa 3e-14 in ref transcript
    • Histidine phosphatase domain found in histidine acid phosphatases and phytases; contains a His residue which is phosphorylated during the reaction. Catalytic domain of HAP (histidine acid phosphatases) and phytases (myo-inositol hexakisphosphate phosphohydrolases). The conserved catalytic core of this domain contains a His residue which is phosphorylated in the reaction. Functions in this subgroup include roles in metabolism, signaling, or regulation, for example Escherichia coli glucose-1-phosphatase functions to scavenge glucose from glucose-1-phosphate and the signaling molecules inositol 1,3,4,5,6-pentakisphosphate (InsP5) and inositol hexakisphosphate (InsP6) are in vivo substrates for eukaryotic multiple inositol polyphosphate phosphatase 1 (Minpp1). Phytases scavenge phosphate from extracellular sources and are added to animal feed while prostatic acid phosphatase (PAP) has been used for many years as a serum marker for prostate cancer. Recently PAP has been shown in mouse models to suppress pain by functioning as an ecto-5prime-nucleotidase. In vivo it dephosphorylates extracellular adenosine monophosphate (AMP) generating adenosine,and leading to the activation of A1-adenosine receptors in dorsal spinal cord.
  • cd HP_HAP_like 75aa 8e-13 in ref transcript
  • pfam Acid_phosphat_A 338aa 3e-45 in ref transcript
    • Histidine acid phosphatase.

ACSM2B

  • rs.ACSM2B.F1 rs.ACSM2B.R1 102 137
  • NCBIGene 36.3 348158
  • Single exon skipping, size difference: 35
  • Exclusion in 5'UTR
  • Reference transcript: NM_182617

  • pfam AMP-binding 411aa 2e-81 in ref transcript
    • AMP-binding enzyme.
  • COG Acs 526aa 1e-109 in ref transcript
    • Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases [Lipid metabolism].

ACTR2

  • rs.ACTR2.F1 rs.ACTR2.R1 102 117
  • NCBIGene 36.3 10097
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005386

  • Changed! cd ACTIN 386aa 1e-122 in ref transcript
    • Actin; An ubiquitous protein involved in the formation of filaments that are a major component of the cytoskeleton. Interaction with myosin provides the basis of muscular contraction and many aspects of cell motility. Each actin protomer binds one molecule of ATP and either calcium or magnesium ions. Actin exists as a monomer in low salt concentrations, but filaments form rapidly as salt concentration rises, with the consequent hydrolysis of ATP. Polymerization is regulated by so-called capping proteins. The ATPase domain of actin shares similarity with ATPase domains of hexokinase and hsp70 proteins.
  • Changed! smart ACTIN 388aa 1e-137 in ref transcript
    • Actin. ACTIN subfamily of ACTIN/mreB/sugarkinase/Hsp70 superfamily.
  • Changed! PTZ PTZ00004 393aa 1e-106 in ref transcript
    • actin; Provisional.
  • Changed! cd ACTIN 381aa 1e-122 in modified transcript
  • Changed! smart ACTIN 383aa 1e-139 in modified transcript
  • Changed! PTZ PTZ00004 388aa 1e-107 in modified transcript

ADAM15

  • rs.ADAM15.F1 rs.ADAM15.R1 100 172
  • NCBIGene 36.3 8751
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207197

  • cd ZnMc_adamalysin_II_like 200aa 3e-63 in ref transcript
    • Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
  • pfam Reprolysin 199aa 2e-62 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • smart ACR 142aa 1e-38 in ref transcript
    • ADAM Cysteine-Rich Domain.
  • pfam Pep_M12B_propep 143aa 3e-29 in ref transcript
    • Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.
  • pfam Disintegrin 75aa 4e-29 in ref transcript
    • Disintegrin.

ADAM9

  • rs.ADAM9.F1 rs.ADAM9.R1 106 212
  • NCBIGene 36.3 8754
  • Single exon skipping, size difference: 106
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003816

  • cd ZnMc_adamalysin_II_like 193aa 3e-68 in ref transcript
    • Zinc-dependent metalloprotease; adamalysin_II_like subfamily. Adamalysin II is a snake venom zinc endopeptidase. This subfamily contains other snake venom metalloproteinases, as well as membrane-anchored metalloproteases belonging to the ADAM family. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions.
  • pfam Reprolysin 195aa 2e-74 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • pfam Pep_M12B_propep 132aa 3e-52 in ref transcript
    • Reprolysin family propeptide. This region is the propeptide for members of peptidase family M12B. The propeptide contains a sequence motif similar to the "cysteine switch" of the matrixins. This motif is found at the C terminus of the alignment but is not well aligned.
  • smart ACR 136aa 1e-50 in ref transcript
    • ADAM Cysteine-Rich Domain.
  • pfam Disintegrin 77aa 4e-29 in ref transcript
    • Disintegrin.

ADAMTS13

  • rs.ADAMTS13.F1 rs.ADAMTS13.R1 145 238
  • NCBIGene 36.3 11093
  • Alternative 3-prime, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139025

  • Changed! cd ZnMc_ADAMTS_like 204aa 8e-68 in ref transcript
    • Zinc-dependent metalloprotease, ADAMTS_like subgroup. ADAMs (A Disintegrin And Metalloprotease) are glycoproteins, which play roles in cell signaling, cell fusion, and cell-cell interactions. This particular subfamily represents domain architectures that combine ADAM-like metalloproteinases with thrombospondin type-1 repeats. ADAMTS (a disintegrin and metalloproteinase with thrombospondin motifs) proteinases are inhibited by TIMPs (tissue inhibitors of metalloproteinases), and they play roles in coagulation, angiogenesis, development and progression of arthritis. They hydrolyze the von Willebrand factor precursor and various components of the extracellular matrix.
  • Changed! pfam Reprolysin 206aa 1e-11 in ref transcript
    • Reprolysin (M12B) family zinc metalloprotease. The members of this family are enzymes that cleave peptides. These proteases require zinc for catalysis. Members of this family are also known as adamalysins. Most members of this family are snake venom endopeptidases, but there are also some mammalian proteins and fertilin. Fertilin and closely related proteins appear to not have some active site residues and may not be active enzymes.
  • smart TSP1 53aa 3e-11 in ref transcript
    • Thrombospondin type 1 repeats. Type 1 repeats in thrombospondin-1 bind and activate TGF-beta.
  • Changed! cd ZnMc_ADAMTS_like 198aa 4e-65 in modified transcript
  • Changed! pfam Reprolysin 200aa 3e-10 in modified transcript

ADORA1

  • rs.ADORA1.F1 rs.ADORA1.R1 138 293
  • NCBIGene 36.3 134
  • Single exon skipping, size difference: 155
  • Exclusion in 5'UTR
  • Reference transcript: NM_000674

  • pfam 7tm_1 257aa 1e-20 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

AFAP1L2

  • rs.AFAP1L2.F1 rs.AFAP1L2.R1 100 112
  • NCBIGene 36.3 84632
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001936

  • cd PH 89aa 8e-07 in ref transcript
    • Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • cd PH 77aa 9e-07 in ref transcript
  • pfam PH 75aa 1e-08 in ref transcript
    • PH domain. PH stands for pleckstrin homology.
  • smart PH 88aa 6e-06 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

AFF3

  • rs.AFF3.F1 rs.AFF3.R1 113 188
  • NCBIGene 36.3 3899
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025108

  • pfam AF-4 1204aa 0.0 in ref transcript
    • AF-4 proto-oncoprotein. This family consists of AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental retardation syndrome) nuclear proteins. These proteins have been linked to human diseases such as acute lymphoblastic leukaemia and mental retardation. The family also contains a Drosophila AF4 protein homologue Lilliputian which contains an AT-hook domain. Lilliputian represents a novel pair-rule gene that acts in cytoskeleton regulation, segmentation and morphogenesis in Drosophila.

AGPAT2

  • rs.AGPAT2.F1 rs.AGPAT2.R1 251 347
  • NCBIGene 36.3 10555
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006412

  • Changed! pfam Acyltransferase 115aa 3e-28 in ref transcript
    • Acyltransferase. This family contains acyltransferases involved in phospholipid biosynthesis and other proteins of unknown function. This family also includes tafazzin, the Barth syndrome gene.
  • Changed! COG PlsC 228aa 3e-19 in ref transcript
    • 1-acyl-sn-glycerol-3-phosphate acyltransferase [Lipid metabolism].
  • Changed! pfam Acyltransferase 88aa 3e-15 in modified transcript
  • Changed! COG PlsC 129aa 5e-10 in modified transcript

AGTR1

  • rs.AGTR1.F1 rs.AGTR1.R1 200 357
  • NCBIGene 36.3 185
  • Single exon skipping, size difference: 157
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_031850

  • Changed! pfam 7tm_1 252aa 2e-42 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

AGTR1

  • rs.AGTR1.F2 rs.AGTR1.R2 223 281
  • NCBIGene 36.3 185
  • Single exon skipping, size difference: 58
  • Exclusion of the protein initiation site
  • Reference transcript: NM_031850

  • pfam 7tm_1 252aa 2e-42 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

AGTR1

  • rs.AGTR1.F3 rs.AGTR1.R3 291 375
  • NCBIGene 36.3 185
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_031850

  • pfam 7tm_1 252aa 2e-42 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

AKAP13

  • rs.AKAP13.F1 rs.AKAP13.R1 106 118
  • NCBIGene 36.3 11214
  • Mutually exclusive exon skipping, size difference: 12
  • Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006738

  • cd RhoGEF 195aa 4e-37 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • cd PH 91aa 2e-05 in ref transcript
    • Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • smart RhoGEF 192aa 5e-40 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
  • smart PH 102aa 5e-07 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • COG ROM1 305aa 5e-06 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].

AKAP9

  • rs.AKAP9.F1 rs.AKAP9.R1 129 153
  • NCBIGene 36.3 10142
  • Alternative 5-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_147171

  • pfam PACT_coil_coil 83aa 2e-26 in ref transcript
    • Pericentrin-AKAP-450 domain of centrosomal targeting protein. This domain is a coiled-coil region close to the C-terminus of centrosomal proteins that is directly responsible for recruiting AKAP-450 and pericentrin to the centrosome. Hence the suggested name for this region is a PACT domain (pericentrin-AKAP-450 centrosomal targeting). This domain is also present at the C-terminus of coiled-coil proteins from Drosophila and S. pombe, and that from the Drosophila protein is sufficient for targeting to the centrosome in mammalian cells. The function of these proteins is unknown but they seem good candidates for having a centrosomal or spindle pole body location. The final 22 residues of this domain in AKAP-450 appear specifically to be a calmodulin-binding domain indicating that this member at least is likely to contribute to centrosome assembly.
  • Changed! TIGR SMC_prok_B 404aa 6e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • pfam SMC_N 259aa 6e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • TIGR SMC_prok_A 252aa 0.001 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • TIGR SMC_prok_B 709aa 0.002 in ref transcript
  • COG Smc 264aa 3e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! COG Smc 317aa 7e-04 in ref transcript
  • PRK PRK05771 168aa 0.009 in ref transcript
    • V-type ATP synthase subunit I; Validated.
  • Changed! TIGR SMC_prok_B 386aa 2e-07 in modified transcript
  • Changed! COG Smc 308aa 4e-04 in modified transcript

AKR1C2

  • rs.AKR1C2.F1 rs.AKR1C2.R1 245 345
  • NCBIGene 36.3 1646
  • Single exon skipping, size difference: 100
  • Exclusion in 5'UTR
  • Reference transcript: NM_001354

  • cd Aldo_ket_red 289aa 1e-71 in ref transcript
    • Aldo-keto reductases (AKRs) are a superfamily of soluble NAD(P)(H) oxidoreductases whose chief purpose is to reduce aldehydes and ketones to primary and secondary alcohols. AKRs are present in all phyla and are of importance to both health and industrial applications. Members have very distinct functions and include the prokaryotic 2,5-diketo-D-gluconic acid reductases and beta-keto ester reductases, the eukaryotic aldose reductases, aldehyde reductases, hydroxysteroid dehydrogenases, steroid 5beta-reductases, potassium channel beta-subunits and aflatoxin aldehyde reductases, among others.
  • pfam Aldo_ket_red 292aa 1e-106 in ref transcript
    • Aldo/keto reductase family. This family includes a number of K+ ion channel beta chain regulatory domains - these are reported to have oxidoreductase activity.
  • COG ARA1 298aa 9e-81 in ref transcript
    • Aldo/keto reductases, related to diketogulonate reductase [General function prediction only].

AKT1

  • rs.AKT1.F1 rs.AKT1.R1 239 323
  • NCBIGene 36.3 207
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_001014432

  • cd STKc_PKB_alpha 324aa 0.0 in ref transcript
    • STKc_PKB_alpha: Serine/Threonine Kinases (STKs), Protein Kinase B (PKB) or Akt subfamily, alpha (or Akt1) isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PKB subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. There are three PKB isoforms from different genes, PKB-alpha (or Akt1), PKB-beta (or Akt2), and PKB-gamma (or Akt3). PKB contains an N-terminal pleckstrin homology (PH) domain and a C-terminal catalytic domain. PKB-alpha is predominantly expressed in endothelial cells. It is critical for the regulation of angiogenesis and the maintenance of vascular integrity. It also plays a role in adipocyte differentiation. Mice deficient in PKB-alpha exhibit perinatal morbidity, growth retardation, reduction in body weight accompanied by reduced sizes of multiple organs, and enhanced apoptosis in some cell types. PKB-alpha activity has been reported to be frequently elevated in breast and prostate cancers. In some cancer cells, PKB-alpha may act as a suppressor of metastasis.
  • cd PH_Akt 102aa 3e-52 in ref transcript
    • Akt pleckstrin homology (PH) domain. Akt (Protein Kinase B (PKB)) is a phosphatidylinositol 3'-kinase (PI3K)-dependent Ser/Thr kinase. The PH domain recruits Akt to the plasma membrane by binding to phosphoinositides (PtdIns-3,4-P2) and is required for activation. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • smart S_TKc 249aa 2e-86 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart S_TK_X 68aa 7e-15 in ref transcript
    • Extension to Ser/Thr-type protein kinases.
  • pfam PH 99aa 1e-11 in ref transcript
    • PH domain. PH stands for pleckstrin homology.
  • PTZ PTZ00263 309aa 4e-80 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

AKT1S1

  • rs.AKT1S1.F1 rs.AKT1S1.R1 111 533
  • NCBIGene 36.3 84335
  • Alternative 5-prime, size difference: 422
  • Exclusion of the protein initiation site
  • Reference transcript: NM_032375

ALG9

  • rs.ALG9.F1 rs.ALG9.R1 117 138
  • NCBIGene 36.3 79796
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024740

  • Changed! pfam Glyco_transf_22 318aa 2e-57 in ref transcript
    • Alg9-like mannosyltransferase family. Members of this family are mannosyltransferase enzymes. At least some members are localised in endoplasmic reticulum and involved in GPI anchor biosynthesis.
  • Changed! pfam Glyco_transf_22 311aa 8e-59 in modified transcript

ALPK1

  • rs.ALPK1.F1 rs.ALPK1.R1 209 261
  • NCBIGene 36.3 80216
  • Alternative 3-prime, size difference: 52
  • Inclusion in 5'UTR
  • Reference transcript: NM_001102406

  • pfam Alpha_kinase 205aa 4e-57 in ref transcript
    • Alpha-kinase family. This family is a novel family of eukaryotic protein kinase catalytic domains, which have no detectable similarity to conventional kinases. The family contains myosin heavy chain kinases and Elongation Factor-2 kinase and a bifunctional ion channel. This family is known as the alpha-kinase family. The structure of the kinase domain revealed unexpected similarity to eukaryotic protein kinases in the catalytic core as well as to metabolic enzymes with ATP-grasp domains.

ALS2CR8

  • rs.ALS2CR8.F1 rs.ALS2CR8.R1 207 326
  • NCBIGene 36.3 79800
  • Single exon skipping, size difference: 119
  • Exclusion in 5'UTR
  • Reference transcript: NM_024744

AMELX

  • rs.AMELX.F1 rs.AMELX.R1 245 287
  • NCBIGene 36.3 265
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182680

  • Changed! pfam Amelogenin 66aa 4e-16 in ref transcript
    • Amelogenin. Amelogenins play a role in biomineralisation. They seem to regulate the formation of crystallites during the secretory stage of tooth enamel development. thought to play a major role in the structural organisation and mineralisation of developing enamel. They are found in the extracellular matrix. Mutations in X-chromosomal amelogenin can cause Amelogenesis imperfecta.
  • Changed! pfam Amelogenin 52aa 1e-17 in modified transcript

AMPH

  • rs.AMPH.F1 rs.AMPH.R1 297 423
  • NCBIGene 36.3 273
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001635

  • cd SH3 68aa 3e-06 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • pfam BAR 220aa 3e-55 in ref transcript
    • BAR domain. BAR domains are dimerisation, lipid binding and curvature sensing modules found in many different protein families. A BAR domain with an additional N-terminal amphipathic helix (an N-BAR) can drive membrane curvature. These N-BAR domains are found in amphiphysin, endophilin, BRAP and Nadrin. BAR domains are also frequently found alongside domains that determine lipid specificity, like pfam00169 and pfam00787 domains in beta centaurins and sorting nexins respectively.
  • smart SH3 70aa 7e-07 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! PRK rne 148aa 0.009 in ref transcript
    • ribonuclease E; Reviewed.
  • Changed! PRK PRK05648 145aa 0.005 in modified transcript
    • DNA polymerase III subunits gamma and tau; Reviewed.

ANK1

  • rs.ANK1.F1 rs.ANK1.R1 191 392
  • NCBIGene 36.3 286
  • Multiple exon skipping, size difference: 201
  • Exclusion in the protein (no frameshift), Exclusion of the stop codon
  • Reference transcript: NM_020478

ANKHD1

  • rs.ANKHD1.F1 rs.ANKHD1.R1 143 176
  • NCBIGene 36.3 54882
  • Alternative 3-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017747

  • cd ANK 125aa 5e-28 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 127aa 2e-26 in ref transcript
  • cd ANK 127aa 3e-26 in ref transcript
  • cd ANK 119aa 2e-25 in ref transcript
  • cd ANK 128aa 4e-25 in ref transcript
  • cd ANK 123aa 5e-23 in ref transcript
  • cd KH-I 63aa 2e-11 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • cd ANK 65aa 5e-08 in ref transcript
  • pfam KH_1 63aa 2e-11 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • TIGR trp 239aa 1e-06 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • TIGR trp 222aa 2e-05 in ref transcript
  • pfam Ank 33aa 3e-05 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 29aa 9e-05 in ref transcript
  • pfam Ank 31aa 4e-04 in ref transcript
  • pfam Ank 29aa 0.001 in ref transcript
  • pfam Ank 27aa 0.004 in ref transcript
  • COG Arp 183aa 7e-16 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 196aa 1e-15 in ref transcript
  • COG Arp 201aa 9e-15 in ref transcript
  • COG Arp 228aa 2e-11 in ref transcript
  • COG Arp 133aa 3e-11 in ref transcript

ANKRD12

  • rs.ANKRD12.F1 rs.ANKRD12.R1 250 319
  • NCBIGene 36.3 23253
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015208

  • cd ANK 113aa 4e-28 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • pfam Ank 31aa 6e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 28aa 4e-05 in ref transcript
  • pfam Ank 33aa 1e-04 in ref transcript
  • COG Arp 172aa 2e-12 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

ANKRD16

  • rs.ANKRD16.F1 rs.ANKRD16.R1 102 207
  • NCBIGene 36.3 54522
  • Alternative 5-prime and 3-prime, size difference: 105
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_019046

  • cd ANK 121aa 5e-22 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd ANK 156aa 5e-20 in ref transcript
  • cd ANK 128aa 3e-19 in ref transcript
  • TIGR trp 142aa 4e-05 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! TIGR trp 223aa 3e-04 in ref transcript
  • Changed! COG Arp 184aa 7e-12 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 122aa 1e-10 in ref transcript
  • Changed! cd ANK 121aa 1e-23 in modified transcript
  • Changed! TIGR trp 177aa 0.003 in modified transcript
  • Changed! COG Arp 169aa 3e-12 in modified transcript

ANKRD7

  • rs.ANKRD7.F1 rs.ANKRD7.R1 177 237
  • NCBIGene 36.3 56311
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077708

  • Changed! cd ANK 123aa 1e-26 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 65aa 7e-10 in ref transcript
  • pfam Ank 33aa 7e-07 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • TIGR trp 102aa 1e-05 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • Changed! pfam Ank 30aa 4e-05 in ref transcript
  • Changed! COG Arp 134aa 1e-10 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! cd ANK 126aa 2e-27 in modified transcript
  • Changed! pfam Ank 30aa 5e-05 in modified transcript
  • Changed! COG Arp 161aa 2e-11 in modified transcript

ANXA11

  • rs.ANXA11.F1 rs.ANXA11.R1 220 416
  • NCBIGene 36.3 311
  • Single exon skipping, size difference: 196
  • Exclusion in 5'UTR
  • Reference transcript: NM_145869

  • pfam Annexin 66aa 8e-22 in ref transcript
    • Annexin. This family of annexins also includes giardin that has been shown to function as an annexin.
  • pfam Annexin 66aa 1e-21 in ref transcript
  • pfam Annexin 66aa 4e-19 in ref transcript
  • pfam Annexin 67aa 2e-17 in ref transcript

AP1S1

  • rs.AP1S1.F1 rs.AP1S1.R1 139 416
  • NCBIGene 36.3 1174
  • Alternative 5-prime and 3-prime, size difference: 277
  • Exclusion in the protein (no frameshift), Exclusion of the stop codon
  • Reference transcript: NM_001283

  • Changed! pfam Clat_adaptor_s 142aa 9e-58 in ref transcript
    • Clathrin adaptor complex small chain.
  • Changed! COG APS2 150aa 7e-47 in ref transcript
    • Clathrin adaptor complex, small subunit [Intracellular trafficking and secretion].
  • Changed! pfam Clat_adaptor_s 127aa 4e-54 in modified transcript
  • Changed! COG APS2 129aa 2e-43 in modified transcript

AP3D1

  • rs.AP3D1.F1 rs.AP3D1.R1 199 349
  • NCBIGene 36.3 8943
  • Multiple exon skipping, size difference: 150
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_003938

  • Changed! pfam BLVR 260aa 1e-141 in ref transcript
    • Bovine leukaemia virus receptor (BLVR). This family consists of several bovine specific leukaemia virus receptors which are thought to function as transmembrane proteins, although their exact function is unknown.
  • pfam Adaptin_N 550aa 1e-132 in ref transcript
    • Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
  • pfam BLVR 194aa 1e-79 in ref transcript
  • COG COG5096 407aa 1e-12 in ref transcript
    • Vesicle coat complex, various subunits [Intracellular trafficking and secretion].
  • Changed! pfam BLVR 253aa 1e-139 in modified transcript

AP3D1

  • rs.AP3D1.F2 rs.AP3D1.R2 101 374
  • NCBIGene 36.3 8943
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 273
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003938

  • pfam BLVR 260aa 1e-141 in ref transcript
    • Bovine leukaemia virus receptor (BLVR). This family consists of several bovine specific leukaemia virus receptors which are thought to function as transmembrane proteins, although their exact function is unknown.
  • Changed! pfam Adaptin_N 550aa 1e-132 in ref transcript
    • Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
  • pfam BLVR 194aa 1e-79 in ref transcript
  • Changed! COG COG5096 407aa 1e-12 in ref transcript
    • Vesicle coat complex, various subunits [Intracellular trafficking and secretion].
  • Changed! pfam Adaptin_N 333aa 3e-61 in modified transcript
  • Changed! COG COG5096 212aa 1e-06 in modified transcript

APAF1

  • rs.APAF1.F1 rs.APAF1.R1 193 322
  • NCBIGene 36.3 317
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181861

  • Changed! cd WD40 304aa 4e-60 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 274aa 5e-47 in ref transcript
  • pfam NB-ARC 286aa 5e-73 in ref transcript
    • NB-ARC domain.
  • pfam CARD 85aa 4e-14 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • smart WD40 40aa 8e-08 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 40aa 2e-06 in ref transcript
  • smart WD40 40aa 8e-05 in ref transcript
  • smart WD40 42aa 1e-04 in ref transcript
  • smart WD40 37aa 1e-04 in ref transcript
  • Changed! pfam eIF2A 114aa 0.003 in ref transcript
    • Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
  • smart WD40 28aa 0.006 in ref transcript
  • smart WD40 34aa 0.008 in ref transcript
  • Changed! COG COG2319 398aa 4e-33 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! COG COG2319 408aa 5e-25 in ref transcript
  • Changed! cd WD40 264aa 1e-55 in modified transcript
  • Changed! COG COG2319 390aa 1e-33 in modified transcript

APAF1

  • rs.APAF1.F2 rs.APAF1.R2 185 218
  • NCBIGene 36.3 317
  • Alternative 5-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181861

  • cd WD40 304aa 4e-60 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 274aa 5e-47 in ref transcript
  • pfam NB-ARC 286aa 5e-73 in ref transcript
    • NB-ARC domain.
  • pfam CARD 85aa 4e-14 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • smart WD40 40aa 8e-08 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 40aa 2e-06 in ref transcript
  • smart WD40 40aa 8e-05 in ref transcript
  • smart WD40 42aa 1e-04 in ref transcript
  • smart WD40 37aa 1e-04 in ref transcript
  • pfam eIF2A 114aa 0.003 in ref transcript
    • Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
  • smart WD40 28aa 0.006 in ref transcript
  • smart WD40 34aa 0.008 in ref transcript
  • COG COG2319 398aa 4e-33 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG COG2319 408aa 5e-25 in ref transcript

APP

  • rs.APP.F1 rs.APP.R1 245 302
  • NCBIGene 36.3 351
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000484

  • cd KU 52aa 9e-18 in ref transcript
    • BPTI/Kunitz family of serine protease inhibitors; Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure.
  • pfam A4_EXTRA 165aa 8e-89 in ref transcript
    • Amyloid A4 extracellular domain.
  • pfam Kunitz_BPTI 52aa 6e-19 in ref transcript
    • Kunitz/Bovine pancreatic trypsin inhibitor domain. Indicative of a protease inhibitor, usually a serine protease inhibitor. Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure. Certain family members are similar to the tick anticoagulant peptide (TAP). This is a highly selective inhibitor of factor Xa in the blood coagulation pathways. TAP molecules are highly dipolar, and are arranged to form a twisted two- stranded antiparallel beta-sheet followed by an alpha helix.
  • pfam APP_amyloid 43aa 4e-18 in ref transcript
    • beta-amyloid precursor protein C-terminus. This is the amyloid, C-terminal, protein of the beta-Amyloid precursor protein (APP) which is a conserved and ubiquitous transmembrane glycoprotein strongly implicated in the pathogenesis of Alzheimer's disease but whose normal biological function is unknown. The C-terminal 100 residues are released and aggregate into amyloid deposits which are strongly implicated in the pathology of Alzheimer's disease plaque-formation. The domain is associated with family A4_EXTRA, pfam02177, further towards the N-terminus.
  • pfam Beta-APP 34aa 9e-07 in ref transcript
    • Beta-amyloid peptide (beta-APP).
  • pfam OmpH 81aa 0.005 in ref transcript
    • Outer membrane protein (OmpH-like). This family includes outer membrane proteins such as OmpH among others. Skp (OmpH) has been characterised as a molecular chaperone that interacts with unfolded proteins as they emerge in the periplasm from the Sec translocation machinery.
  • Changed! COG SbcC 197aa 0.006 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! PRK xseA 173aa 0.001 in modified transcript
    • exodeoxyribonuclease VII large subunit; Reviewed.

ARFGAP1

  • rs.ARFGAP1.F1 rs.ARFGAP1.R1 169 193
  • NCBIGene 36.3 55738
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_175609

  • pfam ArfGap 118aa 3e-48 in ref transcript
    • Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • COG COG5347 159aa 4e-31 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].

ARHGAP9

  • rs.ARHGAP9.F1 rs.ARHGAP9.R1 295 401
  • NCBIGene 36.3 64333
  • Single exon skipping, size difference: 106
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032496

  • Changed! cd RhoGAP_ARHGAP27_15_12_9 205aa 6e-82 in ref transcript
    • RhoGAP_ARHGAP27_15_12_9: GTPase-activator protein (GAP) domain for Rho-like GTPases found in ARHGAP27 (also called CAMGAP1), ARHGAP15, 12 and 9-like proteins; This subgroup of ARHGAPs are multidomain proteins that contain RhoGAP, PH, SH3 and WW domains. Most members that are studied show GAP activity towards Rac1, some additionally show activity towards Cdc42. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • cd PH 109aa 2e-09 in ref transcript
    • Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • cd SH3 54aa 0.002 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! pfam RhoGAP 171aa 7e-43 in ref transcript
    • RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
  • pfam PH 112aa 3e-08 in ref transcript
    • PH domain. PH stands for pleckstrin homology.
  • pfam SH3_1 54aa 3e-05 in ref transcript
    • SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
  • Changed! cd RhoGAP_ARHGAP27_15_12_9 119aa 2e-38 in modified transcript
  • Changed! pfam RhoGAP 103aa 6e-16 in modified transcript

ARID4A

  • rs.ARID4A.F1 rs.ARID4A.R1 146 308
  • NCBIGene 36.3 5926
  • Alternative 5-prime, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002892

  • cd CHROMO 38aa 0.003 in ref transcript
    • Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
  • pfam ARID 96aa 1e-31 in ref transcript
    • ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
  • pfam RBB1NT 93aa 8e-25 in ref transcript
    • RBB1NT (NUC162) domain. This domain is found N terminal to the ARID/BRIGHT domain in DNA-binding proteins of the Retinoblastoma-binding protein 1 family.
  • smart TUDOR 56aa 1e-05 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
  • smart CHROMO 42aa 1e-04 in ref transcript
    • Chromatin organization modifier domain.

ARL13B

  • rs.ARL13B.F1 rs.ARL13B.R1 136 457
  • NCBIGene 36.3 200894
  • Multiple exon skipping, size difference: 321
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_182896

  • Changed! cd Arl2l1_Arl13_like 167aa 3e-79 in ref transcript
    • Arl2l1/Arl13 subfamily. Arl2l1 (Arl2-like protein 1) and Arl13 form a subfamily of the Arf family of small GTPases. Arl2l1 was identified in human cells during a search for the gene(s) responsible for Bardet-Biedl syndrome (BBS). Like Arl6, the identified BBS gene, Arl2l1 is proposed to have cilia-specific functions. Arl13 is found on the X chromosome, but its expression has not been confirmed; it may be a pseudogene.
  • Changed! pfam Arf 171aa 4e-36 in ref transcript
    • ADP-ribosylation factor family. Pfam combines a number of different Prosite families together.
  • Changed! pfam DUF1682 43aa 0.007 in ref transcript
    • Protein of unknown function (DUF1682). The members of this family are all hypothetical eukaryotic proteins of unknown function. One member is described as being an adipocyte-specific protein, but no evidence of this was found.
  • TIGR major_cap_HK97 94aa 0.009 in ref transcript
    • This model family represents the major capsid protein component of the heads (capsids) of bacteriophage HK97, phi-105, P27, and related phage. This model represents one of several analogous families lacking detectable sequence similarity. The gene encoding this component is typically located in an operon encoding the small and large terminase subunits, the portal protein and the prohead or maturation protease.
  • Changed! PTZ PTZ00133 167aa 2e-23 in ref transcript
    • ADP-ribosylation factor; Provisional.
  • PRK rne 46aa 0.002 in ref transcript
    • ribonuclease E; Reviewed.
  • Changed! cd Arl2l1_Arl13_like 62aa 4e-25 in modified transcript
  • Changed! pfam Arf 63aa 6e-05 in modified transcript

ARL5A

  • rs.ARL5A.F1 rs.ARL5A.R1 100 358
  • NCBIGene 36.3 26225
  • Alternative 5-prime, size difference: 258
  • Exclusion of the protein initiation site
  • Reference transcript: NM_012097

  • Changed! cd Arl5_Arl8 173aa 1e-94 in ref transcript
    • Arl5/Arl8 subfamily. Arl5 (Arf-like 5) and Arl8, like Arl4 and Arl7, are localized to the nucleus and nucleolus. Arl5 is developmentally regulated during embryogenesis in mice. Human Arl5 interacts with the heterochromatin protein 1-alpha (HP1alpha), a nonhistone chromosomal protein that is associated with heterochromatin and telomeres, and prevents telomere fusion. Arl5 may also play a role in embryonic nuclear dynamics and/or signaling cascades. Arl8 was identified from a fetal cartilage cDNA library. It is found in brain, heart, lung, cartilage, and kidney. No function has been assigned for Arl8 to date.
  • Changed! pfam Arf 158aa 2e-61 in ref transcript
    • ADP-ribosylation factor family. Pfam combines a number of different Prosite families together.
  • Changed! PTZ PTZ00133 179aa 5e-55 in ref transcript
    • ADP-ribosylation factor; Provisional.
  • Changed! cd Arl5_Arl8 138aa 6e-76 in modified transcript
  • Changed! pfam Arf 135aa 3e-50 in modified transcript
  • Changed! PTZ PTZ00133 142aa 4e-43 in modified transcript

ARL6IP4

  • rs.ARL6IP4.F1 rs.ARL6IP4.R1 101 125
  • NCBIGene 36.3 51329
  • Alternative 3-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018694

  • pfam SR-25 73aa 2e-19 in ref transcript
    • Nuclear RNA-splicing-associated protein. SR-25, otherwise known as ADP-ribosylation factor-like factor 6-interacting protein 4, is expressed in virtually all tissues. At the N-terminus there is a repeat of serine-arginine (SR repeat), and towards the middle of the protein there are clusters of both serines and of basic amino acids. The presence of many nuclear localisation signals strongly implies that this is a nuclear protein that may contribute to RNA splicing. SR-25 is also implicated, along with heat-shock-protein-27, as a mediator in the Rac1 (GTPase ras-related C3 botulinum toxin substrate 1) signalling pathway.

ARPC4

  • rs.ARPC4.F1 rs.ARPC4.R1 103 522
  • NCBIGene 36.3 10093
  • Alternative 5-prime, size difference: 419
  • Exclusion of the protein initiation site
  • Reference transcript: NM_005718

  • Changed! pfam ARPC4 168aa 1e-88 in ref transcript
    • ARP2/3 complex 20 kDa subunit (ARPC4). This family consists of several eukaryotic ARP2/3 complex 20 kDa subunit (P20-ARC) proteins. The Arp2/3 protein complex has been implicated in the control of actin polymerisation in cells. The human complex consists of seven subunits which include the actin related proteins Arp2 and Arp3 it has been suggested that the complex promotes actin assembly in lamellipodia and may participate in lamellipodial protrusion.
  • Changed! PTZ PTZ00278 168aa 2e-59 in ref transcript
    • ARP2/3 complex subunit; Provisional.

ARTN

  • rs.ARTN.F1 rs.ARTN.R1 150 174
  • NCBIGene 36.3 9048
  • Alternative 3-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_057090

  • pfam TGF_beta 96aa 8e-06 in ref transcript
    • Transforming growth factor beta like domain.

ASGR2

  • rs.ASGR2.F1 rs.ASGR2.R1 222 279
  • NCBIGene 36.3 433
  • Alternative 5-prime, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080912

  • cd CLECT_DC-SIGN_like 126aa 1e-40 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • Changed! pfam Lectin_N 143aa 5e-55 in ref transcript
    • Hepatic lectin, N-terminal domain.
  • pfam Lectin_C 109aa 4e-32 in ref transcript
    • Lectin C-type domain. This family includes both long and short form C-type.
  • Changed! pfam Lectin_N 143aa 4e-51 in modified transcript

ASGR2

  • rs.ASGR2.F2 rs.ASGR2.R2 112 127
  • NCBIGene 36.3 433
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080912

  • cd CLECT_DC-SIGN_like 126aa 1e-40 in ref transcript
    • CLECT_DC-SIGN_like: C-type lectin-like domain (CTLD) of the type found in human dendritic cell (DC)-specific intercellular adhesion molecule 3-grabbing non-integrin (DC-SIGN) and the related receptor, DC-SIGN receptor (DC-SIGNR). This group also contains proteins similar to hepatic asialoglycoprotein receptor (ASGP-R) and langerin in human. These proteins are type II membrane proteins with a CTLD ectodomain. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. DC-SIGN is thought to mediate the initial contact between dendritic cells and resting T cells, and may also mediate the rolling of DCs on epithelium. DC-SIGN and DC-SIGNR bind to oligosaccharides present on human tissues, as well as, on pathogens including parasites, bacteria, and viruses. DC-SIGN and DC-SIGNR bind to HIV enhancing viral infection of T cells. DC-SIGN and DC-SIGNR are homotetrameric, and contain four CTLDs stabilized by a coiled coil of alpha helices. The hepatic ASGP-R is an endocytic recycling receptor which binds and internalizes desialylated glycoproteins having a terminal galactose or N-acetylgalactosamine residues on their N-linked carbohydrate chains, via the clathrin-coated pit mediated endocytic pathway, and delivers them to lysosomes for degradation. It has been proposed that glycoproteins bearing terminal Sia (sialic acid) alpha2, 6GalNAc and Sia alpha2, 6Gal are endogenous ligands for ASGP-R and that ASGP-R participates in regulating the relative concentration of serum glycoproteins bearing alpha 2,6-linked Sia. The human ASGP-R is a hetero-oligomer composed of two subunits, both of which are found within this group. Langerin is expressed in a subset of dendritic leukocytes, the Langerhans cells (LC). Langerin induces the formation of Birbeck Granules (BGs) and associates with these BGs following internalization. Langerin binds, in a calcium-dependent manner, to glyco-conjugates containing mannose and related sugars mediating their uptake and degradation. Langerin molecules oligomerize as trimers with three CTLDs held together by a coiled-coil of alpha helices.
  • Changed! pfam Lectin_N 143aa 5e-55 in ref transcript
    • Hepatic lectin, N-terminal domain.
  • pfam Lectin_C 109aa 4e-32 in ref transcript
    • Lectin C-type domain. This family includes both long and short form C-type.
  • Changed! pfam Lectin_N 138aa 1e-56 in modified transcript

ASRGL1

  • rs.ASRGL1.F1 rs.ASRGL1.R1 213 290
  • NCBIGene 36.3 80150
  • Alternative 3-prime, size difference: 77
  • Inclusion in 5'UTR
  • Reference transcript: NM_025080

  • cd ASRGL1_like 295aa 1e-100 in ref transcript
    • ASRGL1_like domains, a subfamily of the L-Asparaginase type 2-like enzymes. The wider family includes Glycosylasparaginase, Taspase 1 and L-Asparaginase type 2 enzymes. The proenzymes undergo autoproteolytic cleavage before a threonine to generate alpha and beta subunits. The threonine becomes the N-terminal residue of the beta subunit and is the catalytic residue. ASRGL1, or asparaginase-like 1, has been cloned from mammalian testis cDNA libraries. It has been identified as a sperm antigen that may induce the production of autoantibodies following obstruction of the male reproductive tract, e.g. vasectomy.
  • pfam Asparaginase_2 250aa 5e-56 in ref transcript
    • Asparaginase.
  • COG COG1446 292aa 1e-63 in ref transcript
    • Asparaginase [Amino acid transport and metabolism].

ASS1

  • rs.ASS1.F1 rs.ASS1.R1 202 264
  • NCBIGene 36.3 445
  • Single exon skipping, size difference: 62
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000050

  • Changed! cd Argininosuccinate_Synthase 392aa 1e-161 in ref transcript
    • Argininosuccinate synthase. The Argininosuccinate synthase is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterized by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.
  • Changed! pfam Arginosuc_synth 398aa 1e-180 in ref transcript
    • Arginosuccinate synthase. This family contains a PP-loop motif.
  • Changed! PRK PRK00509 402aa 1e-170 in ref transcript
    • argininosuccinate synthase; Provisional.

ASTN2

  • rs.ASTN2.F1 rs.ASTN2.R1 100 112
  • NCBIGene 36.3 23245
  • Alternative 3-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198187

  • smart MACPF 184aa 1e-36 in ref transcript
    • membrane-attack complex / perforin.

ATG4B

  • rs.ATG4B.F1 rs.ATG4B.R1 101 121
  • NCBIGene 36.3 23192
  • Alternative 5-prime, size difference: 20
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_013325

  • pfam Peptidase_C54 301aa 1e-101 in ref transcript
    • Peptidase family C54.

ATG9A

  • rs.ATG9A.F1 rs.ATG9A.R1 125 177
  • NCBIGene 36.3 79065
  • Single exon skipping, size difference: 52
  • Exclusion in 5'UTR
  • Reference transcript: NM_001077198

  • pfam APG9 359aa 1e-143 in ref transcript
    • Autophagy protein Apg9. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. Autophagy is a bulk degradation process induced by starvation in eukaryotic cells. Apg9 plays a direct role in the formation of the cytoplasm to vacuole targeting and autophagic vesicles, possibly serving as a marker for a specialised compartment essential for these vesicle-mediated alternative targeting pathways.

ATP11A

  • rs.ATP11A.F1 rs.ATP11A.R1 142 169
  • NCBIGene 36.3 23250
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 27
  • Inclusion in the protein causing a new stop codon, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032189

  • TIGR ATPase-Plipid 1065aa 0.0 in ref transcript
    • This model describes the P-type ATPase responsible for transporting phospholipids from one leaflet of bilayer membranes to the other. These ATPases are found only in eukaryotes.
  • COG MgtA 1061aa 1e-106 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].

ATP2A3

  • rs.ATP2A3.F1 rs.ATP2A3.R1 104 117
  • NCBIGene 36.3 489
  • Alternative 5-prime, size difference: 13
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_174953

  • cd HAD_like 118aa 0.002 in ref transcript
    • Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
  • TIGR ATPase-IIA1_Ca 937aa 0.0 in ref transcript
    • The calcium P-type ATPases have been characterized as Type IIA based on a phylogenetic analysis which distinguishes this group from the Type IIB PMCA calcium pump modelled by TIGR01517. A separate analysis divides Type IIA into sub-types, SERCA and PMR1 the latter of which is modelled by TIGR01522.
  • pfam Cation_ATPase_N 57aa 1e-12 in ref transcript
    • Cation transporter/ATPase, N-terminus. Members of this families are involved in Na+/K+, H+/K+, Ca++ and Mg++ transport.
  • COG MgtA 980aa 0.0 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].

ATP2B1

  • rs.ATP2B1.F1 rs.ATP2B1.R1 232 386
  • NCBIGene 36.3 490
  • Single exon skipping, size difference: 154
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001682

  • TIGR ATPase-IIB_Ca 707aa 0.0 in ref transcript
    • The calcium P-type ATPases have been characterized as Type IIB based on a phylogenetic analysis which distinguishes this group from the Type IIA SERCA calcium pump. A separate analysis divides Type IIA into sub-types (SERCA and PMR1) which are modelled by two corresponding HMMs (TIGR01116 and TIGR01522). This model is well separated from those.
  • TIGR ATPase-IIB_Ca 281aa 1e-100 in ref transcript
  • TIGR ATPase-IID_K-Na 297aa 7e-18 in ref transcript
    • The Leishmania sequence (GP|3192903), which falls between trusted and noise in this model, may very well turn out to be an active potassium pump.
  • COG MgtA 703aa 1e-127 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].
  • COG MgtA 237aa 2e-26 in ref transcript

ATP2B2

  • rs.ATP2B2.F1 rs.ATP2B2.R1 176 311
  • NCBIGene 36.3 491
  • Multiple exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001331

  • Changed! TIGR ATPase-IIB_Ca 706aa 0.0 in ref transcript
    • The calcium P-type ATPases have been characterized as Type IIB based on a phylogenetic analysis which distinguishes this group from the Type IIA SERCA calcium pump. A separate analysis divides Type IIA into sub-types (SERCA and PMR1) which are modelled by two corresponding HMMs (TIGR01116 and TIGR01522). This model is well separated from those.
  • Changed! TIGR ATPase-IIB_Ca 278aa 2e-78 in ref transcript
  • Changed! COG MgtA 698aa 1e-126 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].
  • Changed! COG MgtA 238aa 2e-24 in ref transcript
  • Changed! TIGR ATPase-IIB_Ca 1031aa 0.0 in modified transcript
  • Changed! COG MgtA 981aa 1e-151 in modified transcript

ATP2C1

  • rs.ATP2C1.F1 rs.ATP2C1.R1 169 199
  • NCBIGene 36.3 27032
  • Alternative 5-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001486

  • cd HAD_like 111aa 2e-04 in ref transcript
    • Haloacid dehalogenase-like hydrolases. The haloacid dehalogenase-like (HAD) superfamily includes L-2-haloacid dehalogenase, epoxide hydrolase, phosphoserine phosphatase, phosphomannomutase, phosphoglycolate phosphatase, P-type ATPase, and many others, all of which use a nucleophilic aspartate in their phosphoryl transfer reaction. All members possess a highly conserved alpha/beta core domain, and many also possess a small cap domain, the fold and function of which is variable. Members of this superfamily are sometimes referred to as belonging to the DDDD superfamily of phosphohydrolases.
  • TIGR ATPase-IIA2_Ca 881aa 0.0 in ref transcript
    • The calcium P-type ATPases have been characterized as Type IIA based on a phylogenetic analysis which distinguishes this group from the Type IIB PMCA calcium pump modelled by TIGR01517. A separate analysis divides Type IIA into sub-types, SERCA and PMR1 the former of which is modelled by TIGR01116.
  • COG MgtA 865aa 0.0 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].

ATP6V0E2

  • rs.ATP6V0E2.F1 rs.ATP6V0E2.R1 155 268
  • NCBIGene 36.3 155066
  • Single exon skipping, size difference: 113
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001100592

  • Changed! pfam ATP_synt_H 43aa 8e-14 in ref transcript
    • ATP synthase subunit H. ATP synthase subunit H is an extremely hydrophobic of approximately 9 kDa. This subunit may be required for assembly of vacuolar ATPase.
  • Changed! pfam ATP_synt_H 65aa 2e-22 in modified transcript

ATP6V1C2

  • rs.ATP6V1C2.F1 rs.ATP6V1C2.R1 139 277
  • NCBIGene 36.3 245973
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039362

  • Changed! pfam V-ATPase_C 414aa 1e-129 in ref transcript
    • V-ATPase subunit C.
  • Changed! COG COG5127 380aa 1e-54 in ref transcript
    • Vacuolar H+-ATPase V1 sector, subunit C [Energy production and conversion].
  • Changed! pfam V-ATPase_C 368aa 1e-135 in modified transcript
  • Changed! COG COG5127 334aa 3e-59 in modified transcript

ATP8A1

  • rs.ATP8A1.F1 rs.ATP8A1.R1 100 145
  • NCBIGene 36.3 10396
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006095

  • Changed! TIGR ATPase-Plipid 1038aa 0.0 in ref transcript
    • This model describes the P-type ATPase responsible for transporting phospholipids from one leaflet of bilayer membranes to the other. These ATPases are found only in eukaryotes.
  • Changed! COG MgtA 1036aa 1e-121 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].
  • Changed! TIGR ATPase-Plipid 1023aa 0.0 in modified transcript
  • Changed! COG MgtA 1021aa 1e-122 in modified transcript

ATPAF1

  • rs.ATPAF1.F1 rs.ATPAF1.R1 136 340
  • NCBIGene 36.3 64756
  • Multiple exon skipping, size difference: 204
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022745

  • Changed! pfam ATP11 244aa 1e-102 in ref transcript
    • ATP11 protein. This family consists of several eukaryotic ATP11 proteins. In Saccharomyces cerevisiae, expression of functional F1-ATPase requires two proteins encoded by the ATP11 and ATP12 genes. Atp11p is a molecular chaperone of the mitochondrial matrix that participates in the biogenesis pathway to form F1, the catalytic unit of the ATP synthase.
  • Changed! pfam ATP11 176aa 3e-49 in modified transcript

ATRX

  • rs.ATRX.F1 rs.ATRX.R1 101 215
  • NCBIGene 36.3 546
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000489

  • cd HELICc 147aa 4e-16 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • cd DEXDc 163aa 4e-11 in ref transcript
    • DEAD-like helicases superfamily. A diverse family of proteins involved in ATP-dependent RNA or DNA unwinding. This domain contains the ATP-binding region.
  • pfam SNF2_N 317aa 9e-84 in ref transcript
    • SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
  • pfam Helicase_C 77aa 4e-16 in ref transcript
    • Helicase conserved C-terminal domain. The Prosite family is restricted to DEAD/H helicases, whereas this domain family is found in a wide variety of helicases and helicase related proteins. It may be that this is not an autonomously folding unit, but an integral part of the helicase.
  • COG HepA 321aa 1e-27 in ref transcript
    • Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
  • COG HepA 194aa 6e-22 in ref transcript

ATXN3

  • rs.ATXN3.F1 rs.ATXN3.R1 108 273
  • NCBIGene 36.3 4287
  • Single exon skipping, size difference: 165
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004993

  • Changed! pfam Josephin 161aa 2e-69 in ref transcript
    • Josephin.
  • Changed! pfam Josephin 105aa 2e-45 in modified transcript

ATXN7L3

  • rs.ATXN7L3.F1 rs.ATXN7L3.R1 107 128
  • NCBIGene 36.3 56970
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020218

  • pfam Sgf11 45aa 1e-05 in ref transcript
    • Sgf11 (transcriptional regulation protein). The Sgf11 family is a SAGA complex subunit in Saccharomyces cerevisiae. The SAGA complex is a multisubunit protein complex involved in transcriptional regulation. SAGA combines proteins involved in interactions with DNA-bound activators and TATA-binding protein (TBP), as well as enzymes for histone acetylation and deubiquitylation.
  • pfam SCA7 30aa 0.006 in ref transcript
    • SCA7. This domain is found in the protein Sgf73/Sca7 which is a component of the multihistone acetyltransferase complexes SAGA and SILK. This domain is also found in Ataxin-7, a human protein which in its polyglutamine expanded pathological form, is responsible for the neurodegenerative disease spinocerebellar ataxia 7 (SCA7).

AURKA

  • rs.AURKA.F1 rs.AURKA.R1 119 217
  • NCBIGene 36.3 6790
  • Single exon skipping, size difference: 98
  • Exclusion in 5'UTR
  • Reference transcript: NM_198433

  • cd S_TKc 252aa 1e-80 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 240aa 2e-80 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00263 244aa 4e-53 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

AXIN1

  • rs.AXIN1.F1 rs.AXIN1.R1 297 405
  • NCBIGene 36.3 8312
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003502

  • pfam RGS 123aa 2e-31 in ref transcript
    • Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
  • smart DAX 83aa 4e-27 in ref transcript
    • Domain present in Dishevelled and axin. Domain of unknown function.
  • pfam Axin_b-cat_bind 33aa 2e-08 in ref transcript
    • Axin beta-catenin binding domain. This domain is found on the scaffolding protein Axin which is a component of the beta-catenin destruction complex. It competes with the tumour suppressor adenomatous polyposis coli protein (APC) for binding to beta-catenin.

B3GALT5

  • rs.B3GALT5.F1 rs.B3GALT5.R1 237 397
  • NCBIGene 36.3 10317
  • Single exon skipping, size difference: 160
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_033172

  • Changed! pfam Galactosyl_T 191aa 2e-44 in ref transcript
    • Galactosyltransferase. This family includes the galactosyltransferases UDP-galactose:2-acetamido-2-deoxy-D-glucose3beta- galactosyltrans ferase and UDP-Gal:beta-GlcNAc beta 1,3-galactosyltranferase. Specific galactosyltransferases transfer galactose to GlcNAc terminal chains in the synthesis of the lacto-series oligosaccharides types 1 and 2.

BACE2

  • rs.BACE2.F1 rs.BACE2.R1 114 264
  • NCBIGene 36.3 25825
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012105

  • Changed! cd beta_secretase_like 362aa 0.0 in ref transcript
    • Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. The enzymes are mostly secreted from cells as inactive proenzymes that activate autocatalytically at acidic pH. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
  • Changed! pfam Asp 339aa 3e-34 in ref transcript
    • Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
  • Changed! PTZ PTZ00147 338aa 6e-20 in ref transcript
    • histoaspartic protease (plasmepsin I or II); Provisional.
  • Changed! cd beta_secretase_like 312aa 1e-165 in modified transcript
  • Changed! pfam Asp 289aa 8e-33 in modified transcript
  • Changed! PTZ PTZ00147 288aa 1e-17 in modified transcript

BACE2

  • rs.BACE2.F2 rs.BACE2.R2 136 305
  • NCBIGene 36.3 25825
  • Single exon skipping, size difference: 169
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_012105

  • Changed! cd beta_secretase_like 362aa 0.0 in ref transcript
    • Beta-secretase, aspartic-acid protease important in the pathogenesis of Alzheimer's disease. Beta-secretase also called BACE (beta-site of APP cleaving enzyme) or memapsin-2. Beta-secretase is an aspartic-acid protease important in the pathogenesis of Alzheimer's disease, and in the formation of myelin sheaths in peripheral nerve cells. It cleaves amyloid precursor protein (APP) to reveal the N-terminus of the beta-amyloid peptides. The beta-amyloid peptides are the major components of the amyloid plaques formed in the brain of patients with Alzheimer's disease (AD). Since BACE mediates one of the cleavages responsible for generation of AD, it is regarded as a potential target for pharmacological intervention in AD. Beta-secretase is a member of pepsin family of aspartic proteases. Same as other aspartic proteases, beta-secretase is a bilobal enzyme, each lobe contributing a catalytic Asp residue, with an extended active site cleft localized between the two lobes of the molecule. The N- and C-terminal domains, although structurally related by a 2-fold axis, have only limited sequence homology except the vicinity of the active site. This suggests that the enzymes evolved by an ancient duplication event. The enzymes specifically cleave bonds in peptides which have at least six residues in length with hydrophobic residues in both the P1 and P1' positions. The active site is located at the groove formed by the two lobes, with an extended loop projecting over the cleft to form an 11-residue flap, which encloses substrates and inhibitors in the active site. Specificity is determined by nearest-neighbor hydrophobic residues surrounding the catalytic aspartates, and by three residues in the flap. The enzymes are mostly secreted from cells as inactive proenzymes that activate autocatalytically at acidic pH. This family of aspartate proteases is classified by MEROPS as the peptidase family A1 (pepsin A, clan AA).
  • Changed! pfam Asp 339aa 3e-34 in ref transcript
    • Eukaryotic aspartyl protease. Aspartyl (acid) proteases include pepsins, cathepsins, and renins. Two-domain structure, probably arising from ancestral duplication. This family does not include the retroviral nor retrotransposon proteases (pfam00077), which are much smaller and appear to be homologous to a single domain of the eukaryotic asp proteases.
  • Changed! PTZ PTZ00147 338aa 6e-20 in ref transcript
    • histoaspartic protease (plasmepsin I or II); Provisional.
  • Changed! cd beta_secretase_like 305aa 1e-163 in modified transcript
  • Changed! pfam Asp 232aa 5e-29 in modified transcript
  • Changed! PTZ PTZ00147 230aa 4e-16 in modified transcript

BAG5

  • rs.BAG5.F1 rs.BAG5.R1 191 378
  • NCBIGene 36.3 9529
  • Alternative 5-prime, size difference: 187
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001015049

  • smart BAG 78aa 1e-14 in ref transcript
    • BAG domains, present in regulator of Hsp70 proteins. BAG domains, present in Bcl-2-associated athanogene 1 and silencer of death domains.
  • smart BAG 79aa 5e-14 in ref transcript
  • smart BAG 76aa 1e-10 in ref transcript
  • smart BAG 78aa 6e-09 in ref transcript

BANP

  • rs.BANP.F1 rs.BANP.R1 114 180
  • NCBIGene 36.3 54971
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_079837

  • pfam BEN 74aa 1e-13 in ref transcript
    • BEN domain. The BEN domain is found in diverse animal proteins such as BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C, in the chordopoxvirus virosomal protein E5R and in several proteins of polydnaviruses. Computational analysis suggests that the BEN domain mediates protein-DNA and protein-protein interactions during chromatin organisation and transcription.

BBS9

  • rs.BBS9.F1 rs.BBS9.R1 100 115
  • NCBIGene 36.3 27241
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198428

  • COG COG1361 158aa 0.001 in ref transcript
    • S-layer domain [Cell envelope biogenesis, outer membrane].

BCAS3

  • rs.BCAS3.F1 rs.BCAS3.R1 151 196
  • NCBIGene 36.3 54828
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001099432

BCAS4

  • rs.BCAS4.F1 rs.BCAS4.R1 264 361
  • NCBIGene 36.3 55653
  • Single exon skipping, size difference: 97
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017843

BCL2L1

  • rs.BCL2L1.F1 rs.BCL2L1.R1 198 387
  • NCBIGene 36.3 598
  • Alternative 5-prime, size difference: 189
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138578

  • Changed! cd Bcl-2_like 114aa 2e-40 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • Changed! TIGR bcl-2 233aa 1e-89 in ref transcript
    • in artificial membranes at acidic pH, proapoptotic Bcl-2 family proteins (including Bax and Bak) probably induce the mitochondrial permeability transition and cytochrome c release by interacting with permeability transition pores, the most important component for pore fomation of which is VDAC.
  • Changed! cd Bcl-2_like 45aa 1e-08 in modified transcript
  • Changed! TIGR bcl-2 125aa 1e-35 in modified transcript
  • Changed! TIGR bcl-2 45aa 6e-09 in modified transcript

BCLAF1

  • rs.BCLAF1.F1 rs.BCLAF1.R1 161 308
  • NCBIGene 36.3 9774
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014739

BCS1L

  • rs.BCS1L.F1 rs.BCS1L.R1 116 325
  • NCBIGene 36.3 617
  • Single exon skipping, size difference: 209
  • Exclusion in 5'UTR
  • Reference transcript: NM_004328

  • cd AAA 157aa 3e-06 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • pfam BCS1_N 168aa 2e-50 in ref transcript
    • BCS1 N terminal. This domain is found at the N terminal of the mitochondrial ATPase BSC1. It encodes the import and intramitochondrial sorting for the protein.
  • pfam AAA 175aa 2e-19 in ref transcript
    • ATPase family associated with various cellular activities (AAA). AAA family proteins often perform chaperone-like functions that assist in the assembly, operation, or disassembly of protein complexes.
  • COG HflB 405aa 1e-18 in ref transcript
    • ATP-dependent Zn proteases [Posttranslational modification, protein turnover, chaperones].

BEST3

  • rs.BEST3.F1 rs.BEST3.R1 323 476
  • NCBIGene 36.3 144453
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032735

  • Changed! pfam Bestrophin 369aa 1e-161 in ref transcript
    • Bestrophin. Bestrophin is a 68-kDa basolateral plasma membrane protein expressed in retinal pigment epithelial cells (RPE). It is encoded by the VMD2 gene, which is mutated in Best macular dystrophy, a disease characterised by a depressed light peak in the electrooculogram. VMD2 encodes a 585-amino acid protein with an approximate mass of 68 kDa which has been designated bestrophin. Bestrophin shares homology with the Caenorhabditis elegans RFP gene family, named for the presence of a conserved arginine (R), phenylalanine (F), proline (P), amino acid sequence motif. Bestrophin is a plasma membrane protein, localised to the basolateral surface of RPE cells consistent with a role for bestrophin in the generation or regulation of the EOG light peak. Bestrophin and other RFP family members represent a new class of chloride channels, indicating a direct role for bestrophin in generating the light peak. The VMD2 gene underlying Best disease was shown to represent the first human member of the RFP-TM protein family. More than 97% of the disease-causing mutations are located in the N-terminal RFP-TM domain implying important functional properties.
  • Changed! pfam Bestrophin 318aa 1e-128 in modified transcript

BID

  • rs.BID.F1 rs.BID.R1 180 250
  • NCBIGene 36.3 637
  • Single exon skipping, size difference: 70
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001196

  • Changed! pfam BID 192aa 1e-76 in ref transcript
    • BH3 interacting domain (BID). BID is a member of the BCL-2 superfamily of proteins are key regulators of programmed cell death, hence this family is related to pfam00452. BID is a pro-apoptotic member of the Bcl-2 superfamily and as such posses the ability to target intracellular membranes and contains the BH3 death domain. The activity of BID is regulated by a Caspase 8-mediated cleavage event, exposing the BH3 domain and significantly changing the surface charge and hydrophobicity, which causes a change of cellular localisation.
  • Changed! pfam BID 96aa 1e-33 in modified transcript

BIRC5

  • rs.BIRC5.F1 rs.BIRC5.R1 375 444
  • NCBIGene 36.3 332
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012271

  • Changed! cd BIR 95aa 3e-14 in ref transcript
    • Baculoviral inhibition of apoptosis protein repeat domain; Found in inhibitors of apoptosis proteins (IAPs) and other proteins. In higher eukaryotes, BIR domains inhibit apoptosis by acting as direct inhibitors of the caspase family of protease enzymes. In yeast, BIR domains are involved in regulating cytokinesis. This novel fold is stabilized by zinc tetrahedrally coordinated by one histidine and three cysteine residues and resembles a classical zinc finger.
  • Changed! smart BIR 97aa 4e-18 in ref transcript
    • Baculoviral inhibition of apoptosis protein repeat. Domain found in inhibitor of apoptosis proteins (IAPs) and other proteins. Acts as a direct inhibitor of caspase enzymes.
  • Changed! cd BIR 72aa 1e-17 in modified transcript
  • Changed! smart BIR 74aa 2e-21 in modified transcript

BIRC7

  • rs.BIRC7.F1 rs.BIRC7.R1 400 454
  • NCBIGene 36.3 79444
  • Alternative 3-prime, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_139317

  • cd BIR 68aa 5e-23 in ref transcript
    • Baculoviral inhibition of apoptosis protein repeat domain; Found in inhibitors of apoptosis proteins (IAPs) and other proteins. In higher eukaryotes, BIR domains inhibit apoptosis by acting as direct inhibitors of the caspase family of protease enzymes. In yeast, BIR domains are involved in regulating cytokinesis. This novel fold is stabilized by zinc tetrahedrally coordinated by one histidine and three cysteine residues and resembles a classical zinc finger.
  • Changed! cd RING 39aa 0.007 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • smart BIR 71aa 4e-23 in ref transcript
    • Baculoviral inhibition of apoptosis protein repeat. Domain found in inhibitor of apoptosis proteins (IAPs) and other proteins. Acts as a direct inhibitor of caspase enzymes.
  • smart RING 34aa 0.003 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).

BMP4

  • rs.BMP4.F1 rs.BMP4.R1 132 341
  • NCBIGene 36.3 652
  • Alternative 5-prime, size difference: 209
  • Exclusion in 5'UTR
  • Reference transcript: NM_001202

  • pfam TGFb_propeptide 236aa 9e-62 in ref transcript
    • TGF-beta propeptide. This propeptide is known as latency associated peptide (LAP) in TGF-beta. LAP is a homodimer which is disulfide linked to TGF-beta binding protein.
  • smart TGFB 101aa 9e-48 in ref transcript
    • Transforming growth factor-beta (TGF-beta) family. Family members are active as disulphide-linked homo- or heterodimers. TGFB is a multifunctional peptide that controls proliferation, differentiation, and other functions in many cell types.

BPTF

  • rs.BPTF.F1 rs.BPTF.R1 103 532
  • NCBIGene 36.3 2186
  • Alternative 5-prime, size difference: 429
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182641

  • cd Bromo_gcn5_like 101aa 1e-46 in ref transcript
    • Bromodomain; Gcn5_like subfamily. Gcn5p is a histone acetyltransferase (HAT) which mediates acetylation of histones at lysine residues; such acetylation is generally correlated with the activation of transcription. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd BAH_plant_2 27aa 8e-04 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • smart BROMO 102aa 1e-27 in ref transcript
    • bromo domain.
  • pfam DDT 61aa 1e-20 in ref transcript
    • DDT domain. This domain is approximately 60 residues in length, and is predicted to be a DNA binding domain. The DDT domain is named after (DNA binding homeobox and Different Transcription factors). It is exclusively associated with nuclear domains, and is thought to be arranged into three alpha helices.
  • pfam PHD 45aa 6e-10 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • smart PHD 47aa 5e-08 in ref transcript
    • PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
  • COG COG5076 105aa 8e-14 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].

BRCA1

  • rs.BRCA1.F1 rs.BRCA1.R1 188 311
  • NCBIGene 36.3 672
  • Multiple exon skipping, size difference: 123
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007296

  • cd RING 46aa 1e-07 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • cd BRCT 80aa 2e-04 in ref transcript
    • Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
  • smart BRCT 84aa 6e-09 in ref transcript
    • breast cancer carboxy-terminal domain.
  • smart RING 41aa 9e-09 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG PEX10 43aa 1e-04 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
  • COG RAD18 119aa 0.002 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Signal transduction mechanisms].

BTBD9

  • rs.BTBD9.F1 rs.BTBD9.R1 211 411
  • NCBIGene 36.3 114781
  • Single exon skipping, size difference: 200
  • Exclusion in 5'UTR
  • Reference transcript: NM_052893

  • pfam BTB 105aa 7e-23 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • pfam BACK 99aa 2e-20 in ref transcript
    • BTB And C-terminal Kelch. This domain is found associated with pfam00651 and pfam01344. The BACK domain is found juxtaposed to the BTB domain; they are separated by as little as two residues. This family appears to be closely related to the BTB domain (Finn RD, personal observation).
  • pfam F5_F8_type_C 115aa 4e-06 in ref transcript
    • F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
  • pfam F5_F8_type_C 102aa 1e-04 in ref transcript

BTLA

  • rs.BTLA.F1 rs.BTLA.R1 108 252
  • NCBIGene 36.3 151888
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181780

  • cd IG 75aa 7e-04 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • smart IG_like 92aa 7e-06 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.

BTN2A2

  • rs.BTN2A2.F1 rs.BTN2A2.R1 100 448
  • NCBIGene 36.3 10385
  • Single exon skipping, size difference: 348
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006995

  • smart SPRY 119aa 3e-27 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart PRY 52aa 4e-21 in ref transcript
    • associated with SPRY domains.
  • Changed! pfam V-set 97aa 3e-10 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • pfam C2-set_2 82aa 0.007 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.

BTNL8

  • rs.BTNL8.F1 rs.BTNL8.R1 133 263
  • NCBIGene 36.3 79908
  • Alternative 5-prime, size difference: 130
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001040462

  • Changed! smart SPRY 123aa 3e-16 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • Changed! smart PRY 50aa 5e-11 in ref transcript
    • associated with SPRY domains.
  • pfam V-set 106aa 6e-07 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

C10orf125

  • rs.C10orf125.F1 rs.C10orf125.R1 397 466
  • NCBIGene 36.3 282969
  • Alternative 3-prime, size difference: 69
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001098483

  • Changed! pfam RbsD_FucU 139aa 9e-28 in ref transcript
    • RbsD / FucU transport protein family. The Escherichia coli high-affinity ribose-transport system consists of six proteins encoded by the rbs operon (rbsD, rbsA, rbsC, rbsB, rbsK and rbsR). Of the six components, RbsD is the only one whose function is unknown although it is thought that it somehow plays a critical role in PtsG-mediated ribose transport. This family also includes FucU a protein from the fucose biosynthesis operon that is presumably also involved in fucose transport by similarity to RbsD.
  • Changed! COG FucU 145aa 5e-40 in ref transcript
    • Fucose dissimilation pathway protein FucU [Carbohydrate transport and metabolism].
  • Changed! pfam RbsD_FucU 125aa 2e-21 in modified transcript
  • Changed! COG FucU 130aa 3e-33 in modified transcript

C10orf30

  • rs.C10orf30.F1 rs.C10orf30.R1 219 389
  • NCBIGene 36.3 222389
  • Single exon skipping, size difference: 170
  • Exclusion in 5'UTR
  • Reference transcript: NM_152751

  • pfam BEN 84aa 2e-09 in ref transcript
    • BEN domain. The BEN domain is found in diverse animal proteins such as BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C, in the chordopoxvirus virosomal protein E5R and in several proteins of polydnaviruses. Computational analysis suggests that the BEN domain mediates protein-DNA and protein-protein interactions during chromatin organisation and transcription.

C10orf30

  • rs.C10orf30.F1 rs.C10orf30.R2 411 542
  • NCBIGene 36.3 222389
  • Single exon skipping, size difference: 170
  • Exclusion in 5'UTR
  • Reference transcript: NM_152751

  • pfam BEN 84aa 2e-09 in ref transcript
    • BEN domain. The BEN domain is found in diverse animal proteins such as BANP/SMAR1, NAC1 and the Drosophila mod(mdg4) isoform C, in the chordopoxvirus virosomal protein E5R and in several proteins of polydnaviruses. Computational analysis suggests that the BEN domain mediates protein-DNA and protein-protein interactions during chromatin organisation and transcription.

C11orf56

  • rs.C11orf56.F1 rs.C11orf56.R1 151 193
  • NCBIGene 36.3 84067
  • Alternative 5-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032127

  • pfam RAI16-like 331aa 1e-88 in ref transcript
    • Retinoic acid induced 16-like protein. This is the conserved N-terminal 450 residues of a family of proteins described as retinoic acid-induced protein 16-like proteins. The exact function is not known. The proteins are found from worms to humans.

C12orf42

  • rs.C12orf42.F1 rs.C12orf42.R1 380 515
  • NCBIGene 36.3 374470
  • Alternative 5-prime, size difference: 135
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099336

C12orf44

  • rs.C12orf44.F1 rs.C12orf44.R1 164 335
  • NCBIGene 36.3 60673
  • Alternative 5-prime, size difference: 171
  • Exclusion in 5'UTR
  • Reference transcript: NM_021934

  • pfam DUF1649 167aa 1e-56 in ref transcript
    • Protein of unknown function (DUF1649). This family is made up of sequences derived from hypothetical eukaryotic proteins of unknown function.

C14orf100

  • rs.C14orf100.F1 rs.C14orf100.R1 131 149
  • NCBIGene 36.3 51528
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016475

  • pfam DUF766 293aa 1e-113 in ref transcript
    • Protein of unknown function (DUF766). This family consists of several eukaryotic proteins of unknown function.

C14orf104

  • rs.C14orf104.F1 rs.C14orf104.R1 239 383
  • NCBIGene 36.3 55172
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018139

  • pfam PIH1 299aa 3e-84 in ref transcript
    • pre-RNA processing PIH1/Nop17. This domain is involved in pre-rRNA processing. It has has been shown to be required either for nucleolar retention or correct assembly of the box C/D snoRNP in Saccharomyces cerevisiae.

C14orf118

  • rs.C14orf118.F1 rs.C14orf118.R1 128 143
  • NCBIGene 36.3 55668
  • Mutually exclusive exon skipping, size difference: 15
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017926

C14orf130

  • rs.C14orf130.F1 rs.C14orf130.R1 313 447
  • NCBIGene 36.3 55148
  • Single exon skipping, size difference: 134
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_175748

  • Changed! pfam zf-UBR 68aa 0.001 in ref transcript
    • Putative zinc finger in N-recognin (UBR box). This region is found in E3 ubiquitin ligases that recognise N-recognins.

C14orf138

  • rs.C14orf138.F1 rs.C14orf138.R1 210 315
  • NCBIGene 36.3 79609
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024558

  • Changed! pfam Methyltransf_16 172aa 9e-52 in ref transcript
    • Putative methyltransferase.
  • COG COG3897 126aa 8e-04 in ref transcript
    • Predicted methyltransferase [General function prediction only].
  • Changed! pfam Methyltransf_16 170aa 2e-51 in modified transcript

C14orf159

  • rs.C14orf159.F1 rs.C14orf159.R1 100 115
  • NCBIGene 36.3 80017
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001102368

  • Changed! pfam DUF1445 143aa 2e-57 in ref transcript
    • Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
  • Changed! PRK PRK05463 263aa 7e-46 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! pfam DUF1445 138aa 1e-54 in modified transcript
  • Changed! PRK PRK05463 258aa 3e-44 in modified transcript

C14orf159

  • rs.C14orf159.F2 rs.C14orf159.R2 124 202
  • NCBIGene 36.3 80017
  • Single exon skipping, size difference: 78
  • Exclusion in 5'UTR
  • Reference transcript: NM_001102366

  • pfam DUF1445 138aa 1e-54 in ref transcript
    • Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
  • PRK PRK05463 258aa 3e-44 in ref transcript
    • hypothetical protein; Provisional.

C14orf159

  • rs.C14orf159.F3 rs.C14orf159.R3 167 287
  • NCBIGene 36.3 80017
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001102368

  • pfam DUF1445 143aa 2e-57 in ref transcript
    • Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
  • PRK PRK05463 263aa 7e-46 in ref transcript
    • hypothetical protein; Provisional.

C14orf159

  • rs.C14orf159.F4 rs.C14orf159.R4 211 382
  • NCBIGene 36.3 80017
  • Multiple exon skipping, size difference: 171
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_001102366

  • pfam DUF1445 138aa 1e-54 in ref transcript
    • Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
  • PRK PRK05463 258aa 3e-44 in ref transcript
    • hypothetical protein; Provisional.

C14orf159

  • rs.C14orf159.F5 rs.C14orf159.R5 109 145
  • NCBIGene 36.3 80017
  • Alternative 5-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001102368

  • Changed! pfam DUF1445 143aa 2e-57 in ref transcript
    • Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
  • Changed! PRK PRK05463 263aa 7e-46 in ref transcript
    • hypothetical protein; Provisional.
  • Changed! pfam DUF1445 131aa 5e-48 in modified transcript
  • Changed! PRK PRK05463 251aa 2e-39 in modified transcript

C14orf159

  • rs.C14orf159.F6 rs.C14orf159.R6 131 376
  • NCBIGene 36.3 80017
  • Alternative 5-prime, size difference: 245
  • Exclusion in 5'UTR
  • Reference transcript: NM_001102366

  • pfam DUF1445 138aa 1e-54 in ref transcript
    • Protein of unknown function (DUF1445). This family represents a conserved region approximately 150 residues long within a number of hypothetical bacterial and eukaryotic proteins of unknown function.
  • PRK PRK05463 258aa 3e-44 in ref transcript
    • hypothetical protein; Provisional.

C14orf179

  • rs.C14orf179.F1 rs.C14orf179.R1 132 147
  • NCBIGene 36.3 112752
  • Mutually exclusive exon skipping, size difference: 15
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_052873

C16orf13

  • rs.C16orf13.F1 rs.C16orf13.R1 102 170
  • NCBIGene 36.3 84326
  • Single exon skipping, size difference: 68
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001040160

  • Changed! pfam DUF938 137aa 6e-48 in ref transcript
    • Protein of unknown function (DUF938). This family consists of several hypothetical proteins from both prokaryotes and eukaryotes. The function of this family is unknown.
  • Changed! pfam DUF938 200aa 6e-81 in modified transcript

C16orf35

  • rs.C16orf35.F1 rs.C16orf35.R1 169 374
  • NCBIGene 36.3 8131
  • Multiple exon skipping, size difference: 205
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077350

  • Changed! pfam UPF0171 471aa 1e-161 in ref transcript
    • Uncharacterised protein family (UPF0171).
  • Changed! pfam UPF0171 63aa 1e-16 in modified transcript

C16orf46

  • rs.C16orf46.F1 rs.C16orf46.R1 318 407
  • NCBIGene 36.3 123775
  • Single exon skipping, size difference: 89
  • Exclusion in 5'UTR
  • Reference transcript: NM_152337

C17orf62

  • rs.C17orf62.F1 rs.C17orf62.R1 114 164
  • NCBIGene 36.3 79415
  • Single exon skipping, size difference: 50
  • Exclusion in 5'UTR
  • Reference transcript: NM_001100407

C17orf62

  • rs.C17orf62.F2 rs.C17orf62.R2 113 155
  • NCBIGene 36.3 79415
  • Single exon skipping, size difference: 42
  • Exclusion in 5'UTR
  • Reference transcript: NM_001033046

C17orf63

  • rs.C17orf63.F1 rs.C17orf63.R1 175 209
  • NCBIGene 36.3 55731
  • Single exon skipping, size difference: 34
  • Exclusion in 5'UTR
  • Reference transcript: NM_018182

C17orf70

  • rs.C17orf70.F1 rs.C17orf70.R1 109 305
  • NCBIGene 36.3 80233
  • Alternative 5-prime, size difference: 196
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001109760

C17orf72

  • rs.C17orf72.F1 rs.C17orf72.R1 100 121
  • NCBIGene 36.3 92340
  • Alternative 3-prime, size difference: 21
  • Inclusion in 5'UTR
  • Reference transcript: XM_001726843

C17orf80

  • rs.C17orf80.F1 rs.C17orf80.R1 275 383
  • NCBIGene 36.3 55028
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017941

C18orf19

  • rs.C18orf19.F1 rs.C18orf19.R1 223 363
  • NCBIGene 36.3 125228
  • Single exon skipping, size difference: 140
  • Exclusion in 5'UTR
  • Reference transcript: NM_001098801

  • pfam DUF1279 88aa 2e-32 in ref transcript
    • Protein of unknown function (DUF1279). This family represents the C-terminus (approx. 120 residues) of a number of eukaryotic proteins of unknown function.

C18orf25

  • rs.C18orf25.F1 rs.C18orf25.R1 202 385
  • NCBIGene 36.3 147339
  • Single exon skipping, size difference: 183
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145055

C18orf34

  • rs.C18orf34.F1 rs.C18orf34.R1 171 285
  • NCBIGene 36.3 374864
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001105528

  • COG Smc 233aa 0.005 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

C18orf37

  • rs.C18orf37.F1 rs.C18orf37.R1 333 441
  • NCBIGene 36.3 125476
  • Multiple exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098817

  • pfam YL1_C 30aa 1e-06 in ref transcript
    • YL1 nuclear protein C-terminal domain. This domain is found in proteins of the YL1 family. These proteins have been shown to be DNA-binding and may be a transcription factor. This domain is found in proteins that are not YL1 proteins.
  • COG COG5195 88aa 7e-14 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

C19orf28

  • rs.C19orf28.F1 rs.C19orf28.R1 193 395
  • NCBIGene 36.3 126321
  • Single exon skipping, size difference: 202
  • Exclusion of the stop codon
  • Reference transcript: NM_021731

  • TIGR gph 331aa 5e-08 in ref transcript
    • GPH:cation symporters catalyze uptake of sugars in symport with a monovalent cation (H+ or Na+). Members of this family includes transporters for melibiose, lactose, raffinose, glucuronides, pentosides and isoprimeverose. Mutants of two groups of these symporters (the melibiose permeases of enteric bacteria, and the lactose permease of Streptococcus thermophilus) have been isolated in which altered cation specificity is observed or in which sugar transport is uncoupled from cation symport (i.e., uniport is catalyzed). The various members of the family can use Na+, H+ or Li, Na+ or Li+, H+ or Li+, or only H+ as the symported cation. All of these proteins possess twelve putative transmembrane a-helical spanners.
  • COG MelB 411aa 1e-15 in ref transcript
    • Na+/melibiose symporter and related transporters [Carbohydrate transport and metabolism].

C19orf60

  • rs.C19orf60.F1 rs.C19orf60.R1 181 247
  • NCBIGene 36.3 55049
  • Alternative 3-prime, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100418

C19orf62

  • rs.C19orf62.F1 rs.C19orf62.R1 140 178
  • NCBIGene 36.3 29086
  • Alternative 3-prime, size difference: 38
  • Inclusion in 5'UTR
  • Reference transcript: NM_014173

  • TIGR acidobact_VWFA 74aa 0.004 in ref transcript
    • Members of this family are bacterial domains that include a region related to the von Willebrand factor type A (VWFA) domain (pfam00092). These domains are restricted to, and have undergone a large paralogous family expansion in, the Acidobacteria, including Solibacter usitatus and Acidobacterium capsulatum ATCC 51196.

C1D

  • rs.C1D.F1 rs.C1D.R1 100 113
  • NCBIGene 36.3 10438
  • Alternative 5-prime, size difference: 13
  • Exclusion in 5'UTR
  • Reference transcript: NM_006333

  • pfam Sas10_Utp3 80aa 4e-14 in ref transcript
    • Sas10/Utp3/C1D family. This family contains Utp3 and LCP5 which are components of the U3 ribonucleoprotein complex. It also includes the human C1D protein and Saccharomyces cerevisiae YHR081W (rrp47), an exosome-associated protein required for the 3' processing of stable RNAs, and Sas10 which has been identified as a regulator of chromatin silencing. This family also includes the human protein Neuroguidin an initiation factor 4E (eIF4E) binding protein.

C1orf116

  • rs.C1orf116.F1 rs.C1orf116.R1 129 315
  • NCBIGene 36.3 79098
  • Single exon skipping, size difference: 186
  • Exclusion of the protein initiation site
  • Reference transcript: NM_023938

C1orf43

  • rs.C1orf43.F1 rs.C1orf43.R1 291 393
  • NCBIGene 36.3 25912
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098616

  • Changed! pfam NICE-3 189aa 1e-85 in ref transcript
    • NICE-3 protein. This family consists of several eukaryotic NICE-3 and related proteins. The gene coding for NICE-3 is part of the epidermal differentiation complex (EDC) which comprises a large number of genes that are of crucial importance for the maturation of the human epidermis. The function of NICE-3 is unknown.
  • Changed! pfam NICE-3 155aa 6e-59 in modified transcript

C20orf71

  • rs.C20orf71.F1 rs.C20orf71.R1 309 417
  • NCBIGene 36.3 128861
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_178466

  • Changed! pfam LBP_BPI_CETP 156aa 1e-15 in ref transcript
    • LBP / BPI / CETP family, N-terminal domain. The N and C terminal domains of the LBP/BPI/CETP family are structurally similar.
  • Changed! pfam LBP_BPI_CETP 79aa 2e-10 in modified transcript

C21orf33

  • rs.C21orf33.F1 rs.C21orf33.R1 205 298
  • NCBIGene 36.3 8209
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004649

  • Changed! cd GATase1_ES1 222aa 6e-89 in ref transcript
    • Type 1 glutamine amidotransferase (GATase1)-like domain found in zebrafish ES1. This group includes, proteins similar to ES1, Escherichia coli enhancing lycopene biosynthesis protein 2, Azospirillum brasilense iaaC and, human HES1. The catalytic triad typical of GATase1domains is not conserved in this GATase1-like domain. However, in common with GATase1domains a reactive cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. Zebrafish ES1 is expressed specifically in adult photoreceptor cells and appears to be a cytoplasmic protein. A. brasilense iaaC is involved in controlling IAA biosynthesis.
  • Changed! pfam DJ-1_PfpI 168aa 2e-14 in ref transcript
    • DJ-1/PfpI family. The family includes the protease PfpI. This domain is also found in transcriptional regulators.
  • Changed! PRK PRK11780 224aa 1e-83 in ref transcript
    • isoprenoid biosynthesis protein with amidotransferase-like domain; Provisional.
  • Changed! cd GATase1_ES1 191aa 4e-69 in modified transcript
  • Changed! pfam DJ-1_PfpI 137aa 7e-06 in modified transcript
  • Changed! PRK PRK11780 193aa 6e-64 in modified transcript

C21orf34

  • rs.C21orf34.F1 rs.C21orf34.R1 240 283
  • NCBIGene 36.3 388815
  • Single exon skipping, size difference: 115
  • Inclusion in 3'UTR
  • Reference transcript: NM_001005734

C21orf91

  • rs.C21orf91.F1 rs.C21orf91.R1 209 272
  • NCBIGene 36.3 54149
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100420

  • Changed! pfam EURL 291aa 1e-145 in ref transcript
    • EURL protein. This family consists of several animal EURL proteins. EURL is preferentially expressed in chick retinal precursor cells as well as in the anterior epithelial cells of the lens at early stages of development. EURL transcripts are found primarily in the peripheral dorsal retina, i.e., the most undifferentiated part of the dorsal retina. EURL transcripts are also detected in the lens at stage 18 and remain abundant in the proliferating epithelial cells of the lens until at least day 11. The distribution pattern of EURL in the developing retina and lens suggest a role before the events leading to cell determination and differentiation.
  • Changed! pfam EURL 221aa 1e-108 in modified transcript

C2orf56

  • rs.C2orf56.F1 rs.C2orf56.R1 349 430
  • NCBIGene 36.3 55471
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144736

  • pfam DUF185 247aa 5e-33 in ref transcript
    • Uncharacterized ACR, COG1565. This family contains several uncharacterised proteins. One member has been described as an ATP synthase beta subunit transcription termination factor rho protein.
  • Changed! COG COG1565 357aa 4e-61 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! COG COG1565 330aa 1e-44 in modified transcript

C5orf5

  • rs.C5orf5.F1 rs.C5orf5.R1 225 291
  • NCBIGene 36.3 51306
  • Single exon skipping, size difference: 66
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_016603

  • cd RhoGAP_FAM13A1a 185aa 3e-78 in ref transcript
    • RhoGAP_FAM13A1a: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of FAM13A1, isoform a-like proteins. The function of FAM13A1a is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by up several orders of magnitude.
  • pfam RhoGAP 152aa 3e-37 in ref transcript
    • RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.

C5orf5

  • rs.C5orf5.F2 rs.C5orf5.R2 302 494
  • NCBIGene 36.3 51306
  • Single exon skipping, size difference: 192
  • Exclusion of the protein initiation site
  • Reference transcript: NM_016603

  • Changed! cd RhoGAP_FAM13A1a 185aa 3e-78 in ref transcript
    • RhoGAP_FAM13A1a: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of FAM13A1, isoform a-like proteins. The function of FAM13A1a is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by up several orders of magnitude.
  • Changed! pfam RhoGAP 152aa 3e-37 in ref transcript
    • RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.
  • Changed! cd RhoGAP_FAM13A1a 85aa 4e-32 in modified transcript
  • Changed! pfam RhoGAP 67aa 2e-12 in modified transcript

C5orf5

  • rs.C5orf5.F3 rs.C5orf5.R3 293 377
  • NCBIGene 36.3 51306
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016603

  • cd RhoGAP_FAM13A1a 185aa 3e-78 in ref transcript
    • RhoGAP_FAM13A1a: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of FAM13A1, isoform a-like proteins. The function of FAM13A1a is unknown. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by up several orders of magnitude.
  • pfam RhoGAP 152aa 3e-37 in ref transcript
    • RhoGAP domain. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases.

C6orf162

  • rs.C6orf162.F1 rs.C6orf162.R1 101 122
  • NCBIGene 36.3 57150
  • Single exon skipping, size difference: 21
  • Exclusion in 5'UTR
  • Reference transcript: NM_001042493

C6orf25

  • rs.C6orf25.F1 rs.C6orf25.R1 107 126
  • NCBIGene 36.3 80739
  • Alternative 3-prime, size difference: 19
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_138272

C6orf60

  • rs.C6orf60.F1 rs.C6orf60.R1 174 321
  • NCBIGene 36.3 79632
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024581

  • TIGR SMC_prok_A 303aa 7e-06 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! TIGR SMC_prok_B 226aa 2e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 356aa 3e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

C8orf59

  • rs.C8orf59.F1 rs.C8orf59.R1 136 223
  • NCBIGene 36.3 401466
  • Alternative 5-prime, size difference: 87
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099670

C8orf79

  • rs.C8orf79.F1 rs.C8orf79.R1 278 342
  • NCBIGene 36.3 57604
  • Alternative 5-prime, size difference: 64
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_020844

  • Changed! cd AdoMet_MTases 92aa 6e-10 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! pfam Methyltransf_11 90aa 1e-18 in ref transcript
    • Methyltransferase domain. Members of this family are SAM dependent methyltransferases.
  • Changed! COG UbiE 139aa 3e-19 in ref transcript
    • Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].
  • Changed! pfam Methyltransf_11 38aa 0.006 in modified transcript
  • Changed! COG UbiE 95aa 1e-06 in modified transcript

C9orf116

  • rs.C9orf116.F1 rs.C9orf116.R1 184 230
  • NCBIGene 36.3 138162
  • Alternative 3-prime, size difference: 46
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001048265

C9orf127

  • rs.C9orf127.F1 rs.C9orf127.R1 237 309
  • NCBIGene 36.3 51754
  • Alternative 5-prime, size difference: 72
  • Inclusion in 5'UTR
  • Reference transcript: NM_001042589

C9orf131

  • rs.C9orf131.F1 rs.C9orf131.R1 249 324
  • NCBIGene 36.3 138724
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040412

C9orf5

  • rs.C9orf5.F1 rs.C9orf5.R1 147 171
  • NCBIGene 36.3 23731
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032012

  • COG PerM 163aa 0.009 in ref transcript
    • Predicted permease [General function prediction only].

CADM1

  • rs.CADM1.F1 rs.CADM1.R1 222 306
  • NCBIGene 36.3 23705
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014333

  • cd IGcam 71aa 1e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IG 72aa 0.004 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • smart IG_like 80aa 3e-10 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam C2-set_2 70aa 3e-08 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.
  • smart IG_like 92aa 9e-08 in ref transcript

CADPS2

  • rs.CADPS2.F1 rs.CADPS2.R1 406 535
  • NCBIGene 36.3 93664
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017954

  • cd PH_CADPS 110aa 2e-52 in ref transcript
    • CADPS (Ca2+-dependent activator protein) Pleckstrin homology (PH) domain. CADPS is a calcium-dependent activator involved in secretion. It contains a central PH domain that binds to phosphoinositide 4,5 bisphosphate containing liposomes. However, membrane association may also be mediated by binding to phosphatidlyserine via general electrostatic interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam DUF1041 95aa 1e-13 in ref transcript
    • Domain of Unknown Function (DUF1041). This family consists of several eukaryotic domains of unknown function. Members of this family are often found in tandem repeats and co-occur with pfam00168, pfam00130 and pfam00169 domains.
  • smart PH 102aa 2e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

CALCA

  • rs.CALCA.F1 rs.CALCA.R1 104 128
  • NCBIGene 36.3 796
  • Alternative 5-prime, size difference: 24
  • Exclusion in 5'UTR
  • Reference transcript: NM_001033952

  • smart CALCITONIN 38aa 8e-14 in ref transcript
    • calcitonin. This family is formed by calcitonin, the calcitonin gene-related peptide, and amylin. They are short polypeptide hormones.

CAMK2A

  • rs.CAMK2A.F1 rs.CAMK2A.R1 114 147
  • NCBIGene 36.3 815
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015981

  • cd S_TKc 260aa 1e-81 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 2e-84 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CaMKII_AD 128aa 8e-62 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 261aa 3e-41 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAMK2B

  • rs.CAMK2B.F1 rs.CAMK2B.R1 117 162
  • NCBIGene 36.3 816
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001220

  • cd S_TKc 260aa 2e-82 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 1e-85 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CaMKII_AD 128aa 1e-58 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 249aa 6e-40 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAMK2G

  • rs.CAMK2G.F1 rs.CAMK2G.R1 111 153
  • NCBIGene 36.3 818
  • Alternative 5-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172171

  • cd S_TKc 260aa 5e-84 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 249aa 2e-86 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CaMKII_AD 128aa 5e-59 in ref transcript
    • Calcium/calmodulin dependent protein kinase II Association. This domain is found at the C-terminus of the Calcium/calmodulin dependent protein kinases II (CaMKII). These proteins also have a Ser/Thr protein kinase domain (pfam00069) at their N-terminus. The function of the CaMKII association domain is the assembly of the single proteins into large (8 to 14 subunits) multimers.
  • PTZ PTZ00263 249aa 3e-43 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

CAP1

  • rs.CAP1.F1 rs.CAP1.R1 104 122
  • NCBIGene 36.3 10487
  • Alternative 5-prime, size difference: 18
  • Exclusion in 5'UTR
  • Reference transcript: NM_006367

  • pfam CAP_N 302aa 1e-109 in ref transcript
    • Adenylate cyclase associated (CAP) N terminal.
  • pfam CAP_C 156aa 5e-71 in ref transcript
    • DE Adenylate cyclase associated (CAP) C terminal.

CAPN3

  • rs.CAPN3.F1 rs.CAPN3.R1 104 122
  • NCBIGene 36.3 825
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000070

  • cd CysPc 353aa 1e-106 in ref transcript
    • Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
  • cd Calpain_III 158aa 1e-54 in ref transcript
    • Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
  • cd EFh 55aa 2e-06 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 68aa 0.004 in ref transcript
  • cd EFh 56aa 0.008 in ref transcript
  • pfam Peptidase_C2 343aa 1e-158 in ref transcript
    • Calpain family cysteine protease.
  • pfam Calpain_III 155aa 8e-65 in ref transcript
    • Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
  • COG FRQ1 99aa 6e-06 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

CAPN3

  • rs.CAPN3.F2 rs.CAPN3.R2 123 401
  • NCBIGene 36.3 825
  • Single exon skipping, size difference: 278
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_000070

  • Changed! cd CysPc 353aa 1e-106 in ref transcript
    • Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
  • Changed! cd Calpain_III 158aa 1e-54 in ref transcript
    • Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
  • Changed! cd EFh 55aa 2e-06 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! cd EFh 68aa 0.004 in ref transcript
  • Changed! cd EFh 56aa 0.008 in ref transcript
  • Changed! pfam Peptidase_C2 343aa 1e-158 in ref transcript
    • Calpain family cysteine protease.
  • Changed! pfam Calpain_III 155aa 8e-65 in ref transcript
    • Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
  • Changed! COG FRQ1 99aa 6e-06 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! cd CysPc 99aa 4e-15 in modified transcript
  • Changed! smart CysPc 101aa 3e-27 in modified transcript
    • Calpain-like thiol protease family. Calpain-like thiol protease family (peptidase family C2). Calcium activated neutral protease (large subunit).

CAPN3

  • rs.CAPN3.F3 rs.CAPN3.R3 103 121
  • NCBIGene 36.3 825
  • Single exon skipping, size difference: 18
  • Exclusion in 5'UTR
  • Reference transcript: NM_173090

  • cd EFh 55aa 6e-06 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • COG FRQ1 91aa 2e-06 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

CAPN3

  • rs.CAPN3.F4 rs.CAPN3.R4 188 332
  • NCBIGene 36.3 825
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000070

  • Changed! cd CysPc 353aa 1e-106 in ref transcript
    • Calpains, domains IIa, IIb; calcium-dependent cytoplasmic cysteine proteinases, papain-like. Functions in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction.
  • cd Calpain_III 158aa 1e-54 in ref transcript
    • Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
  • cd EFh 55aa 2e-06 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 68aa 0.004 in ref transcript
  • cd EFh 56aa 0.008 in ref transcript
  • Changed! pfam Peptidase_C2 343aa 1e-158 in ref transcript
    • Calpain family cysteine protease.
  • pfam Calpain_III 155aa 8e-65 in ref transcript
    • Calpain large subunit, domain III. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions.
  • COG FRQ1 99aa 6e-06 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! cd CysPc 305aa 1e-111 in modified transcript
  • Changed! pfam Peptidase_C2 295aa 1e-165 in modified transcript

CASC1

  • rs.CASC1.F1 rs.CASC1.R1 286 348
  • NCBIGene 36.3 55259
  • Alternative 3-prime, size difference: 62
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_018272

CASC4

  • rs.CASC4.F1 rs.CASC4.R1 369 537
  • NCBIGene 36.3 113201
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138423

  • TIGR SMC_prok_B 159aa 3e-06 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 157aa 9e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG FlaD 98aa 0.003 in ref transcript
    • Putative archaeal flagellar protein D/E [Cell motility and secretion].

CASP1

  • rs.CASP1.F1 rs.CASP1.R1 333 477
  • NCBIGene 36.3 834
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033292

  • Changed! cd CASc 250aa 1e-72 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • Changed! smart CASc 250aa 5e-94 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • pfam CARD 88aa 5e-20 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • Changed! cd CASc 202aa 7e-58 in modified transcript
  • Changed! smart CASc 202aa 6e-76 in modified transcript

CASP10

  • rs.CASP10.F1 rs.CASP10.R1 165 294
  • NCBIGene 36.3 843
  • Multiple exon skipping, size difference: 129
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032977

  • cd CASc 240aa 1e-82 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • cd DED 79aa 2e-12 in ref transcript
    • Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
  • cd DED 74aa 1e-11 in ref transcript
  • smart CASc 239aa 4e-97 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • pfam DED 83aa 7e-17 in ref transcript
    • Death effector domain.
  • pfam DED 77aa 1e-15 in ref transcript

CASP2

  • rs.CASP2.F1 rs.CASP2.R1 296 514
  • NCBIGene 36.3 835
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 218
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032982

  • Changed! cd CASc 256aa 3e-74 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • Changed! smart CASc 256aa 9e-86 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • Changed! smart CARD 89aa 9e-16 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.
  • Changed! smart CARD 75aa 6e-13 in modified transcript

CASP3

  • rs.CASP3.F1 rs.CASP3.R1 119 286
  • NCBIGene 36.3 836
  • Single exon skipping, size difference: 167
  • Exclusion in 5'UTR
  • Reference transcript: NM_004346

  • cd CASc 239aa 3e-86 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • smart CASc 241aa 3e-97 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.

CASP4

  • rs.CASP4.F1 rs.CASP4.R1 110 209
  • NCBIGene 36.3 837
  • Single exon skipping, size difference: 99
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001225

  • Changed! cd CASc 250aa 3e-69 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • Changed! smart CASc 250aa 2e-93 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • Changed! smart CARD 89aa 2e-10 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.

CASP7

  • rs.CASP7.F1 rs.CASP7.R1 111 144
  • NCBIGene 36.3 840
  • Alternative 3-prime, size difference: 33
  • Exclusion of the protein initiation site
  • Reference transcript: NM_033338

  • cd CASc 242aa 4e-84 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • smart CASc 243aa 2e-90 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.

CASP8

  • rs.CASP8.F1 rs.CASP8.R1 149 175
  • NCBIGene 36.3 841
  • Alternative 3-prime, size difference: 26
  • Exclusion in 5'UTR
  • Reference transcript: NM_001228

  • cd CASc 253aa 2e-84 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • cd DED 75aa 6e-17 in ref transcript
    • Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
  • cd DED 75aa 2e-11 in ref transcript
  • smart CASc 252aa 4e-89 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • pfam DED 84aa 8e-23 in ref transcript
    • Death effector domain.
  • pfam DED 81aa 1e-16 in ref transcript

CASP8

  • rs.CASP8.F2 rs.CASP8.R2 107 175
  • NCBIGene 36.3 841
  • Alternative 5-prime, size difference: 68
  • Exclusion in 5'UTR
  • Reference transcript: NM_001228

  • cd CASc 253aa 2e-84 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • cd DED 75aa 6e-17 in ref transcript
    • Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
  • cd DED 75aa 2e-11 in ref transcript
  • smart CASc 252aa 4e-89 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • pfam DED 84aa 8e-23 in ref transcript
    • Death effector domain.
  • pfam DED 81aa 1e-16 in ref transcript

CASP8

  • rs.CASP8.F3 rs.CASP8.R3 147 212
  • NCBIGene 36.3 841
  • Single exon skipping, size difference: 65
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001080125

  • Changed! cd CASc 253aa 3e-85 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • cd DED 75aa 3e-17 in ref transcript
    • Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
  • cd DED 78aa 7e-13 in ref transcript
  • Changed! smart CASc 252aa 4e-90 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • pfam DED 84aa 5e-23 in ref transcript
    • Death effector domain.
  • pfam DED 82aa 2e-17 in ref transcript

CASP8

  • rs.CASP8.F4 rs.CASP8.R4 288 333
  • NCBIGene 36.3 841
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001080125

  • cd CASc 253aa 3e-85 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • cd DED 75aa 3e-17 in ref transcript
    • Death effector domain. DED is part of a superfamily of death domains which also includes death-domain (DD) and caspase recruitment domain (CARD). Protein-protein interactions involving these domains occur through homotypic interactions, such as DED-DED. Caspases are the primary executioners of apoptosis via proteolytic cascades, and upstream caspases such as caspase-8 and caspase-9 are activated by signaling complexes such as the death inducing signaling complex (DISC) and the apoptosome. Binding of caspases to specific adaptor molecules via DED or CARD domains leads to autoactivation of caspases.
  • cd DED 78aa 7e-13 in ref transcript
  • smart CASc 252aa 4e-90 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • pfam DED 84aa 5e-23 in ref transcript
    • Death effector domain.
  • pfam DED 82aa 2e-17 in ref transcript

CASP9

  • rs.CASP9.F1 rs.CASP9.R1 100 550
  • NCBIGene 36.3 842
  • Multiple exon skipping, size difference: 450
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001229

  • Changed! cd CASc 263aa 5e-78 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues; Cysteine-dependent aspartate-directed proteases that mediate programmed cell death (apoptosis). Caspases are synthesized as inactive zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologs.
  • Changed! smart CASc 261aa 6e-81 in ref transcript
    • Caspase, interleukin-1 beta converting enzyme (ICE) homologues. Cysteine aspartases that mediate programmed cell death (apoptosis). Caspases are synthesised as zymogens and activated by proteolysis of the peptide backbone adjacent to an aspartate. The resulting two subunits associate to form an (alpha)2(beta)2-tetramer which is the active enzyme. Activation of caspases can be mediated by other caspase homologues.
  • smart CARD 91aa 7e-16 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signalling. Mediates homodimerisation. Structure consists of six antiparallel helices arranged in a topology homologue to the DEATH and the DED domain.
  • Changed! cd CASc 125aa 2e-26 in modified transcript
  • Changed! smart CASc 124aa 3e-26 in modified transcript

CAST

  • rs.CAST.F1 rs.CAST.R1 135 192
  • NCBIGene 36.3 831
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001750

  • pfam Calpain_inhib 127aa 5e-19 in ref transcript
    • Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins.
  • pfam Calpain_inhib 118aa 3e-16 in ref transcript
  • pfam Calpain_inhib 99aa 2e-13 in ref transcript
  • pfam Calpain_inhib 126aa 3e-11 in ref transcript

CAST

  • rs.CAST.F2 rs.CAST.R2 102 141
  • NCBIGene 36.3 831
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001750

  • pfam Calpain_inhib 127aa 5e-19 in ref transcript
    • Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins.
  • pfam Calpain_inhib 118aa 3e-16 in ref transcript
  • pfam Calpain_inhib 99aa 2e-13 in ref transcript
  • Changed! pfam Calpain_inhib 126aa 3e-11 in ref transcript
  • Changed! pfam Calpain_inhib 113aa 2e-09 in modified transcript

CAST

  • rs.CAST.F3 rs.CAST.R3 101 167
  • NCBIGene 36.3 831
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001750

  • pfam Calpain_inhib 127aa 5e-19 in ref transcript
    • Calpain inhibitor. This region is found multiple times in calpain inhibitor proteins.
  • pfam Calpain_inhib 118aa 3e-16 in ref transcript
  • pfam Calpain_inhib 99aa 2e-13 in ref transcript
  • pfam Calpain_inhib 126aa 3e-11 in ref transcript

CBFA2T2

  • rs.CBFA2T2.F1 rs.CBFA2T2.R1 127 415
  • NCBIGene 36.3 9139
  • Single exon skipping, size difference: 288
  • Exclusion of the protein initiation site
  • Reference transcript: NM_005093

  • pfam TAFH 96aa 3e-38 in ref transcript
    • NHR1 homology to TAF. This corresponds to the region NHR1 that is conserved between the product of the nervy gene in Drosophila and the human mtg8b protein, which is hypothesised to be a transcription factor.
  • pfam NHR2 67aa 2e-32 in ref transcript
    • NHR2 domain like. The NHR2 (Nervy homology 2) domain is found in the ETO protein where it mediates oligomerisation and protein-protein interactions. It forms an alpha-helical tetramer.
  • pfam zf-MYND 37aa 5e-11 in ref transcript
    • MYND finger.

CBFA2T3

  • rs.CBFA2T3.F1 rs.CBFA2T3.R1 348 423
  • NCBIGene 36.3 863
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005187

  • pfam TAFH 96aa 5e-38 in ref transcript
    • NHR1 homology to TAF. This corresponds to the region NHR1 that is conserved between the product of the nervy gene in Drosophila and the human mtg8b protein, which is hypothesised to be a transcription factor.
  • pfam NHR2 67aa 3e-24 in ref transcript
    • NHR2 domain like. The NHR2 (Nervy homology 2) domain is found in the ETO protein where it mediates oligomerisation and protein-protein interactions. It forms an alpha-helical tetramer.
  • pfam zf-MYND 37aa 5e-11 in ref transcript
    • MYND finger.
  • COG NtpE 60aa 0.003 in ref transcript
    • Archaeal/vacuolar-type H+-ATPase subunit E [Energy production and conversion].

CCDC32

  • rs.CCDC32.F1 rs.CCDC32.R1 121 146
  • NCBIGene 36.3 90416
  • Alternative 5-prime, size difference: 25
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001080791

CCDC33

  • rs.CCDC33.F1 rs.CCDC33.R1 111 213
  • NCBIGene 36.3 80125
  • Single exon skipping, size difference: 102
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_025055

CCDC41

  • rs.CCDC41.F1 rs.CCDC41.R1 236 289
  • NCBIGene 36.3 51134
  • Single exon skipping, size difference: 53
  • Exclusion in 5'UTR
  • Reference transcript: NM_016122

  • TIGR SMC_prok_B 327aa 1e-11 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 324aa 5e-06 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

CCDC43

  • rs.CCDC43.F1 rs.CCDC43.R1 270 329
  • NCBIGene 36.3 124808
  • Single exon skipping, size difference: 59
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_144609

CCHCR1

  • rs.CCHCR1.F1 rs.CCHCR1.R1 148 256
  • NCBIGene 36.3 54535
  • Alternative 3-prime, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001105564

  • Changed! pfam HCR 756aa 0.0 in ref transcript
    • Alpha helical coiled-coil rod protein (HCR). This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation.
  • Changed! COG Smc 377aa 0.003 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! pfam HCR 750aa 0.0 in modified transcript

CCNB1IP1

  • rs.CCNB1IP1.F1 rs.CCNB1IP1.R1 246 446
  • NCBIGene 36.3 57820
  • Single exon skipping, size difference: 200
  • Exclusion in 5'UTR
  • Reference transcript: NM_021178

CCNC

  • rs.CCNC.F1 rs.CCNC.R1 140 362
  • NCBIGene 36.3 892
  • Alternative 5-prime, size difference: 222
  • Exclusion of the protein initiation site
  • Reference transcript: NM_005190

  • Changed! cd CYCLIN 52aa 4e-06 in ref transcript
    • Cyclin box fold. Protein binding domain functioning in cell-cycle and transcription control. Present in cyclins, TFIIB and Retinoblastoma (RB).The cyclins consist of 8 classes of cell cycle regulators that regulate cyclin dependent kinases (CDKs). TFIIB is a transcription factor that binds the TATA box. Cyclins, TFIIB and RB contain 2 copies of the domain.
  • cd CYCLIN 84aa 0.008 in ref transcript
  • Changed! TIGR ccl1 250aa 7e-21 in ref transcript
    • University).
  • Changed! COG CCL1 243aa 4e-30 in ref transcript
    • Cdk activating kinase (CAK)/RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH/TFIIK, cyclin H subunit [Cell division and chromosome partitioning / Transcription / DNA replication, recombination, and repair].
  • Changed! TIGR ccl1 169aa 4e-12 in modified transcript
  • Changed! COG CCL1 157aa 2e-18 in modified transcript

CCR5

  • rs.CCR5.F1 rs.CCR5.R1 100 335
  • NCBIGene 36.3 1234
  • Alternative 3-prime, size difference: 235
  • Inclusion in 5'UTR
  • Reference transcript: NM_001100168

  • pfam 7tm_1 239aa 8e-30 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

CD1E

  • rs.CD1E.F1 rs.CD1E.R1 129 233
  • NCBIGene 36.3 913
  • Alternative 5-prime, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_030893

  • Changed! cd IGc 93aa 3e-11 in ref transcript
    • Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! smart IGc1 69aa 7e-16 in ref transcript
    • Immunoglobulin C-Type.
  • Changed! cd IGc 59aa 1e-06 in modified transcript
  • Changed! smart IGc1 45aa 8e-10 in modified transcript

CD1E

  • rs.CD1E.F2 rs.CD1E.R2 193 229
  • NCBIGene 36.3 913
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_030893

  • cd IGc 93aa 3e-11 in ref transcript
    • Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • smart IGc1 69aa 7e-16 in ref transcript
    • Immunoglobulin C-Type.

CD33

  • rs.CD33.F1 rs.CD33.R1 125 506
  • NCBIGene 36.3 945
  • Single exon skipping, size difference: 381
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001772

  • Changed! pfam V-set 114aa 7e-09 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • pfam C2-set_2 74aa 4e-04 in ref transcript
    • CD80-like C2-set immunoglobulin domain. These domains belong to the immunoglobulin superfamily.

CD3D

  • rs.CD3D.F1 rs.CD3D.R1 306 438
  • NCBIGene 36.3 915
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000732

  • smart ITAM 21aa 0.001 in ref transcript
    • Immunoreceptor tyrosine-based activation motif. Motif that may be dually phosphorylated on tyrosine that links antigen receptors to downstream signalling machinery.

CD40

  • rs.CD40.F1 rs.CD40.R1 142 204
  • NCBIGene 36.3 958
  • Single exon skipping, size difference: 62
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001250

  • cd TNFR 98aa 8e-16 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • Changed! cd TNFR 63aa 3e-04 in ref transcript
  • smart TNFR 39aa 0.004 in ref transcript
    • Tumor necrosis factor receptor / nerve growth factor receptor repeats. Repeats in growth factor receptors that are involved in growth factor binding. TNF/TNFR.

CD44

  • rs.CD44.F1 rs.CD44.R1 225 354
  • NCBIGene 36.3 960
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000610

  • cd Link_domain_CD44_like 145aa 3e-50 in ref transcript
    • This domain is a hyaluronan (HA)-binding domain. It is found in CD44 receptor and mediates adhesive interactions during inflammatory leukocyte homing and tumor metastasis. It also plays an important role in arteriogenesis. The functional HA-binding domain of CD44 is an extended domain comprised of a single link module flanked with N-and C- extensions. These extensions are essential for folding and for functional activity. This group also contains the cell surface retention sequence (CRS) binding protein-1 (CRSBP-1) and lymph vessel endothelial receptor-1 (LYVE-1). CRSBP-1 is a cell surface binding protein for the CRS motif of PDGF-BB (platelet-derived growth factor-BB) and is responsible for the cell surface retention of PDGF-BB in SSV-transformed cells. CRSBP-1 may play a role in autocrine regulation of cell growth mediated by CRS containing growth regulators. LYVE-1 is preferentially expressed on the lymphatic endothelium and is used as a molecular marker for the detection and characterization of lymphatic vessels in tumors.
  • smart LINK 91aa 1e-32 in ref transcript
    • Link (Hyaluronan-binding).

CD47

  • rs.CD47.F1 rs.CD47.R1 168 201
  • NCBIGene 36.3 961
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001777

  • pfam CD47 158aa 2e-40 in ref transcript
    • CD47 transmembrane region. This family represents the transmembrane region of CD47 leukocyte antigen.
  • pfam V-set_CD47 135aa 2e-38 in ref transcript
    • CD47 immunoglobulin-like domain. This family represents the CD47 leukocyte antigen V-set like Ig domain.

CD59

  • rs.CD59.F1 rs.CD59.R1 105 221
  • NCBIGene 36.3 966
  • Single exon skipping, size difference: 116
  • Exclusion in 5'UTR
  • Reference transcript: NM_203330

  • cd LU 72aa 2e-08 in ref transcript
    • Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
  • pfam UPAR_LY6 68aa 7e-12 in ref transcript
    • u-PAR/Ly-6 domain. This extracellular disulphide bond rich domain is related to pfam00087.

CD82

  • rs.CD82.F1 rs.CD82.R1 475 550
  • NCBIGene 36.3 3732
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002231

  • Changed! cd CD37_CD82_like_LEL 122aa 5e-39 in ref transcript
    • Tetraspanin, extracellular domain or large extracellular loop (LEL), CD37_CD82_Like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". CD37 is a leukocyte-specific protein, and its restricted expression pattern suggests a role in the immune system. A regulatory role in T-cell proliferation has been suggested. CD82 is a metastasis suppressor implicated in biological processes ranging from fusion, adhesion, and migration to apoptosis and alterations of cell morphology.
  • Changed! pfam Tetraspannin 250aa 4e-28 in ref transcript
    • Tetraspanin family.
  • Changed! cd CD37_CD82_like_LEL 117aa 1e-36 in modified transcript
  • Changed! pfam Tetraspannin 225aa 6e-24 in modified transcript

CDC2

  • rs.CDC2.F1 rs.CDC2.R1 111 282
  • NCBIGene 36.3 983
  • Single exon skipping, size difference: 171
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001786

  • Changed! cd S_TKc 285aa 6e-76 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! pfam Pkinase 284aa 4e-82 in ref transcript
    • Protein kinase domain.
  • Changed! PTZ PTZ00024 278aa 3e-51 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 228aa 7e-41 in modified transcript
  • Changed! pfam Pkinase 227aa 1e-48 in modified transcript
  • Changed! PTZ PTZ00024 141aa 2e-27 in modified transcript
  • Changed! PTZ PTZ00266 82aa 6e-09 in modified transcript
    • NIMA-related protein kinase; Provisional.

CDC2L1

  • rs.CDC2L1.F1 rs.CDC2L1.R1 107 154
  • NCBIGene 36.3 984
  • Single exon skipping, size difference: 47
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_033486

  • Changed! cd S_TKc 287aa 3e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 276aa 1e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 295aa 4e-55 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

CDC42BPA

  • rs.CDC42BPA.F1 rs.CDC42BPA.R1 172 415
  • NCBIGene 36.3 8476
  • Single exon skipping, size difference: 243
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003607

  • cd STKc_MRCK_alpha 332aa 0.0 in ref transcript
    • STKc_MRCK_alpha: Serine/Threonine Kinases (STKs), DMPK-like subfamily, DMPK-related cell division control protein 42 (Cdc42) binding kinase (MRCK) alpha isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The DMPK-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MRCK is activated via interaction with the small GTPase Cdc42. MRCK/Cdc42 signaling mediates myosin-dependent cell motility. MRCKalpha is expressed ubiquitously in many tissues. It plays a role in the regulation of peripheral actin reorganization and neurite outgrowth. It may also play a role in the transferrin iron uptake pathway.
  • cd PH_MRCK 122aa 2e-61 in ref transcript
    • MRCK (myotonic dystrophy-related Cdc42-binding kinase) pleckstrin homology (PH) domain. MRCK consists of a serine/threonine kinase domain, a cysteine rich (C1) region, a PH domain and a p21 binding motif. It has been shown to promote cytoskeletal reorganization, which affects many biological processes. The MRCK PH domain is responsible for its targeting to cell to cell junctions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • cd C1 50aa 1e-09 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • cd CRIB 36aa 3e-04 in ref transcript
    • PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. CRIB-containing effector proteins are functionally diverse and include serine/threonine kinases, tyrosine kinases, actin-binding proteins, and adapter molecules.
  • smart S_TKc 257aa 2e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam CNH 264aa 6e-64 in ref transcript
    • CNH domain. Domain found in NIK1-like kinase, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • pfam DMPK_coil 61aa 1e-17 in ref transcript
    • DMPK coiled coil domain like. This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation.
  • Changed! TIGR SMC_prok_B 306aa 4e-14 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart S_TK_X 60aa 1e-11 in ref transcript
    • Extension to Ser/Thr-type protein kinases.
  • pfam C1_1 51aa 8e-10 in ref transcript
    • Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
  • Changed! pfam SMC_N 338aa 1e-08 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • smart PBD 36aa 3e-05 in ref transcript
    • P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
  • smart PH 117aa 2e-04 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • PTZ PTZ00263 335aa 3e-63 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! COG SbcC 545aa 5e-17 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • COG ROM1 138aa 3e-07 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].
  • Changed! pfam SMC_N 329aa 6e-09 in modified transcript
  • Changed! COG SbcC 464aa 3e-11 in modified transcript

CDKN1A

  • rs.CDKN1A.F1 rs.CDKN1A.R1 243 384
  • NCBIGene 36.3 1026
  • Alternative 5-prime, size difference: 141
  • Exclusion in 5'UTR
  • Reference transcript: NM_078467

  • pfam CDI 50aa 8e-13 in ref transcript
    • Cyclin-dependent kinase inhibitor. Cell cycle progression is negatively controlled by cyclin-dependent kinases inhibitors (CDIs). CDIs are involved in cell cycle arrest at the G1 phase.

CDKN2A

  • rs.CDKN2A.F1 rs.CDKN2A.R1 127 401
  • NCBIGene 36.3 1029
  • Alternative 5-prime, size difference: 274
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_000077

  • Changed! cd ANK 100aa 1e-12 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! PTZ PTZ00322 81aa 1e-05 in ref transcript
    • 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase; Provisional.

CEACAM20

  • rs.CEACAM20.F1 rs.CEACAM20.R1 102 138
  • NCBIGene 36.3 125931
  • Single exon skipping, size difference: 36
  • Exclusion in 3'UTR
  • Reference transcript: NM_001102597

  • cd IGcam 71aa 1e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 80aa 3e-04 in ref transcript
  • cd IGcam 73aa 6e-04 in ref transcript
  • pfam I-set 84aa 4e-06 in ref transcript
    • Immunoglobulin I-set domain.
  • smart IGc2 59aa 6e-06 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 74aa 7e-06 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.

CEACAM20

  • rs.CEACAM20.F2 rs.CEACAM20.R2 124 403
  • NCBIGene 36.3 125931
  • Single exon skipping, size difference: 279
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001102597

  • Changed! cd IGcam 71aa 1e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 80aa 3e-04 in ref transcript
  • cd IGcam 73aa 6e-04 in ref transcript
  • pfam I-set 84aa 4e-06 in ref transcript
    • Immunoglobulin I-set domain.
  • smart IGc2 59aa 6e-06 in ref transcript
    • Immunoglobulin C-2 Type.
  • Changed! smart IG_like 74aa 7e-06 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.

CENPA

  • rs.CENPA.F1 rs.CENPA.R1 390 468
  • NCBIGene 36.3 1058
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001809

  • Changed! smart H3 100aa 6e-34 in ref transcript
    • Histone H3.
  • Changed! PTZ PTZ00018 87aa 2e-16 in ref transcript
    • histone 3; Provisional.
  • Changed! smart H3 74aa 3e-25 in modified transcript
  • Changed! PTZ PTZ00018 61aa 5e-14 in modified transcript

CEP170

  • rs.CEP170.F1 rs.CEP170.R1 250 358
  • NCBIGene 36.3 9859
  • Alternative 3-prime, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014812

  • cd FHA 89aa 4e-10 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • pfam FHA 68aa 3e-11 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • COG COG1716 79aa 1e-04 in ref transcript
    • FOG: FHA domain [Signal transduction mechanisms].

CEP170

  • rs.CEP170.F2 rs.CEP170.R2 148 178
  • NCBIGene 36.3 9859
  • Alternative 5-prime, size difference: 30
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_014812

  • cd FHA 89aa 4e-10 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • pfam FHA 68aa 3e-11 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • COG COG1716 79aa 1e-04 in ref transcript
    • FOG: FHA domain [Signal transduction mechanisms].

CEP170

  • rs.CEP170.F3 rs.CEP170.R3 234 528
  • NCBIGene 36.3 9859
  • Single exon skipping, size difference: 294
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014812

  • cd FHA 89aa 4e-10 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • pfam FHA 68aa 3e-11 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • COG COG1716 79aa 1e-04 in ref transcript
    • FOG: FHA domain [Signal transduction mechanisms].

CEP250

  • rs.CEP250.F1 rs.CEP250.R1 185 353
  • NCBIGene 36.3 11190
  • Single exon skipping, size difference: 168
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007186

  • TIGR SMC_prok_B 407aa 8e-12 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • TIGR SMC_prok_B 282aa 3e-07 in ref transcript
  • TIGR SMC_prok_B 209aa 2e-04 in ref transcript
  • pfam SMC_N 108aa 0.009 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! COG Smc 379aa 6e-10 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 261aa 5e-04 in ref transcript
  • Changed! COG Smc 372aa 6e-10 in modified transcript

CEP63

  • rs.CEP63.F1 rs.CEP63.R1 122 260
  • NCBIGene 36.3 80254
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_025180

  • Changed! TIGR SMC_prok_B 289aa 9e-09 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! TIGR SMC_prok_A 224aa 7e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! COG Smc 345aa 3e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_A 346aa 8e-09 in modified transcript
  • Changed! TIGR SMC_prok_B 219aa 1e-07 in modified transcript
  • Changed! COG Smc 299aa 2e-06 in modified transcript
  • Changed! COG Smc 222aa 0.001 in modified transcript

CEP78

  • rs.CEP78.F1 rs.CEP78.R1 108 156
  • NCBIGene 36.3 84131
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098802

  • cd LRR_RI 170aa 1e-09 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).

CHCHD4

  • rs.CHCHD4.F1 rs.CHCHD4.R1 193 363
  • NCBIGene 36.3 131474
  • Single exon skipping, size difference: 170
  • Exclusion of the protein initiation site
  • Reference transcript: NM_144636

  • pfam CHCH 44aa 5e-06 in ref transcript
    • CHCH domain. we have identified a conserved motif in the LOC118487 protein that we have called the CHCH motif. Alignment of this protein with related members showed the presence of three subgroups of proteins, which are called the S (Small), N (N-terminal extended) and C (C-terminal extended) subgroups. All three sub-groups of proteins have in common that they contain a predicted conserved [coiled coil 1]-[helix 1]-[coiled coil 2]-[helix 2] domain (CHCH domain). Within each helix of the CHCH domain, there are two cysteines present in a C-X9-C motif. The N-group contains an additional double helix domain, and each helix contains the C-X9-C motif. This family contains a number of characterised proteins: Cox19 protein - a nuclear gene of Saccharomyces cerevisiae, codes for an 11-kDa protein (Cox19p) required for expression of cytochrome oxidase. Because cox19 mutants are able to synthesise the mitochondrial and nuclear gene products of cytochrome oxidase, Cox19p probably functions post-translationally during assembly of the enzyme. Cox19p is present in the cytoplasm and mitochondria, where it exists as a soluble intermembrane protein. This dual location is similar to what was previously reported for Cox17p, a low molecular weight copper protein thought to be required for maturation of the CuA centre of subunit 2 of cytochrome oxidase. Cox19p have four conserved potential metal ligands, these are three cysteines and one histidine. Mrp10 - belongs to the class of yeast mitochondrial ribosomal proteins that are essential for translation. Eukaryotic NADH-ubiquinone oxidoreductase 19 kDa (NDUFA8) subunit. The CHCH domain was previously called DUF657.

CHEK2

  • rs.CHEK2.F1 rs.CHEK2.R1 231 360
  • NCBIGene 36.3 11200
  • Single exon skipping, size difference: 129
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005735

  • cd S_TKc 268aa 7e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! cd FHA 89aa 5e-07 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • smart S_TKc 257aa 4e-75 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam FHA 79aa 8e-08 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • PTZ PTZ00263 226aa 5e-43 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! cd FHA 109aa 1e-07 in modified transcript

CHEK2

  • rs.CHEK2.F2 rs.CHEK2.R2 157 244
  • NCBIGene 36.3 11200
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005735

  • Changed! cd S_TKc 268aa 7e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd FHA 89aa 5e-07 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • Changed! smart S_TKc 257aa 4e-75 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam FHA 79aa 8e-08 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • Changed! PTZ PTZ00263 226aa 5e-43 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • Changed! cd S_TKc 239aa 4e-50 in modified transcript
  • Changed! smart S_TKc 228aa 6e-56 in modified transcript
  • Changed! PTZ PTZ00263 197aa 9e-29 in modified transcript

CHMP1A

  • rs.CHMP1A.F1 rs.CHMP1A.R1 100 120
  • NCBIGene 36.3 5119
  • Single exon skipping, size difference: 20
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001083314

  • Changed! pfam Snf7 140aa 3e-11 in modified transcript
    • Snf7. This family of proteins are involved in protein sorting and transport from the endosome to the vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes play an important role in the degradation of both lipids and cellular proteins. In order to perform this degradative function, vacuoles/lysosomes contain numerous hydrolases which have been transported in the form of inactive precursors via the biosynthetic pathway and are proteolytically activated upon delivery to the vacuole/lysosome. The delivery of transmembrane proteins, such as activated cell surface receptors to the lumen of the vacuole/lysosome, either for degradation/downregulation, or in the case of hydrolases, for proper localisation, requires the formation of multivesicular bodies (MVBs). These late endosomal structures are formed by invaginating and budding of the limiting membrane into the lumen of the compartment. During this process, a subset of the endosomal membrane proteins is sorted into the forming vesicles. Mature MVBs fuse with the vacuole/lysosome, thereby releasing cargo containing vesicles into its hydrolytic lumen for degradation. Endosomal proteins that are not sorted into the intralumenal MVB vesicles are either recycled back to the plasma membrane or Golgi complex, or remain in the limiting membrane of the MVB and are thereby transported to the limiting membrane of the vacuole/lysosome as a consequence of fusion. Therefore, the MVB sorting pathway plays a critical role in the decision between recycling and degradation of membrane proteins. A few archaeal sequences are also present within this family.
  • Changed! COG VPS24 148aa 1e-06 in modified transcript
    • Conserved protein implicated in secretion [Cell motility and secretion].

CHRFAM7A

  • rs.CHRFAM7A.F1 rs.CHRFAM7A.R1 328 392
  • NCBIGene 36.3 89832
  • Single exon skipping, size difference: 64
  • Exclusion of the protein initiation site
  • Reference transcript: NM_139320

  • Changed! TIGR LIC 373aa 1e-107 in ref transcript
    • selective while glycine receptors are anion selective).
  • Changed! TIGR LIC 307aa 3e-78 in modified transcript

CLCC1

  • rs.CLCC1.F1 rs.CLCC1.R1 90 100
  • NCBIGene 36.3 23155
  • Alternative 3-prime, size difference: 10
  • Inclusion in 5'UTR
  • Reference transcript: NM_001048210

  • pfam MCLC 538aa 0.0 in ref transcript
    • Mid-1-related chloride channel (MCLC). This family consists of several mid-1-related chloride channels. mid-1-related chloride channel (MCLC) proteins function as a chloride channel when incorporated in the planar lipid bilayer.

CLCC1

  • rs.CLCC1.F2 rs.CLCC1.R2 101 251
  • NCBIGene 36.3 23155
  • Alternative 3-prime, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001048210

  • Changed! pfam MCLC 538aa 0.0 in ref transcript
    • Mid-1-related chloride channel (MCLC). This family consists of several mid-1-related chloride channels. mid-1-related chloride channel (MCLC) proteins function as a chloride channel when incorporated in the planar lipid bilayer.
  • Changed! pfam MCLC 488aa 0.0 in modified transcript

CLCN6

  • rs.CLCN6.F1 rs.CLCN6.R1 90 100
  • NCBIGene 36.3 1185
  • Alternative 5-prime, size difference: 10
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001286

  • Changed! cd ClC_6_like 322aa 1e-105 in ref transcript
    • ClC-6-like chloride channel proteins. This CD includes ClC-6, ClC-7 and ClC-B, C, D in plants. Proteins in this family are ubiquitous in eukarotes and their functions are unclear. They are expressed in intracellular organelles membranes. This family belongs to the ClC superfamily of chloride ion channels, which share the unique double-barreled architecture and voltage-dependent gating mechanism. The gating is conferred by the permeating anion itself, acting as the gating charge. ClC chloride ion channel superfamily perform a variety of functions including cellular excitability regulation, cell volume regulation, membrane potential stabilization, acidification of intracellular organelles, signal transduction, and transepithelial transport in animals.
  • Changed! cd ClC_6_like 112aa 5e-39 in ref transcript
  • Changed! cd CBS_pair_EriC_assoc_euk_bac 54aa 2e-13 in ref transcript
    • This cd contains two tandem repeats of the cystathionine beta-synthase (CBS pair) domains in the EriC CIC-type chloride channels in eukaryotes and bacteria. These ion channels are proteins with a seemingly simple task of allowing the passive flow of chloride ions across biological membranes. CIC-type chloride channels come from all kingdoms of life, have several gene families, and can be gated by voltage. The members of the CIC-type chloride channel are double-barreled: two proteins forming homodimers at a broad interface formed by four helices from each protein. The two pores are not found at this interface, but are completely contained within each subunit, as deduced from the mutational analyses, unlike many other channels, in which four or five identical or structurally related subunits jointly form one pore. CBS is a small domain originally identified in cystathionine beta-synthase and subsequently found in a wide range of different proteins. CBS domains usually come in tandem repeats, which associate to form a so-called Bateman domain or a CBS pair which is reflected in this model. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain in CLC chloride channel family members have been associated with classic Bartter syndrome, Osteopetrosis, Dent's disease, idiopathic generalized epilepsy, and myotonia.
  • Changed! pfam Voltage_CLC 231aa 4e-28 in ref transcript
    • Voltage gated chloride channel. This family of ion channels contains 10 or 12 transmembrane helices. Each protein forms a single pore. It has been shown that some members of this family form homodimers. In terms of primary structure, they are unrelated to known cation channels or other types of anion channels. Three ClC subfamilies are found in animals. ClC-1 is involved in setting and restoring the resting membrane potential of skeletal muscle, while other channels play important parts in solute concentration mechanisms in the kidney. These proteins contain two pfam00571 domains.
  • Changed! pfam Voltage_CLC 129aa 2e-15 in ref transcript
  • Changed! pfam CBS 58aa 3e-08 in ref transcript
    • CBS domain pair. CBS domains are small intracellular modules that pair together to form a stable globular domain. This family represents a pair of CBS domains, that has been termed a Bateman domain. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet. CBS domains are found attached to a wide range of other protein domains suggesting that CBS domains may play a regulatory role making proteins sensitive to adenosyl carrying ligands. The region containing the CBS domains in Cystathionine-beta synthase is involved in regulation by S-AdoMet. CBS domain pairs from AMPK bind AMP or ATP. The CBS domains from IMPDH and the chloride channel CLC2 bind ATP.
  • Changed! smart CBS 48aa 0.009 in ref transcript
    • Domain in cystathionine beta-synthase and other proteins. Domain present in all 3 forms of cellular life. Present in two copies in inosine monophosphate dehydrogenase, of which one is disordered in the crystal structure [3]. A number of disease states are associated with CBS-containing proteins including homocystinuria, Becker's and Thomsen disease.
  • Changed! COG EriC 287aa 2e-13 in ref transcript
    • Chloride channel protein EriC [Inorganic ion transport and metabolism].
  • Changed! COG EriC 111aa 3e-09 in ref transcript
  • Changed! PRK PRK05567 58aa 3e-05 in ref transcript
    • inositol-5'-monophosphate dehydrogenase; Reviewed.
  • Changed! cd ClC_6_like 188aa 1e-59 in modified transcript
  • Changed! pfam Voltage_CLC 78aa 1e-14 in modified transcript
  • Changed! COG EriC 136aa 2e-07 in modified transcript

CLEC12A

  • rs.CLEC12A.F1 rs.CLEC12A.R1 211 310
  • NCBIGene 36.3 160364
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138337

  • cd CLECT_NK_receptors_like 118aa 8e-30 in ref transcript
    • CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
  • smart CLECT 117aa 4e-15 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

CLEC1B

  • rs.CLEC1B.F1 rs.CLEC1B.R1 268 367
  • NCBIGene 36.3 51266
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016509

  • cd CLECT_NK_receptors_like 117aa 8e-29 in ref transcript
    • CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis upon recognition of markers of healthy self cells. Most Lys49 receptors are inhibitory; some are stimulatory. OCIL inhibits NK cell function via binding to the receptor NKRP1D. Murine OCIL in addition to inhibiting NK cell function inhibits osteoclast differentiation. MAFA clusters with the type I Fc epsilon receptor (FcepsilonRI) and inhibits the mast cells secretory response to FcepsilonRI stimulus. CD72 is a negative regulator of B cell receptor signaling. NKG2D is an activating receptor for stress-induced antigens; human NKG2D ligands include the stress induced MHC-I homologs, MICA, MICB, and ULBP family of glycoproteins Several NKRs have a carbohydrate-binding capacity which is not mediated through calcium ions (e.g. OCIL binds a range of high molecular weight sulfated glycosaminoglycans including dextran sulfate, fucoidan, and gamma-carrageenan sugars). Dectin-1 binds fungal beta-glucans and in involved in the innate immune responses to fungal pathogens. MAFA binds saccharides having terminal alpha-D mannose residues in a calcium-dependent manner. LOX-1 is the major receptor for OxLDL in endothelial cells and thought to play a role in the pathology of atherosclerosis. Some NKRs exist as homodimers (e.g.Lys49, NKG2D, CD69, LOX-1) and some as heterodimers (e.g. CD94/NKG2A). Dectin-1 can function as a monomer in vitro.
  • smart CLECT 116aa 8e-19 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

CLIP1

  • rs.CLIP1.F1 rs.CLIP1.R1 280 385
  • NCBIGene 36.3 6249
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002956

  • pfam CAP_GLY 66aa 6e-27 in ref transcript
    • CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
  • pfam CAP_GLY 66aa 6e-26 in ref transcript
  • TIGR SMC_prok_A 315aa 9e-14 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! TIGR SMC_prok_B 178aa 8e-08 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! TIGR SMC_prok_B 349aa 2e-07 in ref transcript
  • COG NIP100 61aa 2e-11 in ref transcript
    • Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
  • COG Smc 306aa 4e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG NIP100 62aa 2e-10 in ref transcript
  • Changed! PRK PRK03918 578aa 5e-07 in ref transcript
    • chromosome segregation protein; Provisional.
  • Changed! TIGR SMC_prok_B 144aa 9e-06 in modified transcript
  • Changed! pfam SMC_N 392aa 1e-05 in modified transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! PRK PRK03918 543aa 3e-06 in modified transcript

CMTM1

  • rs.CMTM1.F1 rs.CMTM1.R1 109 460
  • NCBIGene 36.3 113540
  • Alternative 5-prime, size difference: 351
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_052999

  • Changed! pfam MARVEL 114aa 0.006 in ref transcript
    • Membrane-associating domain. MARVEL domain-containing proteins are often found in lipid-associating proteins - such as Occludin and MAL family proteins. It may be part of the machinery of membrane apposition events, such as transport vesicle biogenesis.

CNGA3

  • rs.CNGA3.F1 rs.CNGA3.R1 480 534
  • NCBIGene 36.3 1261
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001298

  • cd CAP_ED 117aa 8e-17 in ref transcript
    • effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
  • smart cNMP 120aa 1e-17 in ref transcript
    • Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
  • pfam Ion_trans 140aa 3e-09 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • COG Crp 123aa 1e-06 in ref transcript
    • cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].

CNTN5

  • rs.CNTN5.F1 rs.CNTN5.R1 176 398
  • NCBIGene 36.3 53942
  • Single exon skipping, size difference: 222
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014361

  • cd IGcam 77aa 2e-14 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 86aa 3e-13 in ref transcript
  • cd FN3 92aa 1e-11 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 87aa 4e-11 in ref transcript
  • cd IGcam 92aa 1e-10 in ref transcript
  • cd FN3 84aa 2e-09 in ref transcript
  • cd FN3 93aa 4e-05 in ref transcript
  • cd IGcam 84aa 5e-05 in ref transcript
  • cd FN3 86aa 2e-04 in ref transcript
  • cd IGcam 93aa 0.002 in ref transcript
  • pfam I-set 89aa 1e-16 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 86aa 2e-15 in ref transcript
  • smart IGc2 62aa 2e-13 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 85aa 4e-10 in ref transcript
    • Fibronectin type III domain.
  • pfam I-set 93aa 1e-09 in ref transcript
  • pfam fn3 78aa 7e-07 in ref transcript
  • smart IG_like 74aa 7e-06 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam I-set 91aa 1e-04 in ref transcript
  • pfam fn3 88aa 6e-04 in ref transcript
  • smart FN3 78aa 7e-04 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.

CNTNAP4

  • rs.CNTNAP4.F1 rs.CNTNAP4.R1 173 317
  • NCBIGene 36.3 85445
  • Single exon skipping, size difference: 144
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_138994

  • cd FA58C 123aa 2e-24 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • cd LamG 145aa 1e-21 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 151aa 1e-17 in ref transcript
  • Changed! cd LamG 96aa 3e-10 in ref transcript
  • cd LamG 140aa 1e-09 in ref transcript
  • cd FReD 48aa 0.006 in ref transcript
    • Fibrinogen-related domains (FReDs); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation.
  • smart LamG 127aa 6e-25 in ref transcript
    • Laminin G domain.
  • pfam F5_F8_type_C 121aa 1e-23 in ref transcript
    • F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.
  • pfam Laminin_G_2 124aa 1e-20 in ref transcript
    • Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
  • pfam Laminin_G_2 128aa 5e-13 in ref transcript
  • Changed! pfam Laminin_G_2 98aa 9e-12 in ref transcript
  • smart FBG 45aa 4e-04 in ref transcript
    • Fibrinogen-related domains (FReDs). Domain present at the C-termini of fibrinogen beta and gamma chains, and a variety of fibrinogen-related proteins, including tenascin and Drosophila scabrous.
  • smart COLFI 84aa 0.003 in ref transcript
    • Fibrillar collagens C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.
  • pfam EGF 33aa 0.005 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
  • Changed! cd LamG 146aa 7e-18 in modified transcript
  • Changed! pfam Laminin_G_2 127aa 7e-20 in modified transcript

COL11A2

  • rs.COL11A2.F1 rs.COL11A2.R1 193 271
  • NCBIGene 36.3 1302
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080680

  • cd LamG 152aa 1e-05 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • smart COLFI 196aa 6e-85 in ref transcript
    • Fibrillar collagens C-terminal domain. Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.
  • smart TSPN 183aa 7e-56 in ref transcript
    • Thrombospondin N-terminal -like domains. Heparin-binding and cell adhesion domain of thrombospondin.

COL13A1

  • rs.COL13A1.F1 rs.COL13A1.R1 123 165
  • NCBIGene 36.3 1305
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005203

COL13A1

  • rs.COL13A1.F2 rs.COL13A1.R2 101 146
  • NCBIGene 36.3 1305
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005203

COL13A1

  • rs.COL13A1.F3 rs.COL13A1.R3 134 170
  • NCBIGene 36.3 1305
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005203

COLQ

  • rs.COLQ.F1 rs.COLQ.R1 137 239
  • NCBIGene 36.3 8292
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005677

  • TIGR myxo_disulf_rpt 25aa 5e-04 in ref transcript
    • This model represents a sequence region shared between several proteins of Myxococcus xanthus DK 1622 and some eukaryotic proteins that include human pappalysin-1. The region of about 40 amino acids contains several conserved Cys residues presumed to form disulfide bonds. The region appears in up to 13 repeats in Myxococcus.
  • pfam Collagen 63aa 0.004 in ref transcript
    • Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.

COMMD5

  • rs.COMMD5.F1 rs.COMMD5.R1 256 410
  • NCBIGene 36.3 28991
  • Alternative 5-prime, size difference: 154
  • Exclusion in 5'UTR
  • Reference transcript: NM_014066

  • cd Commd5_HCaRG 110aa 3e-46 in ref transcript
    • COMM_Domain containing protein 5, also called HCaRG (hypertension-related, calcium-regulated gene). HCaRG is a nuclear protein that might be involved in cell proliferation; it is negatively regulated by extracellular calcium concentration, and its basal mRNA levels are higher in hypertensive animals. The COMM Domain is found at the C-terminus of a variety of proteins; presumably all COMM_Domain containing proteins are located in the nucleus and the COMM domain plays a role in protein-protein interactions. Several family members have been shown to bind and inhibit NF-kappaB.
  • cd Commd10 147aa 3e-06 in ref transcript
    • COMM_Domain containing protein 10. The COMM Domain is found at the C-terminus of a variety of proteins; presumably all COMM_Domain containing proteins are located in the nucleus and the COMM domain plays a role in protein-protein interactions. Several family members have been shown to bind and inhibit NF-kappaB.
  • pfam HCaRG 176aa 1e-43 in ref transcript
    • HCaRG protein. This family consists of several mammalian HCaRG(hypertension-related, calcium-regulated gene) proteins. HCaRG is negatively regulated by extracellular calcium concentration, and its basal mRNA levels are higher in hypertensive animals. HCaRG is a nuclear protein potentially involved in the control of cell proliferation.

COMMD9

  • rs.COMMD9.F1 rs.COMMD9.R1 165 291
  • NCBIGene 36.3 29099
  • Single exon skipping, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014186

  • cd Commd9 108aa 2e-42 in ref transcript
    • COMM_Domain containing protein 9. The COMM Domain is found at the C-terminus of a variety of proteins; presumably all COMM_Domain containing proteins are located in the nucleus and the COMM domain plays a role in protein-protein interactions. Several family members have been shown to bind and inhibit NF-kappaB.
  • Changed! pfam HCaRG 183aa 9e-27 in ref transcript
    • HCaRG protein. This family consists of several mammalian HCaRG(hypertension-related, calcium-regulated gene) proteins. HCaRG is negatively regulated by extracellular calcium concentration, and its basal mRNA levels are higher in hypertensive animals. HCaRG is a nuclear protein potentially involved in the control of cell proliferation.
  • Changed! pfam HCaRG 125aa 4e-24 in modified transcript

COPA

  • rs.COPA.F1 rs.COPA.R1 119 146
  • NCBIGene 36.3 1314
  • Alternative 3-prime, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098398

  • cd WD40 310aa 9e-60 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam COPI_C 410aa 0.0 in ref transcript
    • Coatomer (COPI) alpha subunit C-terminus. This family represents the C-terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit. Coatomer (COPI) is a large cytosolic protein complex which forms a coat around vesicles budding from the Golgi apparatus. Such coatomer-coated vesicles have been proposed to play a role in many distinct steps of intracellular transport. Note that many family members also contain the pfam04053 domain.
  • Changed! pfam Coatomer_WDAD 452aa 0.0 in ref transcript
    • Coatomer WD associated region.
  • smart WD40 40aa 3e-07 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • pfam WD40 34aa 6e-05 in ref transcript
    • WD domain, G-beta repeat.
  • pfam WD40 39aa 6e-05 in ref transcript
  • smart WD40 36aa 1e-04 in ref transcript
  • smart WD40 39aa 5e-04 in ref transcript
  • COG COG2319 316aa 4e-32 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! pfam Coatomer_WDAD 443aa 0.0 in modified transcript

COPE

  • rs.COPE.F1 rs.COPE.R1 101 254
  • NCBIGene 36.3 11316
  • Single exon skipping, size difference: 153
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007263

  • cd TPR 83aa 0.007 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • Changed! pfam Coatomer_E 291aa 1e-109 in ref transcript
    • Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex.
  • Changed! pfam Coatomer_E 240aa 4e-84 in modified transcript

CPEB1

  • rs.CPEB1.F1 rs.CPEB1.R1 100 115
  • NCBIGene 36.3 64506
  • Alternative 5-prime, size difference: 15
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_030594

  • Changed! cd RRM 68aa 5e-05 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 60aa 0.005 in ref transcript
  • Changed! smart RRM_2 63aa 0.001 in ref transcript
    • RNA recognition motif.
  • Changed! cd RRM 73aa 1e-04 in modified transcript
  • Changed! smart RRM_2 68aa 0.005 in modified transcript

CPSF4

  • rs.CPSF4.F1 rs.CPSF4.R1 112 187
  • NCBIGene 36.3 10898
  • Alternative 3-prime, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006693

  • smart ZnF_C3H1 26aa 0.002 in ref transcript
    • zinc finger.
  • COG YTH1 152aa 1e-21 in ref transcript
    • Cleavage and polyadenylation specificity factor (CPSF) Clipper subunit and related makorin family Zn-finger proteins [General function prediction only].

CPZ

  • rs.CPZ.F1 rs.CPZ.R1 132 395
  • NCBIGene 36.3 8532
  • Alternative 5-prime and 3-prime, size difference: 263
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001014447

  • Changed! cd M14_CPZ 395aa 0.0 in ref transcript
    • Peptidase M14-like domain of carboxypeptidase (CP) Z (CPZ), CPZ belongs to the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPZ is a secreted Zn-dependent enzyme whose biological function is largely unknown. Unlike other members of the N/E subfamily, CPZ has a bipartite structure, which consists of an N-terminal cysteine-rich domain (CRD) whose sequence is similar to Wnt-binding proteins, and a C-terminal CP catalytic domain that removes C-terminal Arg residues from substrates. CPZ is enriched in the extracellular matrix and is widely distributed during early embryogenesis. That the CRD of CPZ can bind to Wnt4 suggests that CPZ plays a role in Wnt signaling.
  • Changed! pfam Peptidase_M14 302aa 1e-62 in ref transcript
    • Zinc carboxypeptidase.
  • Changed! smart FRI 118aa 2e-36 in ref transcript
    • Frizzled. Drosophila melanogaster frizzled mediates signalling that polarises a precursor cell along the anteroposterior axis. Homologues of the N-terminal region of frizzled exist either as transmembrane or secreted molecules. Frizzled homologues are reported to be receptors for the Wnt growth factors. (Not yet in MEDLINE: the FRI domain occurs in several receptor tyrosine kinases [Xu, Y.K. and Nusse, Curr. Biol. 8 R405-R406 (1998); Masiakowski, P. and Yanopoulos, G.D., Curr. Biol. 8, R407 (1998)].
  • Changed! COG COG2866 185aa 2e-06 in ref transcript
    • Predicted carboxypeptidase [Amino acid transport and metabolism].

CREBBP

  • rs.CREBBP.F1 rs.CREBBP.R1 209 323
  • NCBIGene 36.3 1387
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004380

  • cd Bromo_cbp_like 108aa 1e-63 in ref transcript
    • Bromodomain, cbp_like subfamily. Cbp (CREB binding protein or CREBBP) is an acetyltransferase acting on histone, which gives a specific tag for transcriptional activation and also acetylates non-histone proteins. CREBBP binds specifically to phosphorylated CREB protein and augments the activity of phosphorylated CREB to activate transcription of cAMP-responsive genes. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
  • cd ZZ_CBP 41aa 2e-19 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in CBP/p300 and related proteins. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. CREB-binding protein (CBP) is a large multidomain protein that provides binding sites for transcriptional coactivators, the role of the ZZ domain in CBP/p300 is unclear.
  • pfam DUF906 234aa 1e-136 in ref transcript
    • Domain of Unknown Function (DUF906).
  • pfam KIX 81aa 4e-38 in ref transcript
    • KIX domain. CBP and P300 bind to the CREB via a domain known as KIX. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun.
  • Changed! pfam zf-TAZ 85aa 1e-35 in ref transcript
    • TAZ zinc finger. The TAZ2 domain of CBP binds to other transcription factors such as the p53 tumour suppressor protein, E1A oncoprotein, MyoD, and GATA-1. The zinc coordinating motif that is necessary for binding to target DNA sequences consists of HCCC.
  • pfam DUF902 61aa 6e-31 in ref transcript
    • Domain of Unknown Function (DUF902).
  • smart BROMO 109aa 5e-29 in ref transcript
    • bromo domain.
  • pfam zf-TAZ 80aa 5e-25 in ref transcript
  • pfam Creb_binding 73aa 6e-16 in ref transcript
    • Creb binding. The Creb binding domain assumes a structure comprising of three alpha-helices which pack in a bundle, exposing a hydrophobic groove between alpha-1 and alpha-3 within which complimentary domains found in the protein 'activator for thyroid hormone and retinoid receptors' (ACTR) can dock. Docking of these domains is required for the recruitment of RNA polymerase II and the basal transcription machinery.
  • smart ZnF_ZZ 43aa 1e-12 in ref transcript
    • Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
  • pfam PAT1 136aa 2e-04 in ref transcript
    • Topoisomerase II-associated protein PAT1. Members of this family are necessary for accurate chromosome transmission during cell division.
  • COG COG5076 109aa 1e-12 in ref transcript
    • Transcription factor involved in chromatin remodeling, contains bromodomain [Chromatin structure and dynamics / Transcription].
  • PRK PRK08853 109aa 0.001 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.
  • Changed! pfam zf-TAZ 58aa 5e-20 in modified transcript

CREM

  • rs.CREM.F1 rs.CREM.R1 210 246
  • NCBIGene 36.3 1390
  • Single exon skipping, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182717

  • pfam bZIP_1 53aa 4e-05 in ref transcript
    • bZIP transcription factor. The Pfam entry includes the basic region and the leucine zipper region.

CRKRS

  • rs.CRKRS.F1 rs.CRKRS.R1 129 156
  • NCBIGene 36.3 51755
  • Alternative 3-prime, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016507

  • cd S_TKc 295aa 5e-73 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! pfam Pkinase 294aa 1e-74 in ref transcript
    • Protein kinase domain.
  • PTZ PTZ00024 288aa 1e-45 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! smart S_TKc 284aa 1e-74 in modified transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.

CRTC1

  • rs.CRTC1.F1 rs.CRTC1.R1 186 234
  • NCBIGene 36.3 23373
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098482

CSHL1

  • rs.CSHL1.F1 rs.CSHL1.R1 129 305
  • NCBIGene 36.3 1444
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 176
  • Inclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022579

  • Changed! pfam Hormone_1 190aa 1e-35 in ref transcript
    • Somatotropin hormone family.

CSHL1

  • rs.CSHL1.F2 rs.CSHL1.R2 187 256
  • NCBIGene 36.3 1444
  • Alternative 3-prime, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022579

  • Changed! pfam Hormone_1 190aa 1e-35 in ref transcript
    • Somatotropin hormone family.
  • Changed! pfam Hormone_1 167aa 5e-31 in modified transcript

CSN1S1

  • rs.CSN1S1.F1 rs.CSN1S1.R1 98 122
  • NCBIGene 36.3 1446
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001890

CSN1S1

  • rs.CSN1S1.F2 rs.CSN1S1.R2 243 270
  • NCBIGene 36.3 1446
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001890

CSPP1

  • rs.CSPP1.F1 rs.CSPP1.R1 163 268
  • NCBIGene 36.3 79848
  • Multiple exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077204

CTDP1

  • rs.CTDP1.F1 rs.CTDP1.R1 210 373
  • NCBIGene 36.3 9150
  • Single exon skipping, size difference: 163
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004715

  • cd BRCT 81aa 1e-06 in ref transcript
    • Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
  • cd biotinyl_domain 91aa 1e-05 in ref transcript
    • The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine.
  • Changed! pfam FCP1_C 246aa 2e-83 in ref transcript
    • FCP1, C-terminal. The C-terminal domain of FCP-1 is required for interaction with the carboxy terminal domain of RAP74. Interaction relies extensively on van der Waals contacts between hydrophobic residues situated within alpha-helices in both domains.
  • pfam NIF 167aa 2e-68 in ref transcript
    • NLI interacting factor-like phosphatase. This family contains a number of NLI interacting factor isoforms and also an N-terminal regions of RNA polymerase II CTC phosphatase and FCP1 serine phosphatase. This region has been identified as the minimal phosphatase domain.
  • smart BRCT 87aa 4e-07 in ref transcript
    • breast cancer carboxy-terminal domain.
  • COG FCP1 148aa 2e-13 in ref transcript
    • TFIIF-interacting CTD phosphatases, including NLI-interacting factor [Transcription].
  • PRK PRK09282 112aa 3e-05 in ref transcript
    • pyruvate carboxylase subunit B; Validated.
  • Changed! pfam FCP1_C 91aa 6e-37 in modified transcript

CTH

  • rs.CTH.F1 rs.CTH.R1 208 340
  • NCBIGene 36.3 1491
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001902

  • Changed! cd CGS_like 360aa 1e-155 in ref transcript
    • CGS_like: Cystathionine gamma-synthase is a PLP dependent enzyme and catalyzes the committed step of methionine biosynthesis. This pathway is unique to microorganisms and plants, rendering the enzyme an attractive target for the development of antimicrobials and herbicides. This subgroup also includes cystathionine gamma-lyases (CGL), O-acetylhomoserine sulfhydrylases and O-acetylhomoserine thiol lyases. CGL's are very similar to CGS's. Members of this group are widely distributed among all three forms of life.
  • Changed! pfam Cys_Met_Meta_PP 377aa 1e-165 in ref transcript
    • Cys/Met metabolism PLP-dependent enzyme. This family includes enzymes involved in cysteine and methionine metabolism. The following are members: Cystathionine gamma-lyase, Cystathionine gamma-synthase, Cystathionine beta-lyase, Methionine gamma-lyase, OAH/OAS sulfhydrylase, O-succinylhomoserine sulfhydrylase All of these members participate is slightly different reactions. All these enzymes use PLP (pyridoxal-5'-phosphate) as a cofactor.
  • Changed! COG MetC 386aa 1e-128 in ref transcript
    • Cystathionine beta-lyases/cystathionine gamma-synthases [Amino acid transport and metabolism].
  • Changed! cd CGS_like 316aa 1e-124 in modified transcript
  • Changed! pfam Cys_Met_Meta_PP 333aa 1e-134 in modified transcript
  • Changed! COG MetC 342aa 1e-102 in modified transcript

CTNND1

  • rs.CTNND1.F1 rs.CTNND1.R1 302 375
  • NCBIGene 36.3 1500
  • Alternative 5-prime, size difference: 73
  • Exclusion in 5'UTR
  • Reference transcript: NM_001085458

  • cd ARM 119aa 1e-19 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 103aa 9e-11 in ref transcript
  • cd ARM 142aa 5e-05 in ref transcript
  • smart ARM 42aa 5e-05 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
  • smart ARM 41aa 1e-04 in ref transcript
  • smart ARM 36aa 6e-04 in ref transcript

CTNND1

  • rs.CTNND1.F2 rs.CTNND1.R2 120 183
  • NCBIGene 36.3 1500
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001085458

  • cd ARM 119aa 1e-19 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 103aa 9e-11 in ref transcript
  • cd ARM 142aa 5e-05 in ref transcript
  • smart ARM 42aa 5e-05 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
  • smart ARM 41aa 1e-04 in ref transcript
  • smart ARM 36aa 6e-04 in ref transcript

CTNND1

  • rs.CTNND1.F3 rs.CTNND1.R3 102 120
  • NCBIGene 36.3 1500
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001085458

  • cd ARM 119aa 1e-19 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 103aa 9e-11 in ref transcript
  • cd ARM 142aa 5e-05 in ref transcript
  • smart ARM 42aa 5e-05 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
  • smart ARM 41aa 1e-04 in ref transcript
  • smart ARM 36aa 6e-04 in ref transcript

CTNND1

  • rs.CTNND1.F4 rs.CTNND1.R4 235 322
  • NCBIGene 36.3 1500
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001085458

  • cd ARM 119aa 1e-19 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 103aa 9e-11 in ref transcript
  • cd ARM 142aa 5e-05 in ref transcript
  • smart ARM 42aa 5e-05 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
  • smart ARM 41aa 1e-04 in ref transcript
  • smart ARM 36aa 6e-04 in ref transcript

CTNND1

  • rs.CTNND1.F5 rs.CTNND1.R5 110 518
  • NCBIGene 36.3 1500
  • Multiple exon skipping, size difference: 408
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_001085458

  • cd ARM 119aa 1e-19 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 103aa 9e-11 in ref transcript
  • cd ARM 142aa 5e-05 in ref transcript
  • smart ARM 42aa 5e-05 in ref transcript
    • Armadillo/beta-catenin-like repeats. Approx. 40 amino acid repeat. Tandem repeats form superhelix of helices that is proposed to mediate interaction of beta-catenin with its ligands. Involved in transducing the Wingless/Wnt signal. In plakoglobin arm repeats bind alpha-catenin and N-cadherin.
  • smart ARM 41aa 1e-04 in ref transcript
  • smart ARM 36aa 6e-04 in ref transcript

CTNS

  • rs.CTNS.F1 rs.CTNS.R1 179 302
  • NCBIGene 36.3 1497
  • Alternative 5-prime, size difference: 123
  • Inclusion in 5'UTR
  • Reference transcript: NM_001031681

  • TIGR 2A43 233aa 1e-77 in ref transcript

CTSB

  • rs.CTSB.F1 rs.CTSB.R1 190 264
  • NCBIGene 36.3 1508
  • Single exon skipping, size difference: 74
  • Exclusion in 5'UTR
  • Reference transcript: NM_147780

  • cd Peptidase_C1A_CathepsinB 248aa 1e-106 in ref transcript
    • Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane antibody-mediated interstitial nephritis. It plays a role in renal tubulogenesis and is defective in hereditary tubulointerstitial disorders. TIN-Ag is exclusively expressed in kidney tissues.
  • pfam Peptidase_C1 250aa 6e-69 in ref transcript
    • Papain family cysteine protease.
  • pfam Propeptide_C1 41aa 2e-15 in ref transcript
    • Peptidase family C1 propeptide. This motif is found at the N terminal of some members of the Peptidase_C1 family (pfam00112) and is involved in activation of this peptidase.
  • PTZ PTZ00200 232aa 4e-25 in ref transcript
    • cysteine proteinase precursor; Provisional.

CTSB

  • rs.CTSB.F2 rs.CTSB.R2 103 191
  • NCBIGene 36.3 1508
  • Single exon skipping, size difference: 88
  • Exclusion in 5'UTR
  • Reference transcript: NM_147780

  • cd Peptidase_C1A_CathepsinB 248aa 1e-106 in ref transcript
    • Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane antibody-mediated interstitial nephritis. It plays a role in renal tubulogenesis and is defective in hereditary tubulointerstitial disorders. TIN-Ag is exclusively expressed in kidney tissues.
  • pfam Peptidase_C1 250aa 6e-69 in ref transcript
    • Papain family cysteine protease.
  • pfam Propeptide_C1 41aa 2e-15 in ref transcript
    • Peptidase family C1 propeptide. This motif is found at the N terminal of some members of the Peptidase_C1 family (pfam00112) and is involved in activation of this peptidase.
  • PTZ PTZ00200 232aa 4e-25 in ref transcript
    • cysteine proteinase precursor; Provisional.

CTSB

  • rs.CTSB.F3 rs.CTSB.R3 399 444
  • NCBIGene 36.3 1508
  • Alternative 5-prime, size difference: 45
  • Exclusion in 5'UTR
  • Reference transcript: NM_147781

  • cd Peptidase_C1A_CathepsinB 248aa 1e-106 in ref transcript
    • Cathepsin B group; composed of cathepsin B and similar proteins, including tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin B is a lysosomal papain-like cysteine peptidase which is expressed in all tissues and functions primarily as an exopeptidase through its carboxydipeptidyl activity. Together with other cathepsins, it is involved in the degradation of proteins, proenzyme activation, Ag processing, metabolism and apoptosis. Cathepsin B has been implicated in a number of human diseases such as cancer, rheumatoid arthritis, osteoporosis and Alzheimer's disease. The unique carboxydipeptidyl activity of cathepsin B is attributed to the presence of an occluding loop in its active site which favors the binding of the C-termini of substrate proteins. Some members of this group do not possess the occluding loop. TIN-Ag is an extracellular matrix basement protein which was originally identified as a target Ag involved in anti-tubular basement membrane antibody-mediated interstitial nephritis. It plays a role in renal tubulogenesis and is defective in hereditary tubulointerstitial disorders. TIN-Ag is exclusively expressed in kidney tissues.
  • pfam Peptidase_C1 250aa 6e-69 in ref transcript
    • Papain family cysteine protease.
  • pfam Propeptide_C1 41aa 2e-15 in ref transcript
    • Peptidase family C1 propeptide. This motif is found at the N terminal of some members of the Peptidase_C1 family (pfam00112) and is involved in activation of this peptidase.
  • PTZ PTZ00200 232aa 4e-25 in ref transcript
    • cysteine proteinase precursor; Provisional.

CUGBP1

  • rs.CUGBP1.F1 rs.CUGBP1.R1 104 116
  • NCBIGene 36.3 10658
  • Alternative 3-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025596

  • cd RRM 77aa 6e-17 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 75aa 3e-16 in ref transcript
  • cd RRM 80aa 3e-14 in ref transcript
  • TIGR SF-CC1 181aa 4e-18 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • smart RRM 71aa 1e-15 in ref transcript
    • RNA recognition motif.
  • COG COG0724 93aa 3e-11 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 163aa 6e-09 in ref transcript

CXXC1

  • rs.CXXC1.F1 rs.CXXC1.R1 100 112
  • NCBIGene 36.3 30827
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001101654

  • pfam zf-CXXC 37aa 1e-08 in ref transcript
    • CXXC zinc finger domain. This domain contains eight conserved cysteine residues that bind to two zinc ions. The CXXC domain is found in a variety of chromatin-associated proteins. This domain binds to nonmethyl-CpG dinucleotides. The domain is characterised by two CGXCXXC repeats. The RecQ helicase has a single repeat that also binds to zinc, but this has not been included in this family. The DNA binding interface has been identified by NMR.
  • pfam PHD 48aa 5e-08 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • COG TNG2 75aa 2e-06 in ref transcript
    • Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics].

CYFIP2

  • rs.CYFIP2.F1 rs.CYFIP2.R1 115 504
  • NCBIGene 36.3 26999
  • Alternative 5-prime, size difference: 389
  • Exclusion in 5'UTR
  • Reference transcript: NM_001037332

  • pfam FragX_IP 1131aa 0.0 in ref transcript
    • Cytoplasmic Fragile-X interacting family. CYFIP1/2 (Cytoplasmic fragile X mental retardation interacting protein) like proteins for a highly conserved protein family. The function of CYFIPs is unclear, but CYFIP interaction with fragile X mental retardation interacting protein (FMRP) involves the domain of FMRP which also mediating homo- and heteromerization.

CYLD

  • rs.CYLD.F1 rs.CYLD.R1 243 380
  • NCBIGene 36.3 1540
  • Single exon skipping, size difference: 137
  • Exclusion in 5'UTR
  • Reference transcript: NM_015247

  • cd Peptidase_C19N 272aa 3e-56 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • pfam CAP_GLY 69aa 1e-13 in ref transcript
    • CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
  • pfam CAP_GLY 77aa 9e-12 in ref transcript
  • pfam CAP_GLY 72aa 7e-10 in ref transcript
  • COG NIP100 102aa 0.001 in ref transcript
    • Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].

DACH1

  • rs.DACH1.F1 rs.DACH1.R1 105 549
  • NCBIGene 36.3 1602
  • Multiple exon skipping, size difference: 444
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080759

  • pfam Ski_Sno 115aa 1e-44 in ref transcript
    • SKI/SNO/DAC family. This family contains a presumed domain that is about 100 amino acids long. All members of this family contain a conserved CLPQ motif. The c-ski proto-oncogene has been shown to influence proliferation, morphological transformation and myogenic differentiation. Sno, a Ski proto-oncogene homologue, is expressed in two isoforms and plays a role in the response to proliferation stimuli. Dachshund also contains this domain. It is involved in various aspects of development.

DACT1

  • rs.DACT1.F1 rs.DACT1.R1 355 466
  • NCBIGene 36.3 51339
  • Alternative 3-prime, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016651

DBI

  • rs.DBI.F1 rs.DBI.R1 172 412
  • NCBIGene 36.3 1622
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 240
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_020548

  • Changed! cd ACBP 84aa 7e-29 in ref transcript
    • Acyl CoA binding protein (ACBP) binds thiol esters of long fatty acids and coenzyme A in a one-to-one binding mode with high specificity and affinity. Acyl-CoAs are important intermediates in fatty lipid synthesis and fatty acid degradation and play a role in regulation of intermediary metabolism and gene regulation. The suggested role of ACBP is to act as a intracellular acyl-CoA transporter and pool former. ACBPs are present in a large group of eukaryotic species and several tissue-specific isoforms have been detected.
  • Changed! pfam ACBP 85aa 2e-21 in ref transcript
    • Acyl CoA binding protein.
  • Changed! COG ACB 81aa 3e-21 in ref transcript
    • Acyl-CoA-binding protein [Lipid metabolism].
  • Changed! cd ACBP 85aa 1e-27 in modified transcript
  • Changed! pfam ACBP 85aa 1e-20 in modified transcript
  • Changed! COG ACB 86aa 2e-20 in modified transcript

DBNDD2

  • rs.DBNDD2.F1 rs.DBNDD2.R1 378 476
  • NCBIGene 36.3 55861
  • Alternative 5-prime, size difference: 98
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001048221

  • Changed! pfam Dysbindin 86aa 3e-25 in ref transcript
    • Dysbindin (Dystrobrevin binding protein 1). Dysbindin is an evolutionary conserved 40-kDa coiled-coil-containing protein that binds to alpha- and beta-dystrobrevin in muscle and brain. Dystrophin and alpha-dystrobrevin are co-immunoprecipitated with dysbindin, indicating that dysbindin is DPC-associated in muscle. Dysbindin co-localises with alpha-dystrobrevin at the sarcolemma and is up-regulated in dystrophin-deficient muscle. In the brain, dysbindin is found primarily in axon bundles and especially in certain axon terminals, notably mossy fibre synaptic terminals in the cerebellum and hippocampus. Dysbindin may have implications for the molecular pathology of Duchenne muscular dystrophy and may provide an alternative route for anchoring dystrobrevin and the DPC to the muscle membrane. Genetic variation in the human dysbindin gene is also thought to be associated with Schizophrenia.
  • Changed! pfam Dysbindin 73aa 3e-23 in modified transcript

DCLRE1C

  • rs.DCLRE1C.F1 rs.DCLRE1C.R1 104 426
  • NCBIGene 36.3 64421
  • Single exon skipping, size difference: 322
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001033855

  • Changed! pfam DRMBL 99aa 2e-16 in ref transcript
    • DNA repair metallo-beta-lactamase. The metallo-beta-lactamase fold contains five sequence motifs. The first four motifs are found in pfam00753 and are common to all metallo-beta-lactamases. The fifth motif appears to be specific to function. This entry represents the fifth motif from metallo-beta-lactamases involved in DNA repair.
  • Changed! smart Lactamase_B 115aa 3e-08 in ref transcript
    • Metallo-beta-lactamase superfamily. Apart from the beta-lactamases a number of other proteins contain this domain PUBMED:7588620. These proteins include thiolesterases, members of the glyoxalase II family, that catalyse the hydrolysis of S-D-lactoyl-glutathione to form glutathione and D-lactic acid and a competence protein that is essential for natural transformation in Neisseria gonorrhoeae and could be a transporter involved in DNA uptake. Except for the competence protein these proteins bind two zinc ions per molecule as cofactor.
  • Changed! COG YSH1 175aa 1e-11 in ref transcript
    • Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis].
  • Changed! smart Lactamase_B 71aa 2e-04 in modified transcript

DCX

  • rs.DCX.F1 rs.DCX.R1 100 115
  • NCBIGene 36.3 1641
  • Alternative 5-prime, size difference: 15
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_000555

  • cd DCX 83aa 9e-25 in ref transcript
    • DCX The ubiquitin-like DCX domain is present in tandem within the N-terminal half of the doublecortin protein. Doublecortin is expressed in migrating neurons. Mutations in the gene encoding doublecortin cause lissencephaly in males and 'double-cortex syndrome' in females.
  • cd DCX 80aa 1e-19 in ref transcript
  • smart DCX 91aa 5e-30 in ref transcript
    • Domain in the Doublecortin (DCX) gene product. Tandemly-repeated domain in doublin, the Doublecortin gene product. Proposed to bind tubulin. Doublecortin (DCX) is mutated in human X-linked neuronal migration defects.
  • smart DCX 89aa 2e-25 in ref transcript

DDR1

  • rs.DDR1.F1 rs.DDR1.R1 100 118
  • NCBIGene 36.3 780
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013994

  • Changed! cd PTKc_DDR1 310aa 1e-174 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Discoidin Domain Receptor 1. Protein Tyrosine Kinase (PTK) family; mammalian Discoidin domain receptor 1 (DDR1) and homologs; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. DDR1 is a member of the DDR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular discoidin homology domain, a transmembrane segment, an extended juxtamembrane region, and an intracellular catalytic domain. The binding of the ligand, collagen, to DDRs results in a slow but sustained receptor activation. DDR1 binds to all collagens tested to date (types I-IV). It is widely expressed in many tissues. It is abundant in the brain and is also found in keratinocytes, colonic mucosa epithelium, lung epithelium, thyroid follicles, and the islets of Langerhans. During embryonic development, it is found in the developing neuroectoderm. DDR1 is a key regulator of cell morphogenesis, differentiation and proliferation. It is important in the development of the mammary gland, the vasculator and the kidney. DDR1 is also found in human leukocytes, where it facilitates cell adhesion, migration, maturation, and cytokine production.
  • cd FA58C 117aa 9e-28 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • Changed! smart TyrKc 302aa 8e-97 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • smart FA58C 156aa 5e-26 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • Changed! COG SPS1 297aa 9e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd PTKc_DDR1 304aa 1e-176 in modified transcript
  • Changed! smart TyrKc 296aa 9e-99 in modified transcript
  • Changed! COG SPS1 291aa 2e-23 in modified transcript

DDR1

  • rs.DDR1.F2 rs.DDR1.R2 230 341
  • NCBIGene 36.3 780
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013994

  • cd PTKc_DDR1 310aa 1e-174 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Discoidin Domain Receptor 1. Protein Tyrosine Kinase (PTK) family; mammalian Discoidin domain receptor 1 (DDR1) and homologs; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. DDR1 is a member of the DDR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular discoidin homology domain, a transmembrane segment, an extended juxtamembrane region, and an intracellular catalytic domain. The binding of the ligand, collagen, to DDRs results in a slow but sustained receptor activation. DDR1 binds to all collagens tested to date (types I-IV). It is widely expressed in many tissues. It is abundant in the brain and is also found in keratinocytes, colonic mucosa epithelium, lung epithelium, thyroid follicles, and the islets of Langerhans. During embryonic development, it is found in the developing neuroectoderm. DDR1 is a key regulator of cell morphogenesis, differentiation and proliferation. It is important in the development of the mammary gland, the vasculator and the kidney. DDR1 is also found in human leukocytes, where it facilitates cell adhesion, migration, maturation, and cytokine production.
  • cd FA58C 117aa 9e-28 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • smart TyrKc 302aa 8e-97 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • smart FA58C 156aa 5e-26 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • COG SPS1 297aa 9e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

DGKG

  • rs.DGKG.F1 rs.DGKG.R1 142 217
  • NCBIGene 36.3 1608
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001346

  • cd C1 50aa 7e-11 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • cd EFh 70aa 4e-08 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • pfam DAGK_acc 175aa 8e-87 in ref transcript
    • Diacylglycerol kinase accessory domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown.
  • Changed! smart DAGKc 122aa 2e-47 in ref transcript
    • Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain is presumed to be the catalytic domain. Bacterial homologues areknown.
  • pfam C1_1 50aa 1e-08 in ref transcript
    • Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
  • smart C1 41aa 4e-05 in ref transcript
    • Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
  • Changed! COG LCB5 327aa 4e-10 in ref transcript
    • Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
  • COG FRQ1 76aa 7e-04 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! smart DAGKc 97aa 1e-32 in modified transcript
  • Changed! COG LCB5 277aa 1e-07 in modified transcript

DGKG

  • rs.DGKG.F2 rs.DGKG.R2 100 217
  • NCBIGene 36.3 1608
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001346

  • cd C1 50aa 7e-11 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • cd EFh 70aa 4e-08 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • pfam DAGK_acc 175aa 8e-87 in ref transcript
    • Diacylglycerol kinase accessory domain. Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. This domain is assumed to be an accessory domain: its function is unknown.
  • smart DAGKc 122aa 2e-47 in ref transcript
    • Diacylglycerol kinase catalytic domain (presumed). Diacylglycerol (DAG) is a second messenger that acts as a protein kinase C activator. DAG can be produced from the hydrolysis of phosphatidylinositol 4,5-bisphosphate (PIP2) by a phosphoinositide-specific phospholipase C and by the degradation of phosphatidylcholine (PC) by a phospholipase C or the concerted actions of phospholipase D and phosphatidate phosphohydrolase. This domain is presumed to be the catalytic domain. Bacterial homologues areknown.
  • pfam C1_1 50aa 1e-08 in ref transcript
    • Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
  • Changed! smart C1 41aa 4e-05 in ref transcript
    • Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
  • COG LCB5 327aa 4e-10 in ref transcript
    • Sphingosine kinase and enzymes related to eukaryotic diacylglycerol kinase [Lipid metabolism / General function prediction only].
  • COG FRQ1 76aa 7e-04 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

DHRS2

  • rs.DHRS2.F1 rs.DHRS2.R1 99 110
  • NCBIGene 36.3 10202
  • Alternative 5-prime, size difference: 11
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_182908

  • cd NAD_bind_H4MPT_DH 53aa 2e-04 in ref transcript
    • NADP binding domain of methylene tetrahydromethanopterin dehydrogenase. Methylene Tetrahydromethanopterin Dehydrogenase (H4MPT DH) NADP binding domain. NADP-dependent H4MPT DH catalyzes the dehydrogenation of methylene- H4MPT and methylene-tetrahydrofolate (H4F) with NADP+ as cofactor. H4F and H4MPT are both cofactors that carry the one-carbon units between the formyl and methyl oxidation level. H4F and H4MPT are structurally analogous to each other with respect to the pterin moiety, but each has distinct side chain. H4MPT is present only in anaerobic methanogenic archaea and aerobic methylotrophic proteobacteria. H4MPT seems to have evolved independently from H4F and functions as a distinct carrier in C1 metabolism. Amino acid DH-like NAD(P)-binding domains are members of the Rossmann fold superfamily and include glutamate, leucine, and phenylalanine DHs, methylene tetrahydrofolate DH, methylene-tetrahydromethanopterin DH, methylene-tetrahydropholate DH/cyclohydrolase, Shikimate DH-like proteins, malate oxidoreductases, and glutamyl tRNA reductase. Amino acid DHs catalyze the deamination of amino acids to keto acids with NAD(P)+ as a cofactor. The NAD(P)-binding Rossmann fold superfamily includes a wide variety of protein families including NAD(P)- binding domains of alcohol DHs, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate DH, lactate/malate DHs, formate/glycerate DHs, siroheme synthases, 6-phosphogluconate DH, amino acid DHs, repressor rex, NAD-binding potassium channel domain, CoA-binding, and ornithine cyclodeaminase-like domains. These domains have an alpha-beta-alpha configuration. NAD binding involves numerous hydrogen and van der Waals contacts.
  • Changed! TIGR 3oxo_ACP_reduc 187aa 2e-30 in ref transcript
    • This model represents 3-oxoacyl-[ACP] reductase, also called 3-ketoacyl-acyl carrier protein reductase, an enzyme of fatty acid biosynthesis.
  • Changed! PRK fabG 205aa 6e-43 in ref transcript
    • 3-ketoacyl-(acyl-carrier-protein) reductase; Provisional.
  • Changed! TIGR 3oxo_ACP_reduc 237aa 1e-40 in modified transcript
  • Changed! PRK fabG 242aa 6e-52 in modified transcript

DIABLO

  • rs.DIABLO.F1 rs.DIABLO.R1 193 222
  • NCBIGene 36.3 56616
  • Mutually exclusive exon skipping, size difference: 29
  • Inclusion in the protein causing a new stop codon, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_019887

  • Changed! pfam Smac_DIABLO 234aa 1e-100 in ref transcript
    • Second Mitochondria-derived Activator of Caspases. Second Mitochondria-derived Activator of Caspases promotes apoptosis by activating caspases in the cytochrome c/Apaf-1/caspase-9 pathway, and by opposing the inhibitory activity of inhibitor of apoptosis proteins (XIAP-BIR3). The protein assumes an elongated three-helix bundle structure, and forms a dimer in solution.

DIAPH1

  • rs.DIAPH1.F1 rs.DIAPH1.R1 131 158
  • NCBIGene 36.3 1729
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005219

  • smart FH2 438aa 1e-111 in ref transcript
    • Formin Homology 2 Domain. FH proteins control rearrangements of the actin cytoskeleton, especially in the context of cytokinesis and cell polarisation. Members of this family have been found to interact with Rho-GTPases, profilin and other actin-assoziated proteins. These interactions are mediated by the proline-rich FH1 domain, usually located in front of FH2 (but not listed in SMART). Despite this cytosolic function, vertebrate formins have been assigned functions within the nucleus. A set of Formin-Binding Proteins (FBPs) has been shown to bind FH1 with their WW domain.
  • pfam Drf_GBD 186aa 3e-52 in ref transcript
    • Diaphanous GTPase-binding Domain. This domain is bound to by GTP-attached Rho proteins, leading to activation of the Drf protein.
  • pfam Drf_FH3 193aa 1e-49 in ref transcript
    • Diaphanous FH3 Domain. This region is found in the Formin-like and and diaphanous proteins.
  • pfam Drf_FH1 29aa 9e-08 in ref transcript
    • Formin Homology Region 1. This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.

DLG1

  • rs.DLG1.F1 rs.DLG1.R1 178 244
  • NCBIGene 36.3 1739
  • Mutually exclusive exon skipping, size difference: 66
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004087

  • cd GMPK 117aa 2e-24 in ref transcript
    • Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
  • cd PDZ_signaling 85aa 4e-14 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 87aa 1e-13 in ref transcript
  • cd PDZ_signaling 81aa 2e-11 in ref transcript
  • cd SH3 59aa 2e-06 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart GuKc 179aa 2e-51 in ref transcript
    • Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
  • pfam MAGUK_N_PEST 117aa 2e-33 in ref transcript
    • Polyubiquitination (PEST) N-terminal domain of MAGUK. The residues upstream of this domain are the probable palmitoylation sites, particularly two cysteines. The domain has a putative PEST site at the very start that seems to be responsible for poly-ubiquitination. PEST domains are polypeptide sequences enriched in proline (P), glutamic acid (E), serine (S) and threonine (T) that target proteins for rapid destruction. The whole domain, in conjunction with a C-terminal domain of the longer protein, is necessary for dimerisation of the whole protein.
  • pfam L27_1 64aa 2e-28 in ref transcript
    • L27_1. The L27 domain is a protein interaction module that exists in a large family of scaffold proteins, functioning as an organisation centre of large protein assemblies required for the establishment and maintenance of cell polarity. L27 domains form specific heterotetrameric complexes, in which each domain contains three alpha-helices.
  • pfam PDZ_assoc 62aa 3e-24 in ref transcript
    • PDZ-associated domain of NMDA receptors. This domain is found in higher eukaryotes between the second and third PDZ domains, pfam00595, of glutamate receptor like proteins. Its exact function is not known.
  • smart PDZ 88aa 8e-17 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 90aa 3e-16 in ref transcript
  • smart PDZ 85aa 5e-14 in ref transcript
  • smart SH3 59aa 3e-07 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • PRK gmk 159aa 1e-25 in ref transcript
    • guanylate kinase; Provisional.
  • COG Prc 103aa 8e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

DLG3

  • rs.DLG3.F1 rs.DLG3.R1 122 218
  • NCBIGene 36.3 1741
  • Mutually exclusive exon skipping, size difference: 96
  • Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_021120

  • cd GMPK 117aa 2e-25 in ref transcript
    • Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
  • cd PDZ_signaling 81aa 3e-17 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 87aa 3e-14 in ref transcript
  • cd PDZ_signaling 77aa 2e-11 in ref transcript
  • cd SH3 59aa 4e-07 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart GuKc 177aa 6e-55 in ref transcript
    • Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
  • pfam PDZ_assoc 75aa 1e-22 in ref transcript
    • PDZ-associated domain of NMDA receptors. This domain is found in higher eukaryotes between the second and third PDZ domains, pfam00595, of glutamate receptor like proteins. Its exact function is not known.
  • smart PDZ 85aa 2e-20 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 90aa 7e-17 in ref transcript
  • smart PDZ 77aa 6e-13 in ref transcript
  • smart SH3 59aa 1e-08 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • pfam MAGUK_N_PEST 74aa 0.001 in ref transcript
    • Polyubiquitination (PEST) N-terminal domain of MAGUK. The residues upstream of this domain are the probable palmitoylation sites, particularly two cysteines. The domain has a putative PEST site at the very start that seems to be responsible for poly-ubiquitination. PEST domains are polypeptide sequences enriched in proline (P), glutamic acid (E), serine (S) and threonine (T) that target proteins for rapid destruction. The whole domain, in conjunction with a C-terminal domain of the longer protein, is necessary for dimerisation of the whole protein.
  • PRK gmk 159aa 2e-23 in ref transcript
    • guanylate kinase; Provisional.
  • COG Prc 74aa 1e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

DMBX1

  • rs.DMBX1.F1 rs.DMBX1.R1 100 115
  • NCBIGene 36.3 127343
  • Alternative 5-prime and 3-prime, size difference: 15
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_147192

  • cd homeodomain 54aa 8e-15 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam Homeobox 55aa 2e-20 in ref transcript
    • Homeobox domain.
  • COG COG5576 65aa 2e-10 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

DMP1

  • rs.DMP1.F1 rs.DMP1.R1 255 303
  • NCBIGene 36.3 1758
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004407

  • Changed! pfam DMP1 375aa 5e-63 in ref transcript
    • Dentin matrix protein 1 (DMP1). This family consists of several mammalian dentin matrix protein 1 (DMP1) sequences. The dentin matrix acidic phosphoprotein 1 (DMP1) gene has been mapped to human chromosome 4q21. DMP1 is a bone and teeth specific protein initially identified from mineralised dentin. DMP1 is primarily localised in the nuclear compartment of undifferentiated osteoblasts. In the nucleus, DMP1 acts as a transcriptional component for activation of osteoblast-specific genes like osteocalcin. During the early phase of osteoblast maturation, Ca(2+) surges into the nucleus from the cytoplasm, triggering the phosphorylation of DMP1 by a nuclear isoform of casein kinase II. This phosphorylated DMP1 is then exported out into the extracellular matrix, where it regulates nucleation of hydroxyapatite. DMP1 is a unique molecule that initiates osteoblast differentiation by transcription in the nucleus and orchestrates mineralised matrix formation extracellularly, at later stages of osteoblast maturation. The DMP1 gene has been found to be ectopically expressed in lung cancer although the reason for this is unknown.
  • pfam DMP1 35aa 1e-04 in ref transcript
  • Changed! pfam DMP1 359aa 2e-61 in modified transcript

DMPK

  • rs.DMPK.F1 rs.DMPK.R1 107 122
  • NCBIGene 36.3 1760
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001081563

  • Changed! cd STKc_DMPK_like 338aa 1e-170 in ref transcript
    • STKc_DMPK_like: Serine/Threonine Kinases (STKs), Myotonic Dystrophy protein kinase (DMPK)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The DMPK-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. The DMPK-like subfamily is composed of DMPK and DMPK-related cell division control protein 42 (Cdc42) binding kinase (MRCK). Three isoforms of MRCK are known, named alpha, beta and gamma. The DMPK gene is implicated in myotonic dystrophy 1 (DM1), an inherited multisystemic disorder with symptoms that include muscle hyperexcitability, progressive muscle weakness and wasting, cataract development, testicular atrophy, and cardiac conduction defects. The genetic basis for DM1 is the mutational expansion of a CTG repeat in the 3'-UTR of DMPK. DMPK is expressed in skeletal and cardiac muscles, and in central nervous tissues. The functional role of DMPK is not fully understood. It may play a role in the signal transduction and homeostasis of calcium. MRCK is activated via interaction with the small GTPase Cdc42. MRCK/Cdc42 signaling mediates myosin-dependent cell motility. MRCKgamma is expressed in heart and skeletal muscles, unlike MRCKalpha and MRCKbeta, which are expressed ubiquitously.
  • smart S_TKc 259aa 8e-69 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam DMPK_coil 61aa 2e-22 in ref transcript
    • DMPK coiled coil domain like. This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation.
  • Changed! smart S_TK_X 67aa 5e-08 in ref transcript
    • Extension to Ser/Thr-type protein kinases.
  • PTZ PTZ00263 299aa 1e-56 in ref transcript
    • protein kinase A catalytic subunit; Provisional.
  • COG Smc 63aa 0.008 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! cd STKc_DMPK_like 333aa 1e-171 in modified transcript
  • Changed! smart S_TK_X 62aa 9e-09 in modified transcript

DNM1L

  • rs.DNM1L.F1 rs.DNM1L.R1 207 285
  • NCBIGene 36.3 10059
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012062

  • pfam Dynamin_M 292aa 1e-106 in ref transcript
    • Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169.
  • smart DYNc 255aa 1e-90 in ref transcript
    • Dynamin, GTPase. Large GTPases that mediate vesicle trafficking. Dynamin participates in the endocytic uptake of receptors, associated ligands, and plasma membrane following an exocytic event.
  • pfam GED 92aa 3e-23 in ref transcript
    • Dynamin GTPase effector domain.
  • Changed! COG COG0699 414aa 2e-32 in ref transcript
    • Predicted GTPases (dynamin-related) [General function prediction only].
  • Changed! COG COG0699 616aa 3e-36 in modified transcript

DNM2

  • rs.DNM2.F1 rs.DNM2.R1 103 115
  • NCBIGene 36.3 1785
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005361

  • cd PH_dynamin 110aa 2e-47 in ref transcript
    • Dynamin pleckstrin homology (PH) domain. Dynamin is a GTPase that regulates endocytic vesicle formation. It has an N-terminal GTPase domain, followed by a PH domain, a GTPase effector domain and a C-terminal proline arginine rich domain. Dynamin-like proteins, which are found in metazoa, plants and yeast have the same domain architecture as dynamin, but lack the PH domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • smart DYNc 240aa 1e-130 in ref transcript
    • Dynamin, GTPase. Large GTPases that mediate vesicle trafficking. Dynamin participates in the endocytic uptake of receptors, associated ligands, and plasma membrane following an exocytic event.
  • pfam Dynamin_M 294aa 1e-104 in ref transcript
    • Dynamin central region. This region lies between the GTPase domain, see pfam00350, and the pleckstrin homology (PH) domain, see pfam00169.
  • smart GED 92aa 2e-20 in ref transcript
    • Dynamin GTPase effector domain.
  • smart PH 104aa 3e-07 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • Changed! COG COG0699 424aa 5e-31 in ref transcript
    • Predicted GTPases (dynamin-related) [General function prediction only].
  • Changed! COG COG0699 455aa 3e-31 in modified transcript

DNMT3B

  • rs.DNMT3B.F1 rs.DNMT3B.R1 188 377
  • NCBIGene 36.3 1789
  • Multiple exon skipping, size difference: 189
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006892

  • cd Dnmt3b_related 87aa 2e-35 in ref transcript
    • The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
  • Changed! cd Cyt_C5_DNA_methylase 271aa 4e-13 in ref transcript
    • Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
  • pfam PWWP 73aa 8e-24 in ref transcript
    • PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
  • pfam DNA_methylase 123aa 8e-08 in ref transcript
    • C-5 cytosine-specific DNA methylase.
  • COG Dcm 160aa 5e-13 in ref transcript
    • Site-specific DNA methylase [DNA replication, recombination, and repair].
  • Changed! cd Cyt_C5_DNA_methylase 134aa 9e-09 in modified transcript

DNMT3B

  • rs.DNMT3B.F2 rs.DNMT3B.R2 135 195
  • NCBIGene 36.3 1789
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006892

  • cd Dnmt3b_related 87aa 2e-35 in ref transcript
    • The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
  • cd Cyt_C5_DNA_methylase 271aa 4e-13 in ref transcript
    • Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
  • pfam PWWP 73aa 8e-24 in ref transcript
    • PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.
  • pfam DNA_methylase 123aa 8e-08 in ref transcript
    • C-5 cytosine-specific DNA methylase.
  • COG Dcm 160aa 5e-13 in ref transcript
    • Site-specific DNA methylase [DNA replication, recombination, and repair].

DPH3

  • rs.DPH3.F1 rs.DPH3.R1 260 335
  • NCBIGene 36.3 285381
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206831

  • Changed! pfam zf-CSL 55aa 3e-18 in ref transcript
    • CSL zinc finger. This is a zinc binding motif which contains four cysteine residues which chelate zinc. This domain is often found associated with a pfam00226 domain. This domain is named after the conserved motif of the final cysteine.
  • Changed! COG COG5216 61aa 1e-16 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam zf-CSL 40aa 1e-07 in modified transcript
  • Changed! COG COG5216 42aa 1e-06 in modified transcript

DPH5

  • rs.DPH5.F1 rs.DPH5.R1 109 127
  • NCBIGene 36.3 51611
  • Alternative 5-prime, size difference: 18
  • Exclusion in 5'UTR
  • Reference transcript: NM_001077394

  • TIGR dph5 271aa 1e-80 in ref transcript
    • This protein participates in the modification of a specific His of elongation factor 2 of eukarotes and Archaea to diphthamide. The protein was characterized in Saccharomyces cerevisiae and designated DPH5.
  • PTZ PTZ00175 272aa 1e-122 in ref transcript
    • diphthine synthase; Provisional.

DSC3

  • rs.DSC3.F1 rs.DSC3.R1 233 276
  • NCBIGene 36.3 1825
  • Single exon skipping, size difference: 43
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001941

  • cd CA 218aa 9e-36 in ref transcript
    • Cadherin repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion; these domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium; plays a role in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-,CNR-,proto-,and FAT-family cadherin, desmocollin, and desmoglein, exists as monomers or dimers (hetero- and homo-); two copies of the repeat are present here.
  • cd CA 209aa 8e-31 in ref transcript
  • cd CA 210aa 4e-24 in ref transcript
  • Changed! cd CA 193aa 1e-19 in ref transcript
  • pfam Cadherin_pro 87aa 5e-18 in ref transcript
    • Cadherin prodomain like. Cadherins are a family of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This domain corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions.
  • pfam Cadherin 99aa 9e-17 in ref transcript
    • Cadherin domain.
  • pfam Cadherin 103aa 2e-11 in ref transcript
  • smart CA 77aa 4e-11 in ref transcript
    • Cadherin repeats. Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.
  • pfam Cadherin 95aa 1e-10 in ref transcript
  • Changed! pfam Cadherin_C 63aa 1e-04 in ref transcript
    • Cadherin cytoplasmic region. Cadherins are vital in cell-cell adhesion during tissue differentiation. Cadherins are linked to the cytoskeleton by catenins. Catenins bind to the cytoplasmic tail of the cadherin. Cadherins cluster to form foci of homophilic binding units. A key determinant to the strength of the binding that it is mediated by cadherins is the juxtamembrane region of the cadherin. This region induces clustering and also binds to the protein p120ctn.
  • Changed! pfam Cadherin 74aa 0.009 in ref transcript
  • Changed! cd CA 192aa 2e-19 in modified transcript

DTNB

  • rs.DTNB.F1 rs.DTNB.R1 119 140
  • NCBIGene 36.3 1838
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021907

  • cd ZZ_dystrophin 49aa 4e-18 in ref transcript
    • Zinc finger, ZZ type. Zinc finger present in dystrophin and dystrobrevin. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Dystrophin attaches actin filaments to an integral membrane glycoprotein complex in muscle cells. The ZZ domain in dystrophin has been shown to be essential for binding to the membrane protein beta-dystroglycan.
  • pfam efhand_1 127aa 3e-47 in ref transcript
    • EF hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • pfam efhand_2 89aa 3e-37 in ref transcript
    • EF-hand. Members of this family adopt a helix-loop-helix motif, as per other EF hand domains. However, since they do not contain the canonical pattern of calcium binding residues found in many EF hand domains, they do not bind calcium ions. The main function of this domain is the provision of specificity in beta-dystroglycan recognition, though in dystrophin it serves an additional role: stabilisation of the WW domain (pfam00397), enhancing dystroglycan binding.
  • smart ZnF_ZZ 45aa 3e-11 in ref transcript
    • Zinc-binding domain, present in Dystrophin, CREB-binding protein. Putative zinc-binding domain present in dystrophin-like proteins, and CREB-binding protein/p300 homologues. The ZZ in dystrophin appears to bind calmodulin. A missense mutation of one of the conserved cysteines in dystrophin results in a patient with Duchenne muscular dystrophy [3].
  • Changed! TIGR SMC_prok_B 66aa 0.008 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! PRK PRK05431 65aa 0.003 in ref transcript
    • seryl-tRNA synthetase; Provisional.
  • Changed! TIGR SMC_prok_A 90aa 4e-04 in modified transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • Changed! PRK PRK05431 71aa 7e-05 in modified transcript

DTX2

  • rs.DTX2.F1 rs.DTX2.R1 153 273
  • NCBIGene 36.3 113878
  • Single exon skipping, size difference: 120
  • Exclusion in 5'UTR
  • Reference transcript: NM_020892

  • smart WWE 72aa 7e-20 in ref transcript
    • Domain in Deltex and TRIP12 homologues. Possibly involved in regulation of ubiquitin-mediated proteolysis.
  • smart WWE 85aa 7e-20 in ref transcript

DTX2

  • rs.DTX2.F2 rs.DTX2.R2 402 543
  • NCBIGene 36.3 113878
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001102595

  • smart WWE 72aa 7e-20 in ref transcript
    • Domain in Deltex and TRIP12 homologues. Possibly involved in regulation of ubiquitin-mediated proteolysis.
  • smart WWE 85aa 7e-20 in ref transcript

DUSP13

  • rs.DUSP13.F1 rs.DUSP13.R1 100 526
  • NCBIGene 36.3 51207
  • Multiple exon skipping, size difference: 426
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_001007271

  • cd DSPc 144aa 2e-32 in ref transcript
    • Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
  • smart DSPc 143aa 4e-26 in ref transcript
    • Dual specificity phosphatase, catalytic domain.
  • COG CDC14 140aa 1e-07 in ref transcript
    • Predicted protein-tyrosine phosphatase [Signal transduction mechanisms].

DUSP6

  • rs.DUSP6.F1 rs.DUSP6.R1 98 536
  • NCBIGene 36.3 1848
  • Single exon skipping, size difference: 438
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001946

  • Changed! cd DSPc 138aa 7e-50 in ref transcript
    • Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
  • Changed! cd DSP_MapKP 131aa 2e-35 in ref transcript
    • N-terminal regulatory rhodanese domain of dual specificity phosphatases (DSP), such as Mapk Phosphatase. This domain is believed to determine substrate specificity by binding the substrate, such as ERK2, and activating the C-terminal catalytic domain by inducing a conformational change. This domain has homology to the Rhodanese Homology Domain.
  • Changed! smart DSPc 140aa 1e-52 in ref transcript
    • Dual specificity phosphatase, catalytic domain.
  • Changed! smart RHOD 114aa 7e-09 in ref transcript
    • Rhodanese Homology Domain. An alpha beta fold found duplicated in the Rhodanese protein. The the Cysteine containing enzymatically active version of the domain is also found in the CDC25 class of protein phosphatases and a variety of proteins such as sulfide dehydrogenases and stress proteins such as Senesence specific protein 1 in plants, PspE and GlpE in bacteria and cyanide and arsenate resistance proteins. Inactive versions with a loss of the cysteine are also seen in Dual specificity phosphatases, ubiquitin hydrolases from yeast and in sulfuryltransferases. These are likely to play a role in protein interactions.
  • Changed! COG CDC14 169aa 2e-13 in ref transcript
    • Predicted protein-tyrosine phosphatase [Signal transduction mechanisms].
  • COG SseA 96aa 0.001 in ref transcript
    • Rhodanese-related sulfurtransferase [Inorganic ion transport and metabolism].
  • Changed! cd DSP_MapKP 118aa 2e-28 in modified transcript
  • Changed! cd DSPc 102aa 3e-26 in modified transcript
  • Changed! smart DSPc 67aa 4e-26 in modified transcript
  • Changed! smart RHOD 76aa 3e-08 in modified transcript
  • Changed! COG CDC14 77aa 6e-09 in modified transcript

DYNLL1

  • rs.DYNLL1.F1 rs.DYNLL1.R1 112 126
  • NCBIGene 36.3 8655
  • Alternative 3-prime, size difference: 14
  • Inclusion in 5'UTR
  • Reference transcript: NM_003746

  • pfam Dynein_light 89aa 2e-38 in ref transcript
    • Dynein light chain type 1.
  • PTZ PTZ00059 89aa 9e-47 in ref transcript
    • dynein light chain; Provisional.

E2F6

  • rs.E2F6.F1 rs.E2F6.R1 201 334
  • NCBIGene 36.3 1876
  • Single exon skipping, size difference: 133
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_198256

  • Changed! pfam E2F_TDP 66aa 5e-22 in ref transcript
    • E2F/DP family winged-helix DNA-binding domain. This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in association with TDP1/2, the heterodimer having increased binding efficiency. The crystal structure of an E2F4-DP2-DNA complex shows that the DNA-binding domains of the E2F and DP proteins both have a fold related to the winged-helix DNA-binding motif. Recognition of the central c/gGCGCg/c sequence of the consensus DNA-binding site is symmetric, and amino acids that contact these bases are conserved among all known E2F and DP proteins.

E2F6

  • rs.E2F6.F2 rs.E2F6.R2 128 199
  • NCBIGene 36.3 1876
  • Single exon skipping, size difference: 71
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_198256

  • Changed! pfam E2F_TDP 66aa 5e-22 in ref transcript
    • E2F/DP family winged-helix DNA-binding domain. This family contains the transcription factor E2F and its dimerisation partners TDP1 and TDP2, which stimulate E2F-dependent transcription. E2F binds to DNA as a homodimer or as a heterodimer in association with TDP1/2, the heterodimer having increased binding efficiency. The crystal structure of an E2F4-DP2-DNA complex shows that the DNA-binding domains of the E2F and DP proteins both have a fold related to the winged-helix DNA-binding motif. Recognition of the central c/gGCGCg/c sequence of the consensus DNA-binding site is symmetric, and amino acids that contact these bases are conserved among all known E2F and DP proteins.

ECE2

  • rs.ECE2.F1 rs.ECE2.R1 261 399
  • NCBIGene 36.3 9718
  • Alternative 5-prime, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100120

  • pfam Peptidase_M13_N 386aa 1e-124 in ref transcript
    • Peptidase family M13. M13 peptidases are well-studied proteases found in a wide range of organisms including mammals and bacteria. In mammals they participate in processes such as cardiovascular development, blood-pressure regulation, nervous control of respiration, and regulation of the function of neuropeptides in the central nervous system. In bacteria they may be used for digestion of milk.
  • pfam Peptidase_M13 204aa 2e-55 in ref transcript
    • Peptidase family M13. Mammalian enzymes are typically type-II membrane anchored enzymes which are known, or believed to activate or inactivate oligopeptide (pro)-hormones such as opioid peptides. The family also contains a bacterial member believed to be involved with milk protein cleavage.
  • COG PepO 654aa 1e-121 in ref transcript
    • Predicted metalloendopeptidase [Posttranslational modification, protein turnover, chaperones].

ECHDC1

  • rs.ECHDC1.F1 rs.ECHDC1.R1 150 203
  • NCBIGene 36.3 55862
  • Single exon skipping, size difference: 53
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001002030

  • Changed! cd crotonase-like 192aa 3e-42 in ref transcript
    • Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole.
  • Changed! pfam ECH 167aa 3e-22 in ref transcript
    • Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase.
  • Changed! COG CaiD 244aa 2e-38 in ref transcript
    • Enoyl-CoA hydratase/carnithine racemase [Lipid metabolism].
  • Changed! cd crotonase-like 71aa 8e-14 in modified transcript
  • Changed! pfam ECH 52aa 2e-05 in modified transcript
  • Changed! COG CaiD 71aa 6e-11 in modified transcript

ECHDC1

  • rs.ECHDC1.F2 rs.ECHDC1.R2 109 331
  • NCBIGene 36.3 55862
  • Single exon skipping, size difference: 222
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001002030

  • Changed! cd crotonase-like 192aa 3e-42 in ref transcript
    • Crotonase/Enoyl-Coenzyme A (CoA) hydratase superfamily. This superfamily contains a diverse set of enzymes including enoyl-CoA hydratase, napthoate synthase, methylmalonyl-CoA decarboxylase, 3-hydoxybutyryl-CoA dehydratase, and dienoyl-CoA isomerase. Many of these play important roles in fatty acid metabolism. In addition to a conserved structural core and the formation of trimers (or dimers of trimers), a common feature in this superfamily is the stabilization of an enolate anion intermediate derived from an acyl-CoA substrate. This is accomplished by two conserved backbone NH groups in active sites that form an oxyanion hole.
  • Changed! pfam ECH 167aa 3e-22 in ref transcript
    • Enoyl-CoA hydratase/isomerase family. This family contains a diverse set of enzymes including: Enoyl-CoA hydratase. Napthoate synthase. Carnitate racemase. 3-hydoxybutyryl-CoA dehydratase. Dodecanoyl-CoA delta-isomerase.
  • Changed! COG CaiD 244aa 2e-38 in ref transcript
    • Enoyl-CoA hydratase/carnithine racemase [Lipid metabolism].
  • Changed! cd crotonase-like 170aa 4e-33 in modified transcript
  • Changed! TIGR FadB 156aa 2e-18 in modified transcript
    • Members represent alpha subunit of multifunctional enzyme complex of the fatty acid degradation cycle. Activities include: enoyl-CoA hydratase (EC 4.2.1.17), dodecenoyl-CoA delta-isomerase activity (EC 5.3.3.8), 3-hydroxyacyl-CoA dehydrogenase (EC 1.1.1.35), 3-hydroxybutyryl-CoA epimerase (EC 5.1.2.3). A representative is E. coli FadB. This model excludes the FadJ family.
  • Changed! COG CaiD 219aa 7e-30 in modified transcript

EDA

  • rs.EDA.F1 rs.EDA.R1 168 219
  • NCBIGene 36.3 1896
  • Single exon skipping, size difference: 51
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001399

  • Changed! cd TNF 135aa 5e-13 in ref transcript
    • Tumor Necrosis Factor; TNF superfamily members include the cytokines: TNF (TNF-alpha), LT (lymphotoxin-alpha, TNF-beta), CD40 ligand, Apo2L (TRAIL), Fas ligand, and osteoprotegerin (OPG) ligand. These proteins generally have an intracellular N-terminal domain, a short transmembrane segment, an extracellular stalk, and a globular TNF-like extracellular domain of about 150 residues. They initiate apoptosis by binding to related receptors, some of which have intracellular death domains. They generally form homo- or hetero- trimeric complexes.TNF cytokines bind one elongated receptor molecule along each of three clefts formed by neighboring monomers of the trimer with ligand trimerization a requiste for receptor binding.
  • Changed! pfam TNF 114aa 2e-18 in ref transcript
    • TNF(Tumour Necrosis Factor) family.

EDG2

  • rs.EDG2.F1 rs.EDG2.R1 313 391
  • NCBIGene 36.3 1902
  • Single exon skipping, size difference: 78
  • Exclusion in 5'UTR
  • Reference transcript: NM_057159

  • pfam 7tm_1 240aa 2e-28 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

EDN3

  • rs.EDN3.F1 rs.EDN3.R1 121 156
  • NCBIGene 36.3 1908
  • Alternative 5-prime, size difference: 35
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_000114

  • pfam Endothelin 30aa 4e-11 in ref transcript
    • Endothelin family.
  • smart END 22aa 1e-04 in ref transcript
    • Endothelin.

EFCAB6

  • rs.EFCAB6.F1 rs.EFCAB6.R1 115 473
  • NCBIGene 36.3 64800
  • Multiple exon skipping, size difference: 358
  • Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_022785

  • cd EFh 60aa 8e-04 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 61aa 0.001 in ref transcript
  • cd EFh 59aa 0.007 in ref transcript
  • pfam DUF1880 118aa 1e-60 in ref transcript
    • Domain of unknown function (DUF1880). This domain is found predominantly in DJ binding protein. It has no known function.
  • Changed! PTZ PTZ00184 162aa 3e-06 in ref transcript
    • calmodulin; Provisional.
  • COG FRQ1 61aa 8e-04 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • COG FRQ1 176aa 0.001 in ref transcript

EFTUD1

  • rs.EFTUD1.F1 rs.EFTUD1.R1 251 404
  • NCBIGene 36.3 79631
  • Multiple exon skipping, size difference: 153
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024580

  • Changed! cd EF2 234aa 1e-108 in ref transcript
    • EF2 (for archaea and eukarya). Translocation requires hydrolysis of a molecule of GTP and is mediated by EF-G in bacteria and by eEF2 in eukaryotes. The eukaryotic elongation factor eEF2 is a GTPase involved in the translocation of the peptidyl-tRNA from the A site to the P site on the ribosome. The 95-kDa protein is highly conserved, with 60% amino acid sequence identity between the human and yeast proteins. Two major mechanisms are known to regulate protein elongation and both involve eEF2. First, eEF2 can be modulated by reversible phosphorylation. Increased levels of phosphorylated eEF2 reduce elongation rates presumably because phosphorylated eEF2 fails to bind the ribosomes. Treatment of mammalian cells with agents that raise the cytoplasmic Ca2+ and cAMP levels reduce elongation rates by activating the kinase responsible for phosphorylating eEF2. In contrast, treatment of cells with insulin increases elongation rates by promoting eEF2 dephosphorylation. Second, the protein can be post-translationally modified by ADP-ribosylation. Various bacterial toxins perform this reaction after modification of a specific histidine residue to diphthamide, but there is evidence for endogenous ADP ribosylase activity. Similar to the bacterial toxins, it is presumed that modification by the endogenous enzyme also inhibits eEF2 activity.
  • cd aeEF2_snRNP_like_IV 300aa 5e-37 in ref transcript
    • This family represents domain IV of archaeal and eukaryotic elongation factor 2 (aeEF-2) and of an evolutionarily conserved U5 snRNP-specific protein. U5 snRNP is a GTP-binding factor closely related to the ribosomal translocase EF-2. In complex with GTP, EF-2 promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site of the small subunit of ribosome and the mRNA is shifted one codon relative to the ribosome. It has been shown that EF-2_IV domain mimics the shape of anticodon arm of the tRNA in the structurally homologous ternary complex of Phe-tRNA, EF-1 (another transcriptional elongation factor) and GTP analog. The tip portion of this domain is found in a position that overlaps the anticodon arm of the A-site tRNA, implying that EF-2 displaces the A-site tRNA to the P-site by physical interaction with the anticodon arm.
  • cd eEF2_snRNP_like_C 79aa 7e-31 in ref transcript
    • eEF2_snRNP_like_C: this family represents a C-terminal domain of eukaryotic elongation factor 2 (eEF-2) and a homologous domain of the spliceosomal human 116kD U5 small nuclear ribonucleoprotein (snRNP) protein (U5-116 kD) and, its yeast counterpart Snu114p. Yeast Snu114p is essential for cell viability and for splicing in vivo. U5-116 kD binds GTP. Experiments suggest that GTP binding and probably GTP hydrolysis is important for the function of the U5-116 kD/Snu114p. In complex with GTP, EF-2 promotes the translocation step of translation. During translocation the peptidyl-tRNA is moved from the A site to the P site, the uncharged tRNA from the P site to the E-site and, the mRNA is shifted one codon relative to the ribosome.
  • cd eEF2_snRNP_like_II 115aa 5e-20 in ref transcript
    • EF2_snRNP_like_II: this subfamily represents domain II of elongation factor (EF) EF-2 found eukaryotes and archaea and, the C-terminal portion of the spliceosomal human 116kD U5 small nuclear ribonucleoprotein (snRNP) protein (U5-116 kD) and, its yeast counterpart Snu114p. During the process of peptide synthesis and tRNA site changes, the ribosome is moved along the mRNA a distance equal to one codon with the addition of each amino acid. This translocation step is catalyzed by EF-2_GTP, which is hydrolyzed to provide the required energy. Thus, this action releases the uncharged tRNA from the P site and transfers the newly formed peptidyl-tRNA from the A site to the P site. Yeast Snu114p is essential for cell viability and for splicing in vivo. U5-116 kD binds GTP. Experiments suggest that GTP binding and probably GTP hydrolysis is important for the function of the U5-116 kD/Snu114p.
  • Changed! pfam GTP_EFTU 149aa 2e-50 in ref transcript
    • Elongation factor Tu GTP binding domain. This domain contains a P-loop motif, also found in several other families such as pfam00071, pfam00025 and pfam00063. Elongation factor Tu consists of three structural domains, this plus two C-terminal beta barrel domains.
  • TIGR aEF-2 263aa 7e-19 in ref transcript
    • This model represents archaeal elongation factor 2, a protein more similar to eukaryotic EF-2 than to bacterial EF-G, both in sequence similarity and in sharing with eukaryotes the property of having a diphthamide (modified His) residue at a conserved position. The diphthamide can be ADP-ribosylated by diphtheria toxin in the presence of NAD.
  • pfam EFG_C 87aa 2e-16 in ref transcript
    • Elongation factor G C-terminus. This domain includes the carboxyl terminal regions of Elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopt a ferredoxin-like fold.
  • pfam EFG_IV 72aa 0.002 in ref transcript
    • Elongation factor G, domain IV. This domain is found in elongation factor G, elongation factor 2 and some tetracycline resistance proteins and adopts a ribosomal protein S5 domain 2-like fold.
  • Changed! PRK PRK07560 277aa 6e-65 in ref transcript
    • elongation factor EF-2; Reviewed.
  • COG FusA 206aa 6e-34 in ref transcript
    • Translation elongation factors (GTPases) [Translation, ribosomal structure and biogenesis].
  • COG FusA 142aa 3e-23 in ref transcript
  • Changed! cd EF2 172aa 3e-68 in modified transcript
  • Changed! TIGR aEF-2 111aa 4e-30 in modified transcript
  • Changed! PRK PRK07560 226aa 6e-40 in modified transcript

EIF4G2

  • rs.EIF4G2.F1 rs.EIF4G2.R1 151 265
  • NCBIGene 36.3 1982
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001418

  • smart MA3 101aa 4e-21 in ref transcript
    • Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press.
  • pfam MIF4G 102aa 1e-20 in ref transcript
    • MIF4G domain. MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.
  • smart eIF5C 76aa 4e-17 in ref transcript
    • Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5.

ELN

  • rs.ELN.F1 rs.ELN.R1 125 179
  • NCBIGene 36.3 2006
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000501

ELN

  • rs.ELN.F2 rs.ELN.R2 124 181
  • NCBIGene 36.3 2006
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000501

ELN

  • rs.ELN.F3 rs.ELN.R3 100 115
  • NCBIGene 36.3 2006
  • Alternative 3-prime, size difference: 15
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_000501

ELN

  • rs.ELN.F4 rs.ELN.R4 254 284
  • NCBIGene 36.3 2006
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000501

ELOVL7

  • rs.ELOVL7.F1 rs.ELOVL7.R1 123 174
  • NCBIGene 36.3 79993
  • Single exon skipping, size difference: 51
  • Exclusion in 5'UTR
  • Reference transcript: NM_024930

  • pfam ELO 240aa 9e-53 in ref transcript
    • GNS1/SUR4 family. Members of this family are involved in long chain fatty acid elongation systems that produce the 26-carbon precursors for ceramide and sphingolipid synthesis. Predicted to be integral membrane proteins, in eukaryotes they are probably located on the endoplasmic reticulum. Yeast ELO3 affects plasma membrane H+-ATPase activity, and may act on a glucose-signaling pathway that controls the expression of several genes that are transcriptionally regulated by glucose such as PMA1.
  • PTZ PTZ00251 100aa 1e-04 in ref transcript
    • fatty acid elongase; Provisional.

ENPP2

  • rs.ENPP2.F1 rs.ENPP2.R1 148 304
  • NCBIGene 36.3 5168
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006209

  • cd NUC 259aa 6e-47 in ref transcript
    • DNA/RNA non-specific endonuclease; prokaryotic and eukaryotic double- and single-stranded DNA and RNA endonucleases also present in phosphodiesterases. They exists as monomers and homodimers.
  • Changed! pfam Phosphodiest 365aa 2e-83 in ref transcript
    • Type I phosphodiesterase / nucleotide pyrophosphatase. This family consists of phosphodiesterases, including human plasma-cell membrane glycoprotein PC-1 / alkaline phosphodiesterase i / nucleotide pyrophosphatase (nppase). These enzymes catalyse the cleavage of phosphodiester and phosphosulfate bonds in NAD, deoxynucleotides and nucleotide sugars. Also in this family is ATX an autotaxin, tumour cell motility-stimulating protein which exhibits type I phosphodiesterases activity. The alignment encompasses the active site. Also present with in this family is 60-kDa Ca2+-ATPase form F. odoratum.
  • smart NUC 231aa 4e-58 in ref transcript
    • DNA/RNA non-specific endonuclease. prokaryotic and eukaryotic double- and single-stranded DNA and RNA endonucleases also present in phosphodiesterases.
  • pfam Somatomedin_B 44aa 3e-13 in ref transcript
    • Somatomedin B domain.
  • pfam Somatomedin_B 45aa 1e-12 in ref transcript
  • Changed! COG COG1524 425aa 2e-28 in ref transcript
    • Uncharacterized proteins of the AP superfamily [General function prediction only].
  • COG NUC1 231aa 1e-05 in ref transcript
    • DNA/RNA endonuclease G, NUC1 [Nucleotide transport and metabolism].
  • Changed! pfam Phosphodiest 313aa 8e-91 in modified transcript
  • Changed! COG COG1524 373aa 5e-34 in modified transcript

ENTPD8

  • rs.ENTPD8.F1 rs.ENTPD8.R1 248 359
  • NCBIGene 36.3 377841
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001033113

  • Changed! pfam GDA1_CD39 421aa 3e-78 in ref transcript
    • GDA1/CD39 (nucleoside phosphatase) family.
  • Changed! COG COG5371 376aa 1e-12 in ref transcript
    • Golgi nucleoside diphosphatase [Carbohydrate transport and metabolism / Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam GDA1_CD39 384aa 2e-66 in modified transcript
  • Changed! COG COG5371 304aa 6e-10 in modified transcript

EPB41

  • rs.EPB41.F1 rs.EPB41.R1 355 475
  • NCBIGene 36.3 2035
  • Multiple exon skipping, size difference: 120
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_203343

  • cd FERM_C 94aa 1e-32 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • smart B41 157aa 1e-40 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam 4_1_CTD 115aa 3e-26 in ref transcript
    • 4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
  • pfam FERM_C 85aa 2e-20 in ref transcript
    • FERM C-terminal PH-like domain.
  • Changed! pfam SAB 47aa 5e-14 in ref transcript
    • SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
  • pfam FA 45aa 5e-13 in ref transcript
    • FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.
  • Changed! pfam SAB 49aa 7e-15 in modified transcript

EPB41

  • rs.EPB41.F2 rs.EPB41.R2 184 241
  • NCBIGene 36.3 2035
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203342

  • cd FERM_C 94aa 9e-32 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • smart B41 192aa 1e-55 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam 4_1_CTD 115aa 1e-27 in ref transcript
    • 4.1 protein C-terminal domain (CTD). At the C-terminus of all known 4.1 proteins is a sequence domain unique to these proteins, known as the C-terminal domain (CTD). Mammalian CTDs are associated with a growing number of protein-protein interactions, although such activities have yet to be associated with invertebrate CTDs. Mammalian CTDs are generally defined by sequence alignment as encoded by exons 18-21. Comparison of known vertebrate 4.1 proteins with invertebrate 4.1 proteins indicates that mammalian 4.1 exon 19 represents a vertebrate adaptation that extends the sequence of the CTD with a Ser/Thr-rich sequence. The CTD was first described as a 22/24-kDa domain by chymotryptic digestion of erythrocyte 4.1 (4.1R). CTD is thought to represent an independent folding structure which has gained function since the divergence of vertebrates from invertebrates.
  • pfam FERM_C 85aa 4e-20 in ref transcript
    • FERM C-terminal PH-like domain.
  • pfam SAB 49aa 3e-16 in ref transcript
    • SAB domain. This presumed domain is found in proteins containing FERM domains pfam00373. This domain is found to bind to both spectrin and actin, hence the name SAB (Spectrin and Actin Binding) domain.
  • pfam FA 45aa 3e-13 in ref transcript
    • FERM adjacent (FA). This region is found adjacent to Band 4.1 / FERM domains (pfam00373) in a subset of FERM containing protein. The region has been hypothesised to play a role in regulatory adaptation, based on similarity to other protein kinase substrates.

EPHA6

  • rs.EPHA6.F1 rs.EPHA6.R1 137 201
  • NCBIGene 36.3 285220
  • Single exon skipping, size difference: 64
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001080448

  • Changed! cd PTKc_EphR_A 309aa 1e-160 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinases, Class EphA Ephrin Receptors. Protein Tyrosine Kinase (PTK) family; Ephrin Receptor (EphR) subfamily; most class EphA receptors including EphA3, EphA4, EphA5, and EphA7, but excluding EphA1, EphA2 and EphA10; catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. EphRs comprise the largest subfamily of receptor tyr kinases (RTKs). In general, class EphA receptors bind GPI-anchored ephrin-A ligands. There are ten vertebrate EphA receptors (EphA1-10), which display promiscuous interactions with six ephrin-A ligands. One exception is EphA4, which also binds ephrins-B2/B3. EphRs contain an ephrin-binding domain and two fibronectin repeats extracellularly, a transmembrane segment, and a cytoplasmic tyr kinase domain. Binding of the ephrin ligand to EphR requires cell-cell contact since both are anchored to the plasma membrane. The resulting downstream signals occur bidirectionally in both EphR-expressing cells (forward signaling) and ephrin-expressing cells (reverse signaling). Ephrin/EphR interaction mainly results in cell-cell repulsion or adhesion, making it important in neural development and plasticity, cell morphogenesis, cell-fate determination, embryonic development, tissue patterning, and angiogenesis. EphARs and ephrin-A ligands are expressed in multiple areas of the developing brain, especially in the retina and tectum. They are part of a system controlling retinotectal mapping.
  • cd FN3 92aa 1e-11 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • Changed! cd SAM 59aa 1e-07 in ref transcript
    • Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization.
  • cd FN3 101aa 0.002 in ref transcript
  • Changed! pfam Pkinase_Tyr 300aa 1e-111 in ref transcript
    • Protein tyrosine kinase.
  • pfam Ephrin_lbd 173aa 7e-98 in ref transcript
    • Ephrin receptor ligand binding domain. The Eph receptors, which bind to ephrins pfam00812 are a large family of receptor tyrosine kinases. This family represents the amino terminal domain which binds the ephrin ligand.
  • Changed! smart SAM 68aa 4e-12 in ref transcript
    • Sterile alpha motif. Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerisation.
  • pfam fn3 87aa 2e-10 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 96aa 3e-06 in ref transcript
  • pfam GCC2_GCC3 37aa 0.002 in ref transcript
    • GCC2 and GCC3.
  • Changed! COG SPS1 317aa 9e-18 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

EPSTI1

  • rs.EPSTI1.F1 rs.EPSTI1.R1 143 176
  • NCBIGene 36.3 94240
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001002264

  • Changed! pfam Borrelia_P83 133aa 0.007 in modified transcript
    • Borrelia P83/100 protein. This family consists of several Borrelia P83/P100 antigen proteins.
  • Changed! COG TolA 169aa 0.009 in modified transcript
    • Membrane protein involved in colicin uptake [Cell envelope biogenesis, outer membrane].

ERBB4

  • rs.ERBB4.F1 rs.ERBB4.R1 114 162
  • NCBIGene 36.3 2066
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005235

  • cd PTKc_HER4 297aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, HER4. Protein Tyrosine Kinase (PTK) family; HER4 (ErbB4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. HER4 is a member of the EGFR (HER, ErbB) subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular EGF-related ligand-binding region, a transmembrane helix, and a cytoplasmic region with a tyr kinase domain and a regulatory C-terminal tail. Unlike other tyr kinases, phosphorylation of the activation loop of EGFR proteins is not critical to their activation. Instead, they are activated by ligand-induced dimerization, leading to the phosphorylation of tyr residues in the C-terminal tail, which serve as binding sites for downstream signaling molecules. Ligands that bind HER4 fall into two groups, the neuregulins (or heregulins) and some EGFR (HER1) ligands including betacellulin, HBEGF, and epiregulin. All four neuregulins (NRG1-4) interact with HER4. Upon ligand binding, HER4 forms homo- or heterodimers with other HER proteins. HER4 is essential in embryonic development. It is implicated in mammary gland, cardiac, and neural development. As a postsynaptic receptor of NRG1, HER4 plays an important role in synaptic plasticity and maturation. The impairment of NRG1/HER4 signaling may contribute to schizophrenia.
  • cd FU 46aa 4e-05 in ref transcript
    • Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
  • cd FU 43aa 1e-04 in ref transcript
  • pfam Pkinase_Tyr 257aa 1e-112 in ref transcript
    • Protein tyrosine kinase.
  • pfam Furin-like 149aa 2e-41 in ref transcript
    • Furin-like cysteine rich region.
  • pfam Recep_L_domain 113aa 7e-30 in ref transcript
    • Receptor L domain. The L domains from these receptors make up the bilobal ligand binding site. Each L domain consists of a single-stranded right hand beta-helix. This Pfam entry is missing the first 50 amino acid residues of the domain.
  • pfam Recep_L_domain 121aa 4e-22 in ref transcript
  • smart FU 48aa 1e-06 in ref transcript
    • Furin-like repeats.
  • pfam Furin-like 101aa 2e-06 in ref transcript
  • COG SPS1 230aa 4e-21 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

ERC1

  • rs.ERC1.F1 rs.ERC1.R1 108 120
  • NCBIGene 36.3 23085
  • Single exon skipping, size difference: 12
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_178040

  • Changed! pfam Cast 829aa 1e-148 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • pfam RBD-FIP 41aa 1e-07 in ref transcript
    • FIP domain. The FIP domain is the Rab11-binding domain (RBD) at the C-terminus of a family of Rab11-interacting proteins (FIPs). The Rab proteins constitute the largest family of small GTPases (>60 members in mammals). Among them Rab11 is a well characterised regulator of endocytic and recycling pathways. Rab11 associates with a broad range of post-Golgi organelles, including recycling endosomes.
  • COG Smc 340aa 3e-09 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! COG Smc 318aa 3e-08 in ref transcript
  • COG Smc 298aa 5e-06 in ref transcript
  • Changed! pfam Cast 833aa 1e-149 in modified transcript
  • Changed! COG Smc 264aa 6e-07 in modified transcript

ERCC8

  • rs.ERCC8.F1 rs.ERCC8.R1 129 347
  • NCBIGene 36.3 1161
  • Single exon skipping, size difference: 218
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_000082

  • Changed! cd WD40 322aa 3e-30 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! smart WD40 41aa 2e-05 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • Changed! smart WD40 32aa 0.007 in ref transcript
  • Changed! COG COG2319 358aa 2e-16 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

ERGIC3

  • rs.ERGIC3.F1 rs.ERGIC3.R1 100 115
  • NCBIGene 36.3 51614
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198398

  • Changed! pfam DUF1692 188aa 1e-70 in ref transcript
    • Protein of unknown function (DUF1692). This family contains many hypothetical proteins. It also contains proteins which are involved in transport between the ER and Golgi complex protein transport from ER to Golgi.
  • Changed! pfam DUF1692 183aa 1e-72 in modified transcript

EVI1

  • rs.EVI1.F1 rs.EVI1.R1 114 176
  • NCBIGene 36.3 2122
  • Single exon skipping, size difference: 62
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001105077

  • Changed! smart SET 57aa 1e-04 in ref transcript
    • SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues.
  • pfam zf-C2H2 23aa 0.007 in ref transcript
    • Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.

EXOC1

  • rs.EXOC1.F1 rs.EXOC1.R1 254 299
  • NCBIGene 36.3 55763
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001024924

  • Changed! pfam Sec3 703aa 0.0 in ref transcript
    • Exocyst complex component Sec3. This entry is the conserved middle and C-terminus of the Sec3 protein. Sec3 binds to the C-terminal cytoplasmic domain of GLYT1 (glycine transporter protein 1). Sec3 is the exocyst component that is closest to the plasma membrane docking site and it serves as a spatial landmark in the plasma membrane for incoming secretory vesicles. Sec3 is recruited to the sites of polarised membrane growth through its interaction with Rho1p, a small GTP-binding protein.
  • Changed! pfam Sec3 688aa 0.0 in modified transcript

EYA1

  • rs.EYA1.F1 rs.EYA1.R1 100 115
  • NCBIGene 36.3 2138
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000503

  • TIGR EYA-cons_domain 271aa 1e-126 in ref transcript
    • This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.

EYA2

  • rs.EYA2.F1 rs.EYA2.R1 125 197
  • NCBIGene 36.3 2139
  • Alternative 5-prime and 3-prime, size difference: 72
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172113

  • TIGR EYA-cons_domain 272aa 1e-118 in ref transcript
    • This domain is common to all eyes absent (EYA) homologs. Metazoan EYA's also contain a variable N-terminal domain consisting largely of low-complexity sequences.

EZH2

  • rs.EZH2.F1 rs.EZH2.R1 100 115
  • NCBIGene 36.3 2146
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004456

  • pfam SET 124aa 3e-36 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • COG COG2940 208aa 4e-15 in ref transcript
    • Proteins containing SET domain [General function prediction only].

F7

  • rs.F7.F1 rs.F7.R1 139 205
  • NCBIGene 36.3 2155
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000131

  • cd Tryp_SPc 237aa 3e-57 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • cd EGF_CA 37aa 9e-06 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • smart Tryp_SPc 235aa 1e-57 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! smart GLA 63aa 1e-21 in ref transcript
    • Domain containing Gla (gamma-carboxyglutamate) residues. A hyaluronan-binding domain found in proteins associated with the extracellular matrix, cell adhesion and cell migration.
  • smart EGF_CA 37aa 2e-04 in ref transcript
    • Calcium-binding EGF-like domain.
  • COG COG5640 252aa 1e-14 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! smart GLA 62aa 3e-21 in modified transcript

FAM104A

  • rs.FAM104A.F1 rs.FAM104A.R1 219 282
  • NCBIGene 36.3 84923
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098832

FAM107A

  • rs.FAM107A.F1 rs.FAM107A.R1 294 392
  • NCBIGene 36.3 11170
  • Single exon skipping, size difference: 98
  • Exclusion in 5'UTR
  • Reference transcript: NM_007177

  • pfam DUF1151 118aa 1e-24 in ref transcript
    • Protein of unknown function (DUF1151). This family consists of several hypothetical eukaryotic proteins of unknown function.

FAM123C

  • rs.FAM123C.F1 rs.FAM123C.R1 239 370
  • NCBIGene 36.3 205147
  • Alternative 5-prime, size difference: 131
  • Exclusion in 5'UTR
  • Reference transcript: NM_001105193

  • pfam WTX 371aa 1e-103 in ref transcript
    • WTX protein. The WTX protein is found to be inactivated in one third of Wilms tumours. The WTX protein is functionally uncharacterised.

FAM135A

  • rs.FAM135A.F1 rs.FAM135A.R1 116 245
  • NCBIGene 36.3 57579
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 129
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001105531

  • pfam DUF676 197aa 2e-59 in ref transcript
    • Putative serine esterase (DUF676). This family of proteins are probably serine esterase type enzymes with an alpha/beta hydrolase fold.

FAM135A

  • rs.FAM135A.F2 rs.FAM135A.R2 101 179
  • NCBIGene 36.3 57579
  • Single exon skipping, size difference: 78
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001105531

  • pfam DUF676 197aa 2e-59 in ref transcript
    • Putative serine esterase (DUF676). This family of proteins are probably serine esterase type enzymes with an alpha/beta hydrolase fold.

FAM135A

  • rs.FAM135A.F3 rs.FAM135A.R3 171 257
  • NCBIGene 36.3 57579
  • Single exon skipping, size difference: 86
  • Exclusion in 5'UTR
  • Reference transcript: NM_001105531

  • pfam DUF676 197aa 2e-59 in ref transcript
    • Putative serine esterase (DUF676). This family of proteins are probably serine esterase type enzymes with an alpha/beta hydrolase fold.

FAM33A

  • rs.FAM33A.F1 rs.FAM33A.R1 222 399
  • NCBIGene 36.3 348235
  • Single exon skipping, size difference: 177
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182620

FAM33A

  • rs.FAM33A.F2 rs.FAM33A.R2 202 377
  • NCBIGene 36.3 348235
  • Alternative 5-prime, size difference: 175
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_182620

FAM48A

  • rs.FAM48A.F1 rs.FAM48A.R1 350 432
  • NCBIGene 36.3 55578
  • Alternative 3-prime, size difference: 3
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001014286

FAM54A

  • rs.FAM54A.F1 rs.FAM54A.R1 197 485
  • NCBIGene 36.3 113115
  • Alternative 5-prime, size difference: 288
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099286

  • pfam DUF729 266aa 1e-74 in ref transcript
    • Protein of unknown function (DUF729). This family consists of several uncharacterised eukaryotic proteins of unknown function.

FAM54B

  • rs.FAM54B.F1 rs.FAM54B.R1 181 290
  • NCBIGene 36.3 56181
  • Alternative 3-prime, size difference: 109
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001099625

  • Changed! pfam DUF729 248aa 9e-51 in ref transcript
    • Protein of unknown function (DUF729). This family consists of several uncharacterised eukaryotic proteins of unknown function.
  • Changed! pfam DUF729 148aa 6e-39 in modified transcript

FAM70A

  • rs.FAM70A.F1 rs.FAM70A.R1 217 289
  • NCBIGene 36.3 55026
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017938

FAM72A

  • rs.FAM72A.F1 rs.FAM72A.R1 237 357
  • NCBIGene 36.3 729533
  • Alternative 5-prime, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001133365

FAM89B

  • rs.FAM89B.F1 rs.FAM89B.R1 90 100
  • NCBIGene 36.3 23625
  • Alternative 3-prime, size difference: 10
  • Inclusion in 5'UTR
  • Reference transcript: NM_001098785

FAS

  • rs.FAS.F1 rs.FAS.R1 187 296
  • NCBIGene 36.3 355
  • Single exon skipping, size difference: 109
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000043

  • Changed! cd TNFR 95aa 5e-16 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • Changed! smart DEATH 88aa 2e-16 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • Changed! pfam TNFR_c6 37aa 4e-05 in ref transcript
    • TNFR/NGFR cysteine-rich region.
  • Changed! cd TNFR 59aa 2e-06 in modified transcript

FAS

  • rs.FAS.F2 rs.FAS.R2 104 167
  • NCBIGene 36.3 355
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000043

  • cd TNFR 95aa 5e-16 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • smart DEATH 88aa 2e-16 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • pfam TNFR_c6 37aa 4e-05 in ref transcript
    • TNFR/NGFR cysteine-rich region.

FAS

  • rs.FAS.F3 rs.FAS.R3 100 125
  • NCBIGene 36.3 355
  • Single exon skipping, size difference: 25
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000043

  • cd TNFR 95aa 5e-16 in ref transcript
    • Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
  • Changed! smart DEATH 88aa 2e-16 in ref transcript
    • DEATH domain, found in proteins involved in cell death (apoptosis). Alpha-helical domain present in a variety of proteins with apoptotic functions. Some (but not all) of these domains form homotypic and heterotypic dimers.
  • pfam TNFR_c6 37aa 4e-05 in ref transcript
    • TNFR/NGFR cysteine-rich region.

FASTK

  • rs.FASTK.F1 rs.FASTK.R1 125 548
  • NCBIGene 36.3 10922
  • Single exon skipping, size difference: 423
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006712

  • pfam FAST_1 67aa 1e-13 in ref transcript
    • FAST kinase-like protein, subdomain 1. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This region is often found immediately N-terminal to the FAST kinase-like protein, subdomain 2.
  • pfam FAST_2 92aa 8e-13 in ref transcript
    • FAST kinase-like protein, subdomain 2. This family represents a conserved region of eukaryotic Fas-activated serine/threonine (FAST) kinases (EC:2.7.1.-) that contains several conserved leucine residues. FAST kinase is rapidly activated during Fas-mediated apoptosis, when it phosphorylates TIA-1, a nuclear RNA-binding protein that has been implicated as an effector of apoptosis. Note that many family members are hypothetical proteins. This subdomain is often found associated with the FAST kinase-like protein, subdomain 2.
  • pfam RAP 58aa 5e-12 in ref transcript
    • RAP domain. This domain is found in various eukaryotic species, particularly in apicomplexans such as Plasmodium falciparum, where it is found in proteins that are important in various parasite-host cell interactions. It is thought to be an RNA-binding domain.

FBXO43

  • rs.FBXO43.F1 rs.FBXO43.R1 209 253
  • NCBIGene 36.3 286151
  • Alternative 5-prime, size difference: 44
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001029860

  • Changed! pfam IBR 43aa 0.002 in ref transcript
    • IBR domain. The IBR (In Between Ring fingers) domain is often found to occur between pairs of ring fingers (pfam00097). This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain. Proteins that contain two Ring fingers and an IBR domain (these proteins are also termed RBR family proteins) are thought to exist in all eukaryotic organisms. RBR family members play roles in protein quality control and can indirectly regulate transcription. Evidence suggests that RBR proteins are often parts of cullin-containing ubiquitin ligase complexes. The ubiquitin ligase Parkin is an RBR family protein whose mutations are involved in forms of familial Parkinson's disease.

FCGR1B

  • rs.FCGR1B.F1 rs.FCGR1B.R1 108 384
  • NCBIGene 36.3 2210
  • Multiple exon skipping, size difference: 276
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001017986

  • Changed! cd IGcam 64aa 8e-04 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • smart IGc2 59aa 4e-04 in ref transcript
    • Immunoglobulin C-2 Type.
  • Changed! smart IG_like 74aa 6e-04 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.

FCGR2C

  • rs.FCGR2C.F1 rs.FCGR2C.R1 103 117
  • NCBIGene 36.3 9103
  • Alternative 3-prime, size difference: 14
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_201563

FERMT3

  • rs.FERMT3.F1 rs.FERMT3.R1 100 112
  • NCBIGene 36.3 83706
  • Alternative 3-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_178443

  • Changed! cd Unc112 110aa 3e-45 in ref transcript
    • Unc-112 pleckstrin homology (PH) domain. Unc-112 and related proteins contain two FERM domains with a PH domain between them. Both the PH and FERM domains have a PH-like fold. The FERM domains are likely responsible for the role of Unc-112 in organizing beta-integrin. The specific role of the Unc-112 PH domain is not known, but it is predicted to be involved in mediating membrane interactions. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • smart B41 90aa 2e-14 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam FERM_M 49aa 2e-10 in ref transcript
    • FERM central domain. This domain is the central structural domain of the FERM domain.
  • Changed! pfam PH 95aa 2e-09 in ref transcript
    • PH domain. PH stands for pleckstrin homology.
  • Changed! cd Unc112 106aa 5e-47 in modified transcript
  • Changed! pfam PH 96aa 1e-09 in modified transcript

FEZ2

  • rs.FEZ2.F1 rs.FEZ2.R1 241 322
  • NCBIGene 36.3 9637
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042548

  • pfam FEZ 216aa 1e-70 in ref transcript
    • FEZ-like protein. This is a family of eukaryotic proteins thought to be involved in axonal outgrowth and fasciculation. The N-terminal regions of these sequences are less conserved than the C-terminal regions, and are highly acidic. The Caenorhabditis elegans homolog, UNC-76, may play structural and signalling roles in the control of axonal extension and adhesion (particularly in the presence of adjacent neuronal cells) and these roles have also been postulated for other FEZ family proteins. Certain homologs have been definitively found to interact with the N-terminal variable region (V1) of PKC-zeta, and this interaction causes cytoplasmic translocation of the FEZ family protein in mammalian neuronal cells. The C-terminal region probably participates in the association with the regulatory domain of PKC-zeta. The members of this family are predicted to form coiled-coil structures, which may interact with members of the RhoA family of signalling proteins, but are not thought to contain other characteristic protein motifs. Certain members of this family are expressed almost exclusively in the brain, whereas others (such as FEZ2) are expressed in other tissues, and are thought to perform similar but unknown functions in these tissues.

FGF8

  • rs.FGF8.F2 rs.FGF8.R2 123 156
  • NCBIGene 36.3 2253
  • Alternative 3-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033163

  • cd FGF 125aa 2e-28 in ref transcript
    • Acidic and basic fibroblast growth factor family; FGFs are mitogens, which stimulate growth or differentiation of cells of mesodermal or neuroectodermal origin. The family plays essential roles in patterning and differentiation during vertebrate embryogenesis, and has neurotrophic activities. FGFs have a high affinity for heparan sulfate proteoglycans and require heparan sulfate to activate one of four cell surface FGF receptors. Upon binding to FGF, the receptors dimerize and their intracellular tyrosine kinase domains become active. FGFs have internal pseudo-threefold symmetry (beta-trefoil topology).
  • pfam FGF 123aa 2e-38 in ref transcript
    • Fibroblast growth factor. Fibroblast growth factors are a family of proteins involved in growth and differentiation in a wide range of contexts. They are found in a wide range of organisms, from nematodes to humans. Most share an internal core region of high similarity, conserved residues in which are involved in binding with their receptors. On binding, they cause dimerisation of their tyrosine kinase receptors leading to intracellular signalling. There are currently four known tyrosine kinase receptors for fibroblast growth factors. These receptors can each bind several different members of this family. Members of this family have a beta trefoil structure. Most have N-terminal signal peptides and are secreted. A few lack signal sequences but are secreted anyway; still others also lack the signal peptide but are found on the cell surface and within the extracellular matrix. A third group remain intracellular. They have central roles in development, regulating cell proliferation, migration and differentiation. On the other hand, they are important in tissue repair following injury in adult organisms.

FGFR1

  • rs.FGFR1.F1 rs.FGFR1.R1 107 218
  • NCBIGene 36.3 2260
  • Alternative 5-prime and 3-prime, size difference: 111
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_023110

  • Changed! cd PTKc_FGFR1 307aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 1. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 1 (FGFR1); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR1 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Alternative splicing of FGFR1 transcripts produces a variety of isoforms, which are differentially expressed in cells. FGFR1 binds the ligands, FGF1 and FGF2, with high affinity and has also been reported to bind FGF4, FGF6, and FGF9. FGFR1 signaling is critical in the control of cell migration during embryo development. It promotes cell proliferation in fibroblasts. Nuclear FGFR1 plays a role in the regulation of transcription. Mutations, insertions or deletions of FGFR1 have been identified in patients with Kallman's syndrome (KS), an inherited disorder characterized by hypogonadotropic hypogonadism and loss of olfaction. Aberrant FGFR1 expression has been found in some human cancers including 8P11 myeloproliferative syndrome (EMS), breast cancer, and pancreatic adenocarcinoma.
  • cd IGcam 93aa 6e-15 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 103aa 2e-09 in ref transcript
  • cd IGcam 55aa 0.004 in ref transcript
  • Changed! pfam Pkinase_Tyr 277aa 1e-116 in ref transcript
    • Protein tyrosine kinase.
  • pfam I-set 88aa 2e-13 in ref transcript
    • Immunoglobulin I-set domain.
  • smart IG_like 98aa 4e-13 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 48aa 4e-07 in ref transcript
    • Immunoglobulin C-2 Type.
  • Changed! COG SPS1 343aa 2e-21 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd PTKc_FGFR1 270aa 1e-158 in modified transcript
  • Changed! pfam Pkinase_Tyr 236aa 2e-95 in modified transcript
  • Changed! COG SPS1 238aa 4e-21 in modified transcript

FGFR1

  • rs.FGFR1.F2 FGFR1.R42 228 495
  • NCBIGene 36.3 2260
  • Single exon skipping, size difference: 267
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_023110

  • cd PTKc_FGFR1 307aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 1. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 1 (FGFR1); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR1 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Alternative splicing of FGFR1 transcripts produces a variety of isoforms, which are differentially expressed in cells. FGFR1 binds the ligands, FGF1 and FGF2, with high affinity and has also been reported to bind FGF4, FGF6, and FGF9. FGFR1 signaling is critical in the control of cell migration during embryo development. It promotes cell proliferation in fibroblasts. Nuclear FGFR1 plays a role in the regulation of transcription. Mutations, insertions or deletions of FGFR1 have been identified in patients with Kallman's syndrome (KS), an inherited disorder characterized by hypogonadotropic hypogonadism and loss of olfaction. Aberrant FGFR1 expression has been found in some human cancers including 8P11 myeloproliferative syndrome (EMS), breast cancer, and pancreatic adenocarcinoma.
  • cd IGcam 93aa 6e-15 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 103aa 2e-09 in ref transcript
  • Changed! cd IGcam 55aa 0.004 in ref transcript
  • pfam Pkinase_Tyr 277aa 1e-116 in ref transcript
    • Protein tyrosine kinase.
  • pfam I-set 88aa 2e-13 in ref transcript
    • Immunoglobulin I-set domain.
  • smart IG_like 98aa 4e-13 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • Changed! smart IGc2 48aa 4e-07 in ref transcript
    • Immunoglobulin C-2 Type.
  • COG SPS1 343aa 2e-21 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

FGFR4

  • rs.FGFR4.F1 FGFR4.R4 261 381
  • NCBIGene 36.3 2264
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 120
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_213647

  • cd PTKc_FGFR4 311aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 4. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 4 (FGFR4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR4 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Unlike other FGFRs, there is only one splice form of FGFR4. It binds FGF1, FGF2, FGF6, FGF19, and FGF23. FGF19 is a selective ligand for FGFR4. Although disruption of FGFR4 in mice causes no obvious phenotype, in vivo inhibition of FGFR4 in cultured skeletal muscle cells resulted in an arrest of muscle progenitor differentiation. FGF6 and FGFR4 are uniquely expressed in myofibers and satellite cells. FGF6/FGFR4 signaling appears to play a key role in the regulation of muscle regeneration. A polymorphism in FGFR4 is found in head and neck squamous cell carcinoma.
  • cd IGcam 93aa 7e-17 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 101aa 2e-07 in ref transcript
  • cd IGcam 67aa 0.003 in ref transcript
  • pfam Pkinase_Tyr 277aa 1e-114 in ref transcript
    • Protein tyrosine kinase.
  • smart IGc2 67aa 3e-15 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 96aa 5e-12 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 58aa 7e-09 in ref transcript
  • COG SPS1 251aa 8e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

FGFR4

  • rs.FGFR4.F2 rs.FGFR4.R2 167 211
  • NCBIGene 36.3 2264
  • Alternative 5-prime, size difference: 44
  • Exclusion in 5'UTR
  • Reference transcript: NM_213647

  • cd PTKc_FGFR4 311aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Fibroblast Growth Factor Receptor 4. Protein Tyrosine Kinase (PTK) family; Fibroblast Growth Factor Receptor 4 (FGFR4); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. FGFR4 is part of the FGFR subfamily, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with three immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of FGFRs to their ligands, the FGFs, results in receptor dimerization and activation, and intracellular signaling. The binding of FGFs to FGFRs is promiscuous, in that a receptor may be activated by several ligands and a ligand may bind to more that one type of receptor. Unlike other FGFRs, there is only one splice form of FGFR4. It binds FGF1, FGF2, FGF6, FGF19, and FGF23. FGF19 is a selective ligand for FGFR4. Although disruption of FGFR4 in mice causes no obvious phenotype, in vivo inhibition of FGFR4 in cultured skeletal muscle cells resulted in an arrest of muscle progenitor differentiation. FGF6 and FGFR4 are uniquely expressed in myofibers and satellite cells. FGF6/FGFR4 signaling appears to play a key role in the regulation of muscle regeneration. A polymorphism in FGFR4 is found in head and neck squamous cell carcinoma.
  • cd IGcam 93aa 7e-17 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 101aa 2e-07 in ref transcript
  • cd IGcam 67aa 0.003 in ref transcript
  • pfam Pkinase_Tyr 277aa 1e-114 in ref transcript
    • Protein tyrosine kinase.
  • smart IGc2 67aa 3e-15 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 96aa 5e-12 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 58aa 7e-09 in ref transcript
  • COG SPS1 251aa 8e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

FKTN

  • rs.FKTN.F1 rs.FKTN.R1 173 265
  • NCBIGene 36.3 2218
  • Single exon skipping, size difference: 92
  • Exclusion in 5'UTR
  • Reference transcript: NM_001079802

  • pfam LicD 136aa 1e-08 in ref transcript
    • LICD Protein Family. The LICD family of proteins show high sequence similarity and are involved in phosphorylcholine metabolism. There is evidence to show that LicD2 mutants have a reduced ability to take up choline, have decreased ability to adhere to host cells and are less virulent.

FLJ11151

  • rs.FLJ11151.F1 rs.FLJ11151.R1 114 540
  • NCBIGene 36.3 55313
  • Single exon skipping, size difference: 426
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018340

  • Changed! pfam Metallophos 172aa 3e-04 in ref transcript
    • Calcineurin-like phosphoesterase. This family includes a diverse range of phosphoesterases, including protein phosphoserine phosphatases, nucleotidases, sphingomyelin phosphodiesterases and 2'-3' cAMP phosphodiesterases as well as nucleases such as bacterial SbcD or yeast MRE11. The most conserved regions in this superfamily centre around the metal chelating residues.
  • Changed! COG Icc 203aa 1e-09 in ref transcript
    • Predicted phosphohydrolases [General function prediction only].

FLJ11171

  • rs.FLJ11171.F1 rs.FLJ11171.R1 181 231
  • NCBIGene 36.3 55783
  • Alternative 5-prime, size difference: 50
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099642

  • pfam FtsJ 209aa 2e-18 in ref transcript
    • FtsJ-like methyltransferase. This family consists of FtsJ from various bacterial and archaeal sources FtsJ is a methyltransferase, but actually has no effect on cell division. FtsJ's substrate is the 23S rRNA. The 1.5 A crystal structure of FtsJ in complex with its cofactor S-adenosylmethionine revealed that FtsJ has a methyltransferase fold. This family also includes the N terminus of flaviviral NS5 protein. It has been hypothesised that the N-terminal domain of NS5 is a methyltransferase involved in viral RNA capping.
  • COG FtsJ 117aa 2e-08 in ref transcript
    • 23S rRNA methylase [Translation, ribosomal structure and biogenesis].

FLJ13611

  • rs.FLJ13611.F1 rs.FLJ13611.R1 110 128
  • NCBIGene 36.3 80006
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001093755

  • Changed! pfam DUF974 238aa 1e-113 in ref transcript
    • Protein of unknown function (DUF974). Family of uncharacterised eukaryotic proteins.
  • Changed! pfam DUF974 232aa 1e-115 in modified transcript

FLJ14154

  • rs.FLJ14154.F1 rs.FLJ14154.R1 100 122
  • NCBIGene 36.3 79903
  • Alternative 5-prime, size difference: 22
  • Exclusion in 3'UTR
  • Reference transcript: NM_024845

  • cd GNAT 103aa 2e-07 in ref transcript
    • GCN5-related N-acetyltransferases (GNAT) represent a large superfamily of functionally diverse enzymes that catalyze the transfer of an acetyl group from acetyl-Coenzyme A to the primary amine of a wide range of acceptor substrates. Members of this superfamily include aminoglycoside N-acetyltransferases, serotonin N-acetyltransferase, glucosamine-6-phosphate N-acetyltransferase, the histone acetyltransferases, mycothiol synthase, and the Fem family of amino acyl transferases.
  • pfam Acetyltransf_1 78aa 2e-09 in ref transcript
    • Acetyltransferase (GNAT) family. This family contains proteins with N-acetyltransferase functions such as Elp3-related proteins.
  • COG RimI 71aa 4e-08 in ref transcript
    • Acetyltransferases [General function prediction only].

FLJ20254

  • rs.FLJ20254.F1 rs.FLJ20254.R1 175 310
  • NCBIGene 36.3 54867
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017727

  • pfam DUF2359 469aa 0.0 in ref transcript
    • Uncharacterised conserved protein (DUF2359). This is a 450 amino acid region of a family of proteins conserved from insects to humans. The mouse protein, Q8BM55, is annotated as being a putative Vitamin K-dependent carboxylation gamma-carboxyglutamic (GLA) domain containing protein, but this could not be confirmed. The function is not known.

FLJ22167

  • rs.FLJ22167.F1 rs.FLJ22167.R1 100 223
  • NCBIGene 36.3 79583
  • Alternative 3-prime, size difference: 123
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001077416

  • Changed! pfam NAcGluc_Transf 261aa 1e-100 in ref transcript
    • Carbohydrate (N-acetylglucosamine 6-O) sulfotransferase 5. Members of this family are involved in the biosynthesis of 6-Sulfo Sialyl Lewis X molecules, by catalysing the transfer of sulphate from 3'-phosphoadenosine 5'-phosphosulfate to position 6 of a non-reducing N-acetylglucosamine (GlcNAc) residue.
  • Changed! pfam NAcGluc_Transf 71aa 8e-19 in modified transcript

FLJ25404

  • rs.FLJ25404.F1 rs.FLJ25404.R1 200 280
  • NCBIGene 36.3 146378
  • Alternative 3-prime, size difference: 80
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001109660

FLJ30428

  • rs.FLJ30428.F1 rs.FLJ30428.R1 306 529
  • NCBIGene 36.3 150519
  • Multiple exon skipping, size difference: 223
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: XM_001723916

  • pfam DUF1886 59aa 0.001 in ref transcript
    • Domain of unknown function (DUF1886). This domain is predominantly found in the Archaeal protein N-glycosylase/DNA lyase.

FLJ42562

  • rs.FLJ42562.F1 rs.FLJ42562.R1 101 113
  • NCBIGene 36.3 400954
  • Alternative 5-prime, size difference: 12
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: XM_001725001

  • cd WD40 306aa 8e-30 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 297aa 3e-29 in ref transcript
  • cd WD40 351aa 5e-19 in ref transcript
  • Changed! cd WD40 274aa 1e-17 in ref transcript
  • cd WD40 308aa 2e-17 in ref transcript
  • cd WD40 274aa 3e-17 in ref transcript
  • pfam HELP 47aa 7e-08 in ref transcript
    • HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localise to membranes. A preliminary study has shown that the N terminus of Sea urchin EMAP containing HELP is sufficient for microtubule binding in vitro (Eichenmuller et al In press).
  • pfam HELP 29aa 3e-06 in ref transcript
  • pfam HELP 41aa 2e-05 in ref transcript
  • pfam WD40 34aa 0.010 in ref transcript
    • WD domain, G-beta repeat.
  • Changed! COG COG2319 403aa 1e-26 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG COG2319 372aa 3e-21 in ref transcript
  • COG COG2319 399aa 5e-19 in ref transcript
  • COG COG2319 280aa 2e-14 in ref transcript
  • COG COG2319 324aa 2e-10 in ref transcript
  • Changed! cd WD40 278aa 6e-16 in modified transcript
  • Changed! COG COG2319 407aa 8e-26 in modified transcript

FLJ42562

  • rs.FLJ42562.F2 rs.FLJ42562.R2 249 345
  • NCBIGene 36.3 400954
  • Alternative 5-prime, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001725001

  • cd WD40 306aa 8e-30 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 297aa 3e-29 in ref transcript
  • Changed! cd WD40 351aa 5e-19 in ref transcript
  • cd WD40 274aa 1e-17 in ref transcript
  • Changed! cd WD40 308aa 2e-17 in ref transcript
  • cd WD40 274aa 3e-17 in ref transcript
  • pfam HELP 47aa 7e-08 in ref transcript
    • HELP motif. The HELP (Hydrophobic ELP) motif is found in EMAP and EMAP-like proteins (ELPs). The HELP motif contains a predicted transmembrane helix so probably does not form a globular domain. It is also not clear if these proteins localise to membranes. A preliminary study has shown that the N terminus of Sea urchin EMAP containing HELP is sufficient for microtubule binding in vitro (Eichenmuller et al In press).
  • pfam HELP 29aa 3e-06 in ref transcript
  • pfam HELP 41aa 2e-05 in ref transcript
  • pfam WD40 34aa 0.010 in ref transcript
    • WD domain, G-beta repeat.
  • COG COG2319 403aa 1e-26 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG COG2319 372aa 3e-21 in ref transcript
  • Changed! COG COG2319 399aa 5e-19 in ref transcript
  • COG COG2319 280aa 2e-14 in ref transcript
  • COG COG2319 324aa 2e-10 in ref transcript
  • Changed! cd WD40 306aa 8e-18 in modified transcript
  • Changed! cd WD40 300aa 4e-17 in modified transcript
  • Changed! COG COG2319 361aa 1e-18 in modified transcript

FLJ46266

  • rs.FLJ46266.F1 rs.FLJ46266.R1 144 225
  • NCBIGene 36.3 399949
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_207430

FLJ77644

  • rs.FLJ77644.F1 rs.FLJ77644.R1 258 399
  • NCBIGene 36.3 728772
  • Single exon skipping, size difference: 141
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001133074

  • Changed! pfam DUF1356 241aa 1e-109 in ref transcript
    • Protein of unknown function (DUF1356). This family consists of several hypothetical mammalian proteins of around 250 residues in length. The function of this family is unknown.
  • Changed! pfam DUF1356 194aa 8e-81 in modified transcript

FLJ77644

  • rs.FLJ77644.F2 rs.FLJ77644.R2 134 302
  • NCBIGene 36.3 728772
  • Single exon skipping, size difference: 168
  • Exclusion in 5'UTR
  • Reference transcript: XM_001133074

  • pfam DUF1356 241aa 1e-109 in ref transcript
    • Protein of unknown function (DUF1356). This family consists of several hypothetical mammalian proteins of around 250 residues in length. The function of this family is unknown.

FN1

  • rs.FN1.F1 rs.FN1.R1 99 369
  • NCBIGene 36.3 2335
  • Single exon skipping, size difference: 270
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_212482

  • cd FN2 48aa 2e-16 in ref transcript
    • Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
  • cd FN2 48aa 3e-16 in ref transcript
  • cd FN3 83aa 4e-10 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN1 44aa 4e-09 in ref transcript
    • Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
  • cd FN3 81aa 8e-09 in ref transcript
  • cd FN3 80aa 1e-08 in ref transcript
  • cd FN1 40aa 4e-08 in ref transcript
  • cd FN1 42aa 5e-08 in ref transcript
  • cd FN1 45aa 8e-08 in ref transcript
  • cd FN3 88aa 1e-07 in ref transcript
  • cd FN3 88aa 2e-07 in ref transcript
  • cd FN1 44aa 2e-07 in ref transcript
  • cd FN3 87aa 3e-07 in ref transcript
  • cd FN1 38aa 4e-07 in ref transcript
  • Changed! cd FN3 81aa 9e-07 in ref transcript
  • cd FN3 82aa 1e-06 in ref transcript
  • Changed! cd FN3 88aa 1e-06 in ref transcript
  • cd FN1 41aa 1e-06 in ref transcript
  • cd FN3 93aa 2e-06 in ref transcript
  • cd FN1 45aa 2e-06 in ref transcript
  • cd FN1 42aa 2e-06 in ref transcript
  • cd FN3 81aa 6e-06 in ref transcript
  • cd FN3 90aa 8e-06 in ref transcript
  • cd FN3 73aa 9e-06 in ref transcript
  • cd FN1 44aa 1e-05 in ref transcript
  • cd FN1 43aa 3e-05 in ref transcript
  • cd FN1 39aa 0.003 in ref transcript
  • cd FN3 73aa 0.006 in ref transcript
  • smart FN2 49aa 9e-20 in ref transcript
    • Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
  • smart FN2 49aa 4e-19 in ref transcript
  • pfam fn3 82aa 8e-17 in ref transcript
    • Fibronectin type III domain.
  • smart FN1 45aa 6e-13 in ref transcript
    • Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
  • Changed! pfam fn1 39aa 1e-12 in ref transcript
    • Fibronectin type I domain.
  • pfam fn3 81aa 3e-12 in ref transcript
  • pfam fn3 81aa 3e-12 in ref transcript
  • smart FN1 45aa 3e-12 in ref transcript
  • smart FN1 42aa 3e-12 in ref transcript
  • smart FN1 45aa 5e-12 in ref transcript
  • smart FN1 39aa 2e-11 in ref transcript
  • pfam fn3 81aa 4e-11 in ref transcript
  • pfam fn3 83aa 5e-11 in ref transcript
  • pfam fn3 67aa 8e-11 in ref transcript
  • pfam fn1 39aa 8e-11 in ref transcript
  • pfam fn3 81aa 1e-10 in ref transcript
  • pfam fn3 80aa 1e-10 in ref transcript
  • smart FN1 43aa 2e-10 in ref transcript
  • pfam fn3 80aa 5e-10 in ref transcript
  • smart FN1 44aa 7e-10 in ref transcript
  • pfam fn3 81aa 8e-09 in ref transcript
  • pfam fn3 85aa 9e-09 in ref transcript
  • pfam fn3 70aa 1e-08 in ref transcript
  • pfam fn3 65aa 2e-08 in ref transcript
  • smart FN1 41aa 3e-08 in ref transcript
  • Changed! pfam fn3 81aa 3e-07 in ref transcript
  • smart FN1 41aa 5e-07 in ref transcript
  • pfam fn3 54aa 5e-05 in ref transcript
  • pfam fn1 31aa 9e-05 in ref transcript
  • Changed! cd FN3 89aa 3e-07 in modified transcript
  • Changed! smart FN1 44aa 2e-12 in modified transcript

FN1

  • rs.FN1.F2 rs.FN1.R2 156 231
  • NCBIGene 36.3 2335
  • Alternative 3-prime, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_212482

  • cd FN2 48aa 2e-16 in ref transcript
    • Fibronectin Type II domain: FN2 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. Fibronectin is composed of 3 types of modules, FN1,FN2 and FN3. The collagen binding domain contains four FN1 and two FN2 repeats.
  • cd FN2 48aa 3e-16 in ref transcript
  • cd FN3 83aa 4e-10 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN1 44aa 4e-09 in ref transcript
    • Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
  • cd FN3 81aa 8e-09 in ref transcript
  • cd FN3 80aa 1e-08 in ref transcript
  • cd FN1 40aa 4e-08 in ref transcript
  • cd FN1 42aa 5e-08 in ref transcript
  • cd FN1 45aa 8e-08 in ref transcript
  • cd FN3 88aa 1e-07 in ref transcript
  • cd FN3 88aa 2e-07 in ref transcript
  • cd FN1 44aa 2e-07 in ref transcript
  • cd FN3 87aa 3e-07 in ref transcript
  • cd FN1 38aa 4e-07 in ref transcript
  • cd FN3 81aa 9e-07 in ref transcript
  • cd FN3 82aa 1e-06 in ref transcript
  • cd FN3 88aa 1e-06 in ref transcript
  • cd FN1 41aa 1e-06 in ref transcript
  • cd FN3 93aa 2e-06 in ref transcript
  • cd FN1 45aa 2e-06 in ref transcript
  • cd FN1 42aa 2e-06 in ref transcript
  • cd FN3 81aa 6e-06 in ref transcript
  • cd FN3 90aa 8e-06 in ref transcript
  • cd FN3 73aa 9e-06 in ref transcript
  • cd FN1 44aa 1e-05 in ref transcript
  • cd FN1 43aa 3e-05 in ref transcript
  • cd FN1 39aa 0.003 in ref transcript
  • cd FN3 73aa 0.006 in ref transcript
  • smart FN2 49aa 9e-20 in ref transcript
    • Fibronectin type 2 domain. One of three types of internal repeat within the plasma protein, fibronectin. Also occurs in coagulation factor XII, 2 type IV collagenases, PDC-109, and cation-independent mannose-6-phosphate and secretory phospholipase A2 receptors. In fibronectin, PDC-109, and the collagenases, this domain contributes to collagen-binding function.
  • smart FN2 49aa 4e-19 in ref transcript
  • pfam fn3 82aa 8e-17 in ref transcript
    • Fibronectin type III domain.
  • smart FN1 45aa 6e-13 in ref transcript
    • Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
  • pfam fn1 39aa 1e-12 in ref transcript
    • Fibronectin type I domain.
  • pfam fn3 81aa 3e-12 in ref transcript
  • pfam fn3 81aa 3e-12 in ref transcript
  • smart FN1 45aa 3e-12 in ref transcript
  • smart FN1 42aa 3e-12 in ref transcript
  • smart FN1 45aa 5e-12 in ref transcript
  • smart FN1 39aa 2e-11 in ref transcript
  • pfam fn3 81aa 4e-11 in ref transcript
  • pfam fn3 83aa 5e-11 in ref transcript
  • pfam fn3 67aa 8e-11 in ref transcript
  • pfam fn1 39aa 8e-11 in ref transcript
  • pfam fn3 81aa 1e-10 in ref transcript
  • pfam fn3 80aa 1e-10 in ref transcript
  • smart FN1 43aa 2e-10 in ref transcript
  • pfam fn3 80aa 5e-10 in ref transcript
  • smart FN1 44aa 7e-10 in ref transcript
  • pfam fn3 81aa 8e-09 in ref transcript
  • pfam fn3 85aa 9e-09 in ref transcript
  • pfam fn3 70aa 1e-08 in ref transcript
  • pfam fn3 65aa 2e-08 in ref transcript
  • smart FN1 41aa 3e-08 in ref transcript
  • pfam fn3 81aa 3e-07 in ref transcript
  • smart FN1 41aa 5e-07 in ref transcript
  • pfam fn3 54aa 5e-05 in ref transcript
  • pfam fn1 31aa 9e-05 in ref transcript

FNTA

  • rs.FNTA.F1 rs.FNTA.R1 297 383
  • NCBIGene 36.3 2339
  • Single exon skipping, size difference: 86
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002027

  • Changed! pfam PPTA 30aa 1e-05 in ref transcript
    • Protein prenyltransferase alpha subunit repeat. Both farnesyltransferase (FT) and geranylgeranyltransferase 1 (GGT1) recognise a CaaX motif on their substrates where 'a' stands for preferably aliphatic residues, whereas GGT2 recognises a completely different motif. Important substrates for FT include, amongst others, many members of the Ras superfamily. GGT1 substrates include some of the other small GTPases and GGT2 substrates include the Rab family.
  • Changed! pfam PPTA 30aa 2e-05 in ref transcript
  • Changed! pfam PPTA 30aa 0.002 in ref transcript
  • Changed! pfam PPTA 29aa 0.003 in ref transcript
  • Changed! COG BET4 211aa 1e-31 in ref transcript
    • Protein prenyltransferase, alpha subunit [Posttranslational modification, protein turnover, chaperones].

FOXN3

  • rs.FOXN3.F1 rs.FOXN3.R1 129 195
  • NCBIGene 36.3 1112
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001085471

  • cd FH 79aa 1e-31 in ref transcript
    • Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
  • smart FH 88aa 4e-34 in ref transcript
    • FORKHEAD. FORKHEAD, also known as a "winged helix".
  • COG COG5025 106aa 5e-15 in ref transcript
    • Transcription factor of the Forkhead/HNF3 family [Transcription].

FOXP4

  • rs.FOXP4.F1 rs.FOXP4.R1 175 211
  • NCBIGene 36.3 116113
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012426

  • cd FH 73aa 8e-26 in ref transcript
    • Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
  • smart FH 73aa 5e-29 in ref transcript
    • FORKHEAD. FORKHEAD, also known as a "winged helix".
  • Changed! COG COG5025 197aa 5e-12 in ref transcript
    • Transcription factor of the Forkhead/HNF3 family [Transcription].
  • Changed! COG COG5025 98aa 2e-10 in modified transcript
  • Changed! COG COG5025 234aa 2e-08 in modified transcript

FOXRED2

  • rs.FOXRED2.F1 rs.FOXRED2.R1 255 352
  • NCBIGene 36.3 80020
  • Alternative 5-prime, size difference: 97
  • Exclusion in 5'UTR
  • Reference transcript: NM_024955

  • pfam HI0933_like 154aa 4e-07 in ref transcript
    • HI0933-like protein.
  • TIGR TRX_reduct 86aa 0.005 in ref transcript
    • This model describes thioredoxin-disulfide reductase, a member of the pyridine nucleotide-disulphide oxidoreductases (pfam00070).
  • COG TrkA 219aa 6e-18 in ref transcript
    • Predicted flavoprotein involved in K+ transport [Inorganic ion transport and metabolism].

FRS2

  • rs.FRS2.F1 rs.FRS2.R1 117 185
  • NCBIGene 36.3 10818
  • Single exon skipping, size difference: 68
  • Exclusion in 5'UTR
  • Reference transcript: NM_006654

  • cd FRS2 97aa 4e-43 in ref transcript
    • Fibroblast growth factor receptor substrate 2 (FRS2/SNT1) Phosphotyrosine-binding domain (IRS1-like). FRS2 mediates signaling downstream of the FGF receptor. It has an N-terminal PTBi domain, which has a PH-like fold and is similiar to the PTB domain that is found in insulin receptor substrate molecules. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs.
  • smart PTBI 91aa 1e-24 in ref transcript
    • Phosphotyrosine-binding domain (IRS1-like).

FRS2

  • rs.FRS2.F2 rs.FRS2.R2 226 287
  • NCBIGene 36.3 10818
  • Single exon skipping, size difference: 61
  • Exclusion in 5'UTR
  • Reference transcript: NM_001042555

  • cd FRS2 97aa 4e-43 in ref transcript
    • Fibroblast growth factor receptor substrate 2 (FRS2/SNT1) Phosphotyrosine-binding domain (IRS1-like). FRS2 mediates signaling downstream of the FGF receptor. It has an N-terminal PTBi domain, which has a PH-like fold and is similiar to the PTB domain that is found in insulin receptor substrate molecules. This PTBi domain is shorter than the PTB domain which is found in SHC, Numb and other proteins. The PTBi domain binds to phosphotyrosines which are in NPXpY motifs.
  • smart PTBI 91aa 1e-24 in ref transcript
    • Phosphotyrosine-binding domain (IRS1-like).

FSCN2

  • rs.FSCN2.F1 rs.FSCN2.R1 141 213
  • NCBIGene 36.3 25794
  • Alternative 3-prime, size difference: 72
  • Inclusion in 3'UTR
  • Reference transcript: NM_012418

  • cd Fascin 121aa 2e-24 in ref transcript
    • Fascin-like domain; members include actin-bundling/crosslinking proteins facsin, histoactophilin and singed; identified in sea urchin, Drosophila, Xenopus, rodents, and humans; The fascin-like domain adopts a beta-trefoil topology and contains an internal threefold repeat; the fascin subgroup contains four copies of the domain; Structurally similar to fibroblast growth factor (FGF).
  • cd Fascin 117aa 3e-20 in ref transcript
  • pfam Fascin 113aa 4e-30 in ref transcript
    • Fascin domain. This family consists of several eukaryotic fascin or singed proteins. The fascins are a structurally unique and evolutionarily conserved group of actin cross-linking proteins. Fascins function in the organisation of two major forms of actin-based structures: dynamic, cortical cell protrusions and cytoplasmic microfilament bundles. The cortical structures, which include filopodia, spikes, lamellipodial ribs, oocyte microvilli and the dendrites of dendritic cells, have roles in cell-matrix adhesion, cell interactions and cell migration, whereas the cytoplasmic actin bundles appear to participate in cell architecture. Dictyostelium hisactophilin, another actin-binding protein, is a submembranous pH sensor that signals slight changes of the H+ concentration to actin by inducing actin polymerisation and binding to microfilaments only at pH values below seven. Members of this family are histidine rich, typically contain the repeated motif of HHXH.
  • pfam Fascin 114aa 8e-22 in ref transcript

FSHR

  • rs.FSHR.F1 rs.FSHR.R1 210 396
  • NCBIGene 36.3 2492
  • Single exon skipping, size difference: 186
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000145

  • pfam 7tm_1 242aa 1e-30 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • pfam LRRNT 26aa 0.001 in ref transcript
    • Leucine rich repeat N-terminal domain. Leucine Rich Repeats pfam00560 are short sequence motifs present in a number of proteins with diverse functions and cellular locations. Leucine Rich Repeats are often flanked by cysteine rich domains. This domain is often found at the N-terminus of tandem leucine rich repeats.
  • Changed! COG COG4886 86aa 0.009 in modified transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

FUT2

  • rs.FUT2.F1 rs.FUT2.R1 90 100
  • NCBIGene 36.3 2524
  • Alternative 5-prime, size difference: 10
  • Exclusion in 5'UTR
  • Reference transcript: NM_000511

  • pfam Glyco_transf_11 314aa 1e-133 in ref transcript
    • Glycosyl transferase family 11. This family contains several fucosyl transferase enzymes.

FUT3

  • rs.FUT3.F1 rs.FUT3.R1 112 460
  • NCBIGene 36.3 2525
  • Alternative 3-prime, size difference: 348
  • Inclusion in 5'UTR
  • Reference transcript: NM_001097639

  • pfam Glyco_transf_10 291aa 1e-102 in ref transcript
    • Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10.

FUT6

  • rs.FUT6.F1 rs.FUT6.R1 187 315
  • NCBIGene 36.3 2528
  • Single exon skipping, size difference: 128
  • Exclusion in 5'UTR
  • Reference transcript: NM_000150

  • pfam Glyco_transf_10 354aa 1e-113 in ref transcript
    • Glycosyltransferase family 10 (fucosyltransferase). This family of Fucosyltransferases are the enzymes transferring fucose from GDP-Fucose to GlcNAc in an alpha1,3 linkage. This family is know as glycosyltransferase family 10.

FUT8

  • rs.FUT8.F1 rs.FUT8.R1 120 544
  • NCBIGene 36.3 2530
  • Multiple exon skipping, size difference: 424
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_178155

  • Changed! cd SH3 53aa 5e-05 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! smart SH3 54aa 1e-06 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

FUT8

  • rs.FUT8.F2 rs.FUT8.R2 116 546
  • NCBIGene 36.3 2530
  • Single exon skipping, size difference: 430
  • Exclusion of the protein initiation site
  • Reference transcript: NM_178155

  • cd SH3 53aa 5e-05 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart SH3 54aa 1e-06 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

G3BP2

  • rs.G3BP2.F1 rs.G3BP2.R1 149 248
  • NCBIGene 36.3 9908
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_203505

  • cd NTF2 129aa 4e-31 in ref transcript
    • Nuclear transport factor 2 (NTF2) domain plays an important role in the trafficking of macromolecules, ions and small molecules between the cytoplasm and nucleus. This bi-directional transport of macromolecules across the nuclear envelope requires many soluble factors that includes GDP-binding protein Ran (RanGDP). RanGDP is required for both import and export of proteins and poly(A) RNA. RanGDP also has been implicated in cell cycle control, specifically in mitotic spindle assembly. In interphase cells, RanGDP is predominately nuclear and thought to be GTP bound, but it is also present in the cytoplasm, probably in the GDP-bound state. NTF2 mediates the nuclear import of RanGDP. NTF2 binds to both RanGDP and FxFG repeat-containing nucleoporins.
  • cd RRM 73aa 3e-10 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • pfam NTF2 123aa 1e-23 in ref transcript
    • Nuclear transport factor 2 (NTF2) domain. This family includes the NTF2-like Delta-5-3-ketosteroid isomerase proteins.
  • pfam RRM_1 66aa 4e-10 in ref transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
  • COG COG0724 71aa 4e-06 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

G6PC2

  • rs.G6PC2.F1 rs.G6PC2.R1 276 392
  • NCBIGene 36.3 57818
  • Single exon skipping, size difference: 116
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_021176

  • Changed! cd PAP2_glucose_6_phosphatase 241aa 5e-91 in ref transcript
    • PAP2_like proteins, glucose-6-phosphatase subfamily. Glucose-6-phosphatase converts glucose-6-phosphate into free glucose and is active in the lumen of the endoplasmic reticulum, where it is bound to the membrane. The generation of free glucose is an important control point in metabolism, and stands at the end of gluconeogenesis and the release of glucose from glycogen. Deficiency of glucose-6-phosphatase leads to von Gierke's disease.
  • Changed! smart acidPPc 135aa 4e-14 in ref transcript
    • Acid phosphatase homologues.
  • Changed! cd PAP2_glucose_6_phosphatase 103aa 8e-40 in modified transcript
  • Changed! smart acidPPc 70aa 2e-08 in modified transcript

GAGE1

  • rs.GAGE1.F1 rs.GAGE1.R1 255 398
  • NCBIGene 36.3 2543
  • Single exon skipping, size difference: 143
  • Exclusion of the stop codon
  • Reference transcript: NM_001468

  • Changed! pfam GAGE 109aa 4e-09 in ref transcript
    • GAGE protein. This family consists of several GAGE and XAGE proteins which are found exclusively in humans. The function of this family is unknown although they have been implicated in human cancers.
  • Changed! pfam GAGE 116aa 3e-08 in modified transcript

GARNL4

  • rs.GARNL4.F1 rs.GARNL4.R1 218 263
  • NCBIGene 36.3 23108
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015085

  • pfam Rap_GAP 188aa 3e-80 in ref transcript
    • Rap/ran-GAP.

GCC2

  • rs.GCC2.F1 rs.GCC2.R1 198 283
  • NCBIGene 36.3 9648
  • Single exon skipping, size difference: 85
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_181453

  • Changed! pfam GRIP 45aa 1e-08 in ref transcript
    • GRIP domain. The GRIP (golgin-97, RanBP2alpha,Imh1p and p230/golgin-245) domain is found in many large coiled-coil proteins. It has been shown to be sufficient for targeting to the Golgi. The GRIP domain contains a completely conserved tyrosine residue. At least some of these domains have been shown to bind to GTPase Arl1, see structures.
  • Changed! TIGR SMC_prok_B 793aa 2e-07 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! TIGR SMC_prok_B 252aa 0.001 in ref transcript
  • Changed! TIGR SMC_prok_B 305aa 0.001 in ref transcript
  • Changed! COG Smc 299aa 5e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! COG Smc 609aa 9e-05 in ref transcript
  • Changed! COG Smc 295aa 0.006 in ref transcript

GCK

  • rs.GCK.F1 rs.GCK.R1 169 293
  • NCBIGene 36.3 2645
  • Single exon skipping, size difference: 124
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_033507

  • Changed! pfam Hexokinase_2 238aa 1e-97 in ref transcript
    • Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam00349. Some members of the family have two copies of each of these domains.
  • Changed! pfam Hexokinase_1 203aa 5e-75 in ref transcript
    • Hexokinase. Hexokinase (EC:2.7.1.1) contains two structurally similar domains represented by this family and pfam03727. Some members of the family have two copies of each of these domains.
  • Changed! COG COG5026 455aa 1e-75 in ref transcript
    • Hexokinase [Carbohydrate transport and metabolism].

GCNT1

  • rs.GCNT1.F1 rs.GCNT1.R1 229 347
  • NCBIGene 36.3 2650
  • Single exon skipping, size difference: 118
  • Exclusion in 5'UTR
  • Reference transcript: NM_001490

  • pfam Branch 210aa 1e-69 in ref transcript
    • Core-2/I-Branching enzyme. This is a family of two different beta-1,6-N-acetylglucosaminyltransferase enzymes, I-branching enzyme and core-2 branching enzyme. I-branching enzyme is responsible for the production of the blood group I-antigen during embryonic development. Core-2 branching enzyme forms crucial side-chain branches in O-glycans.

GDAP1

  • rs.GDAP1.F1 rs.GDAP1.R1 116 246
  • NCBIGene 36.3 54332
  • Alternative 5-prime and 3-prime, size difference: 130
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018972

  • Changed! cd GST_C_GDAP1 111aa 9e-48 in ref transcript
    • GST_C family, Ganglioside-induced differentiation-associated protein 1 (GDAP1) subfamily; GDAP1 was originally identified as a highly expressed gene at the differentiated stage of GD3 synthase-transfected cells. More recently, mutations in GDAP1 have been reported to cause both axonal and demyelinating autosomal-recessive Charcot-Marie-Tooth (CMT) type 4A neuropathy. CMT is characterized by slow and progressive weakness and atrophy of muscles. Sequence analysis of GDAP1 shows similarities and differences with GSTs; it appears to contain both N-terminal thioredoxin-fold and C-terminal alpha helical domains of GSTs, however, it also contains additional C-terminal transmembrane domains unlike GSTs. GDAP1 is mainly expressed in neuronal cells and is localized in the mitochondria through its transmembrane domains. It does not exhibit GST activity using standard substrates.
  • Changed! cd GST_N_GDAP1 73aa 1e-36 in ref transcript
    • GST_N family, Ganglioside-induced differentiation-associated protein 1 (GDAP1) subfamily; GDAP1 was originally identified as a highly expressed gene at the differentiated stage of GD3 synthase-transfected cells. More recently, mutations in GDAP1 have been reported to cause both axonal and demyelinating autosomal-recessive Charcot-Marie-Tooth (CMT) type 4A neuropathy. CMT is characterized by slow and progressive weakness and atrophy of muscles. Sequence analysis of GDAP1 shows similarities and differences with GSTs; it appears to contain both N-terminal TRX-fold and C-terminal alpha helical domains of GSTs, however, it also contains additional C-terminal transmembrane domains unlike GSTs. GDAP1 is mainly expressed in neuronal cells and is localized in the mitochondria through its transmembrane domains. It does not exhibit GST activity using standard substrates.
  • Changed! TIGR maiA 118aa 7e-09 in ref transcript
    • Maleylacetoacetate isomerase is an enzyme of tyrosine and phenylalanine catabolism. It requires glutathione and belongs by homology to the zeta family of glutathione S-transferases. The enzyme (EC 5.2.1.2) is described as active also on maleylpyruvate, and the example from a Ralstonia sp. catabolic plasmid is described as a maleylpyruvate isomerase involved in gentisate catabolism.
  • Changed! pfam GST_C 84aa 0.001 in ref transcript
    • Glutathione S-transferase, C-terminal domain. GST conjugates reduced glutathione to a variety of targets including S-crystallin from squid, the eukaryotic elongation factor 1-gamma, the HSP26 family of stress-related proteins and auxin-regulated proteins in plants. Stringent starvation proteins in Escherichia coli are also included in the alignment but are not known to have GST activity. The glutathione molecule binds in a cleft between N and C-terminal domains. The catalytically important residues are proposed to reside in the N-terminal domain. In plants, GSTs are encoded by a large gene family (48 GST genes in Arabidopsis) and can be divided into the phi, tau, theta, zeta, and lambda classes.
  • Changed! COG Gst 267aa 6e-15 in ref transcript
    • Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones].

GEMIN8

  • rs.GEMIN8.F1 rs.GEMIN8.R1 116 204
  • NCBIGene 36.3 54960
  • Alternative 5-prime, size difference: 88
  • Exclusion in 5'UTR
  • Reference transcript: NM_017856

GFRA1

  • rs.GFRA1.F1 rs.GFRA1.R1 100 115
  • NCBIGene 36.3 2674
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005264

  • pfam GDNF 95aa 3e-16 in ref transcript
    • GDNF/GAS1 domain. This cysteine rich domain is found in multiple copies in GNDF and GAS1 proteins. GDNF and neurturin (NTN) receptors are potent survival factors for sympathetic, sensory and central nervous system neurons. GDNF and neurturin promote neuronal survival by signaling through similar multicomponent receptors that consist of a common receptor tyrosine kinase and a member of a GPI-linked family of receptors that determines ligand specificity.
  • pfam GDNF 83aa 2e-11 in ref transcript
  • pfam GDNF 80aa 9e-10 in ref transcript

GGNBP1

  • rs.GGNBP1.F1 rs.GGNBP1.R1 107 170
  • NCBIGene 36.3 449520
  • Alternative 3-prime, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001724455

GGT2

  • rs.GGT2.F1 rs.GGT2.R1 108 123
  • NCBIGene 36.3 728441
  • Alternative 5-prime and 3-prime, size difference: 15
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001129377

  • Changed! pfam G_glu_transpept 513aa 1e-162 in ref transcript
    • Gamma-glutamyltranspeptidase.
  • Changed! COG Ggt 548aa 1e-116 in ref transcript
    • Gamma-glutamyltransferase [Amino acid transport and metabolism].
  • Changed! pfam G_glu_transpept 508aa 1e-162 in modified transcript
  • Changed! COG Ggt 543aa 1e-117 in modified transcript

GGTLA1

  • rs.GGTLA1.F1 rs.GGTLA1.R1 205 301
  • NCBIGene 36.3 2687
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001099781

  • Changed! pfam G_glu_transpept 523aa 1e-143 in ref transcript
    • Gamma-glutamyltranspeptidase.
  • Changed! COG Ggt 546aa 3e-86 in ref transcript
    • Gamma-glutamyltransferase [Amino acid transport and metabolism].
  • Changed! pfam G_glu_transpept 491aa 1e-130 in modified transcript
  • Changed! COG Ggt 514aa 2e-78 in modified transcript

GH1

  • rs.GH1.F1 rs.GH1.R1 167 212
  • NCBIGene 36.3 2688
  • Alternative 3-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000515

  • Changed! pfam Hormone_1 207aa 1e-48 in ref transcript
    • Somatotropin hormone family.
  • Changed! pfam Hormone_1 192aa 2e-44 in modified transcript

GIPC1

  • rs.GIPC1.F1 rs.GIPC1.R1 165 221
  • NCBIGene 36.3 10755
  • Single exon skipping, size difference: 56
  • Exclusion in 5'UTR
  • Reference transcript: NM_005716

  • cd PDZ_signaling 79aa 8e-08 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 82aa 5e-09 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.

GIT1

  • rs.GIT1.F1 rs.GIT1.R1 92 119
  • NCBIGene 36.3 28964
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001085454

  • cd ANK 98aa 6e-13 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • smart ArfGap 119aa 3e-36 in ref transcript
    • Putative GTP-ase activating proteins for the small GTPase, ARF. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • pfam GIT_SHD 31aa 3e-07 in ref transcript
    • Spa2 homology domain (SHD) of GIT. GIT proteins are signaling integrators with GTPase-activating function which may be involved in the organisation of the cytoskeletal matrix assembled at active zones (CAZ). The function of the CAZ might be to define sites of neurotransmitter release. Mutations in the Spa2 homology domain (SHD) domain of GIT1 described here interfere with the association of GIT1 with Piccolo, beta-PIX, and focal adhesion kinase.
  • pfam GIT_SHD 22aa 0.001 in ref transcript
  • COG COG5347 115aa 3e-14 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].
  • COG Arp 185aa 1e-06 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].

GLRA3

  • rs.GLRA3.F1 rs.GLRA3.R1 119 164
  • NCBIGene 36.3 8001
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006529

  • Changed! TIGR LIC 440aa 1e-144 in ref transcript
    • selective while glycine receptors are anion selective).
  • Changed! TIGR LIC 425aa 1e-145 in modified transcript

GLT8D3

  • rs.GLT8D3.F1 rs.GLT8D3.R1 231 324
  • NCBIGene 36.3 283464
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_173601

  • cd GT8_like_2 302aa 1e-178 in ref transcript
    • GT8_like_2 represents a subfamily of GT8 with unknown function. A subfamily of glycosyltransferase family 8 with unknown function: Glycosyltransferase family 8 comprises enzymes with a number of known activities; lipopolysaccharide galactosyltransferase lipopolysaccharide glucosyltransferase 1, glycogenin glucosyltransferase and inositol 1-alpha-galactosyltransferase. It is classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed.
  • pfam Glyco_transf_8 165aa 2e-11 in ref transcript
    • Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, and glycogenin glucosyltransferase.
  • COG RfaJ 244aa 1e-09 in ref transcript
    • Lipopolysaccharide biosynthesis proteins, LPS:glycosyltransferases [Cell envelope biogenesis, outer membrane].

GMEB1

  • rs.GMEB1.F1 rs.GMEB1.R1 245 278
  • NCBIGene 36.3 10691
  • Alternative 3-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006582

  • pfam SAND 81aa 2e-30 in ref transcript
    • SAND domain. The DNA binding activity of two proteins has been mapped to the SAND domain. The conserved KDWK motif is necessary for DNA binding, and it appears to be important for dimerisation.

GNG4

  • rs.GNG4.F1 rs.GNG4.R1 100 212
  • NCBIGene 36.3 2786
  • Single exon skipping, size difference: 112
  • Exclusion in 5'UTR
  • Reference transcript: NM_001098721

GOLSYN

  • rs.GOLSYN.F1 rs.GOLSYN.R1 138 259
  • NCBIGene 36.3 55638
  • Single exon skipping, size difference: 121
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099750

GOLSYN

  • rs.GOLSYN.F2 rs.GOLSYN.R2 264 550
  • NCBIGene 36.3 55638
  • Multiple exon skipping, size difference: 286
  • Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001099745

GOSR1

  • rs.GOSR1.F1 rs.GOSR1.R1 100 122
  • NCBIGene 36.3 9527
  • Alternative 5-prime, size difference: 22
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_004871

  • Changed! pfam V-SNARE 156aa 7e-32 in ref transcript
    • Vesicle transport v-SNARE protein. V-SNARE proteins are required for protein traffic between eukaryotic organelles. The v-SNAREs on transport vesicles interact with t-SNAREs on target membranes in order to facilitate this.

GPR172B

  • rs.GPR172B.F1 rs.GPR172B.R1 148 182
  • NCBIGene 36.3 55065
  • Alternative 3-prime, size difference: 34
  • Inclusion in 5'UTR
  • Reference transcript: NM_017986

  • pfam DUF1011 95aa 1e-29 in ref transcript
    • Protein of unknown function (DUF1011). Family of uncharacterised eukaryotic proteins.

GPR18

  • rs.GPR18.F1 rs.GPR18.R1 100 292
  • NCBIGene 36.3 2841
  • Single exon skipping, size difference: 192
  • Exclusion in 5'UTR
  • Reference transcript: NM_005292

  • pfam 7tm_1 242aa 7e-25 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

GPR34

  • rs.GPR34.F1 rs.GPR34.R1 256 332
  • NCBIGene 36.3 2857
  • Alternative 3-prime, size difference: 3
  • Inclusion in 5'UTR
  • Reference transcript: NM_005300

  • pfam 7tm_1 251aa 6e-29 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

GPR64

  • rs.GPR64.F1 rs.GPR64.R1 134 176
  • NCBIGene 36.3 10149
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001079858

  • pfam 7tm_2 262aa 3e-62 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • smart GPS 51aa 1e-10 in ref transcript
    • G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.

GPR89A

  • rs.GPR89A.F1 rs.GPR89A.R1 104 124
  • NCBIGene 36.3 653519
  • Alternative 3-prime, size difference: 20
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001097612

GPR98

  • rs.GPR98.F1 rs.GPR98.R1 132 215
  • NCBIGene 36.3 84059
  • Alternative 5-prime, size difference: 83
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_032119

  • cd PTX 156aa 5e-06 in ref transcript
    • Pentraxins are plasma proteins characterized by their pentameric discoid assembly and their Ca2+ dependent ligand binding, such as Serum amyloid P component (SAP) and C-reactive Protein (CRP), which are cytokine-inducible acute-phase proteins implicated in innate immunity. CRP binds to ligands containing phosphocholine, SAP binds to amyloid fibrils, DNA, chromatin, fibronectin, C4-binding proteins and glycosaminoglycans. "Long" pentraxins have N-terminal extensions to the common pentraxin domain; one group, the neuronal pentraxins, may be involved in synapse formation and remodeling, and they may also be able to form heteromultimers.
  • Changed! TIGR caca 203aa 1e-17 in ref transcript
    • This Hmm is specific for the eukaryotic sodium ion/calcium ion exchangers of the Caca family.
  • TIGR caca 219aa 2e-15 in ref transcript
  • TIGR caca 183aa 3e-13 in ref transcript
  • TIGR caca 208aa 1e-12 in ref transcript
  • Changed! TIGR caca 213aa 2e-12 in ref transcript
  • TIGR caca 207aa 1e-11 in ref transcript
  • TIGR caca 184aa 1e-11 in ref transcript
  • TIGR caca 167aa 2e-11 in ref transcript
  • Changed! TIGR caca 161aa 3e-11 in ref transcript
  • Changed! TIGR caca 184aa 7e-11 in ref transcript
  • Changed! TIGR caca 162aa 1e-10 in ref transcript
  • Changed! TIGR caca 142aa 1e-10 in ref transcript
  • Changed! pfam 7tm_2 245aa 2e-10 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • Changed! TIGR caca 213aa 5e-10 in ref transcript
  • TIGR caca 234aa 2e-07 in ref transcript
  • TIGR caca 191aa 2e-07 in ref transcript
  • Changed! TIGR caca 137aa 2e-05 in ref transcript
  • Changed! pfam Calx-beta 94aa 5e-05 in ref transcript
    • Calx-beta domain.
  • Changed! TIGR caca 184aa 1e-04 in ref transcript
  • Changed! TIGR caca 169aa 2e-04 in ref transcript
  • Changed! TIGR caca 95aa 2e-04 in ref transcript
  • TIGR caca 158aa 2e-04 in ref transcript
  • smart LamGL 137aa 7e-04 in ref transcript
    • LamG-like jellyroll fold domain.
  • Changed! pfam EPTP 44aa 9e-04 in ref transcript
    • EPTP domain. Mutations in the LGI/Epitempin gene can result in a special form of epilepsy, autosomal dominant lateral temporal epilepsy. The Epitempin protein contains a large repeat in its C terminal section. The architecture and structural features of this repeat make it a likely member 7-bladed beta-propeller fold.
  • Changed! pfam GPS 43aa 0.002 in ref transcript
    • Latrophilin/CL-1-like GPS domain. Domain present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • TIGR caca 223aa 0.004 in ref transcript
  • Changed! TIGR caca 145aa 3e-09 in modified transcript

GPRASP1

  • rs.GPRASP1.F1 rs.GPRASP1.R1 162 253
  • NCBIGene 36.3 9737
  • Single exon skipping, size difference: 91
  • Exclusion in 5'UTR
  • Reference transcript: NM_014710

  • pfam DUF634 224aa 1e-58 in ref transcript
    • Protein of unknown function (DUF634). Mammalian protein of unknown function.

GPS1

  • rs.GPS1.F1 rs.GPS1.R1 100 112
  • NCBIGene 36.3 2873
  • Alternative 3-prime, size difference: 12
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_212492

  • pfam RPN7 183aa 2e-54 in ref transcript
    • 26S proteasome subunit RPN7. RPN7 (known as the non ATPase regulatory subunit 6 in higher eukaryotes) is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae to be required for structural integrity. The 26S proteasome is is involved in the ATP-dependent degradation of ubiquitinated proteins.
  • pfam PCI 105aa 7e-19 in ref transcript
    • PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15).
  • COG RPN7 300aa 5e-16 in ref transcript
    • 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones].

GRIA2

  • rs.GRIA2.F1 rs.GRIA2.R1 100 505
  • NCBIGene 36.3 2891
  • Alternative 5-prime, size difference: 405
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001083619

  • Changed! cd PBP1_iGluR_AMPA_GluR2 370aa 0.0 in ref transcript
    • N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR2 subunit of the AMPA (alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid) receptor. The AMPA receptor is a member of the glutamate-receptor ion channels (iGluRs) which are the major mediators of excitatory synaptic transmission in the central nervous system. AMPA receptors are composed of four types of subunits (GluR1, GluR2, GluR3, and GluR4) which combine to form a tetramer and play an important role in mediating the rapid excitatory synaptic current. Furthermore, this N-terminal domain of the iGluRs has homology with LIVBP, a bacterial periplasmic biding protein, as well as with the structurally related glutamate-binding domain of the G-protein-coupled metabotropic receptors (mGluRs).
  • cd PBPb 112aa 3e-13 in ref transcript
    • Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
  • cd PBPb 137aa 1e-04 in ref transcript
  • pfam Lig_chan 275aa 3e-98 in ref transcript
    • Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
  • pfam ANF_receptor 324aa 2e-39 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • pfam Lig_chan-Glu_bd 66aa 6e-25 in ref transcript
    • Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
  • COG HisJ 96aa 5e-07 in ref transcript
    • ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
  • PRK glnH 45aa 7e-04 in ref transcript
    • glutamine ABC transporter periplasmic protein; Reviewed.
  • COG LivK 306aa 0.002 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
  • Changed! cd PBP1_iGluR_AMPA_GluR2 350aa 0.0 in modified transcript

GRIA4

  • rs.GRIA4.F1 rs.GRIA4.R1 357 470
  • NCBIGene 36.3 2893
  • Alternative 5-prime, size difference: 113
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_000829

  • cd PBP1_iGluR_AMPA_GluR4 371aa 0.0 in ref transcript
    • N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the GluR4 subunit of the AMPA (alpha-amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid) receptor. The AMPA receptor is a member of the glutamate-receptor ion channels (iGluRs) which are the major mediators of excitatory synaptic transmission in the central nervous system. AMPA receptors are composed of four types of subunits (GluR1, GluR2, GluR3, and GluR4) which combine to form a tetramer and play an important role in mediating the rapid excitatory synaptic current. Furthermore, this N-terminal domain of the iGluRs has homology with LIVBP, a bacterial periplasmic biding protein, as well as with the structurally related glutamate-binding domain of the G-protein-coupled metabotropic receptors (mGluRs).
  • cd PBPb 118aa 6e-13 in ref transcript
    • Bacterial periplasmic transport systems use membrane-bound complexes and substrate-bound, membrane-associated, periplasmic binding proteins (PBPs) to transport a wide variety of substrates, such as, amino acids, peptides, sugars, vitamins and inorganic ions. PBPs have two cell-membrane translocation functions: bind substrate, and interact with the membrane bound complex. A diverse group of periplasmic transport receptors for lysine/arginine/ornithine (LAO), glutamine, histidine, sulfate, phosphate, molybdate, and methanol are included in the PBPb CD.
  • pfam Lig_chan 275aa 1e-99 in ref transcript
    • Ligand-gated ion channel. This family includes the four transmembrane regions of the ionotropic glutamate receptors and NMDA receptors.
  • pfam ANF_receptor 338aa 5e-50 in ref transcript
    • Receptor family ligand binding region. This family includes extracellular ligand binding domains of a wide range of receptors. This family also includes the bacterial amino acid binding proteins of known structure.
  • pfam Lig_chan-Glu_bd 66aa 1e-24 in ref transcript
    • Ligated ion channel L-glutamate- and glycine-binding site. This region, sometimes called the S1 domain, is the luminal domain just upstream of the first, M1, transmembrane region of transmembrane ion-channel proteins, and it binds L-glutamate and glycine. It is found in association with Lig_chan, pfam00060.
  • COG HisJ 126aa 6e-08 in ref transcript
    • ABC-type amino acid transport/signal transduction systems, periplasmic component/domain [Amino acid transport and metabolism / Signal transduction mechanisms].
  • COG LivK 329aa 6e-07 in ref transcript
    • ABC-type branched-chain amino acid transport systems, periplasmic component [Amino acid transport and metabolism].
  • PRK glnH 45aa 0.001 in ref transcript
    • glutamine ABC transporter periplasmic protein; Reviewed.

GRSF1

  • rs.GRSF1.F1 rs.GRSF1.R1 112 355
  • NCBIGene 36.3 2926
  • Alternative 5-prime, size difference: 243
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_002092

  • cd RRM 73aa 6e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 69aa 1e-07 in ref transcript
  • cd RRM 71aa 2e-05 in ref transcript
  • smart RRM 68aa 6e-08 in ref transcript
    • RNA recognition motif.
  • smart RRM 69aa 6e-06 in ref transcript
  • smart RRM_2 69aa 7e-06 in ref transcript
    • RNA recognition motif.
  • TIGR PABP-1234 120aa 6e-04 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.

GSDML

  • rs.GSDML.F1 rs.GSDML.R1 115 142
  • NCBIGene 36.3 55876
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042471

  • Changed! pfam Gasdermin 369aa 1e-101 in ref transcript
    • Gasdermin family. The precise function of this protein is unknown. A deletion/insertion mutation is associated with an autosomal dominant non-syndromic hearing impairment form. In addition, this protein has also been found to contribute to acquired etoposide resistance in melanoma cells. This family also includes the gasdermin protein.
  • Changed! pfam Gasdermin 360aa 1e-101 in modified transcript

GSTCD

  • rs.GSTCD.F1 rs.GSTCD.R1 164 425
  • NCBIGene 36.3 79807
  • Alternative 5-prime, size difference: 261
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031720

  • cd GST_C_family 72aa 4e-05 in ref transcript
    • Glutathione S-transferase (GST) family, C-terminal alpha helical domain; a large, diverse group of cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. In addition, GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. This family, also referred to as soluble GSTs, is the largest family of GSH transferases and is only distantly related to the mitochondrial GSTs (GSTK). Soluble GSTs bear no structural similarity to microsomal GSTs (MAPEG family) and display additional activities unique to their group, such as catalyzing thiolysis, reduction and isomerization of certain compounds. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Based on sequence similarity, different classes of GSTs have been identified, which display varying tissue distribution, substrate specificities and additional specific activities. In humans, GSTs display polymorphisms which may influence individual susceptibility to diseases such as cancer, arthritis, allergy and sclerosis. Some GST family members with non-GST functions include glutaredoxin 2, the CLIC subfamily of anion channels, prion protein Ure2p, crystallins, metaxins and stringent starvation protein A.
  • TIGR RF_mod_HemK 69aa 6e-05 in ref transcript
    • Members of this protein family are HemK, a protein once thought to be involved in heme biosynthesis but now recognized to be a protein-glutamine methyltransferase that modifies the peptide chain release factors. All members of the seed alignment are encoded next to the release factor 1 gene (prfA) and confirmed by phylogenetic analysis. However, the family is diverse enough that even many members of the seed alignment do not score above the seed alignment, which was set high enough to exclude all instances of PrmB.
  • COG UbiE 65aa 9e-06 in ref transcript
    • Methylase involved in ubiquinone/menaquinone biosynthesis [Coenzyme metabolism].

GSTM1

  • rs.GSTM1.F1 rs.GSTM1.R1 243 354
  • NCBIGene 36.3 2944
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000561

  • Changed! cd GST_C_Mu 121aa 3e-41 in ref transcript
    • GST_C family, Class Mu subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Mu subfamily is composed of eukaryotic GSTs. In rats, at least six distinct class Mu subunits have been identified, with homologous genes in humans for five of these subunits. Class Mu GSTs can form homodimers and heterodimers, giving a large number of possible isoenzymes that can be formed, all with overlapping activities but different substrate specificities. They are the most abundant GSTs in human liver, skeletal muscle and brain, and are believed to provide protection against diseases including cancer and neurodegenerative disorders. Some isoenzymes have additional specific functions. Human GST M1-1 acts as an endogenous inhibitor of ASK1 (apoptosis signal-regulating kinase 1) thereby suppressing ASK1-mediated cell death. Human GSTM2-2 and 3-3 have been identified as prostaglandin E2 synthases in the brain and may play crucial roles in temperature and sleep-wake regulation.
  • cd GST_N_Mu 82aa 3e-38 in ref transcript
    • GST_N family, Class Mu subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The class Mu subfamily is composed of eukaryotic GSTs. In rats, at least six distinct class Mu subunits have been identified, with homologous genes in humans for five of these subunits. Class Mu GSTs can form homodimers and heterodimers, giving a large number of possible isoenzymes that can be formed, all with overlapping activities but different substrate specificities. They are the most abundant GSTs in human liver, skeletal muscle and brain, and are believed to provide protection against diseases including cancer and neurodegenerative disorders. Some isoenzymes have additional specific functions. Human GST M1-1 acts as an endogenous inhibitor of ASK1 (apoptosis signal-regulating kinase 1), thereby suppressing ASK1-mediated cell death. Human GSTM2-2 and 3-3 have been identified as prostaglandin E2 synthases in the brain and may play crucial roles in temperature and sleep-wake regulation.
  • pfam GST_N 80aa 1e-17 in ref transcript
    • Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation factors 1-gamma. Not known to have GST activity; similarity not previously recognised. * HSP26 family of stress-related proteins. including auxin-regulated proteins in plants and stringent starvation proteins in Escherichia coli. Not known to have GST activity. Similarity not previously recognised. The glutathione molecule binds in a cleft between N and C-terminal domains - the catalytically important residues are proposed to reside in the N-terminal domain.
  • Changed! pfam GST_C 51aa 5e-10 in ref transcript
    • Glutathione S-transferase, C-terminal domain. GST conjugates reduced glutathione to a variety of targets including S-crystallin from squid, the eukaryotic elongation factor 1-gamma, the HSP26 family of stress-related proteins and auxin-regulated proteins in plants. Stringent starvation proteins in Escherichia coli are also included in the alignment but are not known to have GST activity. The glutathione molecule binds in a cleft between N and C-terminal domains. The catalytically important residues are proposed to reside in the N-terminal domain. In plants, GSTs are encoded by a large gene family (48 GST genes in Arabidopsis) and can be divided into the phi, tau, theta, zeta, and lambda classes.
  • Changed! PTZ PTZ00057 201aa 3e-09 in ref transcript
    • glutathione s-transferase; Provisional.
  • Changed! cd GST_C_Mu 84aa 4e-16 in modified transcript
  • Changed! PTZ PTZ00057 109aa 8e-07 in modified transcript

GTF2I

  • rs.GTF2I.F1 rs.GTF2I.R1 110 170
  • NCBIGene 36.3 2969
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032999

  • pfam GTF2I 73aa 6e-31 in ref transcript
    • GTF2I-like repeat. This region of sequence similarity is found up to six times in a variety of proteins including GTF2I. It has been suggested that this may be a DNA binding domain.
  • pfam GTF2I 73aa 1e-30 in ref transcript
  • pfam GTF2I 73aa 4e-30 in ref transcript
  • pfam GTF2I 73aa 1e-29 in ref transcript
  • pfam GTF2I 73aa 3e-26 in ref transcript
  • pfam GTF2I 73aa 3e-26 in ref transcript

GTPBP10

  • rs.GTPBP10.F1 rs.GTPBP10.R1 127 364
  • NCBIGene 36.3 85865
  • Multiple exon skipping, size difference: 237
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_033107

  • Changed! cd Obg 195aa 2e-58 in ref transcript
    • Obg subfamily. The Obg nucleotide binding protein subfamily has been implicated in stress response, chromosome partitioning, replication initiation, mycelium development, and sporulation. Obg proteins are among a large group of GTP binding proteins conserved from bacteria to humans. The E. coli homolog, ObgE is believed to function in ribosomal biogenesis. Members of the subfamily contain two equally and highly conserved domains, a C-terminal GTP binding domain and an N-terminal glycine-rich domain.
  • Changed! TIGR Obg_CgtA 330aa 5e-73 in ref transcript
    • This model describes a univeral, mostly one-gene-per-genome GTP-binding protein that associates with ribosomal subunits and appears to play a role in ribosomal RNA maturation. This GTPase, related to the nucleolar protein Obg, is designated CgtA in bacteria. Mutations in this gene are pleiotropic, but it appears that effects on cellular functions such as chromosome partition may be secondary to the effect on ribosome structure. Recent work done in Vibrio cholerae shows an essential role in the stringent response, in which RelA-dependent ability to synthesize the alarmone ppGpp is required for deletion of this GTPase to be lethal.
  • Changed! PRK obgE 335aa 1e-72 in ref transcript
    • GTPase ObgE; Reviewed.
  • Changed! cd Obg 188aa 3e-54 in modified transcript
  • Changed! TIGR Obg_CgtA 242aa 3e-46 in modified transcript
  • Changed! PRK obgE 247aa 1e-47 in modified transcript

GYG2

  • rs.GYG2.F1 rs.GYG2.R1 307 400
  • NCBIGene 36.3 8908
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003918

  • cd GT8_Glycogenin 252aa 2e-87 in ref transcript
    • Glycogenin belongs the GT 8 family and initiates the biosynthesis of glycogen. Glycogenin initiates the biosynthesis of glycogen by incorporating glucose residues through a self-glucosylation reaction at a Tyr residue, and then acts as substrate for chain elongation by glycogen synthase and branching enzyme. It contains a conserved DxD motif and an N-terminal beta-alpha-beta Rossmann-like fold that are common to the nucleotide-binding domains of most glycosyltransferases. The DxD motif is essential for coordination of the catalytic divalent cation, most commonly Mn2+. Glycogenin can be classified as a retaining glycosyltransferase, based on the relative anomeric stereochemistry of the substrate and product in the reaction catalyzed. It is placed in glycosyltransferase family 8 which includes lipopolysaccharide glucose and galactose transferases and galactinol synthases.
  • pfam Glyco_transf_8 217aa 2e-51 in ref transcript
    • Glycosyl transferase family 8. This family includes enzymes that transfer sugar residues to donor molecules. Members of this family are involved in lipopolysaccharide biosynthesis and glycogen synthesis. This family includes Lipopolysaccharide galactosyltransferase, lipopolysaccharide glucosyltransferase 1, and glycogenin glucosyltransferase.
  • COG COG5597 175aa 1e-11 in ref transcript
    • Alpha-N-acetylglucosamine transferase [Cell envelope biogenesis, outer membrane].

Gcom1

  • rs.Gcom1.F1 rs.Gcom1.R1 135 340
  • NCBIGene 36.3 145781
  • Single exon skipping, size difference: 205
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001018090

  • TIGR SMC_prok_B 250aa 2e-08 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 266aa 4e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

HAP1

  • rs.HAP1.F1 rs.HAP1.R1 102 126
  • NCBIGene 36.3 9001
  • Alternative 5-prime, size difference: 24
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_003949

  • Changed! pfam HAP1_N 354aa 1e-71 in ref transcript
    • HAP1 N-terminal conserved region. This family represents an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntington's disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34.
  • COG Smc 148aa 4e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! pfam HAP1_N 362aa 3e-71 in modified transcript

HEY1

  • rs.HEY1.F1 rs.HEY1.R1 103 115
  • NCBIGene 36.3 23462
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040708

  • Changed! cd HLH 46aa 9e-06 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • smart ORANGE 47aa 5e-09 in ref transcript
    • Orange domain. This domain confers specificity among members of the Hairy/E(SPL) family.
  • Changed! pfam HLH 44aa 6e-07 in ref transcript
    • Helix-loop-helix DNA-binding domain.
  • Changed! cd HLH 42aa 3e-06 in modified transcript
  • Changed! pfam HLH 40aa 2e-07 in modified transcript

HGF

  • rs.HGF.F1 rs.HGF.R1 110 125
  • NCBIGene 36.3 3082
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000601

  • cd Tryp_SPc 225aa 2e-51 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! cd KR 82aa 6e-28 in ref transcript
    • Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.
  • cd KR 82aa 9e-28 in ref transcript
  • cd KR 83aa 9e-27 in ref transcript
  • cd KR 83aa 5e-25 in ref transcript
  • cd PAN_APPLE 84aa 8e-20 in ref transcript
    • PAN/APPLE-like domain; present in N-terminal (N) domains of plasminogen/ hepatocyte growth factor proteins, plasma prekallikrein/coagulation factor XI and microneme antigen proteins, plant receptor-like protein kinases, and various nematode and leech anti-platelet proteins. Common structural features include two disulfide bonds that link the alpha-helix to the central region of the protein. PAN domains have significant functional versatility, fulfilling diverse biological functions by mediating protein-protein or protein-carbohydrate interactions.
  • smart Tryp_SPc 223aa 3e-56 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • pfam Kringle 78aa 2e-32 in ref transcript
    • Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta.
  • pfam Kringle 79aa 6e-31 in ref transcript
  • Changed! smart KR 83aa 1e-30 in ref transcript
    • Kringle domain. Named after a Danish pastry. Found in several serine proteases and in ROR-like receptors. Can occur in up to 38 copies (in apolipoprotein(a)). Plasminogen-like kringles possess affinity for free lysine and lysine- containing peptides.
  • pfam Kringle 79aa 2e-26 in ref transcript
  • smart PAN_AP 86aa 4e-07 in ref transcript
    • divergent subfamily of APPLE domains. Apple-like domains present in Plasminogen, C. elegans hypothetical ORFs and the extracellular portion of plant receptor-like protein kinases. Predicted to possess protein- and/or carbohydrate-binding functions.
  • COG COG5640 234aa 3e-10 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd KR 77aa 7e-25 in modified transcript
  • Changed! smart KR 78aa 2e-27 in modified transcript

HIF1A

  • rs.HIF1A.F1 rs.HIF1A.R1 262 389
  • NCBIGene 36.3 3091
  • Single exon skipping, size difference: 127
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001530

  • cd PAS 100aa 1e-12 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd PAS 55aa 4e-07 in ref transcript
  • pfam PAS_3 86aa 1e-15 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • Changed! pfam HIF-1a_CTAD 40aa 1e-14 in ref transcript
    • HIF-1 alpha C terminal transactivation domain. Hypoxia inducible factor-1 alpha (HIF-1 alpha) is the regulatory subunit of the heterodimeric transcription factor HIF-1. It plays a key role in cellular response to low oxygen tension. This region corresponds to the C terminal transactivation domain.
  • smart PAS 56aa 8e-08 in ref transcript
    • PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).

HIG2

  • rs.HIG2.F1 rs.HIG2.R1 252 366
  • NCBIGene 36.3 29923
  • Alternative 5-prime, size difference: 114
  • Exclusion in 5'UTR
  • Reference transcript: NM_013332

HIPK3

  • rs.HIPK3.F1 rs.HIPK3.R1 180 243
  • NCBIGene 36.3 10114
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005734

  • cd S_TKc 224aa 3e-51 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd S_TKc 63aa 0.007 in ref transcript
  • smart S_TKc 202aa 3e-52 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • smart S_TKc 30aa 0.007 in ref transcript
  • PTZ PTZ00284 310aa 8e-34 in ref transcript
    • protein kinase; Provisional.

HLA-F

  • rs.HLA-F.F1 rs.HLA-F.R1 137 413
  • NCBIGene 36.3 3134
  • Single exon skipping, size difference: 276
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098479

  • Changed! cd IGc 92aa 2e-19 in ref transcript
    • Immunoglobulin domain constant region subfamily; members of the IGc subfamily are components of immunoglobulins, T-cell receptors, CD1 cell surface glycoproteins, secretory glycoproteins A/C, and Major Histocompatibility Complex (MHC) class I/II molecules. In immunoglobulins, each chain is composed of one variable domain (IGv) and one or more constant domains (IGc); these names reflect the fact that the variability in sequences is higher in the variable domain than in the constant domain. T-cell receptors form heterodimers, pairing two chains (alpha/beta or gamma/delta), each with a IGv and IGc domain. MHCs form heterodimers pairing two chains (alpha/beta or delta/epsilon), each with a MHC and IGc domain. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam MHC_I 178aa 9e-84 in ref transcript
    • Class I Histocompatibility antigen, domains alpha 1 and 2.
  • Changed! pfam C1-set 81aa 3e-22 in ref transcript
    • Immunoglobulin C1-set domain.

HMGA1

  • rs.HMGA1.F1 rs.HMGA1.R1 145 178
  • NCBIGene 36.3 3159
  • Alternative 5-prime, size difference: 33
  • Exclusion in 5'UTR
  • Reference transcript: NM_145899

HMGA1

  • rs.HMGA1.F2 rs.HMGA1.R2 194 308
  • NCBIGene 36.3 3159
  • Single exon skipping, size difference: 114
  • Exclusion in 5'UTR
  • Reference transcript: NM_002131

HMGCLL1

  • rs.HMGCLL1.F1 rs.HMGCLL1.R1 319 409
  • NCBIGene 36.3 54511
  • Single exon skipping, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_019036

  • pfam HMGL-like 271aa 3e-58 in ref transcript
    • HMGL-like. This family contains a diverse set of enzymes. These include various aldolases and a region of pyruvate carboxylase.
  • PRK PRK05692 286aa 1e-132 in ref transcript
    • hydroxymethylglutaryl-CoA lyase; Provisional.

HMGCS1

  • rs.HMGCS1.F1 rs.HMGCS1.R1 328 387
  • NCBIGene 36.3 3157
  • Single exon skipping, size difference: 59
  • Exclusion in 5'UTR
  • Reference transcript: NM_001098272

  • cd init_cond_enzymes 371aa 9e-63 in ref transcript
    • "initiating" condensing enzymes are a subclass of decarboxylating condensing enzymes, including beta-ketoacyl [ACP] synthase, type III and polyketide synthases, type III, which include chalcone synthase and related enzymes. They are characterized by the utlization of CoA substrate primers, as well as the nature of their active site residues.
  • TIGR HMG-CoA-S_euk 457aa 0.0 in ref transcript
    • Hydroxymethylglutaryl(HMG)-CoA synthase is the first step of isopentenyl pyrophosphate (IPP) biosynthesis via the mevalonate pathway. This pathway is found mainly in eukaryotes, but also in archaea and some bacteria. This model is specific for eukaryotes.
  • COG PksG 452aa 2e-75 in ref transcript
    • 3-hydroxy-3-methylglutaryl CoA synthase [Lipid metabolism].

HMMR

  • rs.HMMR.F1 rs.HMMR.R1 138 183
  • NCBIGene 36.3 3161
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012484

  • pfam SMC_N 239aa 3e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! TIGR SMC_prok_A 213aa 9e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • COG Smc 236aa 4e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • PRK PRK05771 196aa 0.002 in ref transcript
    • V-type ATP synthase subunit I; Validated.
  • Changed! TIGR SMC_prok_A 223aa 3e-04 in modified transcript

HN1

  • rs.HN1.F1 rs.HN1.R1 124 143
  • NCBIGene 36.3 51155
  • Single exon skipping, size difference: 19
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001002032

HNF4A

  • rs.HNF4A.F1 rs.HNF4A.R1 123 153
  • NCBIGene 36.3 3172
  • Alternative 5-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000457

  • cd NR_LBD_HNF4_like 223aa 1e-120 in ref transcript
    • The ligand binding domain of heptocyte nuclear factor 4, which is explosively expanded in nematodes. The ligand binding domain of hepatocyte nuclear factor 4 (HNF4) like proteins: HNF4 is a member of the nuclear receptor superfamily. HNF4 plays a key role in establishing and maintenance of hepatocyte differentiation in the liver. It is also expressed in gut, kidney, and pancreatic beta cells. HNF4 was originally classified as an orphan receptor, but later it is found that HNF4 binds with very high affinity to a variety of fatty acids. However, unlike other nuclear receptors, the ligands do not act as a molecular switch for HNF4. They seem to constantly bind to the receptor, which is constitutively active as a transcription activator. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, HNF4 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD). The LBD domain is also responsible for recruiting co-activator proteins. More than 280 nuclear receptors are found in C. ele gans, most of which are originated from an explosive burst of duplications of HNF4.
  • pfam Hormone_recep 182aa 2e-41 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • pfam zf-C4 75aa 2e-38 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.

HNRNPC

  • rs.HNRNPC.F1 rs.HNRNPC.R1 168 207
  • NCBIGene 36.3 3183
  • Alternative 5-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_031314

  • cd RRM 68aa 1e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 67aa 5e-10 in ref transcript
    • RNA recognition motif.

HNRNPC

  • rs.HNRNPC.F2 rs.HNRNPC.R2 144 170
  • NCBIGene 36.3 3183
  • Single exon skipping, size difference: 26
  • Exclusion in 5'UTR
  • Reference transcript: NM_031314

  • cd RRM 68aa 1e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 67aa 5e-10 in ref transcript
    • RNA recognition motif.

HNRNPR

  • rs.HNRNPR.F1 rs.HNRNPR.R1 147 313
  • NCBIGene 36.3 10236
  • Single exon skipping, size difference: 166
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001102398

  • cd RRM 73aa 1e-11 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 56aa 1e-11 in ref transcript
  • cd RRM 82aa 2e-08 in ref transcript
  • TIGR hnRNP-R-Q 313aa 1e-156 in ref transcript
    • Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
  • TIGR hnRNP-R-Q 30aa 3e-05 in ref transcript
  • COG COG0724 65aa 3e-06 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 138aa 7e-06 in ref transcript

HNRPF

  • rs.HNRPF.F1 rs.HNRPF.R1 232 273
  • NCBIGene 36.3 3185
  • Alternative 5-prime, size difference: 41
  • Exclusion in 5'UTR
  • Reference transcript: NM_004966

  • cd RRM 73aa 3e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 72aa 1e-08 in ref transcript
  • cd RRM 76aa 3e-06 in ref transcript
  • pfam zf-RNPHF 36aa 7e-14 in ref transcript
    • RNPHF zinc finger. This domain is a putative zinc-binding domain (CHHC motif) in RNP H and F. The domain is often associated with pfam00076.
  • smart RRM_2 70aa 7e-10 in ref transcript
    • RNA recognition motif.
  • smart RRM_2 71aa 9e-10 in ref transcript
  • smart RRM_2 73aa 6e-04 in ref transcript
  • TIGR SF-CC1 121aa 0.007 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • COG COG0724 159aa 5e-04 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 57aa 0.002 in ref transcript

HNRPH2

  • rs.HNRPH2.F1 rs.HNRPH2.R1 101 113
  • NCBIGene 36.3 3188
  • Alternative 5-prime, size difference: 12
  • Exclusion in 5'UTR
  • Reference transcript: NM_019597

  • cd RRM 73aa 4e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 72aa 5e-08 in ref transcript
  • pfam zf-RNPHF 34aa 4e-13 in ref transcript
    • RNPHF zinc finger. This domain is a putative zinc-binding domain (CHHC motif) in RNP H and F. The domain is often associated with pfam00076.
  • smart RRM_2 71aa 7e-10 in ref transcript
    • RNA recognition motif.
  • smart RRM_2 70aa 3e-09 in ref transcript

HRAS

  • rs.HRAS.F1 rs.HRAS.R1 176 258
  • NCBIGene 36.3 3265
  • Single exon skipping, size difference: 82
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_005343

  • Changed! cd H_N_K_Ras_like 162aa 8e-91 in ref transcript
    • H-Ras/N-Ras/K-Ras subfamily. H-Ras, N-Ras, and K-Ras4A/4B are the prototypical members of the Ras family. These isoforms generate distinct signal outputs despite interacting with a common set of activators and effectors, and are strongly associated with oncogenic progression in tumor initiation. Mutated versions of Ras that are insensitive to GAP stimulation (and are therefore constitutively active) are found in a significant fraction of human cancers. Many Ras guanine nucleotide exchange factors (GEFs) have been identified. They are sequestered in the cytosol until activation by growth factors triggers recruitment to the plasma membrane or Golgi, where the GEF colocalizes with Ras. Active (GTP-bound) Ras interacts with several effector proteins that stimulate a variety of diverse cytoplasmic signaling activities. Some are known to positively mediate the oncogenic properties of Ras, including Raf, phosphatidylinositol 3-kinase (PI3K), RalGEFs, and Tiam1. Others are proposed to play negative regulatory roles in oncogenesis, including RASSF and NORE/MST1. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • Changed! smart RAS 163aa 6e-80 in ref transcript
    • Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades.
  • Changed! COG COG1100 165aa 2e-19 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].
  • Changed! cd H_N_K_Ras_like 148aa 1e-81 in modified transcript
  • Changed! smart RAS 147aa 2e-72 in modified transcript
  • Changed! COG COG1100 147aa 2e-17 in modified transcript

HS6ST2

  • rs.HS6ST2.F1 rs.HS6ST2.R1 200 320
  • NCBIGene 36.3 90161
  • Multiple exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077188

  • Changed! pfam HS6ST 353aa 1e-177 in ref transcript
    • Heparan sulfate 6-sulfotransferase (HS6ST). This family consists of several heparan sulfate 6-sulfotransferase (HS6ST) proteins. Heparan sulphate 6- O -sulphotransferase (HS6ST) catalyses the transfer of sulphate from adenosine 3'-phosphate, 5'-phosphosulphate to the 6th position of the N -sulphoglucosamine residue in heparan sulphate.
  • Changed! pfam HS6ST 313aa 0.0 in modified transcript

HSF4

  • rs.HSF4.F1 rs.HSF4.R1 100 114
  • NCBIGene 36.3 3299
  • Alternative 3-prime, size difference: 14
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001040667

  • pfam HSF_DNA-bind 194aa 1e-59 in ref transcript
    • HSF-type DNA-binding.
  • COG HSF1 182aa 1e-29 in ref transcript
    • Heat shock transcription factor [Transcription].

HSF4

  • rs.HSF4.F2 rs.HSF4.R2 111 215
  • NCBIGene 36.3 3299
  • Alternative 3-prime, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001040667

  • pfam HSF_DNA-bind 194aa 1e-59 in ref transcript
    • HSF-type DNA-binding.
  • COG HSF1 182aa 1e-29 in ref transcript
    • Heat shock transcription factor [Transcription].

HSPA8

  • rs.HSPA8.F1 rs.HSPA8.R1 91 550
  • NCBIGene 36.3 3312
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 459
  • Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006597

  • Changed! pfam HSP70 607aa 0.0 in ref transcript
    • Hsp70 protein. Hsp70 chaperones help to fold many proteins. Hsp70 assisted folding involves repeated cycles of substrate binding and release. Hsp70 activity is ATP dependent. Hsp70 proteins are made up of two regions: the amino terminus is the ATPase domain and the carboxyl terminus is the substrate binding region.
  • Changed! PTZ PTZ00009 612aa 0.0 in ref transcript
    • heat shock 70 kDa protein; Provisional.
  • Changed! pfam HSP70 457aa 0.0 in modified transcript
  • Changed! PTZ PTZ00009 462aa 0.0 in modified transcript

HTATIP2

  • rs.HTATIP2.F1 rs.HTATIP2.R1 189 303
  • NCBIGene 36.3 10553
  • Alternative 5-prime, size difference: 114
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001098520

  • Changed! TIGR hemA 97aa 0.004 in ref transcript
    • This enzyme, together with glutamate-1-semialdehyde-2,1-aminomutase (TIGR00713), leads to the production of delta-amino-levulinic acid from Glu-tRNA.
  • COG COG0702 173aa 2e-08 in ref transcript
    • Predicted nucleoside-diphosphate-sugar epimerases [Cell envelope biogenesis, outer membrane / Carbohydrate transport and metabolism].
  • Changed! TIGR hemA 78aa 0.006 in modified transcript

IARS

  • rs.IARS.F1 rs.IARS.R1 127 300
  • NCBIGene 36.3 3376
  • Alternative 5-prime, size difference: 173
  • Exclusion of the protein initiation site
  • Reference transcript: NM_013417

  • cd IleRS_core 235aa 5e-89 in ref transcript
    • This is the catalytic core domain of isoleucine amino-acyl tRNA synthetases (IleRS) . This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. IleRS has an insertion in the core domain, which is subject to both deletions and rearrangements. This editing region hydrolyzes mischarged cognate tRNAs and thus prevents the incorporation of chemically similar amino acids.
  • cd IleRS_core 143aa 2e-68 in ref transcript
  • TIGR ileS 860aa 0.0 in ref transcript
    • The isoleucyl tRNA synthetase (IleS) is a class I amino acyl-tRNA ligase and is particularly closely related to the valyl tRNA synthetase. This model may recognize IleS from every species, including eukaryotic cytosolic and mitochondrial forms.
  • PRK ileS 1029aa 0.0 in ref transcript
    • isoleucyl-tRNA synthetase; Reviewed.

ICA1L

  • rs.ICA1L.F1 rs.ICA1L.R1 246 316
  • NCBIGene 36.3 130026
  • Single exon skipping, size difference: 70
  • Inclusion in 5'UTR
  • Reference transcript: NM_138468

  • cd Arfaptin 205aa 3e-54 in ref transcript
    • Arfaptin domain; arfaptin is a ubiquitously expressed protein implicated in mediating cross-talk between Rac, a member of the Rho family, and Arf small GTPases; Arfaptin binds to GTP-bound Arf1 and Arf6, but binds Rac.GTP and Rac.GDP with similar affinities. Structures of Arfaptin with Rac bound to either GDP or the slowly hydrolysable analogue GMPPNP show that the switch regions adopt similar conformations in both complexes. Arf1 and Arf6 are thought to bind to the same surface as Rac.
  • pfam Arfaptin 206aa 3e-73 in ref transcript
    • Arfaptin-like domain. Arfaptin interacts with ARF1, a small GTPase involved in vesicle budding at the Golgi complex and immature secretory granules. The structure of arfaptin shows that upon binding to a small GTPase, arfaptin forms a an elongated, crescent-shaped dimer of three-helix coiled-coils. The N-terminal region of ICA69 is similar to arfaptin.
  • pfam ICA69 229aa 5e-54 in ref transcript
    • Islet cell autoantigen ICA69, C-terminal domain. This family includes a 69 kD protein which has been identified as an islet cell autoantigen in type I diabetes mellitus. Its precise function is unknown.

ICAM2

  • rs.ICAM2.F1 rs.ICAM2.R1 178 260
  • NCBIGene 36.3 3384
  • Single exon skipping, size difference: 82
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099786

  • pfam ICAM_N 111aa 2e-45 in ref transcript
    • Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signalling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, during the viral attachment process. This family is a family that is part of the Ig superfamily and is therefore related to the family ig (pfam00047).

ICAM2

  • rs.ICAM2.F2 rs.ICAM2.R2 112 229
  • NCBIGene 36.3 3384
  • Alternative 5-prime, size difference: 117
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099786

  • pfam ICAM_N 111aa 2e-45 in ref transcript
    • Intercellular adhesion molecule (ICAM), N-terminal domain. ICAMs normally functions to promote intercellular adhesion and signalling. However, The N-terminal domain of the receptor binds to the rhinovirus 'canyon' surrounding the icosahedral 5-fold axes, during the viral attachment process. This family is a family that is part of the Ig superfamily and is therefore related to the family ig (pfam00047).

IFI6

  • rs.IFI6.F1 rs.IFI6.R1 100 112
  • NCBIGene 36.3 2537
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022873

  • pfam Ifi-6-16 59aa 2e-11 in ref transcript
    • Interferon-induced 6-16 family.

IGFBP3

  • rs.IGFBP3.F1 rs.IGFBP3.R1 100 118
  • NCBIGene 36.3 3486
  • Alternative 5-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001013398

  • cd TY 70aa 3e-10 in ref transcript
    • Thyroglobulin type I repeats.; The N-terminal region of human thyroglobulin contains 11 type-1 repeats TY repeats are proposed to be inhibitors of cysteine proteases.
  • smart IB 78aa 4e-23 in ref transcript
    • Insulin growth factor-binding protein homologues. High affinity binding partners of insulin-like growth factors.
  • pfam Thyroglobulin_1 73aa 7e-15 in ref transcript
    • Thyroglobulin type-1 repeat. Thyroglobulin type 1 repeats are thought to be involved in the control of proteolytic degradation. The domain usually contains six conserved cysteines. These form three disulphide bridges. Cysteines 1 pairs with 2, 3 with 4 and 5 with 6.

IHPK1

  • rs.IHPK1.F1 rs.IHPK1.R1 138 489
  • NCBIGene 36.3 9807
  • Single exon skipping, size difference: 351
  • Exclusion of the protein initiation site
  • Reference transcript: NM_153273

  • Changed! pfam IPK 272aa 2e-68 in ref transcript
    • Inositol polyphosphate kinase. ArgRIII has has been demonstrated to be an inositol polyphosphate kinase.
  • Changed! pfam IPK 261aa 1e-67 in modified transcript

IKZF2

  • rs.IKZF2.F1 rs.IKZF2.R1 158 236
  • NCBIGene 36.3 22807
  • Alternative 5-prime, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016260

  • Changed! COG COG5048 83aa 0.007 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! COG COG5048 88aa 0.008 in modified transcript

IKZF2

  • rs.IKZF2.F2 rs.IKZF2.R2 103 140
  • NCBIGene 36.3 22807
  • Alternative 5-prime, size difference: 37
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016260

  • Changed! COG COG5048 83aa 0.007 in ref transcript
    • FOG: Zn-finger [General function prediction only].

IL17RE

  • rs.IL17RE.F1 rs.IL17RE.R1 154 290
  • NCBIGene 36.3 132014
  • Multiple exon skipping, size difference: 136
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153483

  • Changed! pfam SEFIR 75aa 1e-06 in ref transcript
    • SEFIR domain. This family comprises IL17 receptors (IL17Rs) and SEF proteins. The latter are feedback inhibitors of FGF signalling and are also thought to be receptors. Due to its similarity to the TIR domain (pfam01582), the SEFIR region is thought to be involved in homotypic interactions with other SEFIR/TIR-domain-containing proteins. Thus, SEFs and IL17Rs may be involved in TOLL/IL1R-like signalling pathways.

IL32

  • rs.IL32.F1 rs.IL32.R1 102 125
  • NCBIGene 36.3 9235
  • Alternative 5-prime, size difference: 23
  • Exclusion in 5'UTR
  • Reference transcript: NM_001012632

ILF3

  • rs.ILF3.F1 rs.ILF3.R1 100 543
  • NCBIGene 36.3 3609
  • Alternative 3-prime, size difference: 443
  • Exclusion of the stop codon
  • Reference transcript: NM_004516

  • cd DSRM 64aa 9e-09 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • cd DSRM 46aa 3e-05 in ref transcript
  • smart DZF 255aa 1e-102 in ref transcript
    • domain in DSRM or ZnF_C2H2 domain containing proteins.
  • smart DSRM 65aa 2e-11 in ref transcript
    • Double-stranded RNA binding motif.
  • smart DSRM 47aa 2e-07 in ref transcript
  • PRK rnc 37aa 3e-05 in ref transcript
    • ribonuclease III; Reviewed.
  • PRK rnc 32aa 0.006 in ref transcript

ILK

  • rs.ILK.F1 rs.ILK.R1 145 191
  • NCBIGene 36.3 3611
  • Alternative 3-prime, size difference: 46
  • Inclusion in 5'UTR
  • Reference transcript: NM_001014794

  • cd PTKc 243aa 1e-32 in ref transcript
    • Catalytic Domain of Protein Tyrosine Kinases. Protein Tyrosine Kinase (PTK) family, catalytic domain. This PTKc family is part of a larger superfamily that includes the catalytic domains of protein serine/threonine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. They can be classified into receptor and non-receptor tyr kinases. PTKs play important roles in many cellular processes including, lymphocyte activation, epithelium growth and maintenance, metabolism control, organogenesis regulation, survival, proliferation, differentiation, migration, adhesion, motility, and morphogenesis. Receptor tyr kinases (RTKs) are integral membrane proteins which contain an extracellular ligand-binding region, a transmembrane segment, and an intracellular tyr kinase domain. RTKs are usually activated through ligand binding, which causes dimerization and autophosphorylation of the intracellular tyr kinase catalytic domain, leading to intracellular signaling. Some RTKs are orphan receptors with no known ligands. Non-receptor (or cytoplasmic) tyr kinases are distributed in different intracellular compartments and are usually multi-domain proteins containing a catalytic tyr kinase domain as well as various regulatory domains such as SH3 and SH2. PTKs are usually autoinhibited and require a mechanism for activation. In many PTKs, the phosphorylation of tyr residues in the activation loop is essential for optimal activity. Aberrant expression of PTKs is associated with many development abnormalities and cancers.
  • cd ANK 113aa 2e-27 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • pfam Pkinase_Tyr 254aa 3e-36 in ref transcript
    • Protein tyrosine kinase.
  • pfam Ank 30aa 2e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • pfam Ank 33aa 4e-05 in ref transcript
  • TIGR trp 115aa 0.001 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • COG Arp 139aa 1e-14 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • PTZ PTZ00283 134aa 9e-11 in ref transcript
    • serine/threonine protein kinase; Provisional.

IMMT

  • rs.IMMT.F1 rs.IMMT.R1 142 175
  • NCBIGene 36.3 10989
  • Alternative 3-prime, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006839

  • Changed! pfam Mitofilin 753aa 0.0 in ref transcript
    • Mitochondrial inner membrane protein. Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence (residues 1-43) rich in positive-charged and hydroxylated residues and a membrane anchor domain (residues 47-66). In addition, it has three centrally located coiled coil domains (residues 200-240,280-310 and 400-420).
  • PRK PRK02995 107aa 9e-04 in ref transcript
    • DNA-directed RNA polymerase subunit beta'; Provisional.
  • Changed! pfam Mitofilin 742aa 0.0 in modified transcript

INCENP

  • rs.INCENP.F1 rs.INCENP.R1 107 119
  • NCBIGene 36.3 3619
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040694

  • pfam INCENP_ARK-bind 59aa 2e-19 in ref transcript
    • Inner centromere protein, ARK binding region. This region of the inner centromere protein has been found to be necessary and sufficient for binding to aurora-related kinase. This interaction has been implicated in the coordination of chromosome segregation with cell division in yeast.

INPP4B

  • rs.INPP4B.F1 rs.INPP4B.R1 97 196
  • NCBIGene 36.3 8821
  • Single exon skipping, size difference: 99
  • Exclusion in 5'UTR
  • Reference transcript: NM_003866

IQCB1

  • rs.IQCB1.F1 rs.IQCB1.R1 103 502
  • NCBIGene 36.3 9657
  • Multiple exon skipping, size difference: 399
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001023570

  • Changed! smart IQ 22aa 0.005 in modified transcript
    • Short calmodulin-binding motif containing conserved Ile and Gln residues. Calmodulin-binding motif.

IQCE

  • rs.IQCE.F1 rs.IQCE.R1 255 303
  • NCBIGene 36.3 23288
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152558

  • TIGR SMC_prok_A 282aa 6e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • COG Smc 283aa 2e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

IQCH

  • rs.IQCH.F1 rs.IQCH.R1 102 375
  • NCBIGene 36.3 64799
  • Single exon skipping, size difference: 273
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001031715

IRAK1

  • rs.IRAK1.F1 rs.IRAK1.R1 125 362
  • NCBIGene 36.3 3654
  • Single exon skipping, size difference: 237
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001569

  • cd S_TKc 205aa 4e-35 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 205aa 6e-36 in ref transcript
    • Protein kinase domain.
  • pfam Death 75aa 3e-07 in ref transcript
    • Death domain.
  • COG SPS1 206aa 6e-24 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

IRAK1

  • rs.IRAK1.F2 rs.IRAK1.R2 242 332
  • NCBIGene 36.3 3654
  • Alternative 3-prime, size difference: 90
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001569

  • cd S_TKc 205aa 4e-35 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 205aa 6e-36 in ref transcript
    • Protein kinase domain.
  • pfam Death 75aa 3e-07 in ref transcript
    • Death domain.
  • COG SPS1 206aa 6e-24 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

IRF2BP2

  • rs.IRF2BP2.F1 rs.IRF2BP2.R1 221 269
  • NCBIGene 36.3 359948
  • Alternative 5-prime, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_182972

ITGA6

  • rs.ITGA6.F1 rs.ITGA6.R1 102 232
  • NCBIGene 36.3 3655
  • Single exon skipping, size difference: 130
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001079818

  • pfam Integrin_alpha2 476aa 1e-110 in ref transcript
    • Integrin alpha. This domain is found in integrin alpha and integrin alpha precursors to the C terminus of a number of pfam01839 repeats and to the N-terminus of the pfam00357 cytoplasmic region.
  • smart Int_alpha 52aa 1e-11 in ref transcript
    • Integrin alpha (beta-propellor repeats). Integrins are cell adhesion molecules that mediate cell-extracellular matrix and cell-cell interactions. They contain both alpha and beta subunits. Alpha integrins are proposed to contain a domain containing a 7-fold repeat that adopts a beta-propellor fold. Some of these domains contain an inserted von Willebrand factor type-A domain. Some repeats contain putative calcium-binding sites. The 7-fold repeat domain is homologous to a similar domain in phosphatidylinositol-glycan-specific phospholipase D.
  • smart Int_alpha 55aa 1e-07 in ref transcript
  • smart Int_alpha 62aa 2e-05 in ref transcript

ITGB1

  • rs.ITGB1.F1 rs.ITGB1.R1 113 131
  • NCBIGene 36.3 3688
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033667

  • pfam Integrin_beta 431aa 0.0 in ref transcript
    • Integrin, beta chain. Integrins have been found in animals and their homologues have also been found in cyanobacteria, probably due to horizontal gene transfer. The sequences repeats have been trimmed due to an overlap with EGF.
  • pfam Integrin_B_tail 89aa 4e-24 in ref transcript
    • Integrin beta tail domain. This is the beta tail domain of the Integrin protein. Integrins are receptors which are involved in cell-cell and cell-extracellular matrix interactions.
  • Changed! pfam Integrin_b_cyt 21aa 5e-06 in ref transcript
    • Integrin beta cytoplasmic domain. Integrins are a group of transmembrane proteins which function as extracellular matrix receptors and in cell adhesion. Integrins are ubiquitously expressed and are heterodimeric, each composed of an alpha and beta subunit. Several variations of the the alpha and beta subunits exist, and association of different alpha and beta subunits can have different a different binding specificity. This domain corresponds to the cytoplasmic domain of the beta subunit.
  • Changed! pfam Integrin_b_cyt 20aa 7e-06 in modified transcript

ITPR1

  • rs.ITPR1.F1 rs.ITPR1.R1 201 246
  • NCBIGene 36.3 3708
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001099952

  • pfam Ins145_P3_rec 222aa 4e-95 in ref transcript
    • Inositol 1,4,5-trisphosphate/ryanodine receptor. This domain corresponds to the ligand binding region on inositol 1,4,5-trisphosphate receptor, and the N terminal region of the ryanodine receptor. Both receptors are involved in Ca2+ release. They can couple to the activation of neurotransmitter-gated receptors and voltage-gated Ca2+ channels on the plasma membrane, thus allowing the endoplasmic reticulum discriminate between different types of neuronal activity.
  • pfam RYDR_ITPR 206aa 9e-67 in ref transcript
    • RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3.
  • Changed! pfam MIR 202aa 3e-66 in ref transcript
    • MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function.
  • pfam RIH_assoc 120aa 4e-55 in ref transcript
    • RyR and IP3R Homology associated. This eukaryotic domain is found in ryanodine receptors (RyR) and inositol 1,4,5-trisphosphate receptors (IP3R) which together form a superfamily of homotetrameric ligand-gated intracellular Ca2+ channels. There seems to be no known function for this domain. Also see the IP3-binding domain pfam01365 and pfam02815.
  • pfam RYDR_ITPR 180aa 8e-55 in ref transcript
  • pfam Ion_trans 224aa 5e-06 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • COG PMT1 142aa 0.004 in ref transcript
    • Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam MIR 187aa 5e-68 in modified transcript

ITSN2

  • rs.ITSN2.F1 rs.ITSN2.R1 426 507
  • NCBIGene 36.3 50618
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006277

  • cd RhoGEF 184aa 2e-35 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases; Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains.
  • cd C2 89aa 2e-18 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • cd EH 67aa 3e-17 in ref transcript
    • Eps15 homology domain; found in proteins implicated in endocytosis, vesicle transport, and signal transduction. The alignment contains a pair of EF-hand motifs, typically one of them is canonical and binds to Ca2+, while the other may not bind to Ca2+. A hydrophobic binding pocket is formed by residues from both EF-hand motifs. The EH domain binds to proteins containing NPF (class I), [WF]W or SWG (class II), or H[TS]F (class III) sequence motifs.
  • cd EH 58aa 2e-14 in ref transcript
  • cd SH3 52aa 4e-14 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 54aa 2e-11 in ref transcript
  • cd SH3 51aa 5e-11 in ref transcript
  • cd SH3 52aa 6e-10 in ref transcript
  • cd SH3 58aa 9e-08 in ref transcript
  • cd PH 106aa 7e-04 in ref transcript
    • Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • smart RhoGEF 181aa 2e-40 in ref transcript
    • Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases. Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases Also called Dbl-homologous (DH) domain. It appears that PH domains invariably occur C-terminal to RhoGEF/DH domains. Improved coverage.
  • smart EH 93aa 4e-22 in ref transcript
    • Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
  • smart EH 80aa 6e-22 in ref transcript
  • smart C2 97aa 1e-20 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • smart SH3 55aa 2e-16 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 55aa 4e-13 in ref transcript
  • smart SH3 56aa 1e-12 in ref transcript
  • smart SH3 53aa 3e-12 in ref transcript
  • smart SH3 61aa 5e-09 in ref transcript
  • Changed! pfam SMC_N 227aa 3e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • smart PH 109aa 7e-05 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • COG ROM1 255aa 6e-11 in ref transcript
    • RhoGEF, Guanine nucleotide exchange factor for Rho/Rac/Cdc42-like GTPases [Signal transduction mechanisms].
  • COG COG5038 96aa 3e-08 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
  • Changed! COG Smc 227aa 1e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 200aa 5e-08 in modified transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! COG Smc 200aa 2e-08 in modified transcript

JARID1A

  • rs.JARID1A.F1 rs.JARID1A.R1 241 323
  • NCBIGene 36.3 5927
  • Single exon skipping, size difference: 82
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001042603

  • cd BAH_plant_2 30aa 0.003 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • pfam PLU-1 333aa 1e-106 in ref transcript
    • PLU-1-like protein. Sequences in this family bear similarity to the central region of PLU-1. This is a nuclear protein that may have a role in DNA-binding and transcription, and is closely associated with the malignant phenotype of breast cancer. This region is found in various other Jumonji/ARID domain-containing proteins (see pfam02373, pfam01388).
  • pfam JmjC 117aa 2e-47 in ref transcript
    • JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
  • pfam ARID 109aa 2e-33 in ref transcript
    • ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
  • pfam zf-C5HC2 54aa 1e-19 in ref transcript
    • C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function.
  • pfam PHD 49aa 1e-11 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • smart JmjN 35aa 2e-09 in ref transcript
    • Small domain found in the jumonji family of transcription factors. To date, this domain always co-occurs with the JmjC domain (although the reverse is not true).
  • Changed! pfam PHD 47aa 3e-06 in ref transcript
  • smart PHD 52aa 4e-05 in ref transcript
    • PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.

JARID1A

  • rs.JARID1A.F2 rs.JARID1A.R2 100 115
  • NCBIGene 36.3 5927
  • Alternative 5-prime and 3-prime, size difference: 15
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001042603

  • cd BAH_plant_2 30aa 0.003 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • pfam PLU-1 333aa 1e-106 in ref transcript
    • PLU-1-like protein. Sequences in this family bear similarity to the central region of PLU-1. This is a nuclear protein that may have a role in DNA-binding and transcription, and is closely associated with the malignant phenotype of breast cancer. This region is found in various other Jumonji/ARID domain-containing proteins (see pfam02373, pfam01388).
  • pfam JmjC 117aa 2e-47 in ref transcript
    • JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.
  • pfam ARID 109aa 2e-33 in ref transcript
    • ARID/BRIGHT DNA binding domain. This domain is know as ARID for AT-Rich Interaction Domain, and also known as the BRIGHT domain.
  • pfam zf-C5HC2 54aa 1e-19 in ref transcript
    • C5HC2 zinc finger. Predicted zinc finger with eight potential zinc ligand binding residues. This domain is found in Jumonji. This domain may have a DNA binding function.
  • pfam PHD 49aa 1e-11 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • smart JmjN 35aa 2e-09 in ref transcript
    • Small domain found in the jumonji family of transcription factors. To date, this domain always co-occurs with the JmjC domain (although the reverse is not true).
  • pfam PHD 47aa 3e-06 in ref transcript
  • smart PHD 52aa 4e-05 in ref transcript
    • PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.

JMJD1C

  • rs.JMJD1C.F1 rs.JMJD1C.R1 260 314
  • NCBIGene 36.3 221037
  • Alternative 3-prime, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032776

  • pfam JmjC 93aa 6e-08 in ref transcript
    • JmjC domain. The JmjC domain belongs to the Cupin superfamily. JmjC-domain proteins may be protein hydroxylases that catalyse a novel histone modification.

KCNG1

  • rs.KCNG1.F1 rs.KCNG1.R1 103 377
  • NCBIGene 36.3 3755
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 274
  • Inclusion in 5'UTR, Inclusion in 5'UTR
  • Reference transcript: NM_002237

  • pfam K_tetra 97aa 6e-27 in ref transcript
    • K+ channel tetramerisation domain. The N-terminal, cytoplasmic tetramerisation domain (T1) of voltage-gated K+ channels encodes molecular determinants for subfamily-specific assembly of alpha-subunits into functional tetrameric channels. It is distantly related to the BTB/POZ domain pfam00651.
  • pfam Ion_trans 190aa 7e-13 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • PRK PRK10537 91aa 1e-09 in ref transcript
    • voltage-gated potassium channel; Provisional.

KCNIP2

  • rs.KCNIP2.F1 rs.KCNIP2.R1 102 147
  • NCBIGene 36.3 30819
  • Alternative 3-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014591

  • cd EFh 63aa 6e-08 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 72aa 1e-07 in ref transcript
  • smart EH 87aa 7e-04 in ref transcript
    • Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
  • COG FRQ1 167aa 3e-18 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

KCNIP2

  • rs.KCNIP2.F2 rs.KCNIP2.R2 119 140
  • NCBIGene 36.3 30819
  • Alternative 3-prime, size difference: 21
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_014591

  • cd EFh 63aa 6e-08 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 72aa 1e-07 in ref transcript
  • smart EH 87aa 7e-04 in ref transcript
    • Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
  • Changed! COG FRQ1 167aa 3e-18 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! COG FRQ1 174aa 1e-17 in modified transcript

KCNJ16

  • rs.KCNJ16.F1 rs.KCNJ16.R1 159 240
  • NCBIGene 36.3 3773
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 81
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_018658

  • pfam IRK 315aa 1e-152 in ref transcript
    • Inward rectifier potassium channel.

KCNK2

  • rs.KCNK2.F1 rs.KCNK2.R1 149 273
  • NCBIGene 36.3 3776
  • Alternative 5-prime, size difference: 124
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001017425

  • pfam Ion_trans_2 52aa 2e-13 in ref transcript
    • Ion channel. This family includes the two membrane helix type ion channels found in bacteria.
  • pfam Ion_trans_2 60aa 7e-09 in ref transcript

KCNMA1

  • rs.KCNMA1.F1 rs.KCNMA1.R1 100 112
  • NCBIGene 36.3 3778
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001014797

  • pfam BK_channel_a 99aa 5e-29 in ref transcript
    • Calcium-activated BK potassium channel alpha subunit.
  • pfam Ion_trans 172aa 8e-14 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam TrkA_N 114aa 0.004 in ref transcript
    • TrkA-N domain. This domain is found in a wide variety of proteins. These protein include potassium channels, phosphoesterases, and various other transporters. This domain binds to NAD.
  • PRK PRK10537 76aa 7e-11 in ref transcript
    • voltage-gated potassium channel; Provisional.

KDELR2

  • rs.KDELR2.F1 rs.KDELR2.R1 280 533
  • NCBIGene 36.3 11014
  • Single exon skipping, size difference: 253
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006854

  • Changed! pfam ER_lumen_recept 143aa 3e-50 in ref transcript
    • ER lumen protein retaining receptor.
  • Changed! COG ERD2 212aa 3e-51 in ref transcript
    • ER lumen protein retaining receptor [Intracellular trafficking and secretion].
  • Changed! pfam ER_lumen_recept 90aa 6e-30 in modified transcript
  • Changed! COG ERD2 117aa 2e-24 in modified transcript

KIAA0232

  • rs.KIAA0232.F1 rs.KIAA0232.R1 198 282
  • NCBIGene 36.3 9778
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_014743

KIAA1191

  • rs.KIAA1191.F1 rs.KIAA1191.R1 195 282
  • NCBIGene 36.3 57179
  • Single exon skipping, size difference: 87
  • Exclusion of the protein initiation site
  • Reference transcript: NM_020444

KIAA1191

  • rs.KIAA1191.F2 rs.KIAA1191.R2 205 313
  • NCBIGene 36.3 57179
  • Single exon skipping, size difference: 108
  • Exclusion in 5'UTR
  • Reference transcript: NM_020444

KIAA1217

  • rs.KIAA1217.F1 rs.KIAA1217.R1 297 402
  • NCBIGene 36.3 56243
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_019590

KIAA1543

  • rs.KIAA1543.F1 rs.KIAA1543.R1 196 277
  • NCBIGene 36.3 57662
  • Multiple exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001080429

  • pfam DUF1781 127aa 2e-68 in ref transcript
    • Protein of unknown function (DUF1781). This is a family of uncharacterised proteins. The structure of a murine hypothetical protein from RIKEN cDNA has shown it to adopt a mainly beta barrel structure with an alpha hairpin.
  • TIGR PDHac_trf_mito 71aa 0.004 in ref transcript
    • This model represents one of several closely related clades of the dihydrolipoamide acetyltransferase subunit of the pyruvate dehydrogenase complex. It includes sequences from mitochondria and from alpha and beta branches of the proteobacteria, as well as from some other bacteria. Sequences from Gram-positive bacteria are not included. The non-enzymatic homolog protein X, which serves as an E3 component binding protein, falls within the clade phylogenetically but is rejected by its low score.

KIAA1833

  • rs.KIAA1833.F1 rs.KIAA1833.R1 107 227
  • NCBIGene 36.3 727957
  • Single exon skipping, size difference: 120
  • Exclusion in 5'UTR
  • Reference transcript: NM_032450

KIF13A

  • rs.KIF13A.F1 rs.KIF13A.R1 134 173
  • NCBIGene 36.3 63971
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022113

  • cd KISc_KIF1A_KIF1B 356aa 1e-152 in ref transcript
    • Kinesin motor domain, KIF1_like proteins. KIF1A (Unc104) transports synaptic vesicles to the nerve terminal, KIF1B has been implicated in transport of mitochondria. Both proteins are expressed in neurons. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. In contrast to the majority of dimeric kinesins, most KIF1A/Unc104 kinesins are monomeric motors. A lysine-rich loop in KIF1A binds to the negatively charged C-terminus of tubulin and compensates for the lack of a second motor domain, allowing KIF1A to move processively.
  • cd FHA 98aa 8e-08 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • smart KISc 355aa 1e-129 in ref transcript
    • Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
  • pfam FHA 65aa 3e-07 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • TIGR SMC_prok_B 219aa 0.002 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG KIP1 359aa 1e-69 in ref transcript
    • Kinesin-like protein [Cytoskeleton].
  • COG Smc 166aa 0.001 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

KIF13A

  • rs.KIF13A.F2 rs.KIF13A.R2 232 337
  • NCBIGene 36.3 63971
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022113

  • cd KISc_KIF1A_KIF1B 356aa 1e-152 in ref transcript
    • Kinesin motor domain, KIF1_like proteins. KIF1A (Unc104) transports synaptic vesicles to the nerve terminal, KIF1B has been implicated in transport of mitochondria. Both proteins are expressed in neurons. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Kinesins are microtubule-dependent molecular motors that play important roles in intracellular transport and in cell division. In most kinesins, the motor domain is found at the N-terminus (N-type). N-type kinesins are (+) end-directed motors, i.e. they transport cargo towards the (+) end of the microtubule. In contrast to the majority of dimeric kinesins, most KIF1A/Unc104 kinesins are monomeric motors. A lysine-rich loop in KIF1A binds to the negatively charged C-terminus of tubulin and compensates for the lack of a second motor domain, allowing KIF1A to move processively.
  • cd FHA 98aa 8e-08 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • smart KISc 355aa 1e-129 in ref transcript
    • Kinesin motor, catalytic domain. ATPase. Microtubule-dependent molecular motors that play important roles in intracellular transport of organelles and in cell division.
  • pfam FHA 65aa 3e-07 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • TIGR SMC_prok_B 219aa 0.002 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG KIP1 359aa 1e-69 in ref transcript
    • Kinesin-like protein [Cytoskeleton].
  • COG Smc 166aa 0.001 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

KIT

  • rs.KIT.F1 rs.KIT.R1 100 112
  • NCBIGene 36.3 3815
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000222

  • cd PTKc_Kit 376aa 0.0 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Kit. Protein Tyrosine Kinase (PTK) family; Kit (or c-Kit); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Kit is a member of the Platelet Derived Growth Factor Receptor (PDGFR) subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular ligand-binding region with five immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. The binding of Kit to its ligand, the stem-cell factor (SCF), leads to receptor dimerization, trans phosphorylation and activation, and intracellular signaling. Kit is important in the development of melanocytes, germ cells, mast cells, hematopoietic stem cells, the interstitial cells of Cajal, and the pacemaker cells of the GI tract. Kit signaling is involved in major cellular functions including cell survival, proliferation, differentiation, adhesion, and chemotaxis. Mutations in Kit, which result in constitutive ligand-independent activation, are found in human cancers such as gastrointestinal stromal tumor (GIST) and testicular germ cell tumor (TGCT). The aberrant expression of Kit and/or SCF is associated with other tumor types such as systemic mastocytosis and cancers of the breast, neurons, lung, prostate, colon, and rectum. Although the structure of the human Kit catalytic domain is known, it is excluded from this specific alignment model because it contains a deletion in its sequence.
  • cd IG 75aa 6e-06 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam Pkinase_Tyr 154aa 4e-73 in ref transcript
    • Protein tyrosine kinase.
  • smart STYKc 304aa 1e-30 in ref transcript
    • Protein kinase; unclassified specificity. Phosphotransferases. The specificity of this class of kinases can not be predicted. Possible dual-specificity Ser/Thr/Tyr kinase.
  • smart IG_like 87aa 8e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • COG SPS1 206aa 2e-12 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • COG SPS1 136aa 2e-05 in ref transcript

KITLG

  • rs.KITLG.F1 rs.KITLG.R1 452 536
  • NCBIGene 36.3 4254
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000899

  • Changed! pfam SCF 273aa 1e-134 in ref transcript
    • Stem cell factor. Stem cell factor (SCF) is a homodimer involved in hematopoiesis. SCF binds to and activates the SCF receptor (SCFR), a receptor tyrosine kinase. The crystal structure of human SCF has been resolved and a potential receptor-binding site identified.
  • Changed! pfam SCF 245aa 1e-121 in modified transcript

KLK15

  • rs.KLK15.F1 rs.KLK15.R1 186 323
  • NCBIGene 36.3 55554
  • Single exon skipping, size difference: 137
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017509

  • Changed! cd Tryp_SPc 228aa 3e-65 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! smart Tryp_SPc 225aa 9e-73 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! COG COG5640 235aa 4e-17 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Tryp_SPc 127aa 5e-36 in modified transcript
  • Changed! smart Tryp_SPc 127aa 2e-37 in modified transcript
  • Changed! COG COG5640 99aa 0.001 in modified transcript

KLK2

  • rs.KLK2.F1 rs.KLK2.R1 193 230
  • NCBIGene 36.3 3817
  • Alternative 5-prime, size difference: 37
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_005551

  • Changed! cd Tryp_SPc 232aa 1e-59 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! smart Tryp_SPc 230aa 2e-67 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! COG COG5640 235aa 1e-12 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Tryp_SPc 185aa 3e-48 in modified transcript
  • Changed! smart Tryp_SPc 186aa 1e-54 in modified transcript
  • Changed! COG COG5640 51aa 6e-06 in modified transcript

KLK3

  • rs.KLK3.F1 rs.KLK3.R1 107 549
  • NCBIGene 36.3 354
  • Alternative 3-prime, size difference: 442
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001648

  • Changed! cd Tryp_SPc 232aa 3e-64 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! smart Tryp_SPc 230aa 5e-71 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! COG COG5640 250aa 1e-21 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Tryp_SPc 186aa 8e-45 in modified transcript
  • Changed! smart Tryp_SPc 186aa 9e-50 in modified transcript
  • Changed! COG COG5640 205aa 2e-08 in modified transcript

KLK6

  • rs.KLK6.F1 rs.KLK6.R1 150 198
  • NCBIGene 36.3 5653
  • Single exon skipping, size difference: 48
  • Exclusion of the protein initiation site
  • Reference transcript: NM_002774

  • Changed! cd Tryp_SPc 218aa 6e-64 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • Changed! smart Tryp_SPc 215aa 9e-72 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! COG COG5640 239aa 1e-22 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].

KLK8

  • rs.KLK8.F1 rs.KLK8.R1 264 399
  • NCBIGene 36.3 11202
  • Alternative 3-prime, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144505

  • cd Tryp_SPc 223aa 2e-55 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • smart Tryp_SPc 221aa 3e-63 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! COG COG5640 236aa 4e-13 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG COG5640 244aa 1e-13 in modified transcript

KRAS

  • rs.KRAS.F1 rs.KRAS.R1 182 306
  • NCBIGene 36.3 3845
  • Single exon skipping, size difference: 124
  • Exclusion of the stop codon
  • Reference transcript: NM_033360

  • Changed! cd H_N_K_Ras_like 162aa 3e-89 in ref transcript
    • H-Ras/N-Ras/K-Ras subfamily. H-Ras, N-Ras, and K-Ras4A/4B are the prototypical members of the Ras family. These isoforms generate distinct signal outputs despite interacting with a common set of activators and effectors, and are strongly associated with oncogenic progression in tumor initiation. Mutated versions of Ras that are insensitive to GAP stimulation (and are therefore constitutively active) are found in a significant fraction of human cancers. Many Ras guanine nucleotide exchange factors (GEFs) have been identified. They are sequestered in the cytosol until activation by growth factors triggers recruitment to the plasma membrane or Golgi, where the GEF colocalizes with Ras. Active (GTP-bound) Ras interacts with several effector proteins that stimulate a variety of diverse cytoplasmic signaling activities. Some are known to positively mediate the oncogenic properties of Ras, including Raf, phosphatidylinositol 3-kinase (PI3K), RalGEFs, and Tiam1. Others are proposed to play negative regulatory roles in oncogenesis, including RASSF and NORE/MST1. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • Changed! smart RAS 163aa 2e-78 in ref transcript
    • Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades.
  • Changed! COG COG1100 185aa 3e-18 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].
  • Changed! cd H_N_K_Ras_like 162aa 9e-90 in modified transcript
  • Changed! smart RAS 161aa 6e-78 in modified transcript
  • Changed! COG COG1100 163aa 5e-18 in modified transcript

KRT80

  • rs.KRT80.F1 rs.KRT80.R1 107 142
  • NCBIGene 36.3 144501
  • Alternative 3-prime, size difference: 35
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_182507

  • pfam Filament 312aa 1e-69 in ref transcript
    • Intermediate filament protein.
  • COG Smc 253aa 2e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

KTN1

  • rs.KTN1.F1 rs.KTN1.R1 121 208
  • NCBIGene 36.3 3895
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001079521

  • TIGR SMC_prok_B 226aa 1e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! TIGR SMC_prok_B 278aa 2e-05 in ref transcript
  • Changed! COG Smc 222aa 0.002 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 272aa 3e-05 in modified transcript

KTN1

  • rs.KTN1.F2 rs.KTN1.R2 163 247
  • NCBIGene 36.3 3895
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001079521

  • TIGR SMC_prok_B 226aa 1e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • TIGR SMC_prok_B 278aa 2e-05 in ref transcript
  • COG Smc 222aa 0.002 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

KTN1

  • rs.KTN1.F3 rs.KTN1.R3 148 204
  • NCBIGene 36.3 3895
  • Single exon skipping, size difference: 56
  • Exclusion of the stop codon
  • Reference transcript: NM_182926

  • TIGR SMC_prok_B 226aa 1e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • TIGR SMC_prok_B 278aa 2e-05 in ref transcript
  • COG Smc 222aa 0.002 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

KTN1

  • rs.KTN1.F4 FOX.KTN1.R1 117 186
  • NCBIGene 36.3 3895
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001079521

  • TIGR SMC_prok_B 226aa 1e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! TIGR SMC_prok_B 278aa 2e-05 in ref transcript
  • Changed! COG Smc 222aa 0.002 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! TIGR SMC_prok_B 255aa 1e-04 in modified transcript

KTN1

  • rs.KTN1.F5 rs.KTN1.R5 277 401
  • NCBIGene 36.3 3895
  • Single exon skipping, size difference: 124
  • Exclusion in 5'UTR
  • Reference transcript: NM_182926

  • TIGR SMC_prok_B 226aa 1e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • TIGR SMC_prok_B 278aa 2e-05 in ref transcript
  • COG Smc 222aa 0.002 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

L1CAM

  • rs.L1CAM.F1 rs.L1CAM.R1 100 112
  • NCBIGene 36.3 3897
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000425

  • cd IGcam 85aa 4e-14 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 88aa 9e-14 in ref transcript
  • cd FN3 97aa 8e-12 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd IGcam 92aa 6e-11 in ref transcript
  • cd FN3 86aa 7e-10 in ref transcript
  • cd IGcam 87aa 1e-09 in ref transcript
  • cd IGcam 86aa 6e-08 in ref transcript
  • cd FN3 88aa 1e-06 in ref transcript
  • cd FN3 82aa 7e-06 in ref transcript
  • smart IGc2 64aa 1e-14 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 82aa 1e-10 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 96aa 1e-10 in ref transcript
    • Fibronectin type III domain.
  • smart IG_like 83aa 2e-10 in ref transcript
  • pfam I-set 90aa 2e-10 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam fn3 90aa 9e-10 in ref transcript
  • smart FN3 85aa 1e-09 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam fn3 82aa 5e-08 in ref transcript
  • smart IG_like 85aa 2e-07 in ref transcript

LAMA2

  • rs.LAMA2.F1 rs.LAMA2.R1 100 112
  • NCBIGene 36.3 3908
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000426

  • cd LamG 150aa 1e-29 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 153aa 2e-28 in ref transcript
  • Changed! cd LamG 162aa 2e-25 in ref transcript
  • cd LamG 163aa 4e-22 in ref transcript
  • cd LamG 160aa 1e-19 in ref transcript
  • cd EGF_Lam 47aa 1e-08 in ref transcript
    • Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies.
  • cd EGF_Lam 45aa 1e-07 in ref transcript
  • cd EGF_Lam 49aa 5e-07 in ref transcript
  • cd EGF_Lam 53aa 7e-07 in ref transcript
  • cd EGF_Lam 43aa 2e-06 in ref transcript
  • cd EGF_Lam 50aa 3e-06 in ref transcript
  • cd EGF_Lam 49aa 7e-06 in ref transcript
  • cd EGF_Lam 58aa 1e-05 in ref transcript
  • cd EGF_Lam 46aa 2e-05 in ref transcript
  • cd EGF_Lam 53aa 1e-04 in ref transcript
  • cd EGF_Lam 48aa 0.001 in ref transcript
  • cd EGF_Lam 42aa 0.002 in ref transcript
  • cd EGF_Lam 58aa 0.007 in ref transcript
  • pfam Laminin_N 247aa 1e-88 in ref transcript
    • Laminin N-terminal (Domain VI).
  • pfam Laminin_I 266aa 1e-65 in ref transcript
    • Laminin Domain I. coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
  • pfam Laminin_G_1 129aa 8e-40 in ref transcript
    • Laminin G domain.
  • smart LamB 133aa 5e-38 in ref transcript
    • Laminin B domain.
  • smart LamB 136aa 1e-37 in ref transcript
  • pfam Laminin_II 137aa 1e-36 in ref transcript
    • Laminin Domain II. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
  • Changed! pfam Laminin_G_1 140aa 3e-34 in ref transcript
  • pfam Laminin_G_1 141aa 4e-30 in ref transcript
  • smart LamG 130aa 8e-29 in ref transcript
    • Laminin G domain.
  • pfam Laminin_G_1 142aa 1e-26 in ref transcript
  • TIGR SMC_prok_B 521aa 1e-14 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart EGF_Lam 45aa 8e-10 in ref transcript
    • Laminin-type epidermal growth factor-like domai.
  • smart EGF_Lam 47aa 1e-09 in ref transcript
  • smart EGF_Lam 44aa 9e-08 in ref transcript
  • pfam Laminin_EGF 47aa 1e-07 in ref transcript
    • Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6.
  • smart EGF_Lam 58aa 3e-07 in ref transcript
  • smart EGF_Lam 44aa 1e-06 in ref transcript
  • pfam Laminin_EGF 49aa 1e-06 in ref transcript
  • smart EGF_Lam 44aa 2e-06 in ref transcript
  • pfam Laminin_EGF 47aa 5e-06 in ref transcript
  • pfam Laminin_EGF 41aa 8e-06 in ref transcript
  • smart EGF_Lam 53aa 3e-05 in ref transcript
  • pfam Laminin_EGF 56aa 3e-04 in ref transcript
  • pfam Laminin_EGF 56aa 9e-04 in ref transcript
  • COG SbcC 608aa 2e-20 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! cd LamG 158aa 2e-26 in modified transcript
  • Changed! pfam Laminin_G_1 136aa 3e-32 in modified transcript

LAMA4

  • rs.LAMA4.F1 rs.LAMA4.R1 99 116
  • NCBIGene 36.3 3910
  • Alternative 5-prime, size difference: 17
  • Inclusion in 5'UTR
  • Reference transcript: NM_001105206

  • cd LamG 136aa 8e-30 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 151aa 7e-25 in ref transcript
  • cd LamG 159aa 6e-17 in ref transcript
  • cd LamG 140aa 7e-15 in ref transcript
  • cd LamG 178aa 8e-13 in ref transcript
  • cd EGF_Lam 52aa 3e-11 in ref transcript
    • Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies.
  • cd EGF_Lam 44aa 8e-08 in ref transcript
  • cd EGF_Lam 46aa 5e-07 in ref transcript
  • cd SPEC 197aa 3e-04 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • pfam Laminin_I 248aa 1e-72 in ref transcript
    • Laminin Domain I. coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
  • smart LamG 134aa 5e-31 in ref transcript
    • Laminin G domain.
  • pfam Laminin_II 128aa 3e-30 in ref transcript
    • Laminin Domain II. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
  • smart LamG 133aa 1e-26 in ref transcript
  • smart LamG 138aa 6e-19 in ref transcript
  • smart LamG 155aa 6e-16 in ref transcript
  • pfam Laminin_G_2 114aa 4e-13 in ref transcript
    • Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
  • pfam Laminin_EGF 47aa 6e-11 in ref transcript
    • Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6.
  • TIGR SMC_prok_B 556aa 2e-09 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart EGF_Lam 37aa 3e-07 in ref transcript
    • Laminin-type epidermal growth factor-like domai.
  • pfam Laminin_EGF 41aa 8e-07 in ref transcript
  • pfam VSP 201aa 8e-04 in ref transcript
    • Giardia variant-specific surface protein.
  • COG Smc 304aa 1e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • PRK PRK02224 345aa 2e-06 in ref transcript
    • chromosome segregation protein; Provisional.

LAMA4

  • rs.LAMA4.F2 rs.LAMA4.R2 127 148
  • NCBIGene 36.3 3910
  • Alternative 5-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001105206

  • cd LamG 136aa 8e-30 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 151aa 7e-25 in ref transcript
  • cd LamG 159aa 6e-17 in ref transcript
  • cd LamG 140aa 7e-15 in ref transcript
  • cd LamG 178aa 8e-13 in ref transcript
  • cd EGF_Lam 52aa 3e-11 in ref transcript
    • Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies.
  • cd EGF_Lam 44aa 8e-08 in ref transcript
  • cd EGF_Lam 46aa 5e-07 in ref transcript
  • cd SPEC 197aa 3e-04 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • pfam Laminin_I 248aa 1e-72 in ref transcript
    • Laminin Domain I. coiled-coil structure. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
  • smart LamG 134aa 5e-31 in ref transcript
    • Laminin G domain.
  • pfam Laminin_II 128aa 3e-30 in ref transcript
    • Laminin Domain II. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure.
  • smart LamG 133aa 1e-26 in ref transcript
  • smart LamG 138aa 6e-19 in ref transcript
  • smart LamG 155aa 6e-16 in ref transcript
  • pfam Laminin_G_2 114aa 4e-13 in ref transcript
    • Laminin G domain. This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.
  • pfam Laminin_EGF 47aa 6e-11 in ref transcript
    • Laminin EGF-like (Domains III and V). This family is like pfam00008 but has 8 conserved cysteines instead of 6.
  • TIGR SMC_prok_B 556aa 2e-09 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart EGF_Lam 37aa 3e-07 in ref transcript
    • Laminin-type epidermal growth factor-like domai.
  • pfam Laminin_EGF 41aa 8e-07 in ref transcript
  • pfam VSP 201aa 8e-04 in ref transcript
    • Giardia variant-specific surface protein.
  • COG Smc 304aa 1e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • PRK PRK02224 345aa 2e-06 in ref transcript
    • chromosome segregation protein; Provisional.

LARP4

  • rs.LARP4.F1 rs.LARP4.R1 99 117
  • NCBIGene 36.3 113251
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199188

  • cd RRM 67aa 0.004 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart LA 79aa 5e-29 in ref transcript
    • Domain in the RNA-binding Lupus La protein; unknown function.

LDB3

  • rs.LDB3.F1 rs.LDB3.R1 100 289
  • NCBIGene 36.3 11155
  • Single exon skipping, size difference: 189
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007078

  • cd PDZ_signaling 77aa 3e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 81aa 5e-17 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • pfam LIM 56aa 8e-12 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 53aa 9e-10 in ref transcript
  • pfam LIM 56aa 2e-08 in ref transcript
  • smart ZM 26aa 3e-05 in ref transcript
    • ZASP-like motif. Short motif (26 amino acids) present in an alpha-actinin-binding protein, ZASP, and similar molecules.

LDB3

  • rs.LDB3.F2 rs.LDB3.R2 381 522
  • NCBIGene 36.3 11155
  • Mutually exclusive exon skipping, size difference: 141
  • Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007078

  • cd PDZ_signaling 77aa 3e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 81aa 5e-17 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • pfam LIM 56aa 8e-12 in ref transcript
    • LIM domain. This family represents two copies of the LIM structural domain.
  • pfam LIM 53aa 9e-10 in ref transcript
  • pfam LIM 56aa 2e-08 in ref transcript
  • Changed! smart ZM 26aa 3e-05 in ref transcript
    • ZASP-like motif. Short motif (26 amino acids) present in an alpha-actinin-binding protein, ZASP, and similar molecules.
  • Changed! smart ZM 26aa 3e-06 in modified transcript

LGALS9

  • rs.LGALS9.F1 rs.LGALS9.R1 111 207
  • NCBIGene 36.3 3965
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_009587

  • cd GLECT 131aa 2e-36 in ref transcript
    • Galectin/galactose-binding lectin. This domain exclusively binds beta-galactosides, such as lactose, and does not require metal ions for activity. GLECT domains occur as homodimers or tandemly repeated domains. They are developmentally regulated and may be involved in differentiation, cell-cell interaction and cellular regulation.
  • cd GLECT 128aa 5e-33 in ref transcript
  • smart GLECT 131aa 4e-40 in ref transcript
    • Galectin. Galectin - galactose-binding lectin.
  • pfam Gal-bind_lectin 128aa 3e-35 in ref transcript
    • Galactoside-binding lectin. This family contains galactoside binding lectins. The family also includes enzymes such as human eosinophil lysophospholipase (EC:3.1.1.5).

LILRB5

  • rs.LILRB5.F1 rs.LILRB5.R1 125 425
  • NCBIGene 36.3 10990
  • Single exon skipping, size difference: 300
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001081442

  • smart IG_like 67aa 0.010 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.

LINS1

  • rs.LINS1.F1 rs.LINS1.R1 199 308
  • NCBIGene 36.3 55180
  • Single exon skipping, size difference: 109
  • Exclusion in 5'UTR
  • Reference transcript: NM_018148

LOC100129583

  • rs.LOC100129583.F1 rs.LOC100129583.R1 404 449
  • NCBIGene 36.3 100129583
  • Alternative 5-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001721012

LOC100129720

  • rs.LOC100129720.F1 rs.LOC100129720.R1 286 391
  • NCBIGene 36.3 100129720
  • Alternative 3-prime, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001725772

LOC100130008

  • rs.LOC100130008.F1 rs.LOC100130008.R1 199 316
  • NCBIGene 36.3 100130008
  • Alternative 5-prime, size difference: 117
  • Exclusion in 5'UTR
  • Reference transcript: XM_001725170

LOC100130771

  • rs.LOC100130771.F1 rs.LOC100130771.R1 131 299
  • NCBIGene 36.3 100130771
  • Alternative 3-prime, size difference: 168
  • Exclusion of the stop codon
  • Reference transcript: XM_001725809

LOC100130890

  • rs.LOC100130890.F1 rs.LOC100130890.R1 180 266
  • NCBIGene 36.3 100130890
  • Single exon skipping, size difference: 86
  • Exclusion in 5'UTR
  • Reference transcript: XM_001721588

  • cd RHOD_HSP67B2 67aa 9e-15 in ref transcript
    • Member of the Rhodanese Homology Domain superfamily. This CD includes the heat shock protein 67B2 of Drosophila melanogaster and other similar proteins, many of which are uncharacterized.
  • pfam Rhodanese 63aa 9e-06 in ref transcript
    • Rhodanese-like domain. Rhodanese has an internal duplication. This Pfam represents a single copy of this duplicated domain. The domain is found as a single copy in other proteins, including phosphatases and ubiquitin C-terminal hydrolases.
  • PRK PRK07878 58aa 4e-05 in ref transcript
    • molybdopterin biosynthesis-like protein MoeZ; Validated.

LOC100131626

  • rs.LOC100131626.F1 rs.LOC100131626.R1 287 340
  • NCBIGene 36.3 100131626
  • Alternative 3-prime, size difference: 53
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001726222

LOC100132134

  • rs.LOC100132134.F1 rs.LOC100132134.R1 112 173
  • NCBIGene 36.3 100132134
  • Alternative 5-prime, size difference: 61
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001719469

  • Changed! pfam Glyco_hydro_2 61aa 8e-04 in ref transcript
    • Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities.

LOC100132215

  • rs.LOC100132215.F1 rs.LOC100132215.R1 144 259
  • NCBIGene 36.3 100132215
  • Alternative 5-prime and 3-prime, size difference: 115
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: XM_001725462

LOC100132352

  • rs.LOC100132352.F1 rs.LOC100132352.R1 138 177
  • NCBIGene 36.3 100132352
  • Alternative 3-prime, size difference: 39
  • Exclusion of the stop codon
  • Reference transcript: XM_001720717

LOC100132565

  • rs.LOC100132565.F1 rs.LOC100132565.R1 149 242
  • NCBIGene 36.3 100132565
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 93
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001724162

  • TIGR SMC_prok_A 151aa 0.003 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.

LOC100132771

  • rs.LOC100132771.F1 rs.LOC100132771.R1 408 487
  • NCBIGene 36.3 100132771
  • Alternative 3-prime, size difference: 79
  • Inclusion in 3'UTR
  • Reference transcript: XM_001719907

LOC116412

  • rs.LOC116412.F1 rs.LOC116412.R1 158 260
  • NCBIGene 36.3 116412
  • Alternative 5-prime, size difference: 102
  • Exclusion in 5'UTR
  • Reference transcript: XM_375665

  • pfam zf-C2H2 23aa 0.006 in ref transcript
    • Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
  • COG COG5048 156aa 5e-06 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 217aa 3e-05 in ref transcript

LOC26010

  • rs.LOC26010.F1 rs.LOC26010.R1 175 382
  • NCBIGene 36.3 26010
  • Single exon skipping, size difference: 207
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015535

  • Changed! pfam DUF1387 310aa 1e-124 in ref transcript
    • Protein of unknown function (DUF1387). This family represents a conserved region approximately 300 residues long within a number of hypothetical proteins of unknown function that seem to be restricted to mammals.
  • Changed! pfam DUF1387 163aa 1e-71 in modified transcript
  • Changed! pfam DUF1387 90aa 2e-19 in modified transcript

LOC284701

  • rs.LOC284701.F1 rs.LOC284701.R1 128 350
  • NCBIGene 36.3 284701
  • Alternative 5-prime, size difference: 222
  • Exclusion in 5'UTR
  • Reference transcript: XM_001723603

LOC391322

  • rs.LOC391322.F1 rs.LOC391322.R1 227 266
  • NCBIGene 36.3 391322
  • Alternative 5-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001726974

  • Changed! pfam MIF 108aa 5e-34 in ref transcript
    • Macrophage migration inhibitory factor (MIF).
  • Changed! pfam MIF 95aa 8e-37 in modified transcript

LOC439992

  • rs.LOC439992.F1 rs.LOC439992.R1 262 412
  • NCBIGene 36.3 439992
  • Alternative 5-prime and 3-prime, size difference: 150
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_926528

  • Changed! pfam Ribosomal_S3Ae 199aa 4e-89 in ref transcript
    • Ribosomal S3Ae family.
  • Changed! COG RPS1A 217aa 2e-48 in ref transcript
    • Ribosomal protein S3AE [Translation, ribosomal structure and biogenesis].
  • Changed! pfam Ribosomal_S3Ae 149aa 2e-55 in modified transcript
  • Changed! COG RPS1A 167aa 6e-30 in modified transcript

LOC440434

  • rs.LOC440434.F1 rs.LOC440434.R1 111 191
  • NCBIGene 36.3 440434
  • Single exon skipping, size difference: 80
  • Inclusion in the protein causing a frameshift
  • Reference transcript: XM_001725426

  • Changed! pfam Peptidase_M1 285aa 3e-88 in ref transcript
    • Peptidase family M1. Members of this family are aminopeptidases. The members differ widely in specificity, hydrolysing acidic, basic or neutral N-terminal residues. This family includes leukotriene-A4 hydrolase, this enzyme also has an aminopeptidase activity.
  • Changed! COG PepN 273aa 4e-44 in ref transcript
    • Aminopeptidase N [Amino acid transport and metabolism].
  • Changed! pfam Peptidase_M1 93aa 5e-10 in modified transcript

LOC51149

  • rs.LOC51149.F1 rs.LOC51149.R1 146 311
  • NCBIGene 36.3 51149
  • Multiple exon skipping, size difference: 165
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016175

LOC642393

  • rs.LOC642393.F1 rs.LOC642393.R1 212 305
  • NCBIGene 36.3 642393
  • Alternative 3-prime, size difference: 93
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_001724428

  • Changed! TIGR rplT_bact 73aa 4e-11 in ref transcript
    • This protein binds directly to 23s ribosomal RNA and is necessary for the in vitro assembly process of the 50s ribosomal subunit. It is not involved in the protein synthesizing functions of that subunit. GO process changed accordingly (SS 5/09/03).
  • Changed! PRK rplT 69aa 1e-11 in ref transcript
    • 50S ribosomal protein L20; Provisional.
  • Changed! TIGR rplT_bact 75aa 3e-12 in modified transcript
  • Changed! PRK rplT 71aa 1e-12 in modified transcript

LOC642393

  • rs.LOC642393.F2 rs.LOC642393.R2 144 489
  • NCBIGene 36.3 642393
  • Alternative 3-prime, size difference: 345
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_001724426

  • Changed! TIGR rplT_bact 98aa 2e-19 in ref transcript
    • This protein binds directly to 23s ribosomal RNA and is necessary for the in vitro assembly process of the 50s ribosomal subunit. It is not involved in the protein synthesizing functions of that subunit. GO process changed accordingly (SS 5/09/03).
  • Changed! PRK rplT 97aa 2e-22 in ref transcript
    • 50S ribosomal protein L20; Provisional.
  • Changed! TIGR rplT_bact 74aa 5e-12 in modified transcript
  • Changed! PRK rplT 70aa 2e-12 in modified transcript

LOC643680

  • rs.LOC643680.F1 rs.LOC643680.R1 227 427
  • NCBIGene 36.3 643680
  • Alternative 3-prime, size difference: 200
  • Exclusion of the stop codon
  • Reference transcript: XM_931901

  • pfam CD20 40aa 2e-04 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.

LOC653117

  • rs.LOC653117.F1 rs.LOC653117.R1 101 449
  • NCBIGene 36.3 653117
  • Single exon skipping, size difference: 348
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001722839

  • smart SPRY 123aa 3e-16 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • smart PRY 50aa 2e-10 in ref transcript
    • associated with SPRY domains.
  • Changed! pfam V-set 106aa 6e-07 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

LOC653188

  • rs.LOC653188.F1 rs.LOC653188.R1 250 311
  • NCBIGene 36.3 653188
  • Alternative 5-prime, size difference: 61
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001721330

  • Changed! pfam Glyco_hydro_2 61aa 9e-05 in ref transcript
    • Glycosyl hydrolases family 2, immunoglobulin-like beta-sandwich domain. This family contains beta-galactosidase, beta-mannosidase and beta-glucuronidase activities.
  • Changed! PRK PRK10150 62aa 0.006 in ref transcript
    • beta-D-glucuronidase; Provisional.

LOC653550

  • rs.LOC653550.F1 rs.LOC653550.R1 191 315
  • NCBIGene 36.3 653550
  • Alternative 5-prime, size difference: 124
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001725815

LOC727737

  • rs.LOC727737.F1 rs.LOC727737.R1 99 115
  • NCBIGene 36.3 727737
  • Alternative 5-prime, size difference: 16
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001126088

  • pfam Peptidase_C54 301aa 1e-102 in ref transcript
    • Peptidase family C54.

LOC727761

  • rs.LOC727761.F1 rs.LOC727761.R1 114 243
  • NCBIGene 36.3 727761
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 129
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001126211

  • Changed! cd TMPK 185aa 3e-23 in ref transcript
    • Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP).
  • Changed! pfam Thymidylate_kin 181aa 3e-54 in ref transcript
    • Thymidylate kinase.
  • Changed! COG Tmk 188aa 2e-32 in ref transcript
    • Thymidylate kinase [Nucleotide transport and metabolism].
  • Changed! cd TMPK 142aa 2e-11 in modified transcript
  • Changed! pfam Thymidylate_kin 138aa 2e-31 in modified transcript
  • Changed! COG Tmk 145aa 4e-16 in modified transcript

LOC727761

  • rs.LOC727761.F2 rs.LOC727761.R2 111 183
  • NCBIGene 36.3 727761
  • Alternative 3-prime, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: XM_001126211

  • Changed! cd TMPK 185aa 3e-23 in ref transcript
    • Thymidine monophosphate kinase (TMPK), also known as thymidylate kinase, catalyzes the phosphorylation of thymidine monophosphate (TMP) to thymidine diphosphate (TDP) utilizing ATP as its preferred phophoryl donor. TMPK represents the rate-limiting step in either de novo or salvage biosynthesis of thymidine triphosphate (TTP).
  • Changed! pfam Thymidylate_kin 181aa 3e-54 in ref transcript
    • Thymidylate kinase.
  • Changed! COG Tmk 188aa 2e-32 in ref transcript
    • Thymidylate kinase [Nucleotide transport and metabolism].
  • Changed! cd TMPK 161aa 4e-17 in modified transcript
  • Changed! pfam Thymidylate_kin 157aa 7e-41 in modified transcript
  • Changed! COG Tmk 164aa 2e-23 in modified transcript

LOC729870

  • rs.LOC729870.F1 rs.LOC729870.R1 253 392
  • NCBIGene 36.3 729870
  • Alternative 3-prime, size difference: 139
  • Inclusion in 5'UTR
  • Reference transcript: XM_001725930

LOC92482

  • rs.LOC92482.F1 rs.LOC92482.R1 147 304
  • NCBIGene 36.3 92482
  • Single exon skipping, size difference: 157
  • Exclusion in the protein causing a frameshift
  • Reference transcript: XM_001726024

LPHN1

  • rs.LPHN1.F1 rs.LPHN1.R1 100 115
  • NCBIGene 36.3 22859
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008701

  • pfam OLF 257aa 1e-105 in ref transcript
    • Olfactomedin-like domain.
  • pfam Latrophilin 362aa 1e-100 in ref transcript
    • Latrophilin Cytoplasmic C-terminal region. This family consists of the cytoplasmic C-terminal region in latrophilin. Latrophilin is a synaptic Ca2+ independent alpha- latrotoxin (LTX) receptor and is a novel member of the secretin family of G-protein coupled receptors that are involved in secretion. Latrophilin mRNA is present only in neuronal tissue. Lactrophillin interacts with G-alpha O.
  • pfam 7tm_2 247aa 5e-74 in ref transcript
    • 7 transmembrane receptor (Secretin family). This family is known as Family B, the secretin-receptor family or family 2 of the G-protein-coupled receptors (GCPRs).They have been described in many animal species, but not in plants, fungi or prokaryotes. Three distinct sub-families are recognised. Subfamily B1 contains classical hormone receptors, such as receptors for secretin and glucagon, that are all involved in cAMP-mediated signalling pathways. Subfamily B2 contains receptors with long extracellular N-termini, such as the leukocyte cell-surface antigen CD97; calcium-independent receptors for latrotoxin, and brain-specific angiogenesis inhibitors amongst others. Subfamily B3 includes Methuselah and other Drosophila proteins. Other than the typical seven-transmembrane region, characteristic structural features include an amino-terminal extracellular domain involved in ligand binding, and an intracellular loop (IC3) required for specific G-protein coupling.
  • pfam Gal_Lectin 81aa 2e-25 in ref transcript
    • Galactose binding lectin domain.
  • smart GPS 53aa 3e-15 in ref transcript
    • G-protein-coupled receptor proteolytic site domain. Present in latrophilin/CL-1, sea urchin REJ and polycystin.
  • smart HormR 64aa 7e-14 in ref transcript
    • Domain present in hormone receptors.

LRRCC1

  • rs.LRRCC1.F1 rs.LRRCC1.R1 122 328
  • NCBIGene 36.3 85444
  • Single exon skipping, size difference: 206
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_033402

  • Changed! pfam SMC_N 259aa 3e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! TIGR SMC_prok_B 251aa 0.002 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! COG Smc 351aa 0.002 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

LRRFIP2

  • rs.LRRFIP2.F1 rs.LRRFIP2.R1 142 337
  • NCBIGene 36.3 9209
  • Multiple exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006309

  • Changed! pfam DUF2051 354aa 1e-54 in ref transcript
    • Double stranded RNA binding protein (DUF2051). This is a novel protein identified as interacting with the leucine-rich repeat domain of human flightless-I, FliI protein.
  • Changed! COG SbcC 333aa 3e-07 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].
  • Changed! pfam DUF2051 289aa 1e-80 in modified transcript
  • Changed! COG Smc 255aa 2e-06 in modified transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

LRRN3

  • rs.LRRN3.F1 rs.LRRN3.R1 267 374
  • NCBIGene 36.3 54674
  • Single exon skipping, size difference: 107
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099660

  • cd IGcam 89aa 2e-10 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 68aa 0.006 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • pfam I-set 88aa 9e-11 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam fn3 68aa 2e-05 in ref transcript
    • Fibronectin type III domain.
  • smart LRRCT 52aa 2e-04 in ref transcript
    • Leucine rich repeat C-terminal domain.
  • smart LRRNT 45aa 0.009 in ref transcript
    • Leucine rich repeat N-terminal domain.
  • COG COG4886 201aa 4e-05 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

LSP1

  • rs.LSP1.F1 rs.LSP1.R1 111 550
  • NCBIGene 36.3 4046
  • Alternative 5-prime, size difference: 439
  • Exclusion in 5'UTR
  • Reference transcript: NM_001013254

  • pfam Caldesmon 92aa 4e-04 in ref transcript
    • Caldesmon.

LTBP4

  • rs.LTBP4.F1 rs.LTBP4.R1 100 119
  • NCBIGene 36.3 8425
  • Alternative 3-prime, size difference: 19
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001042544

  • Changed! cd EGF_CA 34aa 0.003 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • Changed! cd EGF_CA 33aa 0.007 in ref transcript
  • Changed! smart EGF_CA 37aa 7e-05 in ref transcript
    • Calcium-binding EGF-like domain.
  • Changed! smart EGF_CA 34aa 1e-04 in ref transcript
  • Changed! smart EGF_CA 36aa 4e-04 in ref transcript
  • Changed! pfam TB 41aa 6e-04 in ref transcript
    • TB domain. This domain is also known as the 8 cysteine domain. This family includes the hybrid domains. This cysteine rich repeat is found in TGF binding protein and fibrillin.
  • Changed! smart EGF_CA 31aa 0.001 in ref transcript
  • Changed! smart EGF_CA 38aa 0.004 in ref transcript
  • Changed! pfam EGF_CA 41aa 0.004 in ref transcript
    • Calcium binding EGF domain.
  • Changed! smart EGF_CA 37aa 0.008 in ref transcript

MAGEA6

  • rs.MAGEA6.F1 rs.MAGEA6.R1 123 187
  • NCBIGene 36.3 4105
  • Alternative 3-prime, size difference: 64
  • Inclusion in 5'UTR
  • Reference transcript: NM_005363

  • pfam MAGE 171aa 5e-69 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.

MAGEC3

  • rs.MAGEC3.F1 rs.MAGEC3.R1 143 536
  • NCBIGene 36.3 139081
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 393
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein causing a frameshift, Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138702

  • Changed! pfam MAGE 114aa 2e-35 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
  • pfam MAGE 113aa 4e-35 in ref transcript

MAGED4B

  • rs.MAGED4B.F1 rs.MAGED4B.R1 102 114
  • NCBIGene 36.3 81557
  • Alternative 5-prime, size difference: 12
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_177535

  • pfam MAGE 170aa 8e-75 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.
  • PRK PRK07003 168aa 0.002 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.

MAGIX

  • rs.MAGIX.F1 rs.MAGIX.R1 160 388
  • NCBIGene 36.3 79917
  • Single exon skipping, size difference: 228
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024859

  • Changed! cd PDZ_signaling 59aa 7e-10 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • Changed! smart PDZ 56aa 2e-11 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • Changed! COG Prc 72aa 2e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • Changed! cd PDZ_signaling 38aa 3e-06 in modified transcript
  • Changed! smart PDZ 40aa 7e-08 in modified transcript

MAGIX

  • rs.MAGIX.F2 rs.MAGIX.R2 129 174
  • NCBIGene 36.3 79917
  • Alternative 3-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024859

  • cd PDZ_signaling 59aa 7e-10 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 56aa 2e-11 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG Prc 72aa 2e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

MAGIX

  • rs.MAGIX.F3 rs.MAGIX.R3 100 115
  • NCBIGene 36.3 79917
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024859

  • cd PDZ_signaling 59aa 7e-10 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart PDZ 56aa 2e-11 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • COG Prc 72aa 2e-04 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

MAP2K5

  • rs.MAP2K5.F1 rs.MAP2K5.R1 229 259
  • NCBIGene 36.3 5607
  • Single exon skipping, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145160

  • Changed! cd PKc_MKK5 279aa 1e-172 in ref transcript
    • Protein kinases (PKs), MAP kinase kinase 5 (MKK5) subfamily, catalytic (c) domain. PKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine or tyrosine residues on protein substrates. The MKK5 subfamily is part of a larger superfamily that includes the catalytic domains of other protein serine/threonine kinases, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. The mitogen-activated protein (MAP) kinase signaling pathways are important mediators of cellular responses to extracellular signals. The pathways involve a triple kinase core cascade comprising of the MAP kinase (MAPK), which is phosphorylated and activated by a MAPK kinase (MAPKK or MKK), which itself is phosphorylated and activated by a MAPK kinase kinase (MAPKKK or MKKK). MKK5, also referred to as MEK5, is a dual-specificity PK that phosphorylates its downstream target, extracellular signal-regulated kinase 5 (ERK5), on specific threonine and tyrosine residues. MKK5 is activated by MEKK2 and MEKK3 in response to mitogenic and stress stimuli. The ERK5 cascade promotes cell proliferation, differentiation, neuronal survival, and neuroprotection. This cascade plays an essential role in heart development. Mice deficient in either ERK5 or MKK5 die around embryonic day 10 due to cardiovascular defects including underdevelopment of the myocardium. In addition, MKK5 is associated with metastasis and unfavorable prognosis in prostate cancer.
  • cd PB1_Map2k5 91aa 3e-47 in ref transcript
    • PB1 domain is essential part of the mitogen-activated protein kinase kinase 5 (Map2k5, alias MEK5) one of the key member of the signaling kinases cascade which involved in angiogenesis and early cardiovascular development. The PB1 domain of Map2k5 interacts with the PB1 domain of another members of kinase cascade MEKK2 (or MEKK3). A canonical PB1-PB1 interaction, involving heterodimerization of two PB1 domain, is required for the formation of macromolecular signaling complexes ensuring specificity and fidelity during cellular signaling. The interaction between two PB1 domain depends on the type of PB1. There are three types of PB1 domains: type I which contains an OPCA motif, acidic aminoacid cluster, type II which contains a basic cluster, and type I/II which contains both an OPCA motif and a basic cluster. The Map2k5 protein contains a type I PB1 domain.
  • Changed! smart S_TKc 242aa 1e-62 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam PB1 67aa 7e-10 in ref transcript
    • PB1 domain.
  • Changed! COG SPS1 255aa 6e-32 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd PKc_MKK5 269aa 1e-163 in modified transcript
  • Changed! smart S_TKc 232aa 1e-58 in modified transcript
  • Changed! COG SPS1 245aa 3e-29 in modified transcript

MAP4K1

  • rs.MAP4K1.F1 rs.MAP4K1.R1 154 252
  • NCBIGene 36.3 11184
  • Single exon skipping, size difference: 98
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007181

  • cd STKc_MAP4K3_like 260aa 1e-149 in ref transcript
    • Serine/threonine kinases (STKs), mitogen-activated protein kinase (MAPK) kinase kinase kinase 3 (MAPKKKK3 or MAP4K3)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MAP4K3-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. This subfamily includes MAP4K3, MAP4K1, MAP4K2, MAP4K5, and related proteins. Vertebrate members contain an N-terminal catalytic domain and a C-terminal citron homology (CNH) regulatory domain, similar to MAP4K4/6. MAP4Ks are involved in some MAPK signaling pathways that are important in mediating cellular responses to extracellular signals by activating a MAPK kinase kinase (MAPKKK or MAP3K or MKKK). Each MAPK cascade is activated either by a small GTP-binding protein or by an adaptor protein, which transmits the signal either directly to a MAP3K to start the triple kinase core cascade or indirectly through a mediator kinase, a MAP4K. MAP4K1, also called haematopoietic progenitor kinase 1 (HPK1), is a hematopoietic-specific STK involved in many cellular signaling cascades including MAPK, antigen receptor, apoptosis, growth factor, and cytokine signaling. It participates in the regulation of T cell receptor signaling and T cell-mediated immune responses. MAP4K2 was referred to as germinal center (GC) kinase because of its preferred location in GC B cells. MAP4K3 plays a role in the nutrient-responsive pathway of mTOR (mammalian target of rapamycin) signaling. It is required in the activation of S6 kinase by amino acids and for the phosphorylation of the mTOR-regulated inhibitor of eukaryotic initiation factor 4E. MAP4K5, also called germinal center kinase-related enzyme (GCKR), has been shown to activate the MAPK c-Jun N-terminal kinase (JNK).
  • Changed! smart CNH 308aa 9e-84 in ref transcript
    • Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
  • smart S_TKc 248aa 6e-69 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • COG SPS1 354aa 8e-37 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! smart CNH 308aa 5e-85 in modified transcript

MAPK10

  • rs.MAPK10.F1 rs.MAPK10.R1 148 175
  • NCBIGene 36.3 5602
  • Alternative 5-prime, size difference: 27
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_138982

  • Changed! cd S_TKc 297aa 1e-66 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 284aa 1e-67 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 292aa 1e-34 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

MAPK14

  • rs.MAPK14.F1 rs.MAPK14.R1 159 238
  • NCBIGene 36.3 1432
  • Single exon skipping, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_139012

  • Changed! cd S_TKc 276aa 7e-66 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 265aa 4e-64 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 269aa 5e-38 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 220aa 1e-62 in modified transcript
  • Changed! smart S_TKc 209aa 2e-60 in modified transcript
  • Changed! PTZ PTZ00024 213aa 3e-37 in modified transcript

MAPK14

  • rs.MAPK14.F2 rs.MAPK14.R2 101 180
  • NCBIGene 36.3 1432
  • Single exon skipping, size difference: 79
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_139012

  • Changed! cd S_TKc 276aa 7e-66 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 265aa 4e-64 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 269aa 5e-38 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 220aa 1e-62 in modified transcript
  • Changed! smart S_TKc 209aa 2e-60 in modified transcript
  • Changed! PTZ PTZ00024 213aa 3e-37 in modified transcript

MAPK3

  • rs.MAPK3.F1 rs.MAPK3.R1 155 287
  • NCBIGene 36.3 5595
  • Single exon skipping, size difference: 132
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002746

  • Changed! cd S_TKc 290aa 4e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • Changed! smart S_TKc 279aa 1e-69 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! PTZ PTZ00024 300aa 5e-44 in ref transcript
    • cyclin-dependent protein kinase; Provisional.
  • Changed! cd S_TKc 246aa 9e-68 in modified transcript
  • Changed! smart S_TKc 235aa 1e-69 in modified transcript
  • Changed! PTZ PTZ00024 256aa 3e-37 in modified transcript

MATK

  • rs.MATK.F1 rs.MATK.R1 316 539
  • NCBIGene 36.3 4145
  • Single exon skipping, size difference: 223
  • Exclusion of the protein initiation site
  • Reference transcript: NM_139355

  • cd PTKc_Chk 254aa 1e-146 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Csk homologous kinase. Protein Tyrosine Kinase (PTK) family; Csk homologous kinase (Chk); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. Csk subfamily kinases are cytoplasmic (or nonreceptor) tyr kinases containing the Src homology domains, SH3 and SH2, N-terminal to the catalytic tyr kinase domain. They negatively regulate the activity of Src kinases that are anchored to the plasma membrane. Chk is also referred to as megakaryocyte-associated tyrosine kinase (Matk). To inhibit Src kinases, Chk is translocated to the membrane via binding to specific transmembrane proteins, G-proteins, or adaptor proteins near the membrane. Chk inhibit Src kinases using a noncatalytic mechanism by simply binding to them. As a negative regulator of Src kinases, Chk may play important roles in cell proliferation, survival, and differentiation, and consequently, in cancer development and progression. Chk is expressed in brain and hematopoietic cells. Studies in mice reveal that Chk is not functionally redundant with Csk and that it plays an important role as a regulator of immune responses. Chk also plays a role in neural differentiation in a manner independent of Src by enhancing Mapk activation via Ras-mediated signaling.
  • cd SH2 90aa 1e-19 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • cd SH3 43aa 0.009 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart TyrKc 244aa 1e-99 in ref transcript
    • Tyrosine kinase, catalytic domain. Phosphotransferases. Tyrosine-specific kinase subfamily.
  • smart SH2 81aa 2e-22 in ref transcript
    • Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
  • smart SH3 45aa 1e-04 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • COG SPS1 218aa 4e-22 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

MAZ

  • rs.MAZ.F1 rs.MAZ.R1 299 524
  • NCBIGene 36.3 4150
  • Single exon skipping, size difference: 225
  • Exclusion of the stop codon
  • Reference transcript: NM_001042539

  • COG SFP1 58aa 0.004 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].

MBP

  • rs.MBP.F1 rs.MBP.R1 115 148
  • NCBIGene 36.3 4155
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025101

  • Changed! pfam Myelin_MBP 154aa 1e-59 in ref transcript
    • Myelin basic protein.
  • Changed! pfam Myelin_MBP 143aa 8e-52 in modified transcript

MCL1

  • rs.MCL1.F1 rs.MCL1.R1 133 381
  • NCBIGene 36.3 4170
  • Single exon skipping, size difference: 248
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_021960

  • Changed! cd Bcl-2_like 144aa 7e-37 in ref transcript
    • Apoptosis regulator proteins of the Bcl-2 family, named after B-cell lymphoma 2. This alignment model spans what have been described as Bcl-2 homology regions BH1, BH2, BH3, and BH4. Many members of this family have an additional C-terminal transmembrane segment. Some homologous proteins, which are not included in this model, may miss either the BH4 (Bax, Bak) or the BH2 (Bcl-X(S)) region, and some appear to only share the BH3 region (Bik, Bim, Bad, Bid, Egl-1). This family is involved in the regulation of the outer mitochondrial membrane's permeability and in promoting or preventing the release of apoptogenic factors, which in turn may trigger apoptosis by activating caspases. Bcl-2 and the closely related Bcl-X(L) are anti-apoptotic key regulators of programmed cell death. They are assumed to function via heterodimeric protein-protein interactions, binding pro-apoptotic proteins such as Bad (BCL2-antagonist of cell death), Bid, and Bim, by specifically interacting with their BH3 regions. Interfering with this heterodimeric interaction via small-molecule inhibitors may prove effective in targeting various cancers. This family also includes the Caenorhabditis elegans Bcl-2 homolog CED-9, which binds to CED-4, the C. Elegans homolog of mammalian Apaf-1. Apaf-1, however, does not seem to be inhibited by Bcl-2 directly.
  • Changed! pfam Bcl-2 100aa 5e-32 in ref transcript
    • Apoptosis regulator proteins, Bcl-2 family.
  • Changed! cd Bcl-2_like 56aa 2e-04 in modified transcript

MED24

  • rs.MED24.F1 rs.MED24.R1 124 146
  • NCBIGene 36.3 9862
  • Alternative 3-prime, size difference: 22
  • Inclusion in 5'UTR
  • Reference transcript: NM_014815

MED24

  • rs.MED24.F2 rs.MED24.R2 108 147
  • NCBIGene 36.3 9862
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014815

MEN1

  • rs.MEN1.F1 rs.MEN1.R1 103 118
  • NCBIGene 36.3 4221
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130803

  • Changed! pfam Menin 615aa 0.0 in ref transcript
    • Menin. MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23.
  • Changed! pfam Menin 610aa 0.0 in modified transcript

MGC13057

  • rs.MGC13057.F1 rs.MGC13057.R1 139 158
  • NCBIGene 36.3 84281
  • Alternative 5-prime, size difference: 19
  • Exclusion in 5'UTR
  • Reference transcript: NM_032321

MGC45438

  • rs.MGC45438.F1 rs.MGC45438.R1 102 460
  • NCBIGene 36.3 146556
  • Alternative 3-prime, size difference: 358
  • Exclusion of the stop codon
  • Reference transcript: NM_152459

MID1IP1

  • rs.MID1IP1.F1 rs.MID1IP1.R1 143 247
  • NCBIGene 36.3 58526
  • Single exon skipping, size difference: 104
  • Exclusion in 5'UTR
  • Reference transcript: NM_001098790

  • pfam Spot_14 182aa 9e-66 in ref transcript
    • Thyroid hormone-inducible hepatic protein Spot 14. This family consists of several thyroid hormone-inducible hepatic protein (Spot 14 or S14) sequences. Mainly expressed in tissues that synthesise triglycerides, the mRNA coding for Spot 14 has been shown to be increased in rat liver by insulin, dietary carbohydrates, glucose in hepatocyte culture medium, as well as thyroid hormone. In contrast, dietary fats and polyunsaturated fatty acids, have been shown to decrease the amount of Spot 14 mRNA, while an elevated level of cAMP acts as a dominant negative factor. In addition, liver-specific factors or chromatin organisation of the gene have been shown to contribute to the regulation of its expression. Spot 14 protein is thought to be required for induction of hepatic lipogenesis.

MIER1

  • rs.MIER1.F1 rs.MIER1.R1 115 189
  • NCBIGene 36.3 57708
  • Single exon skipping, size difference: 74
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001077700

  • Changed! cd SANT 44aa 7e-04 in ref transcript
    • 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
  • Changed! pfam ELM2 54aa 2e-09 in ref transcript
    • ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
  • Changed! smart SANT 46aa 6e-05 in ref transcript
    • SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.

MITF

  • rs.MITF.F1 rs.MITF.R1 97 115
  • NCBIGene 36.3 4286
  • Alternative 3-prime, size difference: 18
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_198159

  • cd HLH 61aa 4e-11 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam HLH 54aa 3e-11 in ref transcript
    • Helix-loop-helix DNA-binding domain.

MIZF

  • rs.MIZF.F1 rs.MIZF.R1 336 453
  • NCBIGene 36.3 25988
  • Single exon skipping, size difference: 117
  • Exclusion in 5'UTR
  • Reference transcript: NM_015517

  • COG COG5236 133aa 0.002 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

MLLT4

  • rs.MLLT4.F1 rs.MLLT4.R1 187 226
  • NCBIGene 36.3 4301
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 39
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040001

  • cd AF6_RA_repeat1 112aa 3e-53 in ref transcript
    • The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
  • cd AF6_RA_repeat2 100aa 2e-42 in ref transcript
    • The AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region. AF6 acts downstream of the Egfr (Epidermal Growth Factor-receptor)/Ras signalling pathway and provides a link from Egfr to cytoskeletal elements.
  • cd PDZ_signaling 84aa 3e-12 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd FHA 90aa 2e-05 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • pfam DIL 107aa 5e-26 in ref transcript
    • DIL domain. The DIL domain has no known function.
  • smart RA 94aa 7e-17 in ref transcript
    • Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
  • smart PDZ 88aa 4e-14 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart RA 100aa 2e-13 in ref transcript
  • smart FHA 52aa 4e-08 in ref transcript
    • Forkhead associated domain. Found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain.
  • COG Prc 133aa 0.001 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

MLPH

  • rs.MLPH.F1 rs.MLPH.R1 121 205
  • NCBIGene 36.3 79083
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024101

  • pfam MOBP 68aa 2e-20 in ref transcript
    • Myelin-associated oligodendrocytic basic protein (MOBP). MOBP is abundantly expressed in central nervous system myelin, and shares several characteristics with myelin basic protein (MBP), in terms of regional distribution and function. MOBP has been shown to be essential for normal arrangement of the radial component in central nervous system myelin.

MMD2

  • rs.MMD2.F1 rs.MMD2.R1 109 181
  • NCBIGene 36.3 221938
  • Alternative 3-prime, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100600

  • Changed! TIGR hlyIII 225aa 1e-47 in ref transcript
    • This family includes proteins from pathogenic and non-pathogenic bacteria, Homo sapiens and Drosophila. In Bacillus cereus, a pathogen, it has been show to function as a channel-forming cytolysin. The human protein is expressed preferentially in mature macrophages, consistent with a role cytolytic role.
  • Changed! COG COG1272 243aa 8e-22 in ref transcript
    • Predicted membrane protein, hemolysin III homolog [General function prediction only].
  • Changed! TIGR hlyIII 201aa 2e-50 in modified transcript
  • Changed! COG COG1272 219aa 6e-24 in modified transcript

MOBKL3

  • rs.MOBKL3.F1 rs.MOBKL3.R1 123 186
  • NCBIGene 36.3 25843
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015387

  • pfam Mob1_phocein 167aa 3e-61 in ref transcript
    • Mob1/phocein family. Mob1 is an essential Saccharomyces cerevisiae protein, identified from a two-hybrid screen, that binds Mps1p, a protein kinase essential for spindle pole body duplication and mitotic checkpoint regulation. Mob1 contains no known structural motifs; however MOB1 is a member of a conserved gene family and shares sequence similarity with a nonessential yeast gene, MOB2. Mob1 is a phosphoprotein in vivo and a substrate for the Mps1p kinase in vitro. Conditional alleles of MOB1 cause a late nuclear division arrest at restrictive temperature. This family also includes phocein, a rat protein that by yeast two hybrid interacts with striatin.

MOCS1

  • rs.MOCS1.F1 rs.MOCS1.R1 100 115
  • NCBIGene 36.3 4337
  • Alternative 5-prime, size difference: 15
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_005942

  • Changed! cd MoaC_PE 140aa 2e-40 in ref transcript
    • MoaC family, prokaryotic and eukaryotic. Members of this family are involved in molybdenum cofactor (Moco) biosynthesis, an essential cofactor of a diverse group of redox enzymes. MoaC, a small hexameric protein, converts, together with MoaA, a guanosine derivative to the precursor Z by inserting the carbon-8 of the purine between the 2' and 3' ribose carbon atoms, which is the first of three phases of Moco biosynthesis.
  • cd Radical_SAM 198aa 2e-19 in ref transcript
    • Radical SAM superfamily. Enzymes of this family generate radicals by combining a 4Fe-4S cluster and S-adenosylmethionine (SAM) in close proximity. They are characterized by a conserved CxxxCxxC motif, which coordinates the conserved iron-sulfur cluster. Mechanistically, they share the transfer of a single electron from the iron-sulfur cluster to SAM, which leads to its reductive cleavage to methionine and a 5'-deoxyadenosyl radical, which, in turn, abstracts a hydrogen from the appropriately positioned carbon atom. Depending on the enzyme, SAM is consumed during this process or it is restored and reused. Radical SAM enzymes catalyze steps in metabolism, DNA repair, the biosynthesis of vitamins and coenzymes, and the biosynthesis of many antibiotics. Examples are biotin synthase (BioB), lipoyl synthase (LipA), pyruvate formate-lyase (PFL), coproporphyrinogen oxidase (HemN), lysine 2,3-aminomutase (LAM), anaerobic ribonucleotide reductase (ARR), and MoaA, an enzyme of the biosynthesis of molybdopterin.
  • Changed! TIGR moaA 306aa 1e-114 in ref transcript
    • The model for this family describes molybdenum cofactor biosynthesis protein A, or MoaA, as found in bacteria. It does not include the family of probable functional equivalent proteins from the archaea. MoaA works together with MoaC to synthesize precursor Z from guanine.
  • Changed! pfam MoaC 136aa 2e-43 in ref transcript
    • MoaC family. Members of this family are involved in molybdenum cofactor biosynthesis. However their molecular function is not known.
  • Changed! PRK moaA 327aa 1e-116 in ref transcript
    • molybdenum cofactor biosynthesis protein A; Reviewed.
  • Changed! PRK moaC 157aa 5e-48 in ref transcript
    • molybdenum cofactor biosynthesis protein C; Provisional.
  • Changed! TIGR moaA 325aa 1e-114 in modified transcript
  • Changed! PRK moaA 332aa 1e-117 in modified transcript

MORC4

  • rs.MORC4.F1 rs.MORC4.R1 200 238
  • NCBIGene 36.3 79710
  • Alternative 5-prime, size difference: 38
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024657

  • cd HATPase_c 99aa 3e-04 in ref transcript
    • Histidine kinase-like ATPases; This family includes several ATP-binding proteins for example: histidine kinase, DNA gyrase B, topoisomerases, heat shock protein HSP90, phytochrome-like ATPases and DNA mismatch repair proteins.
  • pfam zf-CW 46aa 3e-17 in ref transcript
    • CW-type Zinc Finger. This domain appears to be a zinc finger. The alignment shows four conserved cysteine residues and a conserved tryptophan. It was first identified by, and is predicted to be a "highly specialised mononuclear four-cysteine zinc finger...that plays a role in DNA binding and/or promoting protein-protein interactions in complicated eukaryotic processes including...chromatin methylation status and early embryonic development." Weak homology to pfam00628 further evidences these predictions (personal obs: C Yeats). Twelve different CW-domain-containing protein subfamilies are described, with different subfamilies being characteristic of vertebrates, higher plants and other animals in which these domain is found.
  • pfam HATPase_c 101aa 3e-06 in ref transcript
    • Histidine kinase-, DNA gyrase B-, and HSP90-like ATPase. This family represents the structurally related ATPase domains of histidine kinase, DNA gyrase B and HSP90.
  • COG MutL 63aa 0.001 in ref transcript
    • DNA mismatch repair enzyme (predicted ATPase) [DNA replication, recombination, and repair].

MPP3

  • rs.MPP3.F1 rs.MPP3.R1 99 120
  • NCBIGene 36.3 4356
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001932

  • Changed! cd GMPK 119aa 1e-24 in ref transcript
    • Guanosine monophosphate kinase (GMPK, EC 2.7.4.8), also known as guanylate kinase (GKase), catalyzes the reversible phosphoryl transfer from adenosine triphosphate (ATP) to guanosine monophosphate (GMP) to yield adenosine diphosphate (ADP) and guanosine diphosphate (GDP). It plays an essential role in the biosynthesis of guanosine triphosphate (GTP). This enzyme is also important for the activation of some antiviral and anticancer agents, such as acyclovir, ganciclovir, carbovir, and thiopurines.
  • cd PDZ_signaling 81aa 1e-11 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd SH3 59aa 2e-05 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! smart GuKc 188aa 4e-51 in ref transcript
    • Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
  • smart PDZ 82aa 2e-12 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart L27 54aa 1e-09 in ref transcript
    • domain in receptor targeting proteins Lin-2 and Lin-7.
  • pfam L27 51aa 1e-06 in ref transcript
    • L27 domain. The L27 domain is found in receptor targeting proteins Lin-2 and Lin-7.
  • smart SH3 62aa 1e-05 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! COG Gmk 187aa 9e-22 in ref transcript
    • Guanylate kinase [Nucleotide transport and metabolism].
  • COG Prc 77aa 6e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

MRE11A

  • rs.MRE11A.F1 rs.MRE11A.R1 136 220
  • NCBIGene 36.3 4361
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005591

  • TIGR mre11 393aa 1e-140 in ref transcript
    • All proteins in this family for which functions are known are subunits of a nuclease complex made up of multiple proteins including MRE11 and RAD50 homologs. The functions of this nuclease complex include recombinational repair and non-homolgous end joining. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). The proteins in this family are distantly related to proteins in the SbcCD complex of bacteria.
  • COG SbcD 309aa 2e-20 in ref transcript
    • DNA repair exonuclease [DNA replication, recombination, and repair].

MRGPRF

  • rs.MRGPRF.F1 rs.MRGPRF.R1 99 110
  • NCBIGene 36.3 219928
  • Alternative 5-prime and 3-prime, size difference: 11
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_145015

  • pfam 7tm_1 225aa 2e-11 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

MRPL21

  • rs.MRPL21.F1 rs.MRPL21.R1 109 122
  • NCBIGene 36.3 219927
  • Alternative 5-prime, size difference: 13
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_181514

  • Changed! pfam Ribosomal_L21p 73aa 1e-15 in ref transcript
    • Ribosomal prokaryotic L21 protein.
  • Changed! COG RplU 68aa 2e-06 in ref transcript
    • Ribosomal protein L21 [Translation, ribosomal structure and biogenesis].

MRPL39

  • rs.MRPL39.F1 rs.MRPL39.R1 101 190
  • NCBIGene 36.3 54148
  • Single exon skipping, size difference: 89
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_080794

  • cd TGS_ThrRS_N 55aa 1e-13 in ref transcript
    • TGS _ThrRS_N: ThrRS (threonyl-tRNA Synthetase) is a class II tRNA synthetase that couples threonine to its cognate tRNA. In addition to its catalytic and anticodon-binding domains, ThrRS has an N-terminal TGS domain, named after the ThrRS, GTPase, and SpoT proteins where it occurs. The TGS domain is thought to interact with the tRNA acceptor arm along with an adjacent N-terminal domain. The specific function of TGS is not well understood.
  • TIGR thrS 133aa 2e-05 in ref transcript
    • This model represents the threonyl-tRNA synthetase found in most organisms. This protein is a class II tRNA synthetase, and is recognized by the pfam HMM tRNA-synt_2b. Note that B. subtilis has closely related isozymes thrS and thrZ. The N-terminal regions are quite dissimilar between archaeal and eubacterial forms, while some eukaryotic forms are missing sequence there altogether.
  • PRK thrS 197aa 2e-12 in ref transcript
    • threonyl-tRNA synthetase; Reviewed.

MRPL42

  • rs.MRPL42.F1 rs.MRPL42.R1 115 135
  • NCBIGene 36.3 28977
  • Single exon skipping, size difference: 20
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_172178

  • Changed! pfam MRP-S32 99aa 3e-42 in ref transcript
    • Mitochondrial 28S ribosomal protein S32. This entry is of a family of short, approximately 100 amino acid residues, proteins which are mitochondrial 28S ribosomal proteins named as MRP-S32. Their exact function could not be confirmed.

MRPL55

  • rs.MRPL55.F1 rs.MRPL55.R1 104 216
  • NCBIGene 36.3 128308
  • Alternative 5-prime, size difference: 112
  • Exclusion in 5'UTR
  • Reference transcript: NM_181462

  • pfam Mitoc_L55 118aa 1e-32 in ref transcript
    • Mitochondrial ribosomal protein L55. Members of this family are involved in mitochondrial biogenesis and G2/M phase cell cycle progression. They form a component of the mitochondrial ribosome large subunit (39S) which comprises a 16S rRNA and about 50 distinct proteins.

MRPS11

  • rs.MRPS11.F1 rs.MRPS11.R1 265 364
  • NCBIGene 36.3 64963
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022839

  • Changed! TIGR bact_S11 106aa 3e-18 in ref transcript
    • This model describes the bacterial 30S ribosomal protein S11. Cutoffs are set such that the model excludes archaeal and eukaryotic ribosomal proteins, but many chloroplast and mitochondrial equivalents of S11 are detected.
  • Changed! PRK PRK05309 109aa 2e-19 in ref transcript
    • 30S ribosomal protein S11; Validated.
  • Changed! TIGR bact_S11 84aa 3e-13 in modified transcript
  • Changed! PRK PRK05309 87aa 5e-14 in modified transcript

MRVI1

  • rs.MRVI1.F1 rs.MRVI1.R1 239 293
  • NCBIGene 36.3 10335
  • Mutually exclusive exon skipping, size difference: 54
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_001100163

  • pfam MRVI1 593aa 1e-141 in ref transcript
    • MRVI1 protein. This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins.

MRVI1

  • rs.MRVI1.F2 rs.MRVI1.R2 257 362
  • NCBIGene 36.3 10335
  • Single exon skipping, size difference: 105
  • Exclusion in 5'UTR
  • Reference transcript: NM_001100163

  • pfam MRVI1 593aa 1e-141 in ref transcript
    • MRVI1 protein. This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins.

MRVI1

  • rs.MRVI1.F3 rs.MRVI1.R3 414 518
  • NCBIGene 36.3 10335
  • Single exon skipping, size difference: 104
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001098579

  • Changed! pfam MRVI1 593aa 1e-144 in ref transcript
    • MRVI1 protein. This family consists of mammalian MRVI1 proteins which are related to the lymphoid-restricted membrane protein (JAW1) and the IP3 receptor associated cGMP kinase substrates A and B (IRAGA and IRAGB). The function of MRVI1 is unknown although mutations in the Mrvi1 gene induces myeloid leukaemia by altering the expression of a gene important for myeloid cell growth and/or differentiation so it has been speculated that Mrvi1 is a tumour suppressor gene. IRAG is very similar in sequence to MRVI1 and is an essential NO/cGKI-dependent regulator of IP3-induced calcium release. Activation of cGKI decreases IP3-stimulated elevations in intracellular calcium, induces smooth muscle relaxation and contributes to the antiproliferative and pro-apoptotic effects of NO/cGMP. Jaw1 is a member of a class of proteins with COOH-terminal hydrophobic membrane anchors and is structurally similar to proteins involved in vesicle targeting and fusion. This suggests that the function and/or the structure of the ER in lymphocytes may be modified by lymphoid-restricted resident ER proteins.

MS4A13

  • rs.MS4A13.F1 rs.MS4A13.R1 108 228
  • NCBIGene 36.3 503497
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012417

  • Changed! pfam CD20 139aa 7e-14 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
  • Changed! pfam CD20 99aa 2e-09 in modified transcript

MS4A15

  • rs.MS4A15.F1 rs.MS4A15.R1 149 436
  • NCBIGene 36.3 219995
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 287
  • Exclusion of the protein initiation site, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001098835

  • Changed! pfam CD20 118aa 3e-30 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.

MSLN

  • rs.MSLN.F1 rs.MSLN.R1 118 142
  • NCBIGene 36.3 10232
  • Alternative 3-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013404

  • Changed! pfam Mesothelin 606aa 0.0 in ref transcript
    • Pre-pro-megakaryocyte potentiating factor precursor (Mesothelin). This family consists of several mammalian pre-pro-megakaryocyte potentiating factor precursor (MPF) or mesothelin proteins. Mesothelin is a glycosylphosphatidylinositol-linked glycoprotein highly expressed in mesothelial cells, mesotheliomas, and ovarian cancer, but the biological function of the protein is not known.
  • Changed! pfam Mesothelin 598aa 0.0 in modified transcript

MSR1

  • rs.MSR1.F1 rs.MSR1.R1 142 331
  • NCBIGene 36.3 4481
  • Single exon skipping, size difference: 189
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138715

  • Changed! smart SR 101aa 1e-39 in ref transcript
    • Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
  • pfam Macscav_rec 49aa 3e-18 in ref transcript
    • Macrophage scavenger receptor.
  • pfam Collagen 63aa 3e-13 in ref transcript
    • Collagen triple helix repeat (20 copies). Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxyproline. Collagens are post translationally modified by proline hydroxylase to form the hydroxyproline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure.
  • Changed! pfam SMC_N 83aa 0.007 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! pfam SMC_N 173aa 0.009 in ref transcript
  • Changed! smart SR 42aa 3e-12 in modified transcript

MST4

  • rs.MST4.F1 rs.MST4.R1 102 333
  • NCBIGene 36.3 51765
  • Single exon skipping, size difference: 231
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016542

  • Changed! cd STKc_MST4 277aa 1e-146 in ref transcript
    • Serine/threonine kinases (STKs), mammalian Ste20-like protein kinase 4 (MST4) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MST4 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MST4 is sometimes referred to as MASK (MST3 and SOK1-related kinase). It plays a role in mitogen-activated protein kinase (MAPK) signaling during cytoskeletal rearrangement, morphogenesis, and apoptosis. It influences cell growth and transformation by modulating the extracellular signal-regulated kinase (ERK) pathway. MST4 may also play a role in tumor formation and progression. It localizes in the Golgi apparatus by interacting with the Golgi matrix protein GM130 and may play a role in cell migration.
  • Changed! smart S_TKc 241aa 4e-70 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG SPS1 266aa 3e-32 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd STKc_MST3_like 204aa 1e-119 in modified transcript
    • Serine/threonine kinases (STKs), mammalian Ste20-like protein kinase 3 (MST3)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MST3-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. This subfamily is composed of MST3, MST4, STK25, Schizosaccharomyces pombe Nak1 and Sid1, Saccharomyces cerevisiae sporulation-specific protein 1 (SPS1), and related proteins. Nak1 is required by fission yeast for polarizing the tips of actin cytoskeleton and is involved in cell growth, cell separation, cell morphology and cell-cycle progression. Sid1 is a component in the septation initiation network (SIN) signaling pathway, and plays a role in cytokinesis. SPS1 plays a role in regulating proteins required for spore wall formation. MST4 plays a role in mitogen-activated protein kinase (MAPK) signaling during cytoskeletal rearrangement, morphogenesis, and apoptosis. MST3 phosphorylates the STK NDR and may play a role in cell cycle progression and cell morphology. STK25 may play a role in the regulation of cell migration and polarization.
  • Changed! smart S_TKc 182aa 1e-60 in modified transcript
  • Changed! COG SPS1 201aa 2e-27 in modified transcript

MST4

  • rs.MST4.F2 rs.MST4.R2 325 511
  • NCBIGene 36.3 51765
  • Single exon skipping, size difference: 186
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016542

  • Changed! cd STKc_MST4 277aa 1e-146 in ref transcript
    • Serine/threonine kinases (STKs), mammalian Ste20-like protein kinase 4 (MST4) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MST4 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. MST4 is sometimes referred to as MASK (MST3 and SOK1-related kinase). It plays a role in mitogen-activated protein kinase (MAPK) signaling during cytoskeletal rearrangement, morphogenesis, and apoptosis. It influences cell growth and transformation by modulating the extracellular signal-regulated kinase (ERK) pathway. MST4 may also play a role in tumor formation and progression. It localizes in the Golgi apparatus by interacting with the Golgi matrix protein GM130 and may play a role in cell migration.
  • Changed! smart S_TKc 241aa 4e-70 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG SPS1 266aa 3e-32 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! cd STKc_MST3_like 212aa 6e-90 in modified transcript
    • Serine/threonine kinases (STKs), mammalian Ste20-like protein kinase 3 (MST3)-like subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The MST3-like subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. This subfamily is composed of MST3, MST4, STK25, Schizosaccharomyces pombe Nak1 and Sid1, Saccharomyces cerevisiae sporulation-specific protein 1 (SPS1), and related proteins. Nak1 is required by fission yeast for polarizing the tips of actin cytoskeleton and is involved in cell growth, cell separation, cell morphology and cell-cycle progression. Sid1 is a component in the septation initiation network (SIN) signaling pathway, and plays a role in cytokinesis. SPS1 plays a role in regulating proteins required for spore wall formation. MST4 plays a role in mitogen-activated protein kinase (MAPK) signaling during cytoskeletal rearrangement, morphogenesis, and apoptosis. MST3 phosphorylates the STK NDR and may play a role in cell cycle progression and cell morphology. STK25 may play a role in the regulation of cell migration and polarization.
  • Changed! smart S_TKc 166aa 5e-47 in modified transcript
  • Changed! COG SPS1 208aa 2e-21 in modified transcript

MTERFD3

  • rs.MTERFD3.F1 rs.MTERFD3.R1 90 100
  • NCBIGene 36.3 80298
  • Alternative 3-prime, size difference: 10
  • Inclusion in 5'UTR
  • Reference transcript: NM_025198

  • pfam mTERF 282aa 3e-10 in ref transcript
    • mTERF. This family contains one sequence of known function Human mitochondrial transcription termination factor (mTERF) the rest of the family consists of hypothetical proteins none of which have any functional information. mTERF is a multizipper protein possessing three putative leucine zippers one of which is bipartite. The protein binds DNA as a monomer. The leucine zippers are not implicated in a dimerisation role as in other leucine zippers.

MTMR14

  • rs.MTMR14.F1 rs.MTMR14.R1 267 423
  • NCBIGene 36.3 64419
  • Single exon skipping, size difference: 156
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077525

MTUS1

  • rs.MTUS1.F1 rs.MTUS1.R1 287 449
  • NCBIGene 36.3 57509
  • Single exon skipping, size difference: 162
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001001924

  • TIGR SMC_prok_B 297aa 3e-14 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 320aa 2e-11 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

MUC1

  • rs.MUC1.F1 rs.MUC1.R1 101 128
  • NCBIGene 36.3 4582
  • Alternative 3-prime, size difference: 27
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_002456

  • smart SEA 116aa 2e-31 in ref transcript
    • Domain found in sea urchin sperm protein, enterokinase, agrin. Proposed function of regulating or binding carbohydrate sidechains.

MUC1

  • rs.MUC1.F3 rs.MUC1.R3 137 191
  • NCBIGene 36.3 4582
  • Alternative 3-prime, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002456

  • Changed! smart SEA 116aa 2e-31 in ref transcript
    • Domain found in sea urchin sperm protein, enterokinase, agrin. Proposed function of regulating or binding carbohydrate sidechains.
  • Changed! smart SEA 117aa 1e-26 in modified transcript

MVP

  • rs.MVP.F1 rs.MVP.R1 130 171
  • NCBIGene 36.3 9961
  • Alternative 3-prime, size difference: 41
  • Inclusion in 5'UTR
  • Reference transcript: NM_005115

  • pfam Vault 53aa 8e-12 in ref transcript
    • Major Vault Protein repeat. The vault is a ubiquitous and highly conserved ribonucleoprotein particle of approximately 13 mDa of unknown function. This family corresponds to a repeat found in the amino terminal half of the major vault protein.
  • pfam Vault 53aa 2e-09 in ref transcript
  • pfam Vault 58aa 9e-09 in ref transcript
  • pfam Vault 49aa 9e-08 in ref transcript
  • pfam Vault 62aa 3e-06 in ref transcript
  • pfam Vault 48aa 6e-05 in ref transcript

MYH14

  • rs.MYH14.F1 rs.MYH14.R1 124 148
  • NCBIGene 36.3 79784
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077186

  • Changed! cd MYSc_type_II 707aa 0.0 in ref transcript
    • Myosin motor domain, type II myosins. Myosin II mediates cortical contraction in cell motility, and is the motor in smooth and skeletal muscle. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
  • Changed! smart MYSc 709aa 0.0 in ref transcript
    • Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
  • pfam Myosin_tail_1 499aa 1e-30 in ref transcript
    • Myosin tail. The myosin molecule is a multi-subunit complex made up of two heavy chains and four light chains it is a fundamental contractile protein found in all eukaryote cell types. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament. The coiled-coil region provides the structural backbone the thick filament.
  • pfam Myosin_tail_1 238aa 4e-14 in ref transcript
  • Changed! COG COG5022 780aa 0.0 in ref transcript
    • Myosin heavy chain [Cytoskeleton].
  • Changed! cd MYSc_type_II 699aa 0.0 in modified transcript
  • Changed! smart MYSc 701aa 0.0 in modified transcript
  • Changed! COG COG5022 772aa 0.0 in modified transcript

MYO3B

  • rs.MYO3B.F1 rs.MYO3B.R1 134 215
  • NCBIGene 36.3 140469
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138995

  • cd MYSc_type_III 699aa 0.0 in ref transcript
    • Myosin motor domain, type III myosins. Myosin III has been shown to play a role in the vision process in insects and in hearing in mammals. Myosin III, an unconventional myosin, does not form dimers. This catalytic (head) domain has ATPase activity and belongs to the larger group of P-loop NTPases. Myosins are actin-dependent molecular motors that play important roles in muscle contraction, cell motility, and organelle transport. The head domain is a molecular motor, which utilizes ATP hydrolysis to generate directed movement toward the plus end along actin filaments. A cyclical interaction between myosin and actin provides the driving force. Rates of ATP hydrolysis and consequently the speed of movement along actin filaments vary widely, from about 0.04 micrometer per second for myosin I to 4.5 micrometer per second for myosin II in skeletal muscle. Myosin II moves in discrete steps about 5-10 nm long and generates 1-5 piconewtons of force. Upon ATP binding, the myosin head dissociates from an actin filament. ATP hydrolysis causes the head to pivot and associate with a new actin subunit. The release of Pi causes the head to pivot and move the filament (power stroke). Release of ADP completes the cycle.
  • cd STKc_myosinIIIB 291aa 0.0 in ref transcript
    • Serine/threonine kinases (STKs), class IIIB myosin subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The class III myosin subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Class III myosins are motor proteins containing an N-terminal kinase catalytic domain and a C-terminal actin-binding domain. Class III myosins may play an important role in maintaining the structural integrity of photoreceptor cell microvilli. They may also function as cargo carriers during light-dependent translocation, in photoreceptor cells, of proteins such as transducin and arrestin. Class IIIB myosin is expressed highly in retina. It is also present in the brain and testis. The human class IIIB myosin gene maps to a region that overlaps the locus for Bardet-Biedl syndrome, which is characterized by dysmorphic extremities, retinal dystrophy, obesity, male hypogenitalism, and renal abnormalities.
  • smart MYSc 696aa 0.0 in ref transcript
    • Myosin. Large ATPases. ATPase; molecular motor. Muscle contraction consists of a cyclical interaction between myosin and actin. The core of the myosin structure is similar in fold to that of kinesin.
  • smart S_TKc 257aa 2e-67 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • Changed! COG COG5022 755aa 1e-168 in ref transcript
    • Myosin heavy chain [Cytoskeleton].
  • COG SPS1 268aa 3e-33 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • Changed! COG COG5022 688aa 1e-168 in modified transcript

MYOM1

  • rs.MYOM1.F1 rs.MYOM1.R1 102 390
  • NCBIGene 36.3 8736
  • Single exon skipping, size difference: 288
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003803

  • cd FN3 94aa 2e-15 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 90aa 8e-14 in ref transcript
  • cd FN3 94aa 9e-14 in ref transcript
  • cd IGcam 92aa 9e-14 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 91aa 1e-12 in ref transcript
  • cd FN3 95aa 3e-12 in ref transcript
  • cd IGcam 89aa 2e-11 in ref transcript
  • cd IGcam 80aa 1e-05 in ref transcript
  • cd IGcam 78aa 0.005 in ref transcript
  • pfam I-set 85aa 4e-21 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam fn3 89aa 1e-12 in ref transcript
    • Fibronectin type III domain.
  • smart FN3 84aa 4e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • Changed! smart IG_like 88aa 9e-12 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 87aa 2e-11 in ref transcript
  • pfam fn3 90aa 2e-10 in ref transcript
  • pfam I-set 69aa 5e-10 in ref transcript
  • pfam fn3 85aa 2e-09 in ref transcript
  • pfam I-set 70aa 2e-05 in ref transcript
  • pfam I-set 70aa 3e-04 in ref transcript
  • Changed! pfam I-set 93aa 9e-12 in modified transcript

MYST3

  • rs.MYST3.F1 rs.MYST3.R1 255 299
  • NCBIGene 36.3 7994
  • Alternative 5-prime, size difference: 44
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099412

  • cd BAH_plant_2 34aa 8e-05 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • pfam MOZ_SAS 188aa 1e-101 in ref transcript
    • MOZ/SAS family. This region of these proteins has been suggested to be homologous to acetyltransferases.
  • pfam PHD 52aa 3e-10 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • pfam PHD 58aa 7e-06 in ref transcript
  • smart H15 78aa 8e-06 in ref transcript
    • Domain in histone families 1 and 5.
  • COG SAS2 331aa 3e-90 in ref transcript
    • Histone acetyltransferase (MYST family) [Chromatin structure and dynamics].

NAE1

  • rs.NAE1.F1 rs.NAE1.R1 190 288
  • NCBIGene 36.3 8883
  • Single exon skipping, size difference: 98
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_003905

  • Changed! cd APPBP1_RUB 389aa 1e-178 in ref transcript
    • Ubiquitin activating enzyme (E1) subunit APPBP1. APPBP1 is part of the heterodimeric activating enzyme (E1), specific for the Rub family of ubiquitin-like proteins (Ubls). E1 enzymes are part of a conjugation cascade to attach Ub or Ubls, covalently to substrate proteins consisting of activating (E1), conjugating (E2), and/or ligating (E3) enzymes. E1 activates ubiquitin(-like) by C-terminal adenylation, and subsequently forms a highly reactive thioester bond between its catalytic cysteine and Ubls C-terminus. E1 also associates with E2 and promotes ubiquitin transfer to the E2's catalytic cysteine. Post-translational modification by Rub family of ubiquitin-like proteins (Ublps) activates SCF ubiquitin ligases and is involved in cell cycle control, signaling and embryogenesis. ABPP1 contains part of the adenylation domain.
  • Changed! cd APPBP1_RUB 84aa 3e-17 in ref transcript
  • Changed! TIGR Ube1 141aa 4e-16 in ref transcript
    • This model represents the full length, over a thousand amino acids, of a multicopy family of eukaryotic proteins, many of which are designated ubiquitin-activating enzyme E1. Members have two copies of the ThiF family domain (pfam00899), a repeat found in ubiquitin-activating proteins (pfam02134), and other regions.
  • Changed! TIGR Ube1 104aa 7e-07 in ref transcript
  • Changed! COG ThiF 185aa 4e-22 in ref transcript
    • Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Coenzyme metabolism].

NARF

  • rs.NARF.F1 rs.NARF.R1 109 247
  • NCBIGene 36.3 26502
  • Single exon skipping, size difference: 138
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_012336

  • Changed! pfam Fe_hyd_lg_C 307aa 1e-107 in ref transcript
    • Iron only hydrogenase large subunit, C-terminal domain.
  • Changed! pfam Fe_hyd_SSU 54aa 2e-06 in ref transcript
    • Iron hydrogenase small subunit. This family represents the small subunit of the Fe-only hydrogenases EC:1.18.99.1. The subunit is comprised of alternating random coil and alpha helical structures that encompasses the large subunit in a novel protein fold.
  • Changed! COG COG4624 412aa 6e-50 in ref transcript
    • Iron only hydrogenase large subunit, C-terminal domain [General function prediction only].
  • Changed! pfam Fe_hyd_lg_C 182aa 2e-64 in modified transcript
  • Changed! COG COG4624 224aa 7e-36 in modified transcript

NARF

  • rs.NARF.F2 rs.NARF.R2 102 246
  • NCBIGene 36.3 26502
  • Single exon skipping, size difference: 144
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_012336

  • Changed! pfam Fe_hyd_lg_C 307aa 1e-107 in ref transcript
    • Iron only hydrogenase large subunit, C-terminal domain.
  • pfam Fe_hyd_SSU 54aa 2e-06 in ref transcript
    • Iron hydrogenase small subunit. This family represents the small subunit of the Fe-only hydrogenases EC:1.18.99.1. The subunit is comprised of alternating random coil and alpha helical structures that encompasses the large subunit in a novel protein fold.
  • Changed! COG COG4624 412aa 6e-50 in ref transcript
    • Iron only hydrogenase large subunit, C-terminal domain [General function prediction only].
  • Changed! pfam Fe_hyd_lg_C 302aa 1e-106 in modified transcript
  • Changed! COG COG4624 387aa 1e-43 in modified transcript

NARG1L

  • rs.NARG1L.F1 rs.NARG1L.R1 108 215
  • NCBIGene 36.3 79612
  • Single exon skipping, size difference: 107
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024561

  • cd TPR 100aa 6e-06 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • TIGR PEP_TPR_lipo 219aa 6e-06 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • COG COG4783 165aa 3e-06 in ref transcript
    • Putative Zn-dependent protease, contains TPR repeats [General function prediction only].

NARG2

  • rs.NARG2.F1 rs.NARG2.R1 317 477
  • NCBIGene 36.3 79664
  • Alternative 3-prime, size difference: 160
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024611

  • Changed! pfam NARG2_C 214aa 3e-74 in ref transcript
    • NMDA receptor-regulated gene protein 2. The transition of neuronal cells from pre-cursor to mature state is regulated by the N-methyl-d-aspartate (NMDA) receptor, a glutamate-gated ion channel that is permeable to Ca2+. NMDA receptors probably mediate this activity by permitting expression of NARG2. NARG2 is transiently expressed, being a regulatory protein that is present in the nucleus of dividing cells and then down-regulated as progenitors exit the cell cycle and begin to differentiate. NARG2 contains repeats of (S/T)PXX, (11 in mouse, six in human), a putative DNA-binding motif that is found in many gene-regulatory proteins including Kruppel, Hunchback and Antennapedi.

NBEAL1

  • rs.NBEAL1.F1 rs.NBEAL1.R1 129 336
  • NCBIGene 36.3 65065
  • Multiple exon skipping, size difference: 207
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001099273

  • Changed! cd Beach 281aa 1e-128 in ref transcript
    • BEACH (Beige and Chediak-Higashi) domains, implicated in membrane trafficking, are present in a family of proteins conserved throughout eukaryotes. This group contains human lysosomal trafficking regulator (LYST), LPS-responsive and beige-like anchor (LRBA) and neurobeachin. Disruption of LYST leads to Chediak-Higashi syndrome, characterized by severe immunodeficiency, albinism, poor blood coagulation and neurologic problems. Neurobeachin is a candidate gene linked to autism. LBRA seems to be upregulated in several cancer types. It has been shown that the BEACH domain itself is important for the function of these proteins.
  • cd WD40 274aa 2e-15 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd Neurobeachin 84aa 9e-11 in ref transcript
    • Neurobeachin Pleckstrin homology-like domain. This domain is found in the large multi-domain eukaryotic protein Nerubeachin, N-terminal to the BEACH domain. This PH-like domain interacts with the BEACH domain in the same manner used by other PH-like domains to bind peptides.
  • Changed! pfam Beach 281aa 1e-132 in ref transcript
    • Beige/BEACH domain.
  • COG COG2319 285aa 4e-08 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! cd Beach 275aa 1e-124 in modified transcript
  • Changed! pfam Beach 275aa 1e-128 in modified transcript

NBN

  • rs.NBN.F1 rs.NBN.R1 345 395
  • NCBIGene 36.3 4683
  • Single exon skipping, size difference: 50
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002485

  • Changed! cd FHA 109aa 2e-14 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • Changed! cd BRCT 69aa 4e-04 in ref transcript
    • Breast Cancer Suppressor Protein (BRCA1), carboxy-terminal domain. The BRCT domain is found within many DNA damage repair and cell cycle checkpoint proteins. The unique diversity of this domain superfamily allows BRCT modules to interact forming homo/hetero BRCT multimers, BRCT-non-BRCT interactions, and interactions within DNA strand breaks.
  • Changed! pfam Nbs1_C 65aa 3e-30 in ref transcript
    • DNA damage repair protein Nbs1. This C terminal region of the DNA damage repair protein Nbs1 has been identified to be necessary for the binding of Mre11 and Tel1.
  • Changed! pfam FHA 77aa 2e-08 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • Changed! pfam BRCT 67aa 0.008 in ref transcript
    • BRCA1 C Terminus (BRCT) domain. The BRCT domain is found predominantly in proteins involved in cell cycle checkpoint functions responsive to DNA damage. It has been suggested that the Retinoblastoma protein contains a divergent BRCT domain, this has not been included in this family. The BRCT domain of XRCC1 forms a homodimer in the crystal structure. This suggests that pairs of BRCT domains associate as homo- or heterodimers.
  • Changed! cd FHA 49aa 3e-05 in modified transcript

NCALD

  • rs.NCALD.F1 rs.NCALD.R1 114 201
  • NCBIGene 36.3 83988
  • Single exon skipping, size difference: 87
  • Exclusion in 5'UTR
  • Reference transcript: NM_001040625

  • cd EFh 62aa 1e-10 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • cd EFh 75aa 4e-10 in ref transcript
  • smart EFh 29aa 0.001 in ref transcript
    • EF-hand, calcium binding motif. EF-hands are calcium-binding motifs that occur at least in pairs. Links between disease states and genes encoding EF-hands, particularly the S100 subclass, are emerging. Each motif consists of a 12 residue loop flanked on either side by a 12 residue alpha-helix. EF-hands undergo a conformational change unpon binding calcium ions.
  • COG FRQ1 166aa 2e-26 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].

NCAM1

  • rs.NCAM1.F1 rs.NCAM1.R1 170 275
  • NCBIGene 36.3 4684
  • Multiple exon skipping, size difference: 105
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_181351

  • cd IGcam 91aa 5e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 87aa 8e-10 in ref transcript
  • cd IGcam 104aa 1e-09 in ref transcript
  • cd IGcam 73aa 2e-09 in ref transcript
  • cd FN3 98aa 5e-07 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 88aa 2e-04 in ref transcript
  • pfam I-set 83aa 1e-15 in ref transcript
    • Immunoglobulin I-set domain.
  • smart IG_like 91aa 2e-11 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IGc2 69aa 8e-10 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart IG_like 85aa 2e-08 in ref transcript
  • pfam fn3 89aa 7e-08 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 83aa 1e-06 in ref transcript

NCBP2

  • rs.NCBP2.F1 rs.NCBP2.R1 227 386
  • NCBIGene 36.3 22916
  • Alternative 5-prime and 3-prime, size difference: 159
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_007362

  • Changed! cd RRM 75aa 5e-18 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! smart RRM_2 74aa 7e-18 in ref transcript
    • RNA recognition motif.
  • Changed! COG COG0724 83aa 1e-11 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

NCOA3

  • rs.NCOA3.F1 rs.NCOA3.R1 108 120
  • NCBIGene 36.3 8202
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_181659

  • cd PAS 66aa 2e-08 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd HLH 54aa 0.002 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • pfam Nuc_rec_co-act 48aa 2e-16 in ref transcript
    • Nuclear receptor coactivator. This region is found on eukaryotic nuclear receptor coactivators and forms an alpha helical structure.
  • pfam SRC-1 87aa 6e-15 in ref transcript
    • Steroid receptor coactivator. This domain is found in steroid/nuclear receptor coactivators and contains two LXXLL motifs that are involved in receptor binding. The family includes SRC-1/NcoA-1, NcoA-2/TIF2, pCIP/ACTR/GRIP-1/AIB1.
  • smart PAS 58aa 8e-10 in ref transcript
    • PAS domain. PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels ([1]; Ponting & Aravind, in press).
  • pfam DUF1518 58aa 4e-08 in ref transcript
    • Domain of unknown function (DUF1518). This domain, which is usually found tandemly repeated, is found various receptor co-activating proteins.
  • smart HLH 55aa 3e-06 in ref transcript
    • helix loop helix domain.

NCOR2

  • rs.NCOR2.F1 rs.NCOR2.R1 102 153
  • NCBIGene 36.3 9612
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006312

  • cd SANT 44aa 2e-07 in ref transcript
    • 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
  • cd SANT 43aa 0.003 in ref transcript
  • pfam Myb_DNA-binding 44aa 1e-09 in ref transcript
    • Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
  • pfam Myb_DNA-binding 43aa 6e-04 in ref transcript

NCOR2

  • rs.NCOR2.F2 rs.NCOR2.R2 173 311
  • NCBIGene 36.3 9612
  • Alternative 5-prime, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006312

  • cd SANT 44aa 2e-07 in ref transcript
    • 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
  • cd SANT 43aa 0.003 in ref transcript
  • pfam Myb_DNA-binding 44aa 1e-09 in ref transcript
    • Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.
  • pfam Myb_DNA-binding 43aa 6e-04 in ref transcript

NF1

  • rs.NF1.F1 rs.NF1.R1 212 275
  • NCBIGene 36.3 4763
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042492

  • Changed! cd RasGAP_Neurofibromin 347aa 0.0 in ref transcript
    • Neurofibromin is the product of the neurofibromatosis type 1 gene (NF1) and shares a region of similarity with catalytic domain of the mammalian p120RasGAP protein and an extended similarity with the Saccharomyces cerevisiae RasGAP proteins Ira1 and Ira2. Neurofibromin has been shown to function as a GAP (GTPase-activating protein) which inhibits low molecular weight G proteins such as Ras by stimulating their intrinsic GTPase activity. NF1 is a common genetic disorder characterized by various symptoms ranging from predisposition for the development of tumors to learning disability or mental retardation. Loss of neurofibromin activity can be correlated to the increase in Ras-GTP concentration in neurofibromas of NF1 of patients, supporting the notion that unregulated Ras signaling may contribute to their development.
  • cd SEC14 147aa 1e-09 in ref transcript
    • Sec14p-like lipid-binding domain. Found in secretory proteins, such as S. cerevisiae phosphatidylinositol transfer protein (Sec14p), and in lipid regulated proteins such as RhoGAPs, RhoGEFs and neurofibromin (NF1). SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
  • Changed! smart RasGAP 371aa 2e-95 in ref transcript
    • GTPase-activator protein for Ras-like GTPases. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position. Improved domain limits from structure.
  • smart SEC14 141aa 1e-17 in ref transcript
    • Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p). Domain in homologues of a S. cerevisiae phosphatidylinositol transfer protein (Sec14p) and in RhoGAPs, RhoGEFs and the RasGAP, neurofibromin (NF1). Lipid-binding domain. The SEC14 domain of Dbl is known to associate with G protein beta/gamma subunits.
  • Changed! COG IQG1 219aa 2e-07 in ref transcript
    • Protein involved in regulation of cellular morphogenesis/cytokinesis [Cell division and chromosome partitioning / Signal transduction mechanisms].
  • Changed! cd RasGAP_Neurofibromin 326aa 0.0 in modified transcript
  • Changed! smart RasGAP 350aa 1e-98 in modified transcript
  • Changed! COG IQG1 198aa 5e-08 in modified transcript

NF2

  • rs.NF2.F1 rs.NF2.R1 125 140
  • NCBIGene 36.3 4771
  • Alternative 5-prime, size difference: 15
  • Exclusion in 3'UTR
  • Reference transcript: NM_181832

  • cd FERM_C 93aa 1e-22 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • smart B41 204aa 3e-53 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam FERM_C 87aa 9e-33 in ref transcript
    • FERM C-terminal PH-like domain.
  • pfam ERM 218aa 2e-31 in ref transcript
    • Ezrin/radixin/moesin family. This family of proteins contain a band 4.1 domain (pfam00373), at their amino terminus. This family represents the rest of these proteins.

NFKB2

  • rs.NFKB2.F1 rs.NFKB2.R1 102 276
  • NCBIGene 36.3 4791
  • Alternative 5-prime, size difference: 174
  • Exclusion in 5'UTR
  • Reference transcript: NM_001077494

  • cd IPT_NFkappaB 100aa 9e-45 in ref transcript
    • IPT domain of the transcription factor NFkappaB and related transcription factors. NFkappaB is considered a central regulator of stress responses, activated by different stressful conditions, including physical stress, oxidative stress, and exposure to certain chemicals. NFkappaB blocking cell apoptosis in several cell types, gives it an important role in cell proliferation and differentiation.
  • cd ANK 138aa 1e-21 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • cd ANK 132aa 1e-18 in ref transcript
  • pfam RHD 181aa 4e-66 in ref transcript
    • Rel homology domain (RHD). Proteins containing the Rel homology domain (RHD) are eukaryotic transcription factors. The RHD is composed of two structural domains. This is the N-terminal domain that is similar to that found in P53. The C-terminal domain has an immunoglobulin-like fold (See pfam01833) that binds to DNA.
  • smart IPT 100aa 4e-13 in ref transcript
    • ig-like, plexins, transcription factors.
  • TIGR trp 177aa 3e-06 in ref transcript
    • after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
  • COG Arp 108aa 7e-11 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • COG Arp 167aa 8e-07 in ref transcript

NLRP1

  • rs.NLRP1.F1 rs.NLRP1.R1 101 113
  • NCBIGene 36.3 22861
  • Alternative 5-prime, size difference: 12
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_033004

  • cd LRR_RI 179aa 8e-30 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • cd AAA 95aa 0.009 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • pfam NACHT 170aa 5e-45 in ref transcript
    • NACHT domain. This NTPase domain is found in apoptosis proteins as well as those involved in MHC transcription activation. This family is closely related to pfam00931.
  • pfam CARD 82aa 2e-16 in ref transcript
    • Caspase recruitment domain. Motif contained in proteins involved in apoptotic signaling. Predicted to possess a DEATH (pfam00531) domain-like fold.
  • pfam PAAD_DAPIN 58aa 2e-08 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.
  • smart LRR_RI 28aa 0.002 in ref transcript
    • Leucine rich repeat, ribonuclease inhibitor type.
  • COG RNA1 142aa 0.004 in ref transcript
    • Ran GTPase-activating protein (RanGAP) involved in mRNA processing and transport [Signal transduction mechanisms / RNA processing and modification].

NOVA1

  • rs.NOVA1.F1 rs.NOVA1.R1 361 433
  • NCBIGene 36.3 4857
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002515

  • Changed! cd PCBP_like_KH 66aa 1e-12 in ref transcript
    • K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
  • cd PCBP_like_KH 67aa 3e-12 in ref transcript
  • cd KH-I 66aa 3e-09 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • pfam KH_1 64aa 2e-10 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • pfam KH_1 66aa 1e-09 in ref transcript
  • smart KH 70aa 3e-08 in ref transcript
    • K homology RNA-binding domain.
  • Changed! COG Pnp 73aa 0.007 in ref transcript
    • Polyribonucleotide nucleotidyltransferase (polynucleotide phosphorylase) [Translation, ribosomal structure and biogenesis].
  • Changed! cd PCBP_like_KH 66aa 1e-12 in modified transcript
  • Changed! TIGR arCOG04150 160aa 0.005 in modified transcript
    • This family of proteins is universal among the 41 archaeal genomes analyzed in and is not observed outside of the archaea. The proteins contain a single KH domain (pfam00013) which is likely to confer the ability to bind RNA.
  • Changed! COG COG1094 176aa 1e-04 in modified transcript
    • Predicted RNA-binding protein (contains KH domains) [General function prediction only].

NOXO1

  • rs.NOXO1.F1 rs.NOXO1.R1 100 115
  • NCBIGene 36.3 124056
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_172168

  • Changed! cd PX_NoxO1 125aa 3e-45 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Nox Organizing protein 1. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions such as cell signaling, vesicular trafficking, protein sorting, and lipid modification, among others. Nox Organizing protein 1 (NoxO1) is a critical regulator of enzyme kinetics of the nonphagocytic NADPH oxidase Nox1, which catalyzes the transfer of electrons from NADPH to molecular oxygen to form superoxide. Nox1 is expressed in colon, stomach, uterus, prostate, and vascular smooth muscle cells. NoxO1, a homolog of the p47phox subunit of phagocytic NADPH oxidase, is involved in targeting activator subunits (such as NoxA1) to Nox1. It is co-localized with Nox1 in the membranes of resting cells and directs the subcellular localization of Nox1. The PX domain is involved in targeting of proteins to PI-enriched membranes, and may also be involved in protein-protein interaction. The PX domain of NoxO1 preferentially binds phosphatidylinositol-3,5-bisphosphate [PI(3,5)P2], PI5P, and PI4P.
  • cd SH3 50aa 4e-06 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 54aa 2e-04 in ref transcript
  • smart SH3 45aa 2e-06 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! smart PX 101aa 2e-04 in ref transcript
    • PhoX homologous domain, present in p47phox and p40phox. Eukaryotic domain of unknown function present in phox proteins, PLD isoforms, a PI3K isoform.
  • smart SH3 33aa 0.001 in ref transcript
  • Changed! cd PX_NoxO1 120aa 4e-47 in modified transcript
  • Changed! smart PX 96aa 9e-06 in modified transcript

NPC1L1

  • rs.NPC1L1.F1 rs.NPC1L1.R1 298 379
  • NCBIGene 36.3 29881
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013389

  • Changed! TIGR 2A060601 1261aa 0.0 in ref transcript
    • The model describes Niemann-Pick C type protein in eukaryotes. The defective protein has been associated with Niemann-Pick disease which is described in humans as autosomal recessive lipidosis. It is characterized by the lysosomal accumulation of unestrified cholesterol. It is an integral membrane protein, which indicates that this protein is most likely involved in cholesterol transport or acts as some component of cholesterol homeostasis.
  • COG COG1033 119aa 5e-05 in ref transcript
    • Predicted exporters of the RND superfamily [General function prediction only].
  • Changed! TIGR 2A060601 1234aa 0.0 in modified transcript

NQO1

  • rs.NQO1.F1 rs.NQO1.R1 104 206
  • NCBIGene 36.3 1728
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000903

  • Changed! pfam Flavodoxin_2 213aa 5e-56 in ref transcript
    • Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones and protect cells against damage by free radicals and reactive oxygen species. This enzyme uses a FAD co-factor. The equation for this reaction is:- NAD(P)H + acceptor <=> NAD(P)(+) + reduced acceptor. This enzyme is also involved in the bioactivation of prodrugs used in chemotherapy. The family also includes acyl carrier protein phosphodiesterase EC:3.1.4.14. This enzyme converts holo-ACP to apo-ACP by hydrolytic cleavage of the phosphopantetheine residue from ACP. This family is related to pfam03358 and pfam00258.
  • Changed! COG MdaB 212aa 7e-32 in ref transcript
    • Putative NADPH-quinone reductase (modulator of drug activity B) [General function prediction only].
  • Changed! pfam Flavodoxin_2 179aa 2e-38 in modified transcript
  • Changed! COG MdaB 178aa 2e-21 in modified transcript

NQO1

  • rs.NQO1.F2 rs.NQO1.R2 110 224
  • NCBIGene 36.3 1728
  • Single exon skipping, size difference: 114
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000903

  • Changed! pfam Flavodoxin_2 213aa 5e-56 in ref transcript
    • Flavodoxin-like fold. This family consists of a domain with a flavodoxin-like fold. The family includes bacterial and eukaryotic NAD(P)H dehydrogenase (quinone) EC:1.6.99.2. These enzymes catalyse the NAD(P)H-dependent two-electron reductions of quinones and protect cells against damage by free radicals and reactive oxygen species. This enzyme uses a FAD co-factor. The equation for this reaction is:- NAD(P)H + acceptor <=> NAD(P)(+) + reduced acceptor. This enzyme is also involved in the bioactivation of prodrugs used in chemotherapy. The family also includes acyl carrier protein phosphodiesterase EC:3.1.4.14. This enzyme converts holo-ACP to apo-ACP by hydrolytic cleavage of the phosphopantetheine residue from ACP. This family is related to pfam03358 and pfam00258.
  • Changed! COG MdaB 212aa 7e-32 in ref transcript
    • Putative NADPH-quinone reductase (modulator of drug activity B) [General function prediction only].
  • Changed! pfam Flavodoxin_2 175aa 2e-34 in modified transcript
  • Changed! COG MdaB 174aa 2e-13 in modified transcript

NR1I3

  • rs.NR1I3.F1 rs.NR1I3.R1 195 335
  • NCBIGene 36.3 9970
  • Single exon skipping, size difference: 140
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001077482

  • cd NR_LBD_PXR_like 250aa 9e-91 in ref transcript
    • The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • Changed! pfam zf-C4 71aa 2e-29 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • pfam Hormone_recep 190aa 5e-23 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • Changed! pfam zf-C4 46aa 9e-13 in modified transcript

NR1I3

  • rs.NR1I3.F2 rs.NR1I3.R2 105 117
  • NCBIGene 36.3 9970
  • Alternative 3-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077482

  • Changed! cd NR_LBD_PXR_like 250aa 9e-91 in ref transcript
    • The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 71aa 2e-29 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • Changed! pfam Hormone_recep 190aa 5e-23 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • Changed! cd NR_LBD_PXR_like 246aa 2e-92 in modified transcript
  • Changed! pfam Hormone_recep 186aa 1e-22 in modified transcript

NR1I3

  • rs.NR1I3.F3 rs.NR1I3.R3 103 257
  • NCBIGene 36.3 9970
  • Alternative 3-prime, size difference: 154
  • Exclusion of the stop codon
  • Reference transcript: NM_001077482

  • Changed! cd NR_LBD_PXR_like 250aa 9e-91 in ref transcript
    • The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 71aa 2e-29 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • Changed! pfam Hormone_recep 190aa 5e-23 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • Changed! cd NR_LBD_PXR_like 210aa 1e-81 in modified transcript
  • Changed! pfam Hormone_recep 149aa 1e-23 in modified transcript

NR1I3

  • rs.NR1I3.F4 rs.NR1I3.R4 100 115
  • NCBIGene 36.3 9970
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077482

  • Changed! cd NR_LBD_PXR_like 250aa 9e-91 in ref transcript
    • The ligand binding domain of xenobiotic receptors:pregnane X receptor and constitutive androstane receptor. The ligand binding domain of xenobiotic receptors: This xenobiotic receptor family includes pregnane X receptor (PXR), constitutive androstane receptor (CAR) and other related nuclear receptors. They function as sensors of toxic byproducts of cell metabolism and of exogenous chemicals, to facilitate their elimination. The nuclear receptor pregnane X receptor (PXR) is a ligand-regulated transcription factor that responds to a diverse array of chemically distinct ligands, including many endogenous compounds and clinical drugs. The ligand binding domain of PXR shows remarkable flexibility to accommodate both large and small molecules. PXR functions as a heterodimer with retinoic X receptor-alpha (RXRa) and binds to a variety of response elements in the promoter regions of a diverse set of target genes involved in the metabolism, transport, and elimination of these molecules from the cell. Constitutive androstane receptor (CAR) is a closest mammalian relative of PXR, which has also been proposed to function as a xenosensor. CAR is activated by some of the same ligands as PXR and regulates a subset of common genes. The sequence homology and functional similarity suggests that the CAR gene arose from a duplication of an ancestral PXR gene. Like other nuclear receptors, xenobiotic receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 71aa 2e-29 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • Changed! pfam Hormone_recep 190aa 5e-23 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • Changed! cd NR_LBD_PXR_like 245aa 1e-92 in modified transcript
  • Changed! pfam Hormone_recep 185aa 9e-25 in modified transcript

NR6A1

  • rs.NR6A1.F1 rs.NR6A1.R1 106 118
  • NCBIGene 36.3 2649
  • Alternative 3-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_033334

  • cd NR_LBD_DHR4_like 212aa 1e-103 in ref transcript
    • The ligand binding domain of orphan nuclear receptor Ecdysone-induced receptor DHR4. The ligand binding domain of Ecdysone-induced receptor DHR4: Ecdysone-induced orphan receptor DHR4 is a member of the nuclear receptor family. DHR4 is expressed during the early Drosophila larval development and is induced by ecdysone. DHR4 coordinates growth and maturation in Drosophila by mediating endocrine response to the attainment of proper body size during larval development. Mutations in DHR4 result in shorter larval development which translates into smaller and lighter flies. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, DHR4 has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 76aa 3e-32 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • pfam Hormone_recep 165aa 1e-26 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.

NRD1

  • rs.NRD1.F1 rs.NRD1.R1 339 543
  • NCBIGene 36.3 4898
  • Multiple exon skipping, size difference: 204
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002525

  • Changed! pfam Peptidase_M16 127aa 4e-37 in ref transcript
    • Insulinase (Peptidase family M16).
  • pfam Peptidase_M16_C 176aa 3e-20 in ref transcript
    • Peptidase M16 inactive domain. Peptidase M16 consists of two structurally related domains. One is the active peptidase, whereas the other is inactive. The two domains hold the substrate like a clamp.
  • pfam Peptidase_M16_C 183aa 1e-08 in ref transcript
  • COG Ptr 879aa 1e-148 in ref transcript
    • Secreted/periplasmic Zn-dependent peptidases, insulinase-like [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam Peptidase_M16 139aa 3e-38 in modified transcript

NRG1

  • rs.NRG1.F1 rs.NRG1.R1 191 293
  • NCBIGene 36.3 3084
  • Multiple exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013956

  • cd IGcam 89aa 2e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam Neuregulin 396aa 1e-109 in ref transcript
    • Neuregulin family.
  • smart IG_like 87aa 1e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam EGF 32aa 0.006 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.

NRG1

  • rs.NRG1.F2 rs.NRG1.R2 251 393
  • NCBIGene 36.3 3084
  • Single exon skipping, size difference: 142
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_013956

  • cd IGcam 89aa 2e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! pfam Neuregulin 396aa 1e-109 in ref transcript
    • Neuregulin family.
  • smart IG_like 87aa 1e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • Changed! pfam EGF 32aa 0.006 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
  • Changed! pfam Neuregulin 190aa 2e-57 in modified transcript

NRG1

  • rs.NRG1.F3 rs.NRG1.R3 119 143
  • NCBIGene 36.3 3084
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013956

  • cd IGcam 89aa 2e-05 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam Neuregulin 396aa 1e-109 in ref transcript
    • Neuregulin family.
  • smart IG_like 87aa 1e-07 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam EGF 32aa 0.006 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.

NRG2

  • rs.NRG2.F1 rs.NRG2.R1 127 151
  • NCBIGene 36.3 9542
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013982

  • cd IGcam 88aa 2e-11 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam Neuregulin 321aa 1e-39 in ref transcript
    • Neuregulin family.
  • pfam I-set 90aa 3e-15 in ref transcript
    • Immunoglobulin I-set domain.

NRG2

  • rs.NRG2.F2 rs.NRG2.R2 110 134
  • NCBIGene 36.3 9542
  • Single exon skipping, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013983

  • cd IGcam 88aa 2e-11 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam Neuregulin 321aa 8e-40 in ref transcript
    • Neuregulin family.
  • pfam I-set 90aa 3e-15 in ref transcript
    • Immunoglobulin I-set domain.

NRP1

  • rs.NRP1.F1 rs.NRP1.R1 112 217
  • NCBIGene 36.3 8829
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003873

  • cd MAM 154aa 2e-45 in ref transcript
    • Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
  • cd FA58C 147aa 1e-36 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • cd CUB 114aa 5e-29 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 118aa 1e-26 in ref transcript
  • cd FA58C 148aa 2e-23 in ref transcript
  • smart MAM 160aa 2e-46 in ref transcript
    • Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
  • pfam CUB 116aa 5e-33 in ref transcript
    • CUB domain.
  • smart FA58C 151aa 9e-33 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • pfam CUB 112aa 2e-31 in ref transcript
  • pfam F5_F8_type_C 135aa 1e-18 in ref transcript
    • F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.

NRP2

  • rs.NRP2.F1 rs.NRP2.R1 100 115
  • NCBIGene 36.3 8828
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018534

  • cd FA58C 147aa 4e-39 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • cd MAM 148aa 3e-36 in ref transcript
    • Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
  • cd CUB 114aa 4e-31 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 117aa 3e-27 in ref transcript
  • cd FA58C 154aa 2e-24 in ref transcript
  • smart MAM 154aa 9e-40 in ref transcript
    • Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
  • smart FA58C 151aa 4e-34 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • pfam CUB 112aa 6e-31 in ref transcript
    • CUB domain.
  • pfam CUB 116aa 3e-29 in ref transcript
  • pfam F5_F8_type_C 141aa 3e-21 in ref transcript
    • F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.

NRP2

  • rs.NRP2.F2 rs.NRP2.R2 100 115
  • NCBIGene 36.3 8828
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201266

  • cd FA58C 147aa 2e-39 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain; Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • cd MAM 148aa 4e-36 in ref transcript
    • Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
  • cd CUB 114aa 5e-31 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 117aa 4e-27 in ref transcript
  • cd FA58C 154aa 8e-25 in ref transcript
  • smart MAM 154aa 8e-40 in ref transcript
    • Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
  • smart FA58C 151aa 2e-34 in ref transcript
    • Coagulation factor 5/8 C-terminal domain, discoidin domain. Cell surface-attached carbohydrate-binding domain, present in eukaryotes and assumed to have horizontally transferred to eubacterial genomes.
  • pfam CUB 112aa 7e-31 in ref transcript
    • CUB domain.
  • pfam CUB 116aa 4e-29 in ref transcript
  • pfam F5_F8_type_C 141aa 2e-21 in ref transcript
    • F5/8 type C domain. This domain is also known as the discoidin (DS) domain family.

NRXN3

  • rs.NRXN3.F1 rs.NRXN3.R1 143 233
  • NCBIGene 36.3 9369
  • Single exon skipping, size difference: 90
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_004796

  • cd LamG 154aa 2e-23 in ref transcript
    • Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.
  • cd LamG 170aa 2e-22 in ref transcript
  • Changed! cd LamG 150aa 3e-18 in ref transcript
  • cd LamG 156aa 2e-12 in ref transcript
  • cd EGF_CA 34aa 3e-04 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • smart LamG 150aa 5e-28 in ref transcript
    • Laminin G domain.
  • smart LamG 137aa 2e-26 in ref transcript
  • Changed! smart LamG 127aa 2e-20 in ref transcript
  • smart LamG 136aa 4e-19 in ref transcript
  • smart LamG 40aa 0.001 in ref transcript
  • pfam EGF 33aa 0.008 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
  • Changed! cd LamG 120aa 3e-15 in modified transcript
  • Changed! smart LamG 157aa 7e-17 in modified transcript

NSL1

  • rs.NSL1.F1 rs.NSL1.R1 327 412
  • NCBIGene 36.3 25936
  • Single exon skipping, size difference: 85
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_015471

  • Changed! pfam Mis14 145aa 5e-30 in ref transcript
    • Kinetochore protein Mis14 like. Mis14 is a kinetochore protein which is known to be recruited to kinetochores independently of CENP-A.
  • Changed! pfam Mis14 100aa 1e-19 in modified transcript

NTRK1

  • rs.NTRK1.F1 NTRK1.R1 100 118
  • NCBIGene 36.3 4914
  • Single exon skipping, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002529

  • cd PTKc_TrkA 280aa 1e-164 in ref transcript
    • Catalytic Domain of the Protein Tyrosine Kinase, Tropomyosin Related Kinase A. Protein Tyrosine Kinase (PTK) family; Tropomyosin related kinase A (TrkA); catalytic (c) domain. The PTKc family is part of a larger superfamily that includes the catalytic domains of other kinases such as protein serine/threonine kinases, RIO kinases, and phosphoinositide 3-kinase (PI3K). PTKs catalyze the transfer of the gamma-phosphoryl group from ATP to tyrosine (tyr) residues in protein substrates. TrkA is a member of the Trk subfamily of proteins, which are receptor tyr kinases (RTKs) containing an extracellular region with arrays of leucine-rich motifs flanked by two cysteine-rich clusters followed by two immunoglobulin-like domains, a transmembrane segment, and an intracellular catalytic domain. Binding of TrkA to its ligand, nerve growth factor (NGF), results in receptor oligomerization and activation of the catalytic domain. TrkA is expressed mainly in neural-crest-derived sensory and sympathetic neurons of the peripheral nervous system, and in basal forebrain cholinergic neurons of the central nervous system. It is critical for neuronal growth, differentiation and survival. Alternative TrkA splicing has been implicated as a pivotal regulator of neuroblastoma (NB) behavior. Normal TrkA expression is associated with better NB prognosis, while the hypoxia-regulated TrkAIII splice variant promotes NB pathogenesis and progression. Aberrant TrkA expression has also been demonstrated in non-neural tumors including prostate, breast, lung, and pancreatic cancers.
  • cd IGcam 71aa 0.002 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • pfam Pkinase_Tyr 272aa 1e-104 in ref transcript
    • Protein tyrosine kinase.
  • TIGR PCC 67aa 3e-05 in ref transcript
    • Note: this model is restricted to the amino half because a full-length model is incompatible with the HMM software package.
  • pfam I-set 85aa 4e-04 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 70aa 0.002 in ref transcript
  • COG SPS1 265aa 1e-23 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

NUMB

  • rs.NUMB.F1 rs.NUMB.R1 104 137
  • NCBIGene 36.3 8650
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001005743

  • Changed! cd Numb 149aa 2e-72 in ref transcript
    • Numb Phosphotyrosine-binding (PTB) domain. Numb is a membrane associated adaptor protein, which is a determinant of asymmetric cell division. Numb has an N-terminal PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
  • Changed! pfam PID 134aa 7e-34 in ref transcript
    • Phosphotyrosine interaction domain (PTB/PID).
  • pfam NumbF 82aa 8e-32 in ref transcript
    • NUMB domain. This presumed domain is found in the Numb family of proteins adjacent to the PTB domain.
  • Changed! cd Numb 138aa 5e-75 in modified transcript
  • Changed! pfam PID 123aa 3e-33 in modified transcript

NUP50

  • rs.NUP50.F1 rs.NUP50.R1 123 313
  • NCBIGene 36.3 10762
  • Single exon skipping, size difference: 190
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_007172

  • Changed! cd RanBD 112aa 2e-16 in ref transcript
    • Ran-binding domain; This domain of approximately 150 residues shares structural similarity to the PH domain, but lacks detectable sequence similarity. Ran is a Ras-like nuclear small GTPase, which regulates receptor-mediated transport between the nucleus and the cytoplasm. RanGTP hydrolysis is stimulated by RanGAP together with the Ran-binding domain containing acessory proteins RanBP1 and RanBP2. These accessory proteins stabilize the active GTP-bound form of Ran . The Ran-binding domain is found in multiple copies in Nuclear pore complex proteins.
  • Changed! pfam NUP50 303aa 5e-71 in ref transcript
    • NUP50 (Nucleoporin 50 kDa). Nucleoporin 50 kDa (NUP50) acts as a cofactor for the importin-alpha:importin-beta heterodimer, which in turn allows for transportation of many nuclear-targeted proteins through nuclear pore complexes. The C terminus of NUP50 binds importin-beta through RAN-GTP, the N terminus binds the C terminus of importin-alpha, while a central domain binds importin-beta. NUP50:importin-alpha:importin-beta then binds cargo and can stimulate nuclear import. The N-terminal domain of NUP50 is also able to actively displace nuclear localisation signals from importin-alpha.
  • Changed! pfam Ran_BP1 110aa 3e-05 in ref transcript
    • RanBP1 domain.
  • Changed! pfam NUP50 23aa 1e-07 in modified transcript

NUP98

  • rs.NUP98.F1 rs.NUP98.R1 143 365
  • NCBIGene 36.3 4928
  • Single exon skipping, size difference: 222
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016320

  • pfam Nucleoporin2 156aa 2e-63 in ref transcript
    • Nucleoporin autopeptidase.

NUP98

  • rs.NUP98.F2 rs.NUP98.R2 256 307
  • NCBIGene 36.3 4928
  • Alternative 5-prime, size difference: 51
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_016320

  • pfam Nucleoporin2 156aa 2e-63 in ref transcript
    • Nucleoporin autopeptidase.

NUPR1

  • rs.NUPR1.F1 rs.NUPR1.R1 231 285
  • NCBIGene 36.3 26471
  • Alternative 3-prime, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042483

  • pfam Phospho_p8 29aa 8e-10 in ref transcript
    • DNA-binding nuclear phosphoprotein p8. P8 is a short 80-82 amino acid protein that is conserved from nematodes to humans. It carries at least one protein kinase C domain suggesting a possible role in signal transduction and it is thought to be a phosphoprotein, but the sites of phosphorylation and the kinases involved remain to be determined.

NYD-SP21

  • rs.NYD-SP21.F1 rs.NYD-SP21.R1 248 299
  • NCBIGene 36.3 84689
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032597

  • Changed! pfam CD20 127aa 1e-33 in ref transcript
    • CD20/IgE Fc receptor beta subunit family. This family includes the CD20 protein and the beta subunit of the high affinity receptor for IgE Fc. The high affinity receptor for IgE is a tetrameric structure consisting of a single IgE-binding alpha subunit, a single beta subunit, and two disulfide-linked gamma subunits. The alpha subunit of Fc epsilon RI and most Fc receptors are homologous members of the Ig superfamily. By contrast, the beta and gamma subunits from Fc epsilon RI are not homologous to the Ig superfamily. Both molecules have four putative transmembrane segments and a probably topology where both amino- and carboxy termini protrude into the cytoplasm.
  • Changed! pfam CD20 110aa 1e-24 in modified transcript

OCIAD1

  • rs.OCIAD1.F1 rs.OCIAD1.R1 153 400
  • NCBIGene 36.3 54940
  • Alternative 5-prime, size difference: 247
  • Exclusion in 5'UTR
  • Reference transcript: NM_001079839

  • pfam OCIA 108aa 4e-48 in ref transcript
    • Ovarian carcinoma immunoreactive antigen (OCIA). This family consists of several ovarian carcinoma immunoreactive antigen (OCIA) and related eukaryotic sequences. The function of this family is unknown.

OGG1

  • rs.OGG1.F1 rs.OGG1.R1 98 145
  • NCBIGene 36.3 4968
  • Alternative 3-prime, size difference: 47
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_016828

  • Changed! cd ENDO3c 179aa 3e-18 in ref transcript
    • endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases.
  • TIGR ogg 306aa 1e-137 in ref transcript
    • All proteins in this family for which functions are known are 8-oxo-guanaine DNA glycosylases that function in base excision repair. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). This family is distantly realted to the Nth-MutY superfamily.
  • COG AlkA 229aa 3e-25 in ref transcript
    • 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [DNA replication, recombination, and repair].
  • Changed! cd ENDO3c 179aa 2e-17 in modified transcript

OGN

  • rs.OGN.F1 rs.OGN.R1 122 404
  • NCBIGene 36.3 4969
  • Alternative 3-prime, size difference: 282
  • Inclusion in 5'UTR
  • Reference transcript: NM_014057

OPA1

  • rs.OPA1.F1 rs.OPA1.R1 100 208
  • NCBIGene 36.3 4976
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130837

  • pfam Dynamin_N 179aa 4e-38 in ref transcript
    • Dynamin family.

OPA1

  • rs.OPA1.F2 rs.OPA1.R2 105 216
  • NCBIGene 36.3 4976
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_130837

  • pfam Dynamin_N 179aa 4e-38 in ref transcript
    • Dynamin family.

OPTN

  • rs.OPTN.F1 rs.OPTN.R1 101 116
  • NCBIGene 36.3 10133
  • Alternative 3-prime, size difference: 15
  • Inclusion in 5'UTR
  • Reference transcript: NM_001008213

  • pfam SMC_N 178aa 0.001 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • PRK PRK05771 248aa 0.010 in ref transcript
    • V-type ATP synthase subunit I; Validated.

OSBPL2

  • rs.OSBPL2.F1 rs.OSBPL2.R1 121 157
  • NCBIGene 36.3 9885
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_144498

  • pfam Oxysterol_BP 351aa 4e-64 in ref transcript
    • Oxysterol-binding protein.

OSBPL3

  • rs.OSBPL3.F1 rs.OSBPL3.R1 148 256
  • NCBIGene 36.3 26031
  • Single exon skipping, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015550

  • cd PH_oxysterol_bp 91aa 2e-18 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam Oxysterol_BP 300aa 2e-56 in ref transcript
    • Oxysterol-binding protein.
  • smart PH 91aa 3e-09 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

OSBPL9

  • rs.OSBPL9.F1 rs.OSBPL9.R1 102 171
  • NCBIGene 36.3 114883
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_148909

  • cd PH_oxysterol_bp 93aa 1e-28 in ref transcript
    • Oxysterol binding protein (OSBP) Pleckstrin homology (PH) domain. Oxysterol binding proteins are a multigene family that is conserved in yeast, flies, worms, mammals and plants. They all contain a C-terminal oxysterol binding domain, and most contain an N-terminal PH domain. OSBP PH domains bind to membrane phosphoinositides and thus likely play an important role in intracellular targeting. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • pfam Oxysterol_BP 342aa 2e-49 in ref transcript
    • Oxysterol-binding protein.
  • smart PH 95aa 6e-13 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.

OSCAR

  • rs.OSCAR.F1 rs.OSCAR.R1 105 117
  • NCBIGene 36.3 126014
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206818

PACRG

  • rs.PACRG.F1 rs.PACRG.R1 276 393
  • NCBIGene 36.3 135138
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152410

  • Changed! pfam ParcG 223aa 2e-83 in ref transcript
    • Parkin co-regulated protein. This family of proteins is transcribed anti-sense along the DNA to the Parkin gene product and the two appear to be transcribed under the same promoter. The protein has predicted alpha-helical and beta-sheet domains which suggest its function is in the ubiquitin/proteasome system. Mutations in parkin are the genetic cause of early-onset and autosomal recessive juvenile parkinsonism.
  • Changed! pfam ParcG 184aa 9e-89 in modified transcript

PACS2

  • rs.PACS2.F1 rs.PACS2.R1 102 114
  • NCBIGene 36.3 23241
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100913

  • pfam Pacs-1 213aa 3e-93 in ref transcript
    • PACS-1 cytosolic sorting protein. PACS-1 is a cytosolic sorting protein that directs the localisation of membrane proteins in the trans-Golgi network (TGN)/endosomal system. PACS-1 connects the clathrin adaptor AP-1 to acidic cluster sorting motifs contained in the cytoplasmic domain of cargo proteins such as furin, the cation-independent mannose-6-phosphate receptor and in viral proteins such as human immunodeficiency virus type 1 Nef.
  • pfam Pacs-1 149aa 3e-61 in ref transcript

PAICS

  • rs.PAICS.F1 rs.PAICS.R1 110 131
  • NCBIGene 36.3 10606
  • Single exon skipping, size difference: 21
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_006452

  • Changed! cd SAICAR_synt_Ade5 252aa 1e-142 in ref transcript
    • Ade5_like 5-aminoimidazole-4-(N-succinylcarboxamide) ribonucleotide (SAICAR) synthase. Eukaryotic group of SAICAR synthetases represented by the Drosophila melanogaster, N-terminal, SAICAR synthetase domain of bifunctional Ade5. The Ade5 gene product (CAIR-SAICARs) catalyzes the sixth and seventh steps of the de novo biosynthesis of purine nucleotides (also reported as seventh and eighth steps). SAICAR synthetase converts 5-aminoimidazole-4-carboxyribonucleotide (CAIR), ATP, and L-aspartate into 5-aminoimidazole-4-(N-succinylcarboxamide) ribonucleotide (SAICAR), ADP, and phosphate.
  • Changed! pfam SAICAR_synt 245aa 8e-92 in ref transcript
    • SAICAR synthetase. Also known as Phosphoribosylaminoimidazole-succinocarboxamide synthase.
  • TIGR purE 154aa 1e-57 in ref transcript
    • Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. This model represents PurK, an N5-CAIR mutase.
  • Changed! PRK PRK09362 252aa 5e-54 in ref transcript
    • phosphoribosylaminoimidazole-succinocarboxamide synthase; Reviewed.
  • COG PurE 138aa 3e-19 in ref transcript
    • Phosphoribosylcarboxyaminoimidazole (NCAIR) mutase [Nucleotide transport and metabolism].
  • Changed! cd SAICAR_synt_Ade5 259aa 1e-139 in modified transcript
  • Changed! pfam SAICAR_synt 252aa 5e-89 in modified transcript
  • Changed! PRK PRK09362 259aa 1e-50 in modified transcript

PAK4

  • rs.PAK4.F1 rs.PAK4.R1 91 550
  • NCBIGene 36.3 10298
  • Single exon skipping, size difference: 459
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005884

  • cd STKc_PAK_II 285aa 1e-173 in ref transcript
    • Serine/threonine kinases (STKs), p21-activated kinase (PAK) subfamily, Group II, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The PAK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. PAKs are Rho family GTPase-regulated kinases that serve as important mediators in the function of Cdc42 (cell division cycle 42) and Rac. PAKs from higher eukaryotes are classified into two groups (I and II), according to their biochemical and structural features. Group II PAKs, also called non-conventional PAKs, include PAK4, PAK5, and PAK6. Group II PAKs contain PBD (p21-binding domain) and catalytic domains, but lack other motifs found in group I PAKs, such as an AID (autoinhibitory domain) and SH3 binding sites. Since group II PAKs do not contain an obvious AID, they may be regulated differently from group I PAKs. While group I PAKs interact with the SH3 containing proteins Nck, Grb2 and PIX, no such binding has been demonstrated for group II PAKs. Some known substrates of group II PAKs are also substrates of group I PAKs such as Raf, BAD, LIMK and GEFH1. Unique group II substrates include MARK/Par-1 and PDZ-RhoGEF. Group II PAKs play important roles in filopodia formation, neuron extension, cytoskeletal organization, and cell survival.
  • cd CRIB_PAK_like 39aa 4e-11 in ref transcript
    • PAK (p21 activated kinase) Binding Domain (PBD), binds Cdc42p- and/or Rho-like small GTPases; also known as the Cdc42/Rac interactive binding (CRIB) motif; has been shown to inhibit transcriptional activation and cell transformation mediated by the Ras-Rac pathway. This subgroup of CRIB/PBD-domains is found N-terminal of Serine/Threonine kinase domains in PAK and PAK-like proteins.
  • smart S_TKc 242aa 5e-76 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam PBD 55aa 1e-11 in ref transcript
    • P21-Rho-binding domain. Small domains that bind Cdc42p- and/or Rho-like small GTPases. Also known as the Cdc42/Rac interactive binding (CRIB).
  • COG SPS1 248aa 2e-35 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].

PAP2D

  • rs.PAP2D.F1 rs.PAP2D.R1 113 128
  • NCBIGene 36.3 163404
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001037317

  • cd PAP2_wunen 150aa 4e-52 in ref transcript
    • PAP2, wunen subfamily. Most likely a family of membrane associated phosphatidic acid phosphatases. Wunen is a drosophila protein expressed in the central nervous system, which provides repellent activity towards primordial germ cells (PGCs), controls the survival of PGCs and is essential in the migration process of these cells towards the somatic gonadal precursors.
  • smart acidPPc 91aa 8e-11 in ref transcript
    • Acid phosphatase homologues.

PAPSS2

  • rs.PAPSS2.F1 rs.PAPSS2.R1 100 115
  • NCBIGene 36.3 9060
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001015880

  • Changed! cd ATPS 369aa 1e-140 in ref transcript
    • ATP-sulfurylase (ATPS), also known as sulfate adenylate transferase, catalyzes the transfer of an adenylyl group from ATP to sulfate, forming adenosine 5'-phosphosulfate (APS). This reaction is generally accompanied by a further reaction, catalyzed by APS kinase, in which APS is phosphorylated to yield 3'-phospho-APS (PAPS). In some organisms the APS kinase is a separate protein, while in others it is incorporated with ATP sulfurylase in a bifunctional enzyme that catalyzes both reactions. In bifunctional proteins, the domain that performs the kinase activity can be attached at the N-terminal end of the sulfurylase unit or at the C-terminal end, depending on the organism. While the reaction is ubiquitous among organisms, the physiological role of the reaction varies. In some organisms it is used to generate APS from sulfate and ATP, while in others it proceeds in the opposite direction to generate ATP from APS and pyrophosphate. ATP sulfurylase can be a monomer, a homodimer, or a homo-oligomer, depending on the organism. ATPS belongs to a large superfamily of nucleotidyltransferases that includes pantothenate synthetase (PanC), phosphopantetheine adenylyltransferase (PPAT), and the amino-acyl tRNA synthetases. The enzymes of this family are structurally similar and share a dinucleotide-binding domain.
  • cd APSK 151aa 9e-73 in ref transcript
    • Adenosine 5'-phosphosulfate kinase (APSK) catalyzes the phosphorylation of adenosine 5'-phosphosulfate to form 3'-phosphoadenosine 5'-phosphosulfate (PAPS). The end-product PAPS is a biologically "activated" sulfate form important for the assimilation of inorganic sulfate.
  • Changed! pfam ATP-sulfurylase 324aa 1e-120 in ref transcript
    • ATP-sulfurylase. This family consists of ATP-sulfurylase or sulfate adenylyltransferase EC:2.7.7.4 some of which are part of a bifunctional polypeptide chain associated with adenosyl phosphosulphate (APS) kinase pfam01583. Both enzymes are required for PAPS (phosphoadenosine-phosphosulfate) synthesis from inorganic sulphate. ATP sulfurylase catalyses the synthesis of adenosine-phosphosulfate APS from ATP and inorganic sulphate.
  • pfam APS_kinase 159aa 4e-78 in ref transcript
    • Adenylylsulphate kinase. Enzyme that catalyses the phosphorylation of adenylylsulphate to 3'-phosphoadenylylsulfate. This domain contains an ATP binding P-loop motif.
  • PRK PRK05506 201aa 4e-73 in ref transcript
    • bifunctional sulfate adenylyltransferase subunit 1/adenylylsulfate kinase protein; Provisional.
  • Changed! PRK sat 389aa 4e-56 in ref transcript
    • sulfate adenylyltransferase; Reviewed.
  • Changed! cd ATPS 364aa 1e-140 in modified transcript
  • Changed! pfam ATP-sulfurylase 322aa 1e-119 in modified transcript
  • Changed! PRK sat 384aa 1e-54 in modified transcript

PARK2

  • rs.PARK2.F1 rs.PARK2.R1 183 267
  • NCBIGene 36.3 5071
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004562

  • cd parkin_N 70aa 9e-31 in ref transcript
    • parkin_N parkin protein is a RING-type E3 ubiquitin ligase with an amino-terminal ubiquitin-like (Ubl) domain and an RBR signature consisting of two RING finger domains separated by an IBR/DRIL domain. Naturally occurring mutations in parkin are thought to cause the disease AR_JP (autosomal-recessive juvenile parkinsonism). Parkin binds the Rpn10 subunit of 26S proteasomes through its Ubl domain.
  • pfam ubiquitin 61aa 2e-15 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.
  • pfam IBR 63aa 2e-06 in ref transcript
    • IBR domain. The IBR (In Between Ring fingers) domain is often found to occur between pairs of ring fingers (pfam00097). This domain has also been called the C6HC domain and DRIL (for double RING finger linked) domain. Proteins that contain two Ring fingers and an IBR domain (these proteins are also termed RBR family proteins) are thought to exist in all eukaryotic organisms. RBR family members play roles in protein quality control and can indirectly regulate transcription. Evidence suggests that RBR proteins are often parts of cullin-containing ubiquitin ligase complexes. The ubiquitin ligase Parkin is an RBR family protein whose mutations are involved in forms of familial Parkinson's disease.
  • PTZ PTZ00044 54aa 7e-06 in ref transcript
    • ubiquitin; Provisional.

PARP2

  • rs.PARP2.F1 rs.PARP2.R1 258 297
  • NCBIGene 36.3 10038
  • Alternative 5-prime, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005484

  • cd parp_like 344aa 1e-129 in ref transcript
    • Poly(ADP-ribose) polymerase (parp) catalytic domain catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active. Poly(ADP-ribose)-like polymerases (PARPS 1-3, VPARP, tankyrase) catalyze the addition of up to 100 ADP_ribose units from NAD+. PARPs 1 and 2 are localized in the nucleaus, bind DNA, and are activated by DNA damage. VPARP is part of the vault ribonucleoprotein complex. Tankyrases regulates telomere length through interactions with telomere repeat binding factor 1.
  • pfam PARP 213aa 1e-80 in ref transcript
    • Poly(ADP-ribose) polymerase catalytic domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active.
  • pfam PARP_reg 130aa 5e-30 in ref transcript
    • Poly(ADP-ribose) polymerase, regulatory domain. Poly(ADP-ribose) polymerase catalyses the covalent attachment of ADP-ribose units from NAD+ to itself and to a limited number of other DNA binding proteins, which decreases their affinity for DNA. Poly(ADP-ribose) polymerase is a regulatory component induced by DNA damage. The carboxyl-terminal region is the most highly conserved region of the protein. Experiments have shown that a carboxyl 40 kDa fragment is still catalytically active.
  • smart WGR 86aa 3e-25 in ref transcript
    • Proposed nucleic acid binding domain. This domain is named after its most conserved central motif. It is found in a variety of polyA polymerases as well as in molybdate metabolism regulators (e.g. in E.coli) and other proteins of unknown function. The domain is found in isolation in some proteins and is between 70 and 80 residues in length. It is proposed that it may be a nucleic acid binding domain.

PAX3

  • rs.PAX3.F1 rs.PAX3.R1 110 133
  • NCBIGene 36.3 5077
  • Alternative 5-prime and 3-prime, size difference: 23
  • Inclusion in the protein causing a new stop codon, Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_181459

  • cd PAX 128aa 2e-61 in ref transcript
    • Paired Box domain.
  • cd homeodomain 58aa 6e-18 in ref transcript
    • Homeodomain; DNA binding domains involved in the transcriptional regulation of key eukaryotic developmental processes; may bind to DNA as monomers or as homo- and/or heterodimers, in a sequence-specific manner.
  • pfam PAX 126aa 7e-69 in ref transcript
    • 'Paired box' domain.
  • pfam Homeobox 57aa 5e-24 in ref transcript
    • Homeobox domain.
  • COG COG5576 76aa 5e-10 in ref transcript
    • Homeodomain-containing transcription factor [Transcription].

PC

  • rs.PC.F1 rs.PC.R1 93 281
  • NCBIGene 36.3 5091
  • Single exon skipping, size difference: 188
  • Exclusion in 5'UTR
  • Reference transcript: NM_001040716

  • cd biotinyl_domain 67aa 7e-20 in ref transcript
    • The biotinyl-domain or biotin carboxyl carrier protein (BCCP) domain is present in all biotin-dependent enzymes, such as acetyl-CoA carboxylase, pyruvate carboxylase, propionyl-CoA carboxylase, methylcrotonyl-CoA carboxylase, geranyl-CoA carboxylase, oxaloacetate decarboxylase, methylmalonyl-CoA decarboxylase, transcarboxylase and urea amidolyase. This domain functions in transferring CO2 from one subsite to another, allowing carboxylation, decarboxylation, or transcarboxylation. During this process, biotin is covalently attached to a specific lysine.
  • TIGR pyruv_carbox 1140aa 0.0 in ref transcript
    • This enzyme plays a role in gluconeogensis but not glycolysis.
  • PRK PRK12999 1146aa 0.0 in ref transcript
    • pyruvate carboxylase; Reviewed.

PCBP2

  • rs.PCBP2.F1 rs.PCBP2.R1 113 206
  • NCBIGene 36.3 5094
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005016

  • cd PCBP_like_KH 65aa 5e-16 in ref transcript
    • K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
  • cd PCBP_like_KH 62aa 4e-14 in ref transcript
  • cd KH-I 63aa 7e-12 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • pfam KH_1 61aa 2e-11 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • pfam KH_1 63aa 3e-11 in ref transcript
  • pfam KH_1 64aa 6e-10 in ref transcript
  • COG NusA 122aa 6e-04 in ref transcript
    • Transcription elongation factor [Transcription].

PCBP2

  • rs.PCBP2.F2 rs.PCBP2.R2 97 109
  • NCBIGene 36.3 5094
  • Alternative 5-prime and 3-prime, size difference: 12
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005016

  • cd PCBP_like_KH 65aa 5e-16 in ref transcript
    • K homology RNA-binding domain, PCBP_like. Members of this group possess KH domains in a tandem arrangement. Most members, similar to the poly(C) binding proteins (PCBPs) and Nova, containing three KH domains, with the first and second domains, which are represented here, in tandem arrangement, followed by a large spacer region, with the third domain near the C-terminal end of the protein. The poly(C) binding proteins (PCBPs) can be divided into two groups, hnRNPs K/J and the alphaCPs, which share a triple KH domain configuration and poly(C) binding specificity. They play roles in mRNA stabilization, translational activation, and translational silencing. Nova-1 and Nova-2 are nuclear RNA-binding proteins that regulate splicing. This group also contains plant proteins that seem to have two tandem repeat arrrangements, like Hen4, a protein that plays a role in AGAMOUS (AG) pre-mRNA processing and important step in plant development. In general, KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA.
  • cd PCBP_like_KH 62aa 4e-14 in ref transcript
  • cd KH-I 63aa 7e-12 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • pfam KH_1 61aa 2e-11 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • pfam KH_1 63aa 3e-11 in ref transcript
  • pfam KH_1 64aa 6e-10 in ref transcript
  • COG NusA 122aa 6e-04 in ref transcript
    • Transcription elongation factor [Transcription].

PCMTD2

  • rs.PCMTD2.F1 rs.PCMTD2.R1 263 344
  • NCBIGene 36.3 55251
  • Alternative 3-prime, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018257

  • cd AdoMet_MTases 110aa 6e-04 in ref transcript
    • S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
  • Changed! pfam PCMT 209aa 4e-21 in ref transcript
    • Protein-L-isoaspartate(D-aspartate) O-methyltransferase (PCMT).
  • Changed! COG Pcm 197aa 1e-17 in ref transcript
    • Protein-L-isoaspartate carboxylmethyltransferase [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam PCMT 183aa 4e-20 in modified transcript
  • Changed! COG Pcm 172aa 1e-13 in modified transcript

PCSK6

  • rs.PCSK6.F1 rs.PCSK6.R1 171 210
  • NCBIGene 36.3 5046
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_138320

  • cd FU 51aa 6e-07 in ref transcript
    • Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
  • cd FU 43aa 2e-06 in ref transcript
  • cd FU 47aa 3e-06 in ref transcript
  • cd FU 53aa 1e-05 in ref transcript
  • pfam Peptidase_S8 301aa 2e-94 in ref transcript
    • Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold containing a 7-stranded parallel beta sheet, order 2314567.
  • pfam P_proprotein 91aa 4e-34 in ref transcript
    • Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic domain.
  • smart FU 48aa 2e-08 in ref transcript
    • Furin-like repeats.
  • smart FU 44aa 2e-08 in ref transcript
  • smart FU 44aa 1e-06 in ref transcript
  • pfam Furin-like 112aa 4e-06 in ref transcript
    • Furin-like cysteine rich region.
  • pfam VSP 115aa 5e-04 in ref transcript
    • Giardia variant-specific surface protein.
  • COG AprE 324aa 1e-20 in ref transcript
    • Subtilisin-like serine proteases [Posttranslational modification, protein turnover, chaperones].
  • COG COG4935 91aa 1e-15 in ref transcript
    • Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones].
  • COG NapH 83aa 0.009 in ref transcript
    • Polyferredoxin [Energy production and conversion].

PCSK6

  • rs.PCSK6.F2 rs.PCSK6.R2 227 273
  • NCBIGene 36.3 5046
  • Single exon skipping, size difference: 46
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_138324

  • pfam Peptidase_S8 301aa 3e-91 in ref transcript
    • Subtilase family. Subtilases are a family of serine proteases. They appear to have independently and convergently evolved an Asp/Ser/His catalytic triad, like that found in the trypsin serine proteases (see pfam00089). Structure is an alpha/beta fold containing a 7-stranded parallel beta sheet, order 2314567.
  • Changed! pfam P_proprotein 84aa 2e-28 in ref transcript
    • Proprotein convertase P-domain. A unique feature of the eukaryotic subtilisin-like proprotein convertases is the presence of an additional highly conserved sequence of approximately 150 residues (P domain) located immediately downstream of the catalytic domain.
  • COG AprE 324aa 4e-16 in ref transcript
    • Subtilisin-like serine proteases [Posttranslational modification, protein turnover, chaperones].
  • COG COG4935 69aa 4e-12 in ref transcript
    • Regulatory P domain of the subtilisin-like proprotein convertases and other proteases [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam P_proprotein 85aa 7e-29 in modified transcript

PCTK3

  • rs.PCTK3.F1 rs.PCTK3.R1 282 364
  • NCBIGene 36.3 5129
  • Alternative 5-prime, size difference: 82
  • Inclusion in 5'UTR
  • Reference transcript: NM_212503

  • cd S_TKc 282aa 3e-64 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 272aa 1e-68 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00024 284aa 1e-44 in ref transcript
    • cyclin-dependent protein kinase; Provisional.

PCTP

  • rs.PCTP.F1 rs.PCTP.R1 99 332
  • NCBIGene 36.3 58488
  • Alternative 5-prime, size difference: 233
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_021213

  • Changed! cd START 204aa 4e-30 in ref transcript
    • START(STeroidogenic Acute Regulatory (STAR) related lipid Transfer) Domain. These domains are 200-210 amino acid in length and occur in proteins involved in lipid transport (phosphatidylcholine) and metabolism, signal transduction, and transcriptional regulation. The most striking feature of the START domain structure is a predominantly hydrophobic tunnel extending nearly the entire protein and used to binding a single molecule of large lipophilic compounds, like cholesterol.
  • Changed! smart START 204aa 5e-40 in ref transcript
    • in StAR and phosphatidylcholine transfer protein. putative lipid-binding domain in StAR and phosphatidylcholine transfer protein.

PDE8A

  • rs.PDE8A.F1 rs.PDE8A.R1 120 258
  • NCBIGene 36.3 5151
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002605

  • Changed! cd PAS 99aa 3e-06 in ref transcript
    • PAS domain; PAS motifs appear in archaea, eubacteria and eukarya. Probably the most surprising identification of a PAS domain was that in EAG-like K+-channels. PAS domains have been found to bind ligands, and to act as sensors for light and oxygen in signal transduction.
  • cd HDc 195aa 4e-05 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • pfam PDEase_I 255aa 2e-55 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.
  • Changed! pfam PAS 106aa 2e-07 in ref transcript
    • PAS fold. The PAS fold corresponds to the structural domain that has previously been defined as PAS and PAC motifs. The PAS fold appears in archaea, eubacteria and eukarya.
  • pfam Response_reg 117aa 5e-07 in ref transcript
    • Response regulator receiver domain. This domain receives the signal from the sensor partner in bacterial two-component systems. It is usually found N-terminal to a DNA binding effector domain.
  • Changed! COG AtoS 113aa 1e-04 in ref transcript
    • FOG: PAS/PAC domain [Signal transduction mechanisms].

PDE9A

  • rs.PDE9A.F1 rs.PDE9A.R1 278 333
  • NCBIGene 36.3 5152
  • Single exon skipping, size difference: 55
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002606

  • Changed! cd HDc 174aa 6e-10 in ref transcript
    • Metal dependent phosphohydrolases with conserved 'HD' motif.
  • Changed! pfam PDEase_I 223aa 8e-56 in ref transcript
    • 3'5'-cyclic nucleotide phosphodiesterase.

PDS5A

  • rs.PDS5A.F1 rs.PDS5A.R1 100 471
  • NCBIGene 36.3 23244
  • Alternative 5-prime and 3-prime, size difference: 371
  • Exclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_001100399

PELI3

  • rs.PELI3.F1 rs.PELI3.R1 338 410
  • NCBIGene 36.3 246330
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145065

  • Changed! pfam Pellino 394aa 1e-175 in ref transcript
    • Pellino. Pellino is involved in Toll-like signalling pathways, and associates with the kinase domain of the Pelle Ser/Thr kinase.
  • Changed! pfam Pellino 416aa 0.0 in modified transcript

PFAAP5

  • rs.PFAAP5.F1 rs.PFAAP5.R1 102 152
  • NCBIGene 36.3 10443
  • Alternative 5-prime, size difference: 50
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_033111

PH-4

  • rs.PH-4.F1 rs.PH-4.R1 142 325
  • NCBIGene 36.3 54681
  • Alternative 5-prime and 3-prime, size difference: 183
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_177938

  • cd EFh 61aa 3e-05 in ref transcript
    • EF-hand, calcium binding motif; A diverse superfamily of calcium sensors and calcium signal modulators; most examples in this alignment model have 2 active canonical EF hands. Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. EF-hands tend to occur in pairs or higher copy numbers.
  • Changed! smart P4Hc 98aa 3e-12 in ref transcript
    • Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
  • Changed! smart P4Hc 85aa 4e-07 in ref transcript
  • COG FRQ1 127aa 8e-05 in ref transcript
    • Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only].
  • Changed! smart P4Hc 213aa 1e-23 in modified transcript

PHACTR2

  • rs.PHACTR2.F1 rs.PHACTR2.R1 310 550
  • NCBIGene 36.3 9749
  • Single exon skipping, size difference: 240
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100164

  • pfam RPEL 26aa 0.003 in ref transcript
    • RPEL repeat. The RPEL repeat is named after four conserved amino acids it contains. The function of the RPEL repeat is unknown however it might be a DNA binding repeat based on the observation that a member from Drosophila melanogaster contains a pfam02037 domain that is also implicated in DNA binding.
  • pfam RPEL 26aa 0.006 in ref transcript

PHF20L1

  • rs.PHF20L1.F1 rs.PHF20L1.R1 108 337
  • NCBIGene 36.3 51105
  • Alternative 3-prime, size difference: 229
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_016018

  • cd TUDOR 46aa 2e-06 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • smart TUDOR 54aa 3e-06 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
  • smart MBT 66aa 3e-05 in ref transcript
    • Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
  • Changed! pfam PHD 44aa 1e-04 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.

PHF21A

  • rs.PHF21A.F1 rs.PHF21A.R1 202 343
  • NCBIGene 36.3 51317
  • Mutually exclusive exon skipping, size difference: 141
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001101802

  • cd BAH_plant_2 39aa 3e-04 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • pfam PHD 44aa 1e-12 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • Changed! COG TNG2 163aa 7e-05 in ref transcript
    • Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics].

PHYHD1

  • rs.PHYHD1.F1 rs.PHYHD1.R1 124 146
  • NCBIGene 36.3 254295
  • Single exon skipping, size difference: 22
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_174933

  • Changed! pfam PhyH 204aa 5e-26 in ref transcript
    • Phytanoyl-CoA dioxygenase (PhyH). This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins, ectoine hydroxylases and a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues.
  • Changed! pfam PhyH 250aa 9e-47 in modified transcript
  • Changed! COG COG5285 258aa 1e-11 in modified transcript
    • Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism].

PHYHD1

  • rs.PHYHD1.F2 rs.PHYHD1.R2 124 187
  • NCBIGene 36.3 254295
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100876

  • Changed! pfam PhyH 250aa 9e-47 in ref transcript
    • Phytanoyl-CoA dioxygenase (PhyH). This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins, ectoine hydroxylases and a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues.
  • Changed! COG COG5285 258aa 1e-11 in ref transcript
    • Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism].
  • Changed! pfam PhyH 229aa 5e-44 in modified transcript
  • Changed! COG COG5285 237aa 1e-12 in modified transcript

PHYHIP

  • rs.PHYHIP.F1 rs.PHYHIP.R1 280 402
  • NCBIGene 36.3 9796
  • Single exon skipping, size difference: 122
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099335

  • cd FN3 108aa 7e-04 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.

PICALM

  • rs.PICALM.F1 rs.PICALM.R1 116 266
  • NCBIGene 36.3 8301
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007166

  • cd ANTH_AP180_CALM 117aa 1e-37 in ref transcript
    • ANTH domain family; composed of adaptor protein 180 (AP180), clathrin assembly lymphoid myeloid leukemia protein (CALM) and similar proteins. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. AP180 and CALM play important roles in clathrin-mediated endocytosis. AP180 is a brain-specific clathrin-binding protein which stimulates clathrin assembly during the recycling of synaptic vesicles. The ANTH domain is structurally similar to the VHS domain and is composed of a superhelix of eight alpha helices. ANTH domains bind both inositol phospholipids and proteins, and contribute to the nucleation and formation of clathrin coats on membranes. ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the trans-Golgi network, which suggests that the ANTH domain is a universal component of the machinery for clathrin-mediated membrane budding.
  • pfam ANTH 265aa 7e-80 in ref transcript
    • ANTH domain. AP180 is an endocytotic accessory proteins that has been implicated in the formation of clathrin-coated pits. The domain is involved in phosphatidylinositol 4,5-bisphosphate binding and is a universal adaptor for nucleation of clathrin coats.

PID1

  • rs.PID1.F1 rs.PID1.R1 123 334
  • NCBIGene 36.3 55022
  • Single exon skipping, size difference: 211
  • Exclusion of the protein initiation site
  • Reference transcript: NM_017933

  • cd CG8312 134aa 5e-10 in ref transcript
    • CG8312 Phosphotyrosine-binding (PTB) domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.

PILRB

  • rs.PILRB.F1 rs.PILRB.R1 146 355
  • NCBIGene 36.3 29990
  • Alternative 3-prime, size difference: 209
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_178238

  • Changed! pfam V-set 92aa 1e-05 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

PIP5K3

  • rs.PIP5K3.F1 rs.PIP5K3.R1 146 437
  • NCBIGene 36.3 200576
  • Multiple exon skipping, size difference: 291
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015040

  • cd Fab1_TCP 245aa 1e-104 in ref transcript
    • TCP-1 like domain of the eukaryotic phosphatidylinositol 3-phosphate (PtdIns3P) 5-kinase Fab1. Fab1p is important for vacuole size regulation, presumably by modulating PtdIns(3,5)P2 effector activity. In the human homolog p235/PIKfyve deletion of this domain leads to loss of catalytic activity. However no exact function this domain has been defined. In general, chaperonins are involved in productive folding of proteins.
  • cd PIPKc 314aa 5e-98 in ref transcript
    • Phosphatidylinositol phosphate kinases (PIPK) catalyze the phosphorylation of phosphatidylinositol phosphate on the fourth or fifth hydroxyl of the inositol ring, to form phosphatidylinositol bisphosphate. CD alignment includes type II phosphatidylinositol phosphate kinases (PIPKII-beta), type I andII PIPK (-alpha, -beta, and -gamma) kinases and related yeast Fab1p and Mss4p kinases. Signaling by phosphorylated species of phosphatidylinositol regulates secretion, vesicular trafficking, membrane translocation, cell adhesion, chemotaxis, DNA synthesis, and cell cycling. The catalytic core domains of PIPKs are structurally similar to PI3K, PI4K, and cAMP-dependent protein kinases (PKA), the dimerization region is a unique feature of the PIPKs.
  • cd DEP_PIKfyve 82aa 1e-38 in ref transcript
    • DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in fungal RhoGEF (GDP/GTP exchange factor) PIKfyve-like proteins. PIKfyve contains N-terminal Fyve finger and DEP domains, a central chaperonin-like domain and a C-terminal PIPK (phosphatidylinositol phosphate kinase) domain. PIKfyve-like proteins are important phosphatidylinositol (3)-monophosphate (PtdIns(3)P)-5-kinases, producing PtdIns(3,5)P2, which plays a major role in multivesicular body (MVB) sorting and control of retrograde traffic from the vacuole back to the endosome and/or Golgi. PIKfyve itself has been shown to be play a role in regulating early-endosome-to-trans-Golgi network (TGN) retrograde trafficking.
  • Changed! cd FYVE 53aa 2e-15 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • smart PIPKc 238aa 5e-72 in ref transcript
    • Phosphatidylinositol phosphate kinases.
  • TIGR chap_CCT_gamma 224aa 8e-32 in ref transcript
    • Members of this family, all eukaryotic, are part of the group II chaperonin complex called CCT (chaperonin containing TCP-1) or TRiC. The archaeal equivalent group II chaperonin is often called the thermosome. Both are somewhat related to the group I chaperonin of bacterial, GroEL/GroES. This family consists exclusively of the CCT gamma chain (part of a paralogous family) from animals, plants, fungi, and other eukaryotes.
  • Changed! pfam FYVE 60aa 9e-20 in ref transcript
    • FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
  • smart DEP 73aa 3e-17 in ref transcript
    • Domain found in Dishevelled, Egl-10, and Pleckstrin. Domain of unknown function present in signalling proteins that contain PH, rasGEF, rhoGEF, rhoGAP, RGS, PDZ domains. DEP domain in Drosophila dishevelled is essential to rescue planar polarity defects and induce JNK signalling (Cell 94, 109-118).
  • smart PIPKc 56aa 6e-17 in ref transcript
  • COG MSS4 288aa 9e-54 in ref transcript
    • Phosphatidylinositol-4-phosphate 5-kinase [Signal transduction mechanisms].
  • COG GroL 219aa 2e-17 in ref transcript
    • Chaperonin GroEL (HSP60 family) [Posttranslational modification, protein turnover, chaperones].

PLAGL1

  • rs.PLAGL1.F1 rs.PLAGL1.R1 104 161
  • NCBIGene 36.3 5325
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 57
  • Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_001080951

  • COG SFP1 81aa 0.004 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].

PLAGL1

  • rs.PLAGL1.F2 rs.PLAGL1.R2 176 216
  • NCBIGene 36.3 5325
  • Single exon skipping, size difference: 40
  • Exclusion in 5'UTR
  • Reference transcript: NM_001080954

  • COG SFP1 81aa 0.004 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].

PLAGL1

  • rs.PLAGL1.F3 rs.PLAGL1.R3 99 113
  • NCBIGene 36.3 5325
  • Alternative 3-prime, size difference: 14
  • Inclusion in 5'UTR
  • Reference transcript: NM_001080951

  • COG SFP1 81aa 0.004 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].

PLAT

  • rs.PLAT.F1 rs.PLAT.R1 245 383
  • NCBIGene 36.3 5327
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000930

  • cd Tryp_SPc 249aa 6e-67 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • cd KR 86aa 4e-20 in ref transcript
    • Kringle domain; Kringle domains are believed to play a role in binding mediators, such as peptides, other proteins, membranes, or phospholipids. They are autonomous structural domains, found in a varying number of copies, in blood clotting and fibrinolytic proteins, some serine proteases and plasma proteins. Plasminogen-like kringles possess affinity for free lysine and lysine-containing peptides.
  • cd KR 85aa 5e-17 in ref transcript
  • Changed! cd FN1 43aa 3e-04 in ref transcript
    • Fibronectin type 1 domain, approximately 40 residue long with two conserved disulfide bridges. FN1 is one of three types of internal repeats which combine to form larger domains within fibronectin. Fibronectin, a plasma protein that binds cell surfaces and various compounds including collagen, fibrin, heparin, DNA, and actin, usually exists as a dimer in plasma and as an insoluble multimer in extracellular matrices. Dimers of nearly identical subunits are linked by a disulfide bond close to their C-terminus. FN1 domains also found in coagulation factor XII, HGF activator, and tissue-type plasminogen activator. In tissue plasminogen activator, FN1 domains may form functional fibrin-binding units with EGF-like domains C-terminal to FN1.
  • smart Tryp_SPc 247aa 1e-71 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • pfam Kringle 82aa 3e-26 in ref transcript
    • Kringle domain. Kringle domains have been found in plasminogen, hepatocyte growth factors, prothrombin, and apolipoprotein A. Structure is disulfide-rich, nearly all-beta.
  • pfam Kringle 82aa 4e-21 in ref transcript
  • Changed! smart FN1 43aa 4e-06 in ref transcript
    • Fibronectin type 1 domain. One of three types of internal repeat within the plasma protein, fibronectin. Found also in coagulation factor XII, HGF activator and tissue-type plasminogen activator. In t-PA and fibronectin, this domain type contributes to fibrin-binding.
  • COG COG5640 251aa 2e-23 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].

PLAUR

  • rs.PLAUR.F1 rs.PLAUR.R1 206 341
  • NCBIGene 36.3 5329
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002659

  • cd LU 82aa 8e-10 in ref transcript
    • Ly-6 antigen / uPA receptor -like domain; occurs singly in GPI-linked cell-surface glycoproteins (Ly-6 family,CD59, thymocyte B cell antigen, Sgp-2) or as three-fold repeated domain in urokinase-type plasminogen activator receptor. Topology of these domains is similar to that of snake venom neurotoxins.
  • Changed! cd LU 87aa 1e-06 in ref transcript
  • smart LU 81aa 8e-12 in ref transcript
    • Ly-6 antigen / uPA receptor -like domain. Three-fold repeated domain in urokinase-type plasminogen activator receptor; occurs singly in other GPI-linked cell-surface glycoproteins (Ly-6 family, CD59, thymocyte B cell antigen, Sgp-2). Topology of these domains is similar to that of snake venom neurotoxins.
  • Changed! smart LU 85aa 4e-11 in ref transcript
  • smart LU 77aa 3e-06 in ref transcript

PLCB4

  • rs.PLCB4.F1 rs.PLCB4.R1 122 159
  • NCBIGene 36.3 5332
  • Single exon skipping, size difference: 37
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000933

  • cd PLCc 142aa 3e-41 in ref transcript
    • Phospholipase C, catalytic domain; Phosphoinositide-specific phospholipases C catalyze hydrolysis of phosphatidylinositol-4,5-bisphosphate (PIP2) to D-myo-inositol-1,4,5-trisphosphate (1,4,5-IP3) and sn-1,2-diacylglycerol (DAG). Both products function as second messengers in eukaryotic signal transduction cascades. 1,4,5-IP3 triggers inflow of calcium from intracellular stores; the membrane resident product DAG controls cellular protein phosphorylation states by activating various protein kinase C isozymes. The enzyme comprises 2 regions (X and Y) connected via a linker which may contain inserted domains, X and Y together form a TIM barrel-like structure containing the active site residues.
  • cd PH_PLC 121aa 3e-26 in ref transcript
    • Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
  • cd PLCc 99aa 2e-17 in ref transcript
  • cd C2_2 117aa 7e-14 in ref transcript
    • Protein kinase C conserved region 2, subgroup 2; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing phospholipases C and D( PLC-1, PLD) and specific protein kinases C (PKC) subtypes, the C-terminal beta strand occupies the position of what is the N-terminal strand in subgroup 1.
  • smart PLCYc 115aa 7e-55 in ref transcript
    • Phospholipase C, catalytic domain (part); domain Y. Phosphoinositide-specific phospholipases C. These enzymes contain 2 regions (X and Y) which together form a TIM barrel-like structure containing the active site residues. Phospholipase C enzymes (PI-PLC) act as signal transducers that generate two second messengers, inositol-1,4,5-trisphosphate and diacylglycerol. The bacterial enzyme [6] appears to be a homologue of the mammalian PLCs.
  • pfam PI-PLC-X 139aa 4e-51 in ref transcript
    • Phosphatidylinositol-specific phospholipase C, X domain. This associates with pfam00387 to form a single structural unit.
  • pfam efhand_like 92aa 4e-23 in ref transcript
    • Phosphoinositide-specific phospholipase C, efhand-like. Members of this family are predominantly found in phosphoinositide-specific phospholipase C. They adopt a structure consisting of a core of four alpha helices, in an EF like fold, and are required for functioning of the enzyme.
  • pfam DUF1154 47aa 6e-13 in ref transcript
    • Protein of unknown function (DUF1154). This family represents a small conserved region of unknown function within eukaryotic phospholipase C (EC:3.1.4.3). All members also contain pfam00387 and pfam00388.
  • smart C2 98aa 1e-11 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • Changed! pfam SMC_N 256aa 7e-06 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! COG Smc 258aa 5e-05 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! pfam SMC_N 243aa 4e-07 in modified transcript
  • Changed! COG Smc 249aa 1e-04 in modified transcript

PNMA5

  • rs.PNMA5.F1 rs.PNMA5.R1 100 114
  • NCBIGene 36.3 114824
  • Alternative 5-prime, size difference: 14
  • Exclusion in 5'UTR
  • Reference transcript: NM_001103151

PNMAL1

  • rs.PNMAL1.F1 rs.PNMAL1.R1 106 311
  • NCBIGene 36.3 55228
  • Alternative 5-prime, size difference: 205
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_018215

PNPLA7

  • rs.PNPLA7.F1 rs.PNPLA7.R1 332 407
  • NCBIGene 36.3 375775
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098537

  • cd Pat_PNPLA6_PNPLA7 306aa 1e-176 in ref transcript
    • Patatin-like phospholipase domain containing protein 6 and protein 7. Patatin-like phospholipase domain containing protein 6 (PNPLA6) and protein 7 (PNPLA7) are 60% identical to each other. PNPLA6 is commonly known as Neuropathy Target Esterase (NTE). NTE has at least two functional domains: the N-terminal domain putatively regulatory domain and the C-terminal catalytic domain which shows esterase activity. NTE shows phospholipase activity for lysophosphatidylcholine (LPC) and phosphatidylcholine (PC). Exposure of NTE to organophosphates leads to organophosphate-induced delayed neurotoxicity (OPIDN). OPIDN is a progressive neurological condition that is characterized by weakness, paralysis, pain, and paresthesia. PNPLA7 is an insulin-regulated phospholipase that is homologous to Neuropathy Target Esterase (NTE or PNPLA6) and is also known as NTE-related esterase (NRE). Human NRE is predominantly expressed in prostate, white adipose, and pancreatic tissue. NRE hydrolyzes sn-1 esters in lysophosphatidylcholine and lysophosphatidic acid, but shows no lipase activity with substrates like triacylglycerols (TG), cholesteryl esters, retinyl esters (RE), phosphatidylcholine (PC), or monoacylglycerol (MG). This family includes PNPLA6 and PNPLA7 from Homo sapiens, YMF9 from Yeast, and Swiss Cheese protein (sws) from Drosophila melanogaster.
  • cd CAP_ED 109aa 1e-15 in ref transcript
    • effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of the structural integrity of the beta-barrel; CooA is a homodimeric transcription factor that belongs to CAP family; cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain; cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain; cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section; also found in vertebrate cyclic nucleotide-gated ion-channels.
  • cd CAP_ED 109aa 1e-12 in ref transcript
  • cd CAP_ED 122aa 6e-11 in ref transcript
  • pfam Patatin 151aa 3e-28 in ref transcript
    • Patatin-like phospholipase. This family consists of various patatin glycoproteins from plants. The patatin protein accounts for up to 40% of the total soluble protein in potato tubers. Patatin is a storage protein but it also has the enzymatic activity of lipid acyl hydrolase, catalysing the cleavage of fatty acids from membrane lipids. Members of this family have been found also in vertebrates.
  • pfam cNMP_binding 91aa 4e-14 in ref transcript
    • Cyclic nucleotide-binding domain.
  • pfam cNMP_binding 91aa 1e-13 in ref transcript
  • pfam cNMP_binding 98aa 4e-09 in ref transcript
  • COG RssA 274aa 1e-35 in ref transcript
    • Predicted esterase of the alpha-beta hydrolase superfamily [General function prediction only].
  • COG Crp 116aa 1e-11 in ref transcript
    • cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms].
  • COG Crp 130aa 3e-11 in ref transcript
  • COG Crp 143aa 2e-10 in ref transcript

PODXL

  • rs.PODXL.F1 rs.PODXL.R1 305 401
  • NCBIGene 36.3 5420
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001018111

  • pfam CD34_antigen 201aa 2e-62 in ref transcript
    • CD34/Podocalyxin family. This family consists of several mammalian CD34 antigen proteins. The CD34 antigen is a human leukocyte membrane protein expressed specifically by lymphohematopoietic progenitor cells. CD34 is a phosphoprotein. Activation of protein kinase C (PKC) has been found to enhance CD34 phosphorylation. This family contains several eukaryotic podocalyxin proteins. Podocalyxin is a major membrane protein of the glomerular epithelium and is thought to be involved in maintenance of the architecture of the foot processes and filtration slits characteristic of this unique epithelium by virtue of its high negative charge. Podocalyxin functions as an anti-adhesin that maintains an open filtration pathway between neighbouring foot processes in the glomerular epithelium by charge repulsion.

POLR3H

  • rs.POLR3H.F1 rs.POLR3H.R1 123 210
  • NCBIGene 36.3 171568
  • Single exon skipping, size difference: 87
  • Exclusion in 5'UTR
  • Reference transcript: NM_001018050

POMT1

  • rs.POMT1.F1 rs.POMT1.R1 214 366
  • NCBIGene 36.3 10585
  • Single exon skipping, size difference: 152
  • Exclusion of the protein initiation site
  • Reference transcript: NM_007171

  • Changed! pfam PMT 215aa 1e-58 in ref transcript
    • Dolichyl-phosphate-mannose-protein mannosyltransferase. This is a family of Dolichyl-phosphate-mannose-protein mannosyltransferase proteins EC:2.4.1.109. These proteins are responsible for O-linked glycosylation of proteins, they catalyse the reaction:- Dolichyl phosphate D-mannose + protein <=> dolichyl phosphate + O-D-mannosyl-protein. Also in this family is Drosophila rotated abdomen protein which is a putative mannosyltransferase. This family appears to be distantly related to pfam02516 (A Bateman pers. obs.).
  • pfam MIR 173aa 2e-10 in ref transcript
    • MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function.
  • Changed! COG PMT1 729aa 2e-97 in ref transcript
    • Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam PMT 180aa 7e-49 in modified transcript
  • Changed! COG PMT1 683aa 1e-88 in modified transcript

POMT1

  • rs.POMT1.F2 rs.POMT1.R2 157 223
  • NCBIGene 36.3 10585
  • Alternative 5-prime, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007171

  • Changed! pfam PMT 215aa 1e-58 in ref transcript
    • Dolichyl-phosphate-mannose-protein mannosyltransferase. This is a family of Dolichyl-phosphate-mannose-protein mannosyltransferase proteins EC:2.4.1.109. These proteins are responsible for O-linked glycosylation of proteins, they catalyse the reaction:- Dolichyl phosphate D-mannose + protein <=> dolichyl phosphate + O-D-mannosyl-protein. Also in this family is Drosophila rotated abdomen protein which is a putative mannosyltransferase. This family appears to be distantly related to pfam02516 (A Bateman pers. obs.).
  • pfam MIR 173aa 2e-10 in ref transcript
    • MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function.
  • Changed! COG PMT1 729aa 2e-97 in ref transcript
    • Dolichyl-phosphate-mannose--protein O-mannosyl transferase [Posttranslational modification, protein turnover, chaperones].
  • Changed! pfam PMT 215aa 1e-57 in modified transcript
  • Changed! COG PMT1 707aa 2e-98 in modified transcript

PON2

  • rs.PON2.F1 rs.PON2.R1 152 188
  • NCBIGene 36.3 5445
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000305

  • pfam Arylesterase 86aa 6e-38 in ref transcript
    • Arylesterase. This family consists of arylesterases (Also known as serum paraoxonase) EC:3.1.1.2. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity. Human arylesterase (PON1) is associated with HDL and may protect against LDL oxidation.
  • Changed! COG COG3386 192aa 4e-09 in ref transcript
    • Gluconolactonase [Carbohydrate transport and metabolism].
  • Changed! COG COG3386 138aa 1e-08 in modified transcript

POT1

  • rs.POT1.F1 rs.POT1.R1 256 353
  • NCBIGene 36.3 25913
  • Single exon skipping, size difference: 97
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015450

  • cd hPOT1_OB2 118aa 2e-39 in ref transcript
    • hPOT1_OB2: A subfamily of OB folds similar to the second OB fold (OB2) of human protection of telomeres 1 protein (hPOT1). POT1 proteins bind to the single-stranded (ss) 3-prime ends of the telomere. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. hPOT1 is implicated in telomere length regulation.
  • cd hPOT1_OB1_like 133aa 2e-33 in ref transcript
    • hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. A second OB fold has not been predicted in S. pombe POT1. OnTEBP binds the extreme 3-prime end of telomeric DNA. It is heterodimeric and contains four OB folds - three in the alpha subunit (two in the N-terminal domain and one in the C-terminal domain) and one in the beta subunit. OB1Nalpha, together with the second OB fold of the N-terminal domain of OnTEBP alpha subunit and the beta subunit OB fold, forms a deep cleft that binds ssDNA.
  • pfam Telo_bind 103aa 4e-14 in ref transcript
    • Telomere-binding protein alpha subunit, central domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Alpha subunit consists of 3 structural domains, all with the same beta-barrel OB fold.

POT1

  • rs.POT1.F2 rs.POT1.R2 107 238
  • NCBIGene 36.3 25913
  • Single exon skipping, size difference: 131
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015450

  • Changed! cd hPOT1_OB2 118aa 2e-39 in ref transcript
    • hPOT1_OB2: A subfamily of OB folds similar to the second OB fold (OB2) of human protection of telomeres 1 protein (hPOT1). POT1 proteins bind to the single-stranded (ss) 3-prime ends of the telomere. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. hPOT1 is implicated in telomere length regulation.
  • Changed! cd hPOT1_OB1_like 133aa 2e-33 in ref transcript
    • hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. A second OB fold has not been predicted in S. pombe POT1. OnTEBP binds the extreme 3-prime end of telomeric DNA. It is heterodimeric and contains four OB folds - three in the alpha subunit (two in the N-terminal domain and one in the C-terminal domain) and one in the beta subunit. OB1Nalpha, together with the second OB fold of the N-terminal domain of OnTEBP alpha subunit and the beta subunit OB fold, forms a deep cleft that binds ssDNA.
  • Changed! pfam Telo_bind 103aa 4e-14 in ref transcript
    • Telomere-binding protein alpha subunit, central domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Alpha subunit consists of 3 structural domains, all with the same beta-barrel OB fold.
  • Changed! cd hPOT1_OB1_like 37aa 4e-06 in modified transcript

POT1

  • rs.POT1.F3 rs.POT1.R3 102 191
  • NCBIGene 36.3 25913
  • Single exon skipping, size difference: 89
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015450

  • cd hPOT1_OB2 118aa 2e-39 in ref transcript
    • hPOT1_OB2: A subfamily of OB folds similar to the second OB fold (OB2) of human protection of telomeres 1 protein (hPOT1). POT1 proteins bind to the single-stranded (ss) 3-prime ends of the telomere. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. hPOT1 is implicated in telomere length regulation.
  • cd hPOT1_OB1_like 133aa 2e-33 in ref transcript
    • hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. A second OB fold has not been predicted in S. pombe POT1. OnTEBP binds the extreme 3-prime end of telomeric DNA. It is heterodimeric and contains four OB folds - three in the alpha subunit (two in the N-terminal domain and one in the C-terminal domain) and one in the beta subunit. OB1Nalpha, together with the second OB fold of the N-terminal domain of OnTEBP alpha subunit and the beta subunit OB fold, forms a deep cleft that binds ssDNA.
  • pfam Telo_bind 103aa 4e-14 in ref transcript
    • Telomere-binding protein alpha subunit, central domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Alpha subunit consists of 3 structural domains, all with the same beta-barrel OB fold.

POT1

  • rs.POT1.F4 rs.POT1.R4 106 226
  • NCBIGene 36.3 25913
  • Single exon skipping, size difference: 120
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_015450

  • cd hPOT1_OB2 118aa 2e-39 in ref transcript
    • hPOT1_OB2: A subfamily of OB folds similar to the second OB fold (OB2) of human protection of telomeres 1 protein (hPOT1). POT1 proteins bind to the single-stranded (ss) 3-prime ends of the telomere. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. hPOT1 is implicated in telomere length regulation.
  • cd hPOT1_OB1_like 133aa 2e-33 in ref transcript
    • hPOT1_OB1_like: A subfamily of OB folds similar to the first OB fold (OB1) of human protection of telomeres 1 protein (hPOT1), the single OB fold of the N-terminal domain of Schizosaccharomyces pombe POT1 (SpPOT1), and the first OB fold of the N-terminal domain of the alpha subunit (OB1Nalpha) of Oxytricha nova telomere end binding protein (OnTEBP). POT1 proteins recognize single-stranded (ss) 3-prime ends of the telomere. A 3-prime ss overhang is conserved in ciliated protozoa, yeast, and mammals. SpPOT1 is essential for telomere maintenance. It binds specifically to the ss G-rich telomeric sequence (GGTTAC) of S. pombe. hPOT1 binds specifically to ss telomeric DNA repeats ending with the sequence GGTTAG. Deletion of the S. pombe pot1+ gene results in a rapid loss of telomere sequences, chromosome mis-segregation and chromosome circularization. hPOT1 is implicated in telomere length regulation. The hPOT1 monomer consists of two closely connected OB folds (OB1-OB2) which cooperate to bind telomeric ssDNA. OB1 makes more extensive contact with the ssDNA than OB2. OB2 protects the 3' end of the ssDNA. A second OB fold has not been predicted in S. pombe POT1. OnTEBP binds the extreme 3-prime end of telomeric DNA. It is heterodimeric and contains four OB folds - three in the alpha subunit (two in the N-terminal domain and one in the C-terminal domain) and one in the beta subunit. OB1Nalpha, together with the second OB fold of the N-terminal domain of OnTEBP alpha subunit and the beta subunit OB fold, forms a deep cleft that binds ssDNA.
  • pfam Telo_bind 103aa 4e-14 in ref transcript
    • Telomere-binding protein alpha subunit, central domain. The telomere-binding protein forms a heterodimer in ciliates consisting of an alpha and a beta subunit. This complex may function as a protective cap for the single-stranded telomeric overhang. Alpha subunit consists of 3 structural domains, all with the same beta-barrel OB fold.

PPARA

  • rs.PPARA.F1 rs.PPARA.R1 127 211
  • NCBIGene 36.3 5465
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_005036

  • cd NR_LBD_PPAR 267aa 1e-108 in ref transcript
    • The ligand binding domain of peroxisome proliferator-activated receptors. The ligand binding domain (LBD) of peroxisome proliferator-activated receptors (PPAR): Peroxisome proliferator-activated receptors (PPARs) are members of the nuclear receptor superfamily of ligand-activated transcription factors. PPARs play important roles in regulating cellular differentiation, development and lipid metabolism. Activated PPAR forms a heterodimer with the retinoid X receptor (RXR) that binds to the hormone response element located upstream of the peroxisome proliferator responsive genes and interacts with co-activators. There are three subtypes of peroxisome proliferator activated receptors, alpha, beta (or delta), and gamma, each with a distinct tissue distribution. Several essential fatty acids, oxidized lipids and prostaglandin J derivatives can bind and activate PPAR. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, PPAR has a central well conserved DNA binding domain (DBD), a variable N-terminal regulatory domain, a flexible hinge a nd a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 75aa 1e-33 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • pfam Hormone_recep 182aa 5e-29 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.

PPARG

  • rs.PPARG.F1 rs.PPARG.R1 143 217
  • NCBIGene 36.3 5468
  • Single exon skipping, size difference: 74
  • Exclusion in 5'UTR
  • Reference transcript: NM_138712

  • cd NR_LBD_PPAR 268aa 1e-111 in ref transcript
    • The ligand binding domain of peroxisome proliferator-activated receptors. The ligand binding domain (LBD) of peroxisome proliferator-activated receptors (PPAR): Peroxisome proliferator-activated receptors (PPARs) are members of the nuclear receptor superfamily of ligand-activated transcription factors. PPARs play important roles in regulating cellular differentiation, development and lipid metabolism. Activated PPAR forms a heterodimer with the retinoid X receptor (RXR) that binds to the hormone response element located upstream of the peroxisome proliferator responsive genes and interacts with co-activators. There are three subtypes of peroxisome proliferator activated receptors, alpha, beta (or delta), and gamma, each with a distinct tissue distribution. Several essential fatty acids, oxidized lipids and prostaglandin J derivatives can bind and activate PPAR. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, PPAR has a central well conserved DNA binding domain (DBD), a variable N-terminal regulatory domain, a flexible hinge a nd a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 75aa 1e-35 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • pfam Hormone_recep 182aa 4e-27 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.

PPCS

  • rs.PPCS.F1 rs.PPCS.R1 100 545
  • NCBIGene 36.3 79717
  • Alternative 5-prime, size difference: 445
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024664

  • Changed! pfam DFP 244aa 2e-54 in ref transcript
    • DNA / pantothenate metabolism flavoprotein. The DNA/pantothenate metabolism flavoprotein (EC:4.1.1.36) affects synthesis of DNA, and pantothenate metabolism.
  • Changed! COG Dfp 85aa 4e-09 in ref transcript
    • Phosphopantothenoylcysteine synthetase/decarboxylase [Coenzyme metabolism].
  • Changed! PRK PRK05579 63aa 1e-06 in ref transcript
    • bifunctional phosphopantothenoylcysteine decarboxylase/phosphopantothenate synthase; Validated.

PPM1B

  • rs.PPM1B.F1 rs.PPM1B.R1 102 522
  • NCBIGene 36.3 5495
  • Alternative 3-prime, size difference: 420
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_002706

  • cd PP2Cc 273aa 1e-68 in ref transcript
    • Serine/threonine phosphatases, family 2C, catalytic domain; The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
  • smart PP2Cc 277aa 6e-76 in ref transcript
    • Serine/threonine phosphatases, family 2C, catalytic domain. The protein architecture and deduced catalytic mechanism of PP2C phosphatases are similar to the PP1, PP2A, PP2B family of protein Ser/Thr phosphatases, with which PP2C shares no sequence similarity.
  • pfam PP2C_C 81aa 3e-26 in ref transcript
    • Protein serine/threonine phosphatase 2C, C-terminal domain. Protein phosphatase 2C (PP2C) is involved in regulating cellular responses to stress in various eukaryotes. It consists of two domains: an N-terminal catalytic domain and a C-terminal domain characteristic of mammalian PP2Cs. This domain consists of three antiparallel alpha helices, one of which packs against two corresponding alpha-helices of the N-terminal domain. The C-terminal domain does not seem to play a role in catalysis, but it may provide protein substrate specificity due to the cleft that is created between it and the catalytic domain.
  • PTZ PTZ00224 297aa 2e-46 in ref transcript
    • protein phosphatase 2C; Provisional.

PPP2R5D

  • rs.PPP2R5D.F1 rs.PPP2R5D.R1 195 291
  • NCBIGene 36.3 5528
  • Alternative 5-prime and 3-prime, size difference: 96
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006245

  • Changed! pfam B56 408aa 0.0 in ref transcript
    • Protein phosphatase 2A regulatory B subunit (B56 family). Protein phosphatase 2A (PP2A) is a major intracellular protein phosphatase that regulates multiple aspects of cell growth and metabolism. The ability of this widely distributed heterotrimeric enzyme to act on a diverse array of substrates is largely controlled by the nature of its regulatory B subunit. There are multiple families of B subunits (See also pfam01240), this family is called the B56 family.
  • Changed! pfam B56 400aa 0.0 in modified transcript

PPP4R1

  • rs.PPP4R1.F1 rs.PPP4R1.R1 254 305
  • NCBIGene 36.3 9989
  • Alternative 3-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042388

PQBP1

  • rs.PQBP1.F1 rs.PQBP1.R1 99 110
  • NCBIGene 36.3 10084
  • Alternative 5-prime, size difference: 11
  • Exclusion in 5'UTR
  • Reference transcript: NM_001032383

  • pfam WW 31aa 0.004 in ref transcript
    • WW domain. The WW domain is a protein module with two highly conserved tryptophans that binds proline-rich peptide motifs in vitro.

PRDM10

  • rs.PRDM10.F1 rs.PRDM10.R1 103 115
  • NCBIGene 36.3 56980
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020228

PRDM7

  • rs.PRDM7.F1 rs.PRDM7.R1 100 383
  • NCBIGene 36.3 11105
  • Single exon skipping, size difference: 283
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001098173

  • smart KRAB 59aa 4e-08 in ref transcript
    • krueppel associated box.
  • pfam SSXRD 33aa 1e-05 in ref transcript
    • SSXRD motif. SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localise to this region.
  • Changed! smart SET 111aa 1e-04 in ref transcript
    • SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues.

PRELP

  • rs.PRELP.F1 rs.PRELP.R1 90 100
  • NCBIGene 36.3 5549
  • Alternative 5-prime, size difference: 10
  • Exclusion in 5'UTR
  • Reference transcript: NM_002725

  • cd LRR_RI 220aa 4e-04 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • smart LRRNT 36aa 2e-04 in ref transcript
    • Leucine rich repeat N-terminal domain.
  • COG COG4886 219aa 9e-07 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

PREPL

  • rs.PREPL.F1 rs.PREPL.R1 133 331
  • NCBIGene 36.3 9581
  • Single exon skipping, size difference: 198
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006036

  • pfam Peptidase_S9 162aa 1e-20 in ref transcript
    • Prolyl oligopeptidase family.
  • Changed! pfam Peptidase_S9_N 320aa 2e-18 in ref transcript
    • Prolyl oligopeptidase, N-terminal beta-propeller domain. This unusual 7-stranded beta-propeller domain protects the catalytic triad of prolyl oligopeptidase (see pfam00326), excluding larger peptides and proteins from proteolysis in the cytosol.
  • Changed! COG PtrB 551aa 7e-83 in ref transcript
    • Protease II [Amino acid transport and metabolism].
  • Changed! pfam Peptidase_S9_N 255aa 4e-17 in modified transcript
  • Changed! COG PtrB 213aa 3e-49 in modified transcript
  • Changed! COG PtrB 273aa 3e-25 in modified transcript
  • Changed! COG DAP2 451aa 6e-07 in modified transcript
    • Dipeptidyl aminopeptidases/acylaminoacyl-peptidases [Amino acid transport and metabolism].

PREPL

  • rs.PREPL.F2 rs.PREPL.R2 162 348
  • NCBIGene 36.3 9581
  • Single exon skipping, size difference: 186
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006036

  • pfam Peptidase_S9 162aa 1e-20 in ref transcript
    • Prolyl oligopeptidase family.
  • Changed! pfam Peptidase_S9_N 320aa 2e-18 in ref transcript
    • Prolyl oligopeptidase, N-terminal beta-propeller domain. This unusual 7-stranded beta-propeller domain protects the catalytic triad of prolyl oligopeptidase (see pfam00326), excluding larger peptides and proteins from proteolysis in the cytosol.
  • Changed! COG PtrB 551aa 7e-83 in ref transcript
    • Protease II [Amino acid transport and metabolism].
  • Changed! pfam Peptidase_S9_N 212aa 5e-14 in modified transcript
  • Changed! COG PtrB 267aa 7e-52 in modified transcript
  • Changed! COG PtrB 212aa 6e-20 in modified transcript

PRF1

  • rs.PRF1.F1 rs.PRF1.R1 103 129
  • NCBIGene 36.3 5551
  • Alternative 3-prime, size difference: 26
  • Inclusion in 5'UTR
  • Reference transcript: NM_005041

  • cd C2 82aa 1e-11 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • smart MACPF 203aa 4e-44 in ref transcript
    • membrane-attack complex / perforin.
  • pfam C2 81aa 5e-15 in ref transcript
    • C2 domain.
  • COG COG5038 97aa 6e-04 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].

PRKCD

  • rs.PRKCD.F1 rs.PRKCD.R1 180 292
  • NCBIGene 36.3 5580
  • Single exon skipping, size difference: 112
  • Exclusion in 5'UTR
  • Reference transcript: NM_006254

  • cd STKc_nPKC_delta 316aa 1e-179 in ref transcript
    • STKc_nPKC_delta: Serine/Threonine Kinases (STKs), Novel Protein Kinase C (nPKC), delta isoform, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The nPKC subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. PKCs are classified into three groups (classical, atypical, and novel) depending on their mode of activation and the structural characteristics of their regulatory domain. nPKCs are calcium-independent, but require DAG (1,2-diacylglycerol) and phosphatidylserine (PS) for activity. There are four nPKC isoforms, delta, epsilon, eta, and theta. PKC-delta plays a role in cell cycle regulation and programmed cell death in many cell types. It slows down cell proliferation, inducing cell cycle arrest and enhancing cell differentiation. PKC-delta is also involved in the regulation of transcription as well as immune and inflammatory responses. It plays a central role in the genotoxic stress response that leads to DNA damaged-induced apoptosis.
  • cd C1 50aa 2e-13 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • cd C1 50aa 8e-10 in ref transcript
  • smart S_TKc 245aa 8e-73 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam C1_1 53aa 6e-18 in ref transcript
    • Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
  • smart S_TK_X 64aa 2e-17 in ref transcript
    • Extension to Ser/Thr-type protein kinases.
  • pfam C1_1 50aa 2e-11 in ref transcript
  • PTZ PTZ00263 323aa 1e-68 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

PRKDC

  • rs.PRKDC.F1 rs.PRKDC.R1 305 398
  • NCBIGene 36.3 5591
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006904

  • Changed! cd PIKKc_DNA-PK 297aa 1e-115 in ref transcript
    • DNA-dependent protein kinase (DNA-PK), catalytic domain; The DNA-PK catalytic domain subfamily is part of a larger superfamily that includes the catalytic domains of other kinases such as the typical serine/threonine/tyrosine protein kinases (PKs), aminoglycoside phosphotransferase, choline kinase, and RIO kinases. DNA-PK is a member of the phosphoinositide 3-kinase-related protein kinase (PIKK) subfamily. PIKKs have intrinsic serine/threonine kinase activity and are distinguished from other PKs by their unique catalytic domain, similar to that of lipid PI3K, and their large molecular weight (240-470 kDa). DNA-PK is comprised of a regulatory subunit, containing the Ku70/80 subunit, and a catalytic subunit, which contains a NUC194 domain of unknown function, a FAT (FRAP, ATM and TRRAP) domain, a catalytic domain, and a FATC domain at the C-terminus. It is part of a multi-component system involved in non-homologous end joining (NHEJ), a process of repairing double strand breaks (DSBs) by joining together two free DNA ends of little homology. DNA-PK functions as a molecular sensor for DNA damage that enhances the signal via phosphorylation of downstream targets. It may also act as a protein scaffold that aids the localization of DNA repair proteins to the site of DNA damage. DNA-PK also plays a role in the maintenance of telomeric stability and the prevention of chromosomal end fusion.
  • pfam NUC194 398aa 0.0 in ref transcript
    • NUC194 domain. This is domain B in the catalytic subunit of DNA-dependent protein kinases.
  • Changed! pfam PI3_PI4_kinase 268aa 1e-57 in ref transcript
    • Phosphatidylinositol 3- and 4-kinase. Some members of this family probably do not have lipid kinase activity and are protein kinases.
  • pfam FAT 448aa 2e-52 in ref transcript
    • FAT domain. The FAT domain is named after FRAP, ATM and TRRAP.
  • pfam FATC 32aa 5e-05 in ref transcript
    • FATC domain. The FATC domain is named after FRAP, ATM, TRRAP C-terminal. The solution structure of the FATC domain suggests it plays a role in redox-dependent structural and cellular stability.
  • Changed! COG TEL1 579aa 2e-48 in ref transcript
    • Phosphatidylinositol kinase and protein kinases of the PI-3 kinase family [Signal transduction mechanisms / Cell division and chromosome partitioning / Chromatin structure and dynamics / DNA replication, recombination, and repair / Intracellular trafficking and secretion].
  • Changed! cd PIKKc_DNA-PK 266aa 1e-114 in modified transcript
  • Changed! pfam PI3_PI4_kinase 237aa 1e-51 in modified transcript
  • Changed! COG TEL1 548aa 7e-44 in modified transcript

PRR3

  • rs.PRR3.F1 rs.PRR3.R1 194 257
  • NCBIGene 36.3 80742
  • Single exon skipping, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_025263

PRR5

  • rs.PRR5.F1 rs.PRR5.R1 128 245
  • NCBIGene 36.3 55615
  • Single exon skipping, size difference: 117
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001017528

  • Changed! pfam HbrB 132aa 6e-40 in ref transcript
    • HbrB-like. HbrB is involved hyphal growth and polarity.
  • Changed! pfam HbrB 71aa 4e-22 in modified transcript

PSAT1

  • rs.PSAT1.F1 rs.PSAT1.R1 208 346
  • NCBIGene 36.3 29968
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_058179

  • Changed! cd PSAT_like 359aa 1e-175 in ref transcript
    • Phosphoserine aminotransferase (PSAT) family. This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). The major group in this CD corresponds to phosphoserine aminotransferase (PSAT). PSAT is active as a dimer and catalyzes the conversion of phosphohydroxypyruvate to phosphoserine.
  • Changed! TIGR serC_1 354aa 1e-174 in ref transcript
    • This model represents the common form of the phosphoserine aminotransferase SerC. The phosphoserine aminotransferase of the archaeon Methanosarcina barkeri and putative phosphoserine aminotransferase of Mycobacterium tuberculosis are represented by separate models. All are members of the class V aminotransferases (pfam00266).
  • Changed! PRK PRK05355 359aa 1e-161 in ref transcript
    • phosphoserine aminotransferase; Provisional.
  • Changed! cd PSAT_like 313aa 1e-148 in modified transcript
  • Changed! TIGR serC_1 308aa 1e-147 in modified transcript
  • Changed! PRK PRK05355 313aa 1e-137 in modified transcript

PSG11

  • rs.PSG11.F1 rs.PSG11.R1 195 263
  • NCBIGene 36.3 5680
  • Alternative 3-prime, size difference: 68
  • Inclusion in 3'UTR
  • Reference transcript: NM_002785

  • cd IGcam 79aa 3e-07 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 81aa 3e-04 in ref transcript
  • pfam V-set 93aa 3e-10 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.
  • smart IG_like 74aa 1e-08 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 61aa 7e-04 in ref transcript

PSMA4

  • rs.PSMA4.F1 rs.PSMA4.R1 104 180
  • NCBIGene 36.3 5685
  • Alternative 5-prime, size difference: 76
  • Exclusion in 5'UTR
  • Reference transcript: NM_002789

  • cd proteasome_alpha_type_4 214aa 1e-108 in ref transcript
    • proteasome_alpha_type_4. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis. Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
  • TIGR arc_protsome_A 233aa 8e-62 in ref transcript
    • This protein family describes the archaeal proteasome alpha subunit, homologous to both the beta subunit and to the alpha and beta subunits of eukaryotic proteasome subunits. This family is universal in the first 29 complete archaeal genomes but occasionally is duplicated.
  • PTZ PTZ00246 237aa 7e-86 in ref transcript
    • proteasome subunit alpha; Provisional.

PSMA4

  • rs.PSMA4.F2 rs.PSMA4.R2 120 283
  • NCBIGene 36.3 5685
  • Single exon skipping, size difference: 163
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001102667

  • Changed! cd proteasome_alpha_type_4 214aa 1e-108 in ref transcript
    • proteasome_alpha_type_4. The 20S proteasome, multisubunit proteolytic complex, is the central enzyme of nonlysosomal protein degradation in both the cytosol and nucleus. It is composed of 28 subunits arranged as four homoheptameric rings that stack on top of one another forming an elongated alpha-beta-beta-alpha cylinder with a central cavity. The proteasome alpha and beta subunits are members of the N-terminal nucleophile (Ntn)-hydrolase superfamily. Their N-terminal threonine residues are exposed as a nucleophile in peptide bond hydrolysis. Mammals have 7 alpha and 7 beta proteasome subunits while archaea have one of each.
  • Changed! TIGR arc_protsome_A 233aa 8e-62 in ref transcript
    • This protein family describes the archaeal proteasome alpha subunit, homologous to both the beta subunit and to the alpha and beta subunits of eukaryotic proteasome subunits. This family is universal in the first 29 complete archaeal genomes but occasionally is duplicated.
  • Changed! PTZ PTZ00246 237aa 7e-86 in ref transcript
    • proteasome subunit alpha; Provisional.
  • Changed! cd proteasome_alpha_type_4 14aa 6e-04 in modified transcript
  • Changed! PTZ PTZ00246 16aa 9e-05 in modified transcript

PSMD13

  • rs.PSMD13.F1 rs.PSMD13.R1 217 296
  • NCBIGene 36.3 5719
  • Single exon skipping, size difference: 79
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_175932

  • Changed! pfam PCI 105aa 5e-14 in ref transcript
    • PCI domain. This domain has also been called the PINT motif (Proteasome, Int-6, Nip-1 and TRIP-15).

PTBP1

  • rs.PTBP1.F1 rs.PTBP1.R1 103 124
  • NCBIGene 36.3 5725
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002819

  • cd RRM 71aa 4e-10 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 72aa 2e-08 in ref transcript
  • cd RRM 73aa 6e-07 in ref transcript
  • cd RRM 71aa 7e-05 in ref transcript
  • Changed! TIGR hnRNP-L_PTB 501aa 1e-180 in ref transcript
    • Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
  • COG COG0724 171aa 9e-07 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! TIGR hnRNP-L_PTB 494aa 1e-175 in modified transcript

PTCH1

  • rs.PTCH1.F1 PTCH.R5 100 117
  • NCBIGene 36.3 5727
  • Alternative 5-prime, size difference: 17
  • Exclusion in 5'UTR
  • Reference transcript: NM_001083604

  • TIGR 2A060602 1055aa 0.0 in ref transcript
  • COG COG1033 173aa 2e-09 in ref transcript
    • Predicted exporters of the RND superfamily [General function prediction only].
  • COG COG1033 108aa 1e-04 in ref transcript

PTCH1

  • rs.PTCH1.F2 rs.PTCH1.R2 216 370
  • NCBIGene 36.3 5727
  • Alternative 5-prime, size difference: 154
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001083603

  • Changed! TIGR 2A060602 1141aa 0.0 in ref transcript
  • Changed! COG COG1033 173aa 2e-09 in ref transcript
    • Predicted exporters of the RND superfamily [General function prediction only].
  • Changed! COG COG1033 108aa 1e-04 in ref transcript

PTGER3

  • rs.PTGER3.F1 rs.PTGER3.R1 186 255
  • NCBIGene 36.3 5733
  • Mutually exclusive exon skipping, size difference: 69
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_000957

  • pfam 7tm_1 252aa 1e-09 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

PTGER3

  • rs.PTGER3.F2 rs.PTGER3.R2 163 255
  • NCBIGene 36.3 5733
  • Single exon skipping, size difference: 92
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_000957

  • pfam 7tm_1 252aa 1e-09 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

PTGS1

  • rs.PTGS1.F1 rs.PTGS1.R1 135 246
  • NCBIGene 36.3 5742
  • Alternative 5-prime, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000962

  • cd EGF_CA 38aa 0.005 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • Changed! pfam An_peroxidase 356aa 2e-86 in ref transcript
    • Animal haem peroxidase.
  • pfam EGF 31aa 0.001 in ref transcript
    • EGF-like domain. There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between Swiss-Prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.
  • Changed! pfam An_peroxidase 319aa 9e-78 in modified transcript

PTPN13

  • rs.PTPN13.F1 PTPN13.R3 104 119
  • NCBIGene 36.3 5783
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080685

  • cd PTPc 228aa 3e-79 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PDZ_signaling 85aa 7e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 80aa 1e-15 in ref transcript
  • Changed! cd PDZ_signaling 90aa 2e-13 in ref transcript
  • cd FERM_C 94aa 2e-13 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • cd PDZ_signaling 79aa 2e-08 in ref transcript
  • cd PDZ_signaling 88aa 6e-08 in ref transcript
  • smart PTPc 253aa 5e-91 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart KIND 188aa 2e-42 in ref transcript
    • kinase non-catalytic C-lobe domain. It is an interaction domain identified as being similar to the C-terminal protein kinase catalytic fold (C lobe). Its presence at the N terminus of signalling proteins and the absence of the active-site residues in the catalytic and activation loops suggest that it folds independently and is likely to be non-catalytic. The occurrence of KIND only in metazoa implies that it has evolved from the catalytic protein kinase domain into an interaction domain possibly by keeping the substrate-binding features.
  • smart B41 210aa 4e-40 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam FERM_C 88aa 2e-17 in ref transcript
    • FERM C-terminal PH-like domain.
  • smart PDZ 90aa 1e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • Changed! smart PDZ 92aa 2e-15 in ref transcript
  • smart PDZ 83aa 1e-13 in ref transcript
  • smart PDZ 92aa 2e-11 in ref transcript
  • smart PDZ 84aa 9e-08 in ref transcript
  • COG PTP2 267aa 1e-40 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG Prc 80aa 9e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 54aa 3e-04 in ref transcript
  • COG Prc 76aa 3e-04 in ref transcript
  • COG Prc 68aa 0.006 in ref transcript
  • Changed! cd PDZ_signaling 85aa 3e-15 in modified transcript
  • Changed! smart PDZ 87aa 4e-17 in modified transcript

PTPN13

  • rs.PTPN13.F2 rs.PTPN13.R2 137 194
  • NCBIGene 36.3 5783
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080685

  • cd PTPc 228aa 3e-79 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PDZ_signaling 85aa 7e-16 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 80aa 1e-15 in ref transcript
  • cd PDZ_signaling 90aa 2e-13 in ref transcript
  • cd FERM_C 94aa 2e-13 in ref transcript
    • The FERM_C domain is the third structural domain within the FERM domain. The FERM domain is found in the cytoskeletal-associated proteins such as ezrin, moesin, radixin, 4.1R, and merlin. These proteins provide a link between the membrane and cytoskeleton and are involved in signal transduction pathways. The FERM_C domain is also found in protein tyrosine phosphatases (PTPs) , the tryosine kinases FAKand JAK, in addition to other proteins involved in signaling. This domain is structuraly similar to the PH and PTB domains and consequently is capable of binding to both peptides and phospholipids at different sites.
  • cd PDZ_signaling 79aa 2e-08 in ref transcript
  • cd PDZ_signaling 88aa 6e-08 in ref transcript
  • smart PTPc 253aa 5e-91 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart KIND 188aa 2e-42 in ref transcript
    • kinase non-catalytic C-lobe domain. It is an interaction domain identified as being similar to the C-terminal protein kinase catalytic fold (C lobe). Its presence at the N terminus of signalling proteins and the absence of the active-site residues in the catalytic and activation loops suggest that it folds independently and is likely to be non-catalytic. The occurrence of KIND only in metazoa implies that it has evolved from the catalytic protein kinase domain into an interaction domain possibly by keeping the substrate-binding features.
  • smart B41 210aa 4e-40 in ref transcript
    • Band 4.1 homologues. Also known as ezrin/radixin/moesin (ERM) protein domains. Present in myosins, ezrin, radixin, moesin, protein tyrosine phosphatases. Plasma membrane-binding domain. These proteins play structural and regulatory roles in the assembly and stabilization of specialized plasmamembrane domains. Some PDZ domain containing proteins bind one or more of this family. Now includes JAKs.
  • pfam FERM_C 88aa 2e-17 in ref transcript
    • FERM C-terminal PH-like domain.
  • smart PDZ 90aa 1e-16 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 92aa 2e-15 in ref transcript
  • smart PDZ 83aa 1e-13 in ref transcript
  • smart PDZ 92aa 2e-11 in ref transcript
  • smart PDZ 84aa 9e-08 in ref transcript
  • COG PTP2 267aa 1e-40 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG Prc 80aa 9e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • COG Prc 54aa 3e-04 in ref transcript
  • COG Prc 76aa 3e-04 in ref transcript
  • COG Prc 68aa 0.006 in ref transcript

PTPN20B

  • rs.PTPN20A.F2 rs.PTPN20A.R2 136 293
  • NCBIGene 36.3 26095
  • Single exon skipping, size difference: 157
  • Inclusion in 5'UTR
  • Reference transcript: NM_001042363

  • cd PTPc 227aa 8e-77 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • smart PTPc 253aa 9e-87 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • COG PTP2 237aa 1e-45 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].

PTPRA

  • rs.PTPRA.F1 rs.PTPRA.R1 101 191
  • NCBIGene 36.3 5786
  • Single exon skipping, size difference: 90
  • Inclusion in 5'UTR
  • Reference transcript: NM_002836

  • cd PTPc 231aa 3e-89 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 5e-85 in ref transcript
  • smart PTPc 259aa 1e-97 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 261aa 1e-93 in ref transcript
  • COG PTP2 231aa 2e-43 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 241aa 3e-43 in ref transcript

PTPRD

  • rs.PTPRD.F1 rs.PTPRD.R1 102 114
  • NCBIGene 36.3 5789
  • Single exon skipping, size difference: 12
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_002839

  • cd PTPc 230aa 5e-92 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 8e-90 in ref transcript
  • cd FN3 92aa 3e-13 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 90aa 7e-13 in ref transcript
  • cd IGcam 88aa 2e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 91aa 4e-12 in ref transcript
  • cd FN3 95aa 8e-12 in ref transcript
  • cd FN3 93aa 2e-11 in ref transcript
  • cd FN3 89aa 3e-11 in ref transcript
  • cd FN3 107aa 4e-09 in ref transcript
  • cd FN3 82aa 2e-06 in ref transcript
  • cd IGcam 78aa 1e-05 in ref transcript
  • smart PTPc 256aa 1e-101 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 260aa 1e-100 in ref transcript
  • pfam I-set 92aa 5e-16 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam fn3 85aa 8e-15 in ref transcript
    • Fibronectin type III domain.
  • pfam fn3 88aa 2e-13 in ref transcript
  • pfam fn3 83aa 2e-13 in ref transcript
  • smart FN3 79aa 3e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart IGc2 74aa 7e-11 in ref transcript
    • Immunoglobulin C-2 Type.
  • smart FN3 83aa 2e-10 in ref transcript
  • pfam fn3 100aa 5e-10 in ref transcript
  • smart IG_like 79aa 7e-08 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart FN3 81aa 4e-05 in ref transcript
  • COG PTP2 268aa 1e-48 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 281aa 9e-45 in ref transcript

PTPRM

  • rs.PTPRM.F1 rs.PTPRM.R1 326 365
  • NCBIGene 36.3 5797
  • Multiple exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001105244

  • cd PTPc 228aa 3e-89 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 233aa 7e-68 in ref transcript
  • cd MAM 156aa 8e-41 in ref transcript
    • Meprin, A5 protein, and protein tyrosine phosphatase Mu (MAM) domain. MAM is an extracellular domain which mediates protein-protein interactions and is found in a diverse set of proteins, many of which are known to function in cell adhesion. Members include: type IIB receptor protein tyrosine phosphatases (such as RPTPmu), meprins (plasma membrane metalloproteases), neuropilins (receptors of secreted semaphorins), and zonadhesins (sperm-specific membrane proteins which bind to the extracellular matrix of the egg). In meprin A and neuropilin-1 and -2, MAM is involved in homo-oligomerization. In RPTPmu, it has been associated with both homophilic adhesive (trans) interactions and lateral (cis) receptor oligomerization. In a GPI-anchored protein that is expressed in cells in the embryonic chicken spinal chord, MDGA1, the MAM domain has been linked to heterophilic interactions with axon-rich region.
  • cd FN3 83aa 1e-06 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 95aa 8e-04 in ref transcript
  • cd FN3 93aa 0.004 in ref transcript
  • smart PTPc 255aa 1e-102 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 263aa 3e-74 in ref transcript
  • smart MAM 160aa 2e-49 in ref transcript
    • Domain in meprin, A5, receptor protein tyrosine phosphatase mu (and others). Likely to have an adhesive function. Mutations in the meprin MAM domain affect noncovalent associations within meprin oligomers. In receptor tyrosine phosphatase mu-like molecules the MAM domain is important for homophilic cell-cell interactions.
  • pfam fn3 93aa 4e-07 in ref transcript
    • Fibronectin type III domain.
  • smart FN3 85aa 4e-05 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart IG_like 88aa 3e-04 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart FN3 80aa 0.002 in ref transcript
  • COG PTP2 228aa 3e-45 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 247aa 2e-23 in ref transcript

PTPRS

  • rs.PTPRS.F1 rs.PTPRS.R1 101 113
  • NCBIGene 36.3 5802
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002850

  • cd PTPc 230aa 5e-95 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • cd PTPc 231aa 2e-90 in ref transcript
  • cd FN3 90aa 6e-14 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 89aa 2e-12 in ref transcript
  • cd IGcam 87aa 4e-12 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 93aa 4e-11 in ref transcript
  • cd IGcam 91aa 5e-11 in ref transcript
  • cd FN3 90aa 2e-07 in ref transcript
  • cd FN3 80aa 2e-07 in ref transcript
  • cd FN3 106aa 2e-07 in ref transcript
  • cd IGcam 83aa 2e-06 in ref transcript
  • cd FN3 83aa 6e-05 in ref transcript
  • cd FN3 64aa 0.001 in ref transcript
  • smart PTPc 256aa 1e-106 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain.
  • smart PTPc 260aa 1e-100 in ref transcript
  • pfam I-set 92aa 2e-17 in ref transcript
    • Immunoglobulin I-set domain.
  • smart FN3 79aa 1e-12 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • smart FN3 80aa 2e-12 in ref transcript
  • smart FN3 83aa 5e-11 in ref transcript
  • smart IGc2 74aa 1e-10 in ref transcript
    • Immunoglobulin C-2 Type.
  • pfam fn3 80aa 7e-09 in ref transcript
    • Fibronectin type III domain.
  • smart IG_like 79aa 7e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • pfam fn3 91aa 4e-08 in ref transcript
  • pfam fn3 99aa 2e-07 in ref transcript
  • smart FN3 79aa 6e-05 in ref transcript
  • smart FN3 62aa 3e-04 in ref transcript
  • COG PTP2 268aa 6e-51 in ref transcript
    • Protein tyrosine phosphatase [Signal transduction mechanisms].
  • COG PTP2 266aa 2e-48 in ref transcript

PUS1

  • rs.PUS1.F1 rs.PUS1.R1 137 529
  • NCBIGene 36.3 80324
  • Alternative 5-prime, size difference: 392
  • Exclusion of the protein initiation site
  • Reference transcript: NM_025215

  • cd PseudoU_synth_PUS1_PUS2 253aa 8e-91 in ref transcript
    • PseudoU_synth_PUS1_PUS2: Pseudouridine synthase, PUS1/ PUS2 like. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus1p, S. cerevisiae Pus2p, Caenorhabditis elegans Pus1p and human PUS1. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. S. cerevisiae Pus1p catalyzes the formation of psi34 and psi36 in the intron-containing tRNAIle, psi35 in the intron-containing tRNATyr, psi27 and/or psi28 in several yeast cytoplasmic tRNAs and, psi44 in U2 small nuclear RNA (U2 snRNA). The presence of the intron is required for the formation of psi 34, 35 and 36. In addition S. cerevisiae PUS1 makes are psi 26, 65 and 67. C. elegans Pus1p does not modify psi44 in U2 snRNA. Mouse Pus1p makes psi27/28 in pre- tRNASer , tRNAVal and tRNAIle, psi 34/36 in tRNAIle and, psi 32 and potentially 67 in tRNAVal. Psi44 in U2 snRNA and psi32 in tRNAs are highly phylogenetically conserved. Psi 26,27,28,34,35,36,65 and 67 in tRNAs are less highly conserved. Mouse Pus1p regulates nuclear receptor activity through pseudouridylation of Steroid Receptor RNA Activator. Missense mutation in human PUS1 causes mitochondrial myopathy and sideroblastic anemia (MLASA).
  • TIGR hisT_truA 252aa 1e-36 in ref transcript
    • universal so far, single copy in all prokaryotes, 3 in yeast. Trusted cutoff for orthology is about 100 based on 1 match only in complete prokaryote with length > 200.
  • COG TruA 280aa 2e-44 in ref transcript
    • Pseudouridylate synthase [Translation, ribosomal structure and biogenesis].

PUS7L

  • rs.PUS7L.F1 rs.PUS7L.R1 122 537
  • NCBIGene 36.3 83448
  • Alternative 5-prime, size difference: 415
  • Exclusion in 5'UTR
  • Reference transcript: NM_001098615

  • cd PseudoU_synth_ScPUS7 401aa 1e-111 in ref transcript
    • PseudoU_synth_ScPUS7: Pseudouridine synthase, TruD family. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). Saccharomyces cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA), psi13 in cytoplasmic tRNAs and psi35 in pre-tRNATyr. Psi35 in yeast U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35.
  • cd PseudoU_synth_ScPUS7 28aa 0.002 in ref transcript
  • TIGR TIGR00094 400aa 1e-33 in ref transcript
    • MJ11364 is a strong partial match from 50 to 230 aa.
  • COG COG0585 405aa 1e-50 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

PWWP2B

  • rs.PWWP2B.F1 rs.PWWP2B.R1 127 430
  • NCBIGene 36.3 170394
  • Alternative 5-prime, size difference: 303
  • Exclusion of the stop codon
  • Reference transcript: NM_138499

  • Changed! cd Dnmt3b_related 87aa 5e-32 in ref transcript
    • The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
  • Changed! pfam PWWP 71aa 4e-10 in ref transcript
    • PWWP domain. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. The function of the domain is currently unknown.

PXMP3

  • rs.PXMP3.F1 rs.PXMP3.R1 209 241
  • NCBIGene 36.3 5828
  • Single exon skipping, size difference: 32
  • Exclusion in 5'UTR
  • Reference transcript: NM_000318

  • pfam Pex2_Pex12 207aa 4e-47 in ref transcript
    • Pex2 / Pex12 amino terminal region. This region is found at the N terminal of a number of known and predicted peroxins including Pex2, Pex10 and Pex12. This conserved region is usually associated with a C terminal ring finger (pfam00097) domain.
  • smart RING 40aa 0.001 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • COG PEX10 46aa 0.002 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].

PYHIN1

  • rs.PYHIN1.F1 rs.PYHIN1.R1 139 166
  • NCBIGene 36.3 149628
  • Alternative 3-prime, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152501

  • pfam HIN 159aa 3e-51 in ref transcript
    • HIN-200/IF120x domain. This domain has no know function. It is found in one or two copies per protein, and is found associated with the PAAD/DAPIN domain pfam02758.
  • pfam PAAD_DAPIN 79aa 6e-17 in ref transcript
    • PAAD/DAPIN/Pyrin domain. This domain is predicted to contain 6 alpha helices and to have the same fold as the pfam00531 domain. This similarity may mean that this is a protein-protein interaction domain.

RABEP1

  • rs.RABEP1.F1 rs.RABEP1.R1 283 382
  • NCBIGene 36.3 9135
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004703

  • Changed! pfam Rab5-bind 196aa 3e-25 in ref transcript
    • Rab5 binding. Members of this family are predominantly found in Rabaptin and allow for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes.
  • pfam Rabaptin 151aa 6e-20 in ref transcript
    • Rabaptin.
  • pfam Rabaptin 111aa 2e-17 in ref transcript
  • TIGR SMC_prok_B 296aa 4e-06 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! COG Smc 302aa 5e-08 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG Smc 264aa 5e-04 in ref transcript
  • Changed! pfam Rab5-bind 163aa 5e-20 in modified transcript
  • Changed! TIGR SMC_prok_B 280aa 1e-04 in modified transcript
  • Changed! COG Smc 289aa 1e-06 in modified transcript

RAC1

  • rs.RAC1.F2 rs.RAC1.R2 115 172
  • NCBIGene 36.3 5879
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_018890

  • Changed! cd Rac1_like 192aa 1e-107 in ref transcript
    • Rac1-like subfamily. The Rac1-like subfamily consists of Rac1, Rac2, and Rac3 proteins, plus the splice variant Rac1b that contains a 19-residue insertion near switch II relative to Rac1. While Rac1 is ubiquitously expressed, Rac2 and Rac3 are largely restricted to hematopoietic and neural tissues respectively. Rac1 stimulates the formation of actin lamellipodia and membrane ruffles. It also plays a role in cell-matrix adhesion and cell anoikis. In intestinal epithelial cells, Rac1 is an important regulator of migration and mediates apoptosis. Rac1 is also essential for RhoA-regulated actin stress fiber and focal adhesion complex formation. In leukocytes, Rac1 and Rac2 have distinct roles in regulating cell morphology, migration, and invasion, but are not essential for macrophage migration or chemotaxis. Rac3 has biochemical properties that are closely related to Rac1, such as effector interaction, nucleotide binding, and hydrolysis; Rac2 has a slower nucleotide association and is more efficiently activated by the RacGEF Tiam1. Both Rac1 and Rac3 have been implicated in the regulation of cell migration and invasion in human metastatic breast cancer. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • Changed! smart RHO 189aa 1e-98 in ref transcript
    • Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
  • Changed! COG COG1100 193aa 1e-27 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].
  • Changed! cd Rac1_like 173aa 1e-111 in modified transcript
  • Changed! smart RHO 170aa 1e-102 in modified transcript
  • Changed! COG COG1100 174aa 3e-31 in modified transcript

RAD17

  • rs.RAD17.F1 rs.RAD17.R1 154 338
  • NCBIGene 36.3 5884
  • Single exon skipping, size difference: 184
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_133339

  • Changed! cd AAA 37aa 0.001 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • Changed! TIGR rad24 640aa 0.0 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! PRK PRK04195 423aa 1e-16 in ref transcript
    • replication factor C large subunit; Provisional.

RAD17

  • rs.RAD17.F2 rs.RAD17.R2 114 372
  • NCBIGene 36.3 5884
  • Single exon skipping, size difference: 258
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_133343

  • cd AAA 37aa 0.001 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • Changed! TIGR rad24 640aa 0.0 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! PRK PRK04195 423aa 8e-17 in ref transcript
    • replication factor C large subunit; Provisional.
  • Changed! TIGR rad24 553aa 0.0 in modified transcript
  • Changed! PRK PRK04195 406aa 9e-12 in modified transcript

RAD17

  • rs.RAD17.F3 rs.RAD17.R3 271 375
  • NCBIGene 36.3 5884
  • Single exon skipping, size difference: 104
  • Exclusion in 5'UTR
  • Reference transcript: NM_133338

  • cd AAA 37aa 0.001 in ref transcript
    • The AAA+ (ATPases Associated with a wide variety of cellular Activities) superfamily represents an ancient group of ATPases belonging to the ASCE (for additional strand, catalytic E) division of the P-loop NTPase fold. The ASCE division also includes ABC, RecA-like, VirD4-like, PilT-like, and SF1/2 helicases. Members of the AAA+ ATPases function as molecular chaperons, ATPase subunits of proteases, helicases, or nucleic-acid stimulated ATPases. The AAA+ proteins contain several distinct features in addition to the conserved alpha-beta-alpha core domain structure and the Walker A and B motifs of the P-loop NTPases.
  • TIGR rad24 640aa 0.0 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • PRK PRK04195 423aa 8e-17 in ref transcript
    • replication factor C large subunit; Provisional.

RAD51

  • rs.RAD51.F1 rs.RAD51.R1 122 413
  • NCBIGene 36.3 5888
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 291
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_002875

  • Changed! cd Rad51_DMC1_radA 234aa 1e-98 in ref transcript
    • Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57.
  • Changed! TIGR recomb_RAD51 315aa 1e-178 in ref transcript
    • This eukaryotic sequence family consists of RAD51, a protein involved in DNA homologous recombination and repair. It is similar in sequence the exclusively meiotic recombinase DMC1 (TIGR02238), to archaeal families RadA (TIGR02236) and RadB (TIGR02237), and to bacterial RecA (TIGR02012).
  • Changed! PTZ PTZ00035 332aa 1e-125 in ref transcript
    • Rad51; Provisional.
  • Changed! cd Rad51_DMC1_radA 168aa 1e-59 in modified transcript
  • Changed! TIGR recomb_RAD51 170aa 4e-89 in modified transcript
  • Changed! TIGR recomb_RAD51 56aa 2e-20 in modified transcript
  • Changed! PTZ PTZ00035 173aa 3e-64 in modified transcript
  • Changed! PTZ PTZ00035 67aa 1e-10 in modified transcript

RAD51L3

  • rs.RAD51L3.F1 rs.RAD51L3.R1 98 434
  • NCBIGene 36.3 5892
  • Multiple exon skipping, size difference: 336
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002878

  • Changed! cd Rad51_DMC1_radA 236aa 6e-43 in ref transcript
    • Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57.
  • Changed! TIGR recomb_radA 306aa 2e-14 in ref transcript
    • This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA (TIGR02012), eukaryotic RAD51 (TIGR02239), and archaeal RadB (TIGR02237). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein.
  • Changed! PRK radB 223aa 3e-17 in ref transcript
    • DNA repair and recombination protein RadB; Provisional.
  • Changed! cd Rad51_DMC1_radA 170aa 4e-19 in modified transcript
  • Changed! PRK radB 84aa 4e-04 in modified transcript

RANBP3

  • rs.RANBP3.F1 rs.RANBP3.R1 111 126
  • NCBIGene 36.3 8498
  • Alternative 5-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007322

  • cd RanBD 125aa 3e-26 in ref transcript
    • Ran-binding domain; This domain of approximately 150 residues shares structural similarity to the PH domain, but lacks detectable sequence similarity. Ran is a Ras-like nuclear small GTPase, which regulates receptor-mediated transport between the nucleus and the cytoplasm. RanGTP hydrolysis is stimulated by RanGAP together with the Ran-binding domain containing acessory proteins RanBP1 and RanBP2. These accessory proteins stabilize the active GTP-bound form of Ran . The Ran-binding domain is found in multiple copies in Nuclear pore complex proteins.
  • smart RanBD 111aa 3e-15 in ref transcript
    • Ran-binding domain. Domain of apporximately 150 residues that stabilises the GTP-bound form of Ran (the Ras-like nuclear small GTPase).
  • COG YRB1 71aa 1e-04 in ref transcript
    • Ran GTPase-activating protein (Ran-binding protein) [Intracellular trafficking and secretion].

RAP1A

  • rs.RAP1A.F1 rs.RAP1A.R1 176 270
  • NCBIGene 36.3 5906
  • Single exon skipping, size difference: 94
  • Exclusion in 5'UTR
  • Reference transcript: NM_001010935

  • cd Rap_like 149aa 3e-84 in ref transcript
    • Rap-like subfamily. The Rap subfamily consists of the Rap1, Rap2, and RSR1. Rap subfamily proteins perform different cellular functions, depending on the isoform and its subcellular localization. For example, in rat salivary gland, neutrophils, and platelets, Rap1 localizes to secretory granules and is believed to regulate exocytosis or the formation of secretory granules. Rap1 has also been shown to localize in the Golgi of rat fibroblasts, zymogen granules, plasma membrane, and microsomal membrane of the pancreatic acini, as well as in the endocytic compartment of skeletal muscle cells and fibroblasts. Rap1 localizes in the nucleus of human oropharyngeal squamous cell carcinomas (SCCs) and cell lines. Rap1 plays a role in phagocytosis by controlling the binding of adhesion receptors (typically integrins) to their ligands. In yeast, Rap1 has been implicated in multiple functions, including activation and silencing of transcription and maintenance of telomeres. Rap2 is involved in multiple functions, including activation of c-Jun N-terminal kinase (JNK) to regulate the actin cytoskeleton and activation of the Wnt/beta-catenin signaling pathway in embryonic Xenopus. A number of effector proteins for Rap2 have been identified, including isoform 3 of the human mitogen-activated protein kinase kinase kinase kinase 4 (MAP4K4) and Traf2- and Nck-interacting kinase (TNIK), and the RalGEFs RalGDS, RGL, and Rlf, which also interact with Rap1 and Ras. RSR1 is the fungal homolog of Rap1 and Rap2. In budding yeasts, it is involved in selecting a site for bud growth, which directs the establishment of cell polarization. The Rho family GTPase Cdc42 and its GEF, Cdc24, then establish an axis of polarized growth. It is believed that Cdc42 interacts directly with RSR1 in vivo. In filamentous fungi such as Ashbya gossypii, RSR1 is a key regulator of polar growth in the hypha. Most Ras proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Ras proteins. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • smart RAS 151aa 8e-78 in ref transcript
    • Ras subfamily of RAS small GTPases. Similar in fold and function to the bacterial EF-Tu GTPase. p21Ras couples receptor Tyr kinases and G protein receptors to protein kinase cascades.
  • COG COG1100 150aa 1e-16 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].

RAP1GDS1

  • rs.RAP1GDS1.F1 rs.RAP1GDS1.R1 193 340
  • NCBIGene 36.3 5910
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100426

  • cd ARM 114aa 1e-08 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • Changed! cd ARM 124aa 3e-07 in ref transcript
  • pfam Arm 32aa 0.003 in ref transcript
    • Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
  • Changed! COG SRP1 57aa 0.008 in ref transcript
    • Karyopherin (importin) alpha [Intracellular trafficking and secretion].
  • Changed! cd ARM 120aa 3e-09 in modified transcript
  • Changed! COG SRP1 318aa 9e-04 in modified transcript

RARB

  • rs.RARB.F1 rs.RARB.R1 137 494
  • NCBIGene 36.3 5915
  • Alternative 5-prime, size difference: 357
  • Exclusion of the protein initiation site
  • Reference transcript: NM_000965

  • cd NR_LBD_RAR 231aa 1e-131 in ref transcript
    • The ligand binding domain (LBD) of retinoic acid receptor (RAR), a members of the nuclear receptor superfamily. The ligand binding domain (LBD) of retinoic acid receptor (RAR): Retinoic acid receptors are members of the nuclear receptor (NR) superfamily of ligand-regulated transcription factors. RARs mediate the biological effect of retinoids, including both naturally dietary vitamin A (retinol) metabolites and active synthetic analogs. Retinoids play key roles in a wide variety of essential biological processes, such as vertebrate embryonic morphogenesis and organogenesis, differentiation and apoptosis, and homeostasis. RARs function as heterodimers with retinoic X receptors by binding to specific RAR response elements (RAREs) found in the promoter regions of retinoid target genes. In the absence of ligand, the RAR-RXR heterodimer recruits the corepressor proteins NCoR or AMRT, and associated factors such as histone deacetylases or DNA-methyltransferases, leading to an inactive condensed chromatin structure, preventing transcription. Upon ligand binding, the corepressors are released, and coactivator complexes such as histone acetyltransferase or histone arginine methyltransferases are recruited to activate transcription. There are three RAR subtypes (alpha, beta, gamma), originating from three distinct genes. For each subtype, several isoforms exist that differ in their N-terminal region, allowing retinoids to exert their pleiotropic effects. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, retinoic acid receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a non-conserved hinge and a C-terminal ligand binding domain (LBD).
  • Changed! pfam zf-C4 76aa 6e-38 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • pfam Hormone_recep 181aa 5e-31 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • Changed! pfam zf-C4 42aa 8e-17 in modified transcript

RASA4

  • rs.RASA4.F1 rs.RASA4.R1 141 279
  • NCBIGene 36.3 10156
  • Single exon skipping, size difference: 138
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006989

  • cd RasGAP_RASA4 331aa 1e-172 in ref transcript
    • Ras GTPase activating-like 4 protein (RASAL4), also known as Ca2+ -promoted Ras inactivator (CAPRI), is a member of the GAP1 family. Members of the GAP1 family are characterized by a conserved domain structure comprising N-terminal tandem C2 domains, a highly conserved central RasGAP domain, and a C-terminal pleckstrin-homology domain that is associated with a Bruton's tyrosine kinase motif. RASAL4, like RASAL, is a cytosolic protein that undergoes a rapid translocation to the plasma membrane in response to a receptor-mediated elevation in the concentration of intracellular free Ca2+ ([Ca2+]i). However, unlike RASAL, RASAL4 does not sense oscillations in [Ca2+]i.
  • Changed! cd PH_RasGAP_CG9209 103aa 1e-34 in ref transcript
    • RAS_GTPase activating protein (GAP)_CG9209 pleckstrin homology (PH) domain. This protein consists of two C2 domains, followed by a RasGAP domain, a PH domain and a BTK domain. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinsases, regulators of G-proteins, endocytotic GTPAses, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • cd C2_1 118aa 2e-29 in ref transcript
    • Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
  • cd C2 102aa 3e-20 in ref transcript
    • Protein kinase C conserved region 2 (CalB); Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands.
  • smart RasGAP 362aa 3e-97 in ref transcript
    • GTPase-activator protein for Ras-like GTPases. All alpha-helical domain that accelerates the GTPase activity of Ras, thereby "switching" it into an "off" position. Improved domain limits from structure.
  • pfam C2 82aa 2e-21 in ref transcript
    • C2 domain.
  • pfam C2 82aa 8e-20 in ref transcript
  • Changed! smart PH 106aa 6e-08 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • pfam BTK 28aa 2e-07 in ref transcript
    • BTK motif. Zinc-binding motif containing conserved cysteines and a histidine. Always found C-terminal to PH domains. The crystal structure shows this motif packs against the PH domain. The PH+Btk module pair has been called the Tec homology (TH) region.
  • COG COG5038 88aa 3e-10 in ref transcript
    • Ca2+-dependent lipid-binding protein, contains C2 domain [General function prediction only].
  • COG COG5038 81aa 4e-09 in ref transcript
  • COG IQG1 246aa 4e-05 in ref transcript
    • Protein involved in regulation of cellular morphogenesis/cytokinesis [Cell division and chromosome partitioning / Signal transduction mechanisms].

RASSF1

  • rs.RASSF1.F1 rs.RASSF1.R1 100 112
  • NCBIGene 36.3 11186
  • Alternative 3-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170714

  • cd RASSF1_RA 96aa 4e-39 in ref transcript
    • RASSF1 (also known as RASSF3 and NORE1) is a tumour suppressor protein with a C-terminal Ras-associating (RA) domain that binds Ras. RASSF1 also binds the proapoptotic protein kinase MST1 and is thus thought to regulate the proapoptotic signalling pathway. RASSF1 also associates with microtubule-associated proteins like MAP1B and regulates tubulin polymerization. RASSF1 also binds CDC20 and regulates mitosis by inhibiting the anaphase-promoting complex and preventing degradation of cyclin A and cyclin B until the spindle checkpoint becomes fully operational.
  • Changed! cd C1 54aa 1e-06 in ref transcript
    • Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
  • smart RA 88aa 6e-17 in ref transcript
    • Ras association (RalGDS/AF-6) domain. RasGTP effectors (in cases of AF6, canoe and RalGDS); putative RasGTP effectors in other cases. Kalhammer et al. have shown that not all RA domains bind RasGTP. Predicted structure similar to that determined, and that of the RasGTP-binding domain of Raf kinase. Predicted RA domains in PLC210 and nore1 found to bind RasGTP. Included outliers (Grb7, Grb14, adenylyl cyclases etc.).
  • Changed! pfam C1_1 54aa 2e-07 in ref transcript
    • Phorbol esters/diacylglycerol binding domain (C1 domain). This domain is also known as the Protein kinase C conserved region 1 (C1) domain.
  • Changed! cd C1 50aa 2e-08 in modified transcript
  • Changed! pfam C1_1 50aa 2e-08 in modified transcript

RBM23

  • rs.RBM23.F1 rs.RBM23.R1 107 155
  • NCBIGene 36.3 55147
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077351

  • cd RRM 73aa 2e-21 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 73aa 9e-10 in ref transcript
  • TIGR SF-CC1 292aa 3e-92 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • COG COG0724 162aa 3e-15 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

RBM23

  • rs.RBM23.F2 rs.RBM23.R2 217 271
  • NCBIGene 36.3 55147
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077351

  • cd RRM 73aa 2e-21 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 73aa 9e-10 in ref transcript
  • Changed! TIGR SF-CC1 292aa 3e-92 in ref transcript
    • A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.
  • COG COG0724 162aa 3e-15 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! TIGR SF-CC1 286aa 1e-90 in modified transcript

RBM33

  • rs.RBM33.F1 rs.RBM33.R1 173 203
  • NCBIGene 36.3 155435
  • Alternative 3-prime, size difference: 30
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008408

  • cd RRM 64aa 0.001 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM_2 64aa 3e-04 in ref transcript
    • RNA recognition motif.

RBM35A

  • rs.RBM35A.F1 rs.RBM35A.R1 100 112
  • NCBIGene 36.3 54845
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017697

  • cd RRM 76aa 1e-07 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 75aa 3e-07 in ref transcript
  • cd ERI-1_3'hExo_like 117aa 3e-07 in ref transcript
    • This subfamily is composed of Caenorhabditis elegans ERI-1, human 3' exonuclease (3'hExo), Drosophila exonuclease snipper (snp), and similar proteins from eukaryotes and bacteria. These are DEDDh-type DnaQ-like 3'-5' exonucleases containing three conserved sequence motifs termed ExoI, ExoII and ExoIII, with a specific Hx(4)D conserved pattern at ExoIII. These motifs are clustered around the active site and contain four conserved acidic residues that serve as ligands for the two metal ions required for catalysis. ERI-1 has been implicated in the degradation of small interfering RNAs (RNAi). 3'hExo participates in the degradation of histone mRNAs. Snp is a non-essential exonuclease that efficiently degrades structured RNA and DNA substrates as long as there is a minimum of 2 nucleotides in the 3' overhang to initiate degradation. Snp is not a functional homolog of either ERI-1 or 3'hExo.
  • smart RRM_2 75aa 6e-05 in ref transcript
    • RNA recognition motif.
  • smart RRM_2 74aa 3e-04 in ref transcript

RBM47

  • rs.RBM47.F1 rs.RBM47.R1 123 330
  • NCBIGene 36.3 54502
  • Single exon skipping, size difference: 207
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098634

  • cd RRM 72aa 1e-13 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 68aa 8e-12 in ref transcript
  • cd RRM 79aa 1e-06 in ref transcript
  • Changed! TIGR hnRNP-R-Q 573aa 0.0 in ref transcript
    • Sequences in this subfamily include the human heterogeneous nuclear ribonucleoproteins (hnRNP) R, Q and APOBEC-1 complementation factor (aka APOBEC-1 stimulating protein). These proteins contain three RNA recognition domains (rrm: pfam00076) and a somewhat variable C-terminal domain.
  • COG COG0724 143aa 4e-09 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 71aa 2e-07 in ref transcript
  • Changed! TIGR hnRNP-R-Q 354aa 1e-178 in modified transcript
  • Changed! TIGR hnRNP-R-Q 175aa 6e-25 in modified transcript

RBMY1A1

  • rs.RBMY1A1.F1 rs.RBMY1A1.R1 345 523
  • NCBIGene 36.3 5940
  • Single exon skipping, size difference: 178
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_005058

  • Changed! cd RRM 73aa 1e-22 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • Changed! pfam RRM_1 70aa 2e-20 in ref transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
  • Changed! pfam RBM1CTR 44aa 2e-08 in ref transcript
    • RBM1CTR (NUC064) family. This C-terminal region is found in RBM1-like RNA binding hnRNPs.
  • Changed! COG COG0724 84aa 1e-09 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! cd RRM 65aa 2e-17 in modified transcript
  • Changed! pfam RRM_1 63aa 2e-16 in modified transcript
  • Changed! COG COG0724 79aa 2e-08 in modified transcript

RCC1

  • rs.RCC1.F1 rs.RCC1.R1 111 153
  • NCBIGene 36.3 1104
  • Alternative 5-prime, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001048194

  • pfam RCC1 50aa 5e-11 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • pfam RCC1 49aa 1e-10 in ref transcript
  • pfam RCC1 51aa 5e-10 in ref transcript
  • pfam RCC1 52aa 8e-10 in ref transcript
  • pfam RCC1 66aa 1e-08 in ref transcript
  • pfam RCC1 31aa 2e-04 in ref transcript
  • pfam RCC1 51aa 4e-04 in ref transcript
  • Changed! COG ATS1 350aa 5e-42 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].
  • Changed! COG ATS1 415aa 3e-42 in modified transcript

RCP9

  • rs.RCP9.F1 rs.RCP9.R1 222 321
  • NCBIGene 36.3 27297
  • Single exon skipping, size difference: 99
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014478

  • Changed! pfam RNA_pol_III_C17 120aa 5e-36 in ref transcript
    • RNA polymerase III subunit C17. C17 (aka CGRP-RCP) is an essential subunit of RNA polymerase III. C17 forms a subcomplex with C25 which is likely to be the counterpart of subcomplex Rpb4/7 in Pol II.
  • Changed! pfam RNA_pol_III_C17 87aa 1e-22 in modified transcript

REG3A

  • rs.REG3A.F1 rs.REG3A.R1 107 130
  • NCBIGene 36.3 5068
  • Alternative 3-prime, size difference: 23
  • Inclusion in 5'UTR
  • Reference transcript: NM_138937

  • cd CLECT_REG-1_like 134aa 7e-41 in ref transcript
    • CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.
  • smart CLECT 133aa 2e-23 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

REG3G

  • rs.REG3G.F1 rs.REG3G.R1 280 380
  • NCBIGene 36.3 130120
  • Alternative 3-prime, size difference: 100
  • Inclusion in 5'UTR
  • Reference transcript: NM_198448

  • cd CLECT_REG-1_like 134aa 5e-38 in ref transcript
    • CLECT_REG-1_like: C-type lectin-like domain (CTLD) of the type found in Human REG-1 (lithostathine), REG-4, and avian eggshell-specific proteins: ansocalcin, structhiocalcin-1(SCA-1), and -2(SCA-2). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. REG-1 is a proliferating factor which participates in various kinds of tissue regeneration including pancreatic beta-cell regeneration, regeneration of intestinal mucosa, regeneration of motor neurons, and perhaps in tissue regeneration of damaged heart. REG-1 may play a role on the pathophysiology of Alzheimer's disease and in the development of gastric cancers. Its expression is correlated with reduced survival from early-stage colorectal cancer. REG-1 also binds and aggregates several bacterial strains from the intestinal flora and it has been suggested that it is involved in the control of the intestinal bacterial ecosystem. Rat lithostathine has calcium carbonate crystal inhibitor activity in vitro. REG-IV is unregulated in pancreatic, gastric, hepatocellular, and prostrate adenocarcinomas. REG-IV activates the EGF receptor/Akt/AP-1 signaling pathway in colorectal carcinoma. Ansocalcin, SCA-1 and -2 are found at high concentration in the calcified egg shell layer of goose and ostrich, respectively and tend to form aggregates. Ansocalcin nucleates calcite crystal aggregates in vitro.
  • smart CLECT 133aa 2e-21 in ref transcript
    • C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.

REPIN1

  • rs.REPIN1.F1 rs.REPIN1.R1 274 399
  • NCBIGene 36.3 29803
  • Single exon skipping, size difference: 125
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001099695

  • Changed! COG COG5048 275aa 2e-06 in ref transcript
    • FOG: Zn-finger [General function prediction only].

RERE

  • rs.RERE.F1 rs.RERE.R1 207 392
  • NCBIGene 36.3 473
  • Single exon skipping, size difference: 185
  • Exclusion in 5'UTR
  • Reference transcript: NM_012102

  • cd BAH_MTA 206aa 4e-60 in ref transcript
    • BAH, or Bromo Adjacent Homology domain, as present in MTA1 and similar proteins. The Metastasis-associated protein MTA1 is part of the NURD (nucleosome remodeling and deacetylating) complex and plays a role in cellular transformation and metastasis. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
  • cd ZnF_GATA 55aa 4e-09 in ref transcript
    • Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C.
  • cd SANT 44aa 0.001 in ref transcript
    • 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
  • pfam Atrophin-1 622aa 1e-154 in ref transcript
    • Atrophin-1 family. Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteristic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
  • pfam Atrophin-1 134aa 4e-20 in ref transcript
  • smart ZnF_GATA 50aa 4e-11 in ref transcript
    • zinc finger binding to DNA consensus sequence [AT]GATA[AG].
  • smart BAH 177aa 3e-08 in ref transcript
    • Bromo adjacent homology domain.
  • pfam ELM2 53aa 2e-07 in ref transcript
    • ELM2 domain. The ELM2 (Egl-27 and MTA1 homology 2) domain is a small domain of unknown function. It is found in the MTA1 protein that is part of the NuRD complex. The domain is usually found to the N terminus of a myb-like DNA binding domain pfam00249. ELM2 is also found associated with an ARID DNA binding domain pfam01388 in a protein from Arabidopsis thaliana. This suggests that ELM2 may also be involved in DNA binding, or perhaps is a protein-protein interaction domain.
  • smart SANT 46aa 2e-04 in ref transcript
    • SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.

RGR

  • rs.RGR.F1 rs.RGR.R1 100 112
  • NCBIGene 36.3 5995
  • Alternative 3-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_002921

  • Changed! pfam 7tm_1 229aa 7e-11 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.
  • Changed! pfam 7tm_1 225aa 3e-12 in modified transcript

RGS11

  • rs.RGS11.F1 rs.RGS11.R1 94 105
  • NCBIGene 36.3 8786
  • Alternative 5-prime, size difference: 11
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_183337

  • Changed! cd DEP_RGS7-like 88aa 5e-25 in ref transcript
    • DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in RGS (regulator of G-protein signaling) proteins of the subfamily R7. This subgroup contains RGS7, RGS6, RGS9 and RGS11. They share a common domain architecture, containing, beside the RGS domain, a DEP domain and a GGL (G-protein gamma subunit-like ) domain. RGS proteins are GTPase-activating (GAP) proteins of heterotrimeric G proteins by increasing the rate of GTP hydrolysis of the alpha subunit. The fungal homologs, like yeast Sst2, share a related common domain architecture, containing RGS and DEP domains. Sst2 has been identified as the principal regulator of mating pheromone signaling and recently the DEP domain of Sst2 has been shown to be necessary and sufficient to mediate receptor interaction.
  • Changed! cd GGL 56aa 3e-13 in ref transcript
    • G protein gamma subunit-like motifs, the alpha-helical G-gamma chain dimerizes with the G-beta propeller subunit as part of the heterotrimeric G-protein complex; involved in signal transduction via G-protein-coupled receptors.
  • Changed! pfam RGS 115aa 7e-35 in ref transcript
    • Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
  • Changed! smart GGL 62aa 2e-18 in ref transcript
    • G protein gamma subunit-like motifs.
  • Changed! pfam DEP 80aa 1e-16 in ref transcript
    • Domain found in Dishevelled, Egl-10, and Pleckstrin. The DEP domain is responsible for mediating intracellular protein targeting and regulation of protein stability in the cell. The DEP domain is present in a number of signaling molecules, including Regulator of G protein Signaling (RGS) proteins, and has been implicated in membrane targeting. New findings in yeast, however, demonstrate a major role for a DEP domain in mediating the interaction of an RGS protein to the C-terminal tail of a GPCR, thus placing RGS in close proximity with its substrate G protein alpha subunit.

RGS8

  • rs.RGS8.F1 rs.RGS8.R1 153 282
  • NCBIGene 36.3 85397
  • Single exon skipping, size difference: 129
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_033345

  • Changed! pfam RGS 115aa 2e-38 in ref transcript
    • Regulator of G protein signaling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.

RHBDD2

  • rs.RHBDD2.F1 rs.RHBDD2.R1 295 417
  • NCBIGene 36.3 57414
  • Single exon skipping, size difference: 122
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001040456

  • Changed! pfam Rhomboid 146aa 6e-11 in ref transcript
    • Rhomboid family. This family contains integral membrane proteins that are related to Drosophila rhomboid protein. Members of this family are found in bacteria and eukaryotes. Rhomboid promotes the cleavage of the membrane-anchored TGF-alpha-like growth factor Spitz, allowing it to activate the Drosophila EGF receptor. Analysis has shown that Rhomboid-1 is an intramembrane serine protease (EC:3.4.21.105). Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite.
  • Changed! COG GlpG 79aa 0.005 in ref transcript
    • Uncharacterized membrane protein (homolog of Drosophila rhomboid) [General function prediction only].

RHOC

  • rs.RHOC.F1 rs.RHOC.R1 293 362
  • NCBIGene 36.3 389
  • Single exon skipping, size difference: 69
  • Exclusion in 5'UTR
  • Reference transcript: NM_175744

  • cd RhoA_like 175aa 1e-106 in ref transcript
    • RhoA-like subfamily. The RhoA subfamily consists of RhoA, RhoB, and RhoC. RhoA promotes the formation of stress fibers and focal adhesions, regulating cell shape, attachment, and motility. RhoA can bind to multiple effector proteins, thereby triggering different downstream responses. In many cell types, RhoA mediates local assembly of the contractile ring, which is necessary for cytokinesis. RhoA is vital for muscle contraction; in vascular smooth muscle cells, RhoA plays a key role in cell contraction, differentiation, migration, and proliferation. RhoA activities appear to be elaborately regulated in a time- and space-dependent manner to control cytoskeletal changes. Most Rho proteins contain a lipid modification site at the C-terminus, with a typical sequence motif CaaX, where a = an aliphatic amino acid and X = any amino acid. Lipid binding is essential for membrane attachment, a key feature of most Rho proteins. RhoA and RhoC are observed only in geranylgeranylated forms; however, RhoB can be present in palmitoylated, farnesylated, and geranylgeranylated forms. RhoA and RhoC are highly relevant for tumor progression and invasiveness; however, RhoB has recently been suggested to be a tumor suppressor. Due to the presence of truncated sequences in this CD, the lipid modification site is not available for annotation.
  • smart RHO 173aa 7e-99 in ref transcript
    • Rho (Ras homology) subfamily of Ras-like small GTPases. Members of this subfamily of Ras-like small GTPases include Cdc42 and Rac, as well as Rho isoforms.
  • COG COG1100 184aa 6e-32 in ref transcript
    • GTPase SAR1 and related small G proteins [General function prediction only].

RIMS2

  • rs.RIMS2.F1 rs.RIMS2.R1 154 220
  • NCBIGene 36.3 9699
  • Single exon skipping, size difference: 66
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001100117

  • cd C2_1 121aa 3e-28 in ref transcript
    • Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
  • cd C2_1 124aa 2e-22 in ref transcript
  • cd PDZ_signaling 82aa 2e-11 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart C2 104aa 5e-14 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • smart C2 108aa 2e-12 in ref transcript
  • smart PDZ 91aa 3e-11 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • pfam RPH3A_effector 132aa 2e-08 in ref transcript
    • Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas.
  • COG Prc 81aa 0.004 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

RIMS2

  • rs.RIMS2.F2 rs.RIMS2.R2 177 225
  • NCBIGene 36.3 9699
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100117

  • cd C2_1 121aa 3e-28 in ref transcript
    • Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
  • Changed! cd C2_1 124aa 2e-22 in ref transcript
  • cd PDZ_signaling 82aa 2e-11 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • smart C2 104aa 5e-14 in ref transcript
    • Protein kinase C conserved region 2 (CalB). Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotamins (among others). Some do not appear to contain Ca2+-binding sites. Particular C2s appear to bind phospholipids, inositol polyphosphates, and intracellular proteins. Unusual occurrence in perforin. Synaptotagmin and PLC C2s are permuted in sequence with respect to N- and C-terminal beta strands. SMART detects C2 domains using one or both of two profiles.
  • smart C2 108aa 2e-12 in ref transcript
  • smart PDZ 91aa 3e-11 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • pfam RPH3A_effector 132aa 2e-08 in ref transcript
    • Rabphilin-3A effector domain. This is a family of proteins involved in protein transport in synaptic vesicles. Rabphilin-3A has been shown to contact Rab3A, a small G protein important in neurotransmitter release, in two distinct areas.
  • COG Prc 81aa 0.004 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].
  • Changed! cd C2_1 125aa 6e-23 in modified transcript

RLN2

  • rs.RLN2.F1 rs.RLN2.R1 271 372
  • NCBIGene 36.3 6019
  • Single exon skipping, size difference: 101
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_134441

  • Changed! cd IlGF_relaxin_like 40aa 3e-10 in ref transcript
    • IlGF_like family, relaxin_like subgroup, specific to vertebrates. Members include a number of active peptides including (pro)relaxin, mammalian Leydig cell-specific insulin-like peptide (gene INSL3), early placenta insulin-like peptide (ELIP; gene INSL4), and insulin-like peptides 5 (INSL5) and 6 (INSL6). Members of this subgroup are widely expressed in testes (INSL3, INSL6), decidua, placenta, prostate, corpus luteum, brain (various relaxins), GI tract, and kidney (INSL5) where they serve a variety of functions in parturition and development. Typically, the active forms of these peptide hormones are composed of two chains (A and B) linked by two disulfide bonds; the arrangement of four cysteines is conserved in the "A" chain: Cys1 is linked by a disulfide bond to Cys3, Cys2 and Cys4 are linked by interchain disulfide bonds to cysteines in the "B" chain. This alignment contains both chains, plus the intervening linker region, arranged as found in the propeptide form. Propeptides are cleaved to yield two separate chains linked covalently by the two disulfide bonds.
  • cd IlGF_relaxin_like 24aa 5e-05 in ref transcript
  • Changed! pfam Insulin 50aa 3e-08 in ref transcript
    • Insulin/IGF/Relaxin family. Superfamily includes insulins; relaxins; insulin-like growth factor; and bombyxin. All are secreted regulatory hormones. Disulfide rich, all-alpha fold. Alignment includes B chain, linker (which is processed out of the final product), and A chain.
  • Changed! smart IlGF 40aa 2e-06 in ref transcript
    • Insulin / insulin-like growth factor / relaxin family. Family of proteins including insulin, relaxin, and IGFs. Insulin decreases blood glucose concentration.
  • Changed! smart IlGF 37aa 3e-06 in modified transcript

RNASE1

  • rs.RNASE1.F1 rs.RNASE1.R1 100 112
  • NCBIGene 36.3 6035
  • Alternative 3-prime, size difference: 12
  • Inclusion in 5'UTR
  • Reference transcript: NM_198234

  • cd RNase_A_canonical 120aa 1e-39 in ref transcript
    • Canonical RNase A family includes all vertebrate homologues to the bovine pancreatic ribonuclease A (RNase A) that contain the catalytic site, necessary for RNase activity. In the human genome 8 RNases , refered to as "canonical" RNases, have been identified, pancreatic RNase (RNase 1), Eosinophil Derived Neurotoxin (SEDN/RNASE 2), Eosinophil Cationic Protein (ECP/RNase 3), RNase 4, Angiogenin (RNase 5), RNase 6 or k6, the skin derived RNase (RNase 7) and RNase 8. The eight human genes are all located in a cluster on chromosome 14. Canonical RNase A enzymes have special biological activities; for example, some stimulate the development of vascular endothelial cells, dendritic cells, and neurons, are cytotoxic/anti-tumoral and/or anti-pathogenic. RNase A is involved in endonucleolytic cleavage of 3'-phosphomononucleotides and 3'-phosphooligonucleotides ending in C-P or U-P with 2',3'-cyclic phosphate intermediates. The catalytic mechanism is a transphosphorylation of P-O 5' bonds on the 3' side of pyrimidines and subsequent hydrolysis to generate 3' phosphate groups. The canonical RNase A family proteins have a conserved catalytic triad (two histidines and one lysine). They also share 6 to 8 cysteines that form three to four disulfide bonds. Two disulfide bonds that are close to the N and C termini contribute most significantly to conformational stability. Angiogenin or RNAse 5 has been implicated in tumor-associated angiogenesis. Comparative analysis in mammals and birds indicates that the whole family may have originated from a RNase 5-like gene. This hypothesis is supported by the fact that only RNase 5-like RNases have been reported outside the mammalian class. The RNase 5 group would therefore be the most ancient form of this family, and all other members would have arisen during mammalian evolution.
  • smart RNAse_Pc 124aa 1e-54 in ref transcript
    • Pancreatic ribonuclease.

RNASEN

  • rs.RNASEN.F1 rs.RNASEN.R1 172 248
  • NCBIGene 36.3 29102
  • Single exon skipping, size difference: 76
  • Inclusion in 5'UTR
  • Reference transcript: NM_013235

  • cd RIBOc 133aa 1e-32 in ref transcript
    • RIBOc. Ribonuclease III C terminal domain. This group consists of eukaryotic, bacterial and archeal ribonuclease III (RNAse III) proteins. RNAse III is a double stranded RNA-specific endonuclease. Prokaryotic RNAse III is important in post-transcriptional control of mRNA stability and translational efficiency. It is involved in the processing of ribosomal RNA precursors. Prokaryotic RNAse III also plays a role in the maturation of tRNA precursors and in the processing of phage and plasmid transcripts. Eukaryotic RNase III's participate (through direct cleavage) in rRNA processing, in processing of small nucleolar RNAs (snoRNAs) and snRNA's (components of the spliceosome). In eukaryotes RNase III or RNaseIII like enzymes such as Dicer are involved in RNAi (RNA interference) and miRNA (micro-RNA) gene silencing.
  • cd RIBOc 115aa 2e-29 in ref transcript
  • cd DSRM 72aa 1e-11 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • TIGR RNaseIII 222aa 2e-47 in ref transcript
    • This family consists of bacterial examples of ribonuclease III. This enzyme cleaves double-stranded rRNA. It is involved in processing ribosomal RNA precursors. It is found even in minimal genones such as Mycoplasma genitalium and Buchnera aphidicola, and in some cases has been shown to be an essential gene. These bacterial proteins contain a double-stranded RNA binding motif (pfam00035) and a ribonuclease III domain (pfam00636). Eukaryotic homologs tend to be much longer proteins with additional domains, localized to the nucleus, and not included in this family.
  • smart RIBOc 133aa 7e-33 in ref transcript
    • Ribonuclease III family.
  • PRK rnc 229aa 1e-51 in ref transcript
    • ribonuclease III; Reviewed.
  • PRK rnc 109aa 8e-27 in ref transcript

RNASEN

  • rs.RNASEN.F2 rs.RNASEN.R2 131 242
  • NCBIGene 36.3 29102
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_013235

  • cd RIBOc 133aa 1e-32 in ref transcript
    • RIBOc. Ribonuclease III C terminal domain. This group consists of eukaryotic, bacterial and archeal ribonuclease III (RNAse III) proteins. RNAse III is a double stranded RNA-specific endonuclease. Prokaryotic RNAse III is important in post-transcriptional control of mRNA stability and translational efficiency. It is involved in the processing of ribosomal RNA precursors. Prokaryotic RNAse III also plays a role in the maturation of tRNA precursors and in the processing of phage and plasmid transcripts. Eukaryotic RNase III's participate (through direct cleavage) in rRNA processing, in processing of small nucleolar RNAs (snoRNAs) and snRNA's (components of the spliceosome). In eukaryotes RNase III or RNaseIII like enzymes such as Dicer are involved in RNAi (RNA interference) and miRNA (micro-RNA) gene silencing.
  • cd RIBOc 115aa 2e-29 in ref transcript
  • cd DSRM 72aa 1e-11 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • TIGR RNaseIII 222aa 2e-47 in ref transcript
    • This family consists of bacterial examples of ribonuclease III. This enzyme cleaves double-stranded rRNA. It is involved in processing ribosomal RNA precursors. It is found even in minimal genones such as Mycoplasma genitalium and Buchnera aphidicola, and in some cases has been shown to be an essential gene. These bacterial proteins contain a double-stranded RNA binding motif (pfam00035) and a ribonuclease III domain (pfam00636). Eukaryotic homologs tend to be much longer proteins with additional domains, localized to the nucleus, and not included in this family.
  • smart RIBOc 133aa 7e-33 in ref transcript
    • Ribonuclease III family.
  • PRK rnc 229aa 1e-51 in ref transcript
    • ribonuclease III; Reviewed.
  • PRK rnc 109aa 8e-27 in ref transcript

RNF13

  • rs.RNF13.F1 rs.RNF13.R1 90 100
  • NCBIGene 36.3 11342
  • Alternative 5-prime, size difference: 10
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_007282

  • Changed! cd PA_C_RZF_like 158aa 8e-50 in ref transcript
    • PA_C-RZF_ like: Protease-associated (PA) domain C_RZF-like. This group includes various PA domain-containing proteins similar to C-RZF (chicken embryo RING zinc finger) protein. These proteins contain a C3H2C3 RING finger. C-RZF is expressed in embryo cells and is restricted mainly to brain and heart, it is localized to both the nucleus and endosomes. Additional C3H2C3 RING finger proteins belonging to this group, include Arabidopsis ReMembR-H2 protein and mouse sperizin. ReMembR-H2 is likely to be an integral membrane protein, and to traffic through the endosomal pathway. Sperizin is expressed in haploid germ cells and localized in the cytoplasm, it may participate in spermatogenesis. The significance of the PA domain to these proteins has not been ascertained. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate.
  • Changed! cd RING 46aa 1e-10 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • Changed! pfam PA 92aa 2e-11 in ref transcript
    • PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family. It has been suggested that this domain forms a lid-like structure that covers the active site in active proteases, and is involved in protein recognition in vacuolar sorting receptors.
  • Changed! smart RING 42aa 7e-09 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! COG COG5540 253aa 1e-11 in ref transcript
    • RING-finger-containing ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd PA_C_RZF_like 116aa 1e-37 in modified transcript
  • Changed! pfam PA 66aa 7e-08 in modified transcript

RORA

  • rs.RORA.F1 rs.RORA.R1 215 239
  • NCBIGene 36.3 6095
  • Mutually exclusive exon skipping, size difference: 24
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_134260

  • cd NR_LBD_ROR_like 241aa 1e-127 in ref transcript
    • The ligand binding domain of Retinoid-related orphan receptors, of the nuclear receptor superfamily. The ligand binding domain (LBD) of Retinoid-related orphan receptors (RORs): Retinoid-related orphan receptors (RORs) are transcription factors belonging to the nuclear receptor superfamily. RORs are key regulators of many physiological processes during embryonic development. RORs bind as monomers to specific ROR response elements (ROREs) consisting of the consensus core motif AGGTCA preceded by a 5-bp A/T-rich sequence. Transcription regulation by RORs is mediated through certain corepressors, as well as coactivators. There are three subtypes of retinoid-related orphan receptors (RORs), alpha, beta, and gamma that differ only in N-terminal sequence and are distributed in distinct tissues. RORalpha plays a key role in the development of the cerebellum, particularly in the regulation of the maturation and survival of Purkinje cells. RORbeta expression is largely restricted to several regions of the brain, the retina, and pineal gland. RORgamma is essential for lymph node organogenesis. Recently, it has been su ggested that cholesterol or a cholesterol derivative is the natural ligand of RORalpha. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, retinoid-related orphan receptors have a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a non-conserved hinge and a C-terminal ligand binding domain (LBD).
  • pfam zf-C4 76aa 6e-39 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.
  • pfam Hormone_recep 183aa 3e-33 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.

RPL14

  • rs.RPL14.F1 rs.RPL14.R1 122 233
  • NCBIGene 36.3 9045
  • Alternative 5-prime, size difference: 111
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001034996

  • pfam Ribosomal_L14e 77aa 1e-20 in ref transcript
    • Ribosomal protein L14. This family includes the eukaryotic ribosomal protein L14.
  • PTZ PTZ00065 123aa 3e-19 in ref transcript
    • 60S ribosomal protein L14; Provisional.

RPP14

  • rs.RPP14.F1 rs.RPP14.R1 135 369
  • NCBIGene 36.3 11102
  • Alternative 5-prime, size difference: 234
  • Exclusion in 5'UTR
  • Reference transcript: NM_001098783

  • cd R_hydratase 127aa 1e-37 in ref transcript
    • (R)-hydratase [(R)-specific enoyl-CoA hydratase] catalyzes the hydration of trans-2-enoyl CoA to (R)-3-hydroxyacyl-CoA as part of the PHA (polyhydroxyalkanoate) biosynthetic pathway. (R)-hydratase contains a hot-dog fold similar to those of thioesterase II, and beta-hydroxydecanoyl-ACP dehydratase, MaoC dehydratase, Hydratase-Dehydrogenase-Epimerase protein (HDE), and the fatty acid synthase beta subunit. The active site lies within a substrate-binding tunnel formed by the (R)-hydratase homodimer. A subset of the bacterial (R)-hydratases contain a C-terminal phosphotransacetylase (PTA) domain.
  • pfam MaoC_dehydratas 108aa 2e-15 in ref transcript
    • MaoC like domain. The maoC gene is part of a operon with maoA which is involved in the synthesis of monoamine oxidase. The MaoC protein is found to share similarity with a wide variety of enzymes; estradiol 17 beta-dehydrogenase 4, peroxisomal hydratase-dehydrogenase-epimerase, fatty acid synthase beta subunit. Several bacterial proteins that are composed solely of this domain have (R)-specific enoyl-CoA hydratase activity. This domain is also present in the NodN nodulation protein N.
  • PRK PRK08190 131aa 5e-28 in ref transcript
    • bifunctional enoyl-CoA hydratase/phosphate acetyltransferase; Validated.

RPP38

  • rs.RPP38.F1 rs.RPP38.R1 104 461
  • NCBIGene 36.3 10557
  • Alternative 5-prime, size difference: 357
  • Exclusion in 5'UTR
  • Reference transcript: NM_183005

  • pfam Ribosomal_L7Ae 79aa 0.005 in ref transcript
    • Ribosomal protein L7Ae/L30e/S12e/Gadd45 family. This family includes: Ribosomal L7A from metazoa, Ribosomal L8-A and L8-B from fungi, 30S ribosomal protein HS6 from archaebacteria, 40S ribosomal protein S12 from eukaryotes, Ribosomal protein L30 from eukaryotes and archaebacteria. Gadd45 and MyD118.

RPS15A

  • rs.RPS15A.F1 rs.RPS15A.R1 474 540
  • NCBIGene 36.3 6210
  • Alternative 5-prime, size difference: 66
  • Exclusion in 5'UTR
  • Reference transcript: NM_001019

  • pfam Ribosomal_S8 125aa 3e-37 in ref transcript
    • Ribosomal protein S8.
  • PTZ PTZ00158 130aa 8e-68 in ref transcript
    • 40S ribosomal protein S15A; Provisional.

RPS24

  • rs.RPS24.F1 rs.RPS24.R1 112 134
  • NCBIGene 36.3 6229
  • Single exon skipping, size difference: 22
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001026

  • pfam Ribosomal_S24e 83aa 4e-16 in ref transcript
    • Ribosomal protein S24e.
  • PTZ PTZ00071 130aa 9e-29 in ref transcript
    • 40S ribosomal protein S24; Provisional.

RRBP1

  • rs.RRBP1.F1 rs.RRBP1.R1 112 189
  • NCBIGene 36.3 6238
  • Single exon skipping, size difference: 77
  • Exclusion in 5'UTR
  • Reference transcript: NM_001042576

  • pfam Rib_recp_KP_reg 144aa 7e-12 in ref transcript
    • Ribosome receptor lysine/proline rich region. This highly conserved region is found towards the C-terminus of the transmembrane domain. The function is unclear.
  • TIGR SMC_prok_A 286aa 1e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • TIGR SMC_prok_B 340aa 5e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • PRK PRK00409 94aa 0.003 in ref transcript
    • recombination and DNA strand exchange inhibitor protein; Reviewed.
  • PRK PRK02224 456aa 0.009 in ref transcript
    • chromosome segregation protein; Provisional.

RSRC2

  • rs.RSRC2.F1 rs.RSRC2.R1 100 122
  • NCBIGene 36.3 65117
  • Alternative 3-prime, size difference: 22
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_198261

RTKN

  • rs.RTKN.F1 rs.RTKN.R1 102 529
  • NCBIGene 36.3 6242
  • Single exon skipping, size difference: 427
  • Exclusion of the protein initiation site
  • Reference transcript: NM_033046

  • cd PH 99aa 5e-06 in ref transcript
    • Pleckstrin homology (PH) domain. PH domains are only found in eukaryotes. They share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains. PH domains are found in cellular signaling proteins such as serine/threonine kinase, tyrosine kinases, regulators of G-proteins, endocytotic GTPases, adaptors, as well as cytoskeletal associated molecules and in lipid associated enzymes.
  • Changed! smart Hr1 56aa 1e-08 in ref transcript
    • Rho effector or protein kinase C-related kinase homology region 1 homologues. Alpha-helical domain found in vertebrate PRK1 and yeast PKC1 protein kinases C. The HR1 in rhophilin bind RhoGTP; those in PRK1 bind RhoA and RhoB. Also called RBD - Rho-binding domain.
  • smart PH 99aa 1e-07 in ref transcript
    • Pleckstrin homology domain. Domain commonly found in eukaryotic signalling proteins. The domain family possesses multiple functions including the abilities to bind inositol phosphates, and various proteins. PH domains have been found to possess inserted domains (such as in PLC gamma, syntrophins) and to be inserted within other domains. Mutations in Brutons tyrosine kinase (Btk) within its PH domain cause X-linked agammaglobulinaemia (XLA) in patients. Point mutations cluster into the positively charged end of the molecule around the predicted binding site for phosphatidylinositol lipids.
  • Changed! smart Hr1 41aa 5e-04 in modified transcript

RTN2

  • rs.RTN2.F1 rs.RTN2.R1 157 376
  • NCBIGene 36.3 6253
  • Single exon skipping, size difference: 219
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_005619

  • Changed! pfam Reticulon 171aa 1e-30 in ref transcript
    • Reticulon. Reticulon, also know as neuroendocrine-specific protein (NSP), is a protein of unknown function which associates with the endoplasmic reticulum. This family represents the C-terminal domain of the three reticulon isoforms and their homologues.
  • Changed! pfam Reticulon 171aa 6e-32 in modified transcript

RUFY2

  • rs.RUFY2.F1 rs.RUFY2.R1 160 262
  • NCBIGene 36.3 55680
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017987

  • cd FYVE 53aa 2e-19 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • pfam FYVE 62aa 2e-24 in ref transcript
    • FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
  • TIGR SMC_prok_B 285aa 3e-13 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • Changed! COG Smc 308aa 1e-12 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • PTZ PTZ00303 65aa 2e-04 in ref transcript
    • phosphatidylinositol kinase; Provisional.
  • Changed! COG Smc 305aa 2e-12 in modified transcript

RUNX2

  • rs.RUNX2.F1 rs.RUNX2.R1 310 376
  • NCBIGene 36.3 860
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001024630

  • pfam Runt 134aa 9e-74 in ref transcript
    • Runt domain.
  • pfam RunxI 78aa 2e-32 in ref transcript
    • Runx inhibition domain. This domain lies to the C-terminus of Runx-related transcription factors and homologous proteins (AML, CBF-alpha, PEBP2). Its function might be to interact with functional cofactors.

RUSC1

  • rs.RUSC1.F1 rs.RUSC1.R1 121 439
  • NCBIGene 36.3 23623
  • Alternative 3-prime, size difference: 318
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001105203

  • cd SH3 50aa 3e-08 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! pfam RUN 137aa 5e-21 in ref transcript
    • RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
  • smart SH3 53aa 6e-10 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! pfam RUN 92aa 5e-16 in modified transcript

RUSC1

  • rs.RUSC1.F2 rs.RUSC1.R2 138 181
  • NCBIGene 36.3 23623
  • Alternative 5-prime, size difference: 43
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001105205

  • Changed! cd SH3 50aa 7e-09 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! pfam RUN 137aa 1e-20 in ref transcript
    • RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
  • Changed! smart SH3 53aa 1e-10 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

RYR1

  • rs.RYR1.F1 rs.RYR1.R1 125 140
  • NCBIGene 36.3 6261
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000540

  • pfam Ins145_P3_rec 199aa 1e-71 in ref transcript
    • Inositol 1,4,5-trisphosphate/ryanodine receptor. This domain corresponds to the ligand binding region on inositol 1,4,5-trisphosphate receptor, and the N terminal region of the ryanodine receptor. Both receptors are involved in Ca2+ release. They can couple to the activation of neurotransmitter-gated receptors and voltage-gated Ca2+ channels on the plasma membrane, thus allowing the endoplasmic reticulum discriminate between different types of neuronal activity.
  • pfam RYDR_ITPR 204aa 1e-62 in ref transcript
    • RIH domain. The RIH (RyR and IP3R Homology) domain is an extracellular domain from two types of calcium channels. This region is found in the ryanodine receptor and the inositol-1,4,5- trisphosphate receptor. This domain may form a binding site for IP3.
  • pfam MIR 179aa 3e-51 in ref transcript
    • MIR domain. The MIR (protein mannosyltransferase, IP3R and RyR) domain is a domain that may have a ligand transferase function.
  • pfam RYDR_ITPR 209aa 2e-49 in ref transcript
  • pfam RIH_assoc 133aa 7e-42 in ref transcript
    • RyR and IP3R Homology associated. This eukaryotic domain is found in ryanodine receptors (RyR) and inositol 1,4,5-trisphosphate receptors (IP3R) which together form a superfamily of homotetrameric ligand-gated intracellular Ca2+ channels. There seems to be no known function for this domain. Also see the IP3-binding domain pfam01365 and pfam02815.
  • pfam RyR 95aa 4e-41 in ref transcript
    • RyR domain. This domain is called RyR for Ryanodine receptor. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown.
  • pfam RyR 95aa 3e-39 in ref transcript
  • pfam RR_TM4-6 225aa 3e-39 in ref transcript
    • Ryanodine Receptor TM 4-6. This region covers TM regions 4-6 of the ryanodine receptor 1 family.
  • pfam RyR 90aa 3e-38 in ref transcript
  • pfam RyR 89aa 4e-31 in ref transcript
  • smart SPRY 123aa 1e-29 in ref transcript
    • Domain in SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • pfam SPRY 137aa 9e-24 in ref transcript
    • SPRY domain. SPRY Domain is named from SPla and the RYanodine Receptor. Domain of unknown function. Distant homologues are domains in butyrophilin/marenostrin/pyrin homologues.
  • pfam SPRY 141aa 6e-23 in ref transcript

S100A4

  • rs.S100A4.F1 rs.S100A4.R1 136 185
  • NCBIGene 36.3 6275
  • Single exon skipping, size difference: 49
  • Exclusion in 5'UTR
  • Reference transcript: NM_019554

  • cd S-100 87aa 6e-25 in ref transcript
    • S-100: S-100 domain, which represents the largest family within the superfamily of proteins carrying the Ca-binding EF-hand motif. Note that this S-100 hierarchy contains only S-100 EF-hand domains, other EF-hands have been modeled separately. S100 proteins are expressed exclusively in vertebrates, and are implicated in intracellular and extracellular regulatory activities. Intracellularly, S100 proteins act as Ca-signaling or Ca-buffering proteins. The most unusual characteristic of certain S100 proteins is their occurrence in extracellular space, where they act in a cytokine-like manner through RAGE, the receptor for advanced glycation products. Structural data suggest that many S100 members exist within cells as homo- or heterodimers and even oligomers; oligomerization contributes to their functional diversification. Upon binding calcium, most S100 proteins change conformation to a more open structure exposing a hydrophobic cleft. This hydrophobic surface represents the interaction site of S100 proteins with their target proteins. There is experimental evidence showing that many S100 proteins have multiple binding partners with diverse mode of interaction with different targets. In addition to S100 proteins (such as S100A1,-3,-4,-6,-7,-10,-11,and -13), this group includes the ''fused'' gene family, a group of calcium binding S100-related proteins. The ''fused'' gene family includes multifunctional epidermal differentiation proteins - profilaggrin, trichohyalin, repetin, hornerin, and cornulin; functionally these proteins are associated with keratin intermediate filaments and partially crosslinked to the cell envelope. These ''fused'' gene proteins contain N-terminal sequence with two Ca-binding EF-hands motif, which may be associated with calcium signaling in epidermal cells and autoprocessing in a calcium-dependent manner. In contrast to S100 proteins, "fused" gene family proteins contain an extraordinary high number of almost perfect peptide repeats with regular array of polar and charged residues similar to many known cell envelope proteins.
  • pfam S_100 44aa 2e-13 in ref transcript
    • S-100/ICaBP type calcium binding domain. The S-100 domain is a subfamily of the EF-hand calcium binding proteins.

SCARB1

  • rs.SCARB1.F1 rs.SCARB1.R1 184 313
  • NCBIGene 36.3 949
  • Single exon skipping, size difference: 129
  • Exclusion of the stop codon
  • Reference transcript: NM_005505

  • pfam CD36 414aa 1e-168 in ref transcript
    • CD36 family. The CD36 family is thought to be a novel class of scavenger receptors. There is also evidence suggesting a possible role in signal transduction. CD36 is involved in cell adhesion.

SCARF2

  • rs.SCARF2.F1 rs.SCARF2.R1 100 115
  • NCBIGene 36.3 91179
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153334

SCN3A

  • rs.SCN3A.F1 rs.SCN3A.R1 163 310
  • NCBIGene 36.3 6328
  • Alternative 5-prime, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006922

  • pfam Na_trans_assoc 221aa 1e-65 in ref transcript
    • Sodium ion transport-associated. Members of this family contain a region found exclusively in eukaryotic sodium channels or their subunits, many of which are voltage-gated. Members very often also contain between one and four copies of pfam00520 and, less often, one copy of pfam00612.
  • pfam Ion_trans 226aa 9e-34 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • pfam Ion_trans 159aa 5e-25 in ref transcript
  • pfam Ion_trans 141aa 5e-25 in ref transcript
  • pfam Ion_trans 178aa 1e-24 in ref transcript
  • pfam Ion_trans 72aa 2e-13 in ref transcript

SCN5A

  • rs.SCN5A.F1 SCN5A.R5 271 325
  • NCBIGene 36.3 6331
  • Single exon skipping, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198056

  • pfam Na_trans_assoc 265aa 7e-60 in ref transcript
    • Sodium ion transport-associated. Members of this family contain a region found exclusively in eukaryotic sodium channels or their subunits, many of which are voltage-gated. Members very often also contain between one and four copies of pfam00520 and, less often, one copy of pfam00612.
  • pfam Ion_trans 209aa 2e-29 in ref transcript
    • Ion transport protein. This family contains Sodium, Potassium, Calcium ion channels. This family is 6 transmembrane helices in which the last two helices flank a loop which determines ion selectivity. In some sub-families (e.g. Na channels) the domain is repeated four times, whereas in others (e.g. K channels) the protein forms as a tetramer in the membrane. A bacterial structure of the protein is known for the last two helices but is not the Pfam family due to it lacking the first four helices.
  • Changed! pfam Ion_trans 229aa 8e-26 in ref transcript
  • pfam Ion_trans 123aa 2e-25 in ref transcript
  • pfam Ion_trans 176aa 5e-23 in ref transcript
  • pfam Ion_trans 74aa 2e-13 in ref transcript
  • Changed! pfam Ion_trans 211aa 9e-21 in modified transcript

SCYL1

  • rs.SCYL1.F1 rs.SCYL1.R1 123 174
  • NCBIGene 36.3 57410
  • Alternative 3-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020680

  • cd S_TKc 228aa 3e-16 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • pfam Pkinase 189aa 1e-09 in ref transcript
    • Protein kinase domain.
  • PTZ PTZ00267 251aa 5e-04 in ref transcript
    • NIMA-related protein kinase; Provisional.

SDCBP

  • rs.SDCBP.F1 rs.SDCBP.R1 102 120
  • NCBIGene 36.3 6386
  • Alternative 5-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001007067

  • cd PDZ_signaling 80aa 4e-13 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 74aa 3e-09 in ref transcript
  • smart PDZ 83aa 2e-10 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 76aa 1e-07 in ref transcript
  • COG Prc 127aa 7e-07 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

SEC24B

  • rs.SEC24B.F1 FOX.SEC24B.R1 220 325
  • NCBIGene 36.3 10427
  • Single exon skipping, size difference: 105
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006323

  • cd Sec24-like 244aa 1e-100 in ref transcript
    • Sec24-like: Protein and membrane traffic in eukaryotes is mediated by at least in part by the budding and fusion of intracellular transport vesicles that selectively carry cargo proteins and lipids from donor to acceptor organelles. The two main classes of vesicular carriers within the endocytic and the biosynthetic pathways are COP- and clathrin-coated vesicles. Formation of COPII vesicles requires the ordered assembly of the coat built from several cytosolic components GTPase Sar1, complexes of Sec23-Sec24 and Sec13-Sec31. The process is initiated by the conversion of GDP to GTP by the GTPase Sar1 which then recruits the heterodimeric complex of Sec23 and Sec24. This heterodimeric complex generates the pre-budding complex. The final step leading to membrane deformation and budding of COPII-coated vesicles is carried by the heterodimeric complex Sec13-Sec31. The members of this CD belong to the Sec23-like family. Sec 24 is very similar to Sec23. The Sec23 and Sec24 polypeptides fold into five distinct domains: a beta-barrel, a zinc finger, a vWA or trunk, an all helical region and a carboxy Gelsolin domain. The members of this subgroup carry a partial MIDAS motif and have the overall Para-Rossmann type fold that is characteristic of this superfamily.
  • pfam Sec23_trunk 240aa 5e-90 in ref transcript
    • Sec23/Sec24 trunk domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is known as the trunk domain and has an alpha/beta vWA fold and forms the dimer interface.
  • pfam Sec23_helical 103aa 2e-29 in ref transcript
    • Sec23/Sec24 helical domain. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is composed of five alpha helices.
  • pfam Sec23_BS 85aa 1e-28 in ref transcript
    • Sec23/Sec24 beta-sandwich domain.
  • pfam zf-Sec23_Sec24 39aa 2e-12 in ref transcript
    • Sec23/Sec24 zinc finger. COPII-coated vesicles carry proteins from the endoplasmic reticulum to the Golgi complex. This vesicular transport can be reconstituted by using three cytosolic components containing five proteins: the small GTPase Sar1p, the Sec23p/24p complex, and the Sec13p/Sec31p complex. This domain is found to be zinc binding domain.
  • pfam Gelsolin 75aa 7e-09 in ref transcript
    • Gelsolin repeat.
  • COG COG5028 797aa 1e-143 in ref transcript
    • Vesicle coat complex COPII, subunit SEC24/subunit SFB2/subunit SFB3 [Intracellular trafficking and secretion].

SEC31A

  • rs.SEC31A.F1 rs.SEC31A.R1 155 200
  • NCBIGene 36.3 22872
  • Alternative 5-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077207

  • cd WD40 244aa 4e-22 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • COG COG2319 306aa 6e-15 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

SEMA6D

  • rs.SEMA6D.F1 rs.SEMA6D.R1 152 209
  • NCBIGene 36.3 80031
  • Single exon skipping, size difference: 57
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_153618

  • pfam Sema 418aa 1e-121 in ref transcript
    • Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor.
  • pfam PSI 45aa 7e-07 in ref transcript
    • Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas shows 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).
  • pfam DUF1043 111aa 0.007 in ref transcript
    • Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function.

SEMA6D

  • rs.SEMA6D.F2 rs.SEMA6D.R2 342 475
  • NCBIGene 36.3 80031
  • Alternative 5-prime, size difference: 133
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_153618

  • pfam Sema 418aa 1e-121 in ref transcript
    • Sema domain. The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in the hepatocyte growth factor receptor and the human Plexin-A3 precursor.
  • pfam PSI 45aa 7e-07 in ref transcript
    • Plexin repeat. A cysteine rich repeat found in several different extracellular receptors. The function of the repeat is unknown. Three copies of the repeat are found Plexin. Two copies of the repeat are found in mahogany protein. A related Caenorhabditis elegans protein contains four copies of the repeat. The Met receptor contains a single copy of the repeat. The Pfam alignment shows 6 conserved cysteine residues that may form three conserved disulphide bridges, whereas shows 8 conserved cysteines. The pattern of conservation suggests that cysteines 5 and 7 (that are not absolutely conserved) form a disulphide bridge (Personal observation. A Bateman).
  • Changed! pfam DUF1043 111aa 0.007 in ref transcript
    • Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function.

SENP6

  • rs.SENP6.F1 rs.SENP6.R1 121 142
  • NCBIGene 36.3 26054
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015571

  • pfam Peptidase_C48 120aa 9e-10 in ref transcript
    • Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn.
  • pfam Peptidase_C48 95aa 5e-08 in ref transcript
  • COG ULP1 112aa 3e-18 in ref transcript
    • Protease, Ulp1 family [Posttranslational modification, protein turnover, chaperones].
  • COG ULP1 91aa 2e-06 in ref transcript

SENP7

  • rs.SENP7.F1 rs.SENP7.R1 191 386
  • NCBIGene 36.3 57337
  • Single exon skipping, size difference: 195
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_020654

  • pfam Peptidase_C48 242aa 3e-20 in ref transcript
    • Ulp1 protease family, C-terminal catalytic domain. This domain contains the catalytic triad Cys-His-Asn.
  • COG ULP1 114aa 7e-21 in ref transcript
    • Protease, Ulp1 family [Posttranslational modification, protein turnover, chaperones].
  • COG ULP1 81aa 0.009 in ref transcript

SEPT8

  • rs.SEPT8.F1 rs.SEPT8.R1 137 405
  • NCBIGene 36.3 23176
  • Alternative 3-prime, size difference: 268
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001098812

  • cd CDC_Septin 269aa 1e-110 in ref transcript
    • CDC/Septin. Septins are a conserved family of GTP-binding proteins associated with diverse processes in dividing and non-dividing cells. They were first discovered in the budding yeast S. cerevisiae as a set of genes (CDC3, CDC10, CDC11 and CDC12) required for normal bud morphology. Septins are also present in metazoan cells, where they are required for cytokinesis in some systems, and implicated in a variety of other processes involving organization of the cell cortex and exocytosis. In humans, 12 septin genes generate dozens of polypeptides, many of which comprise heterooligomeric complexes. Since septin mutants are commonly defective in cytokinesis and formation of the neck formation of the neck filaments/septin rings, septins have been considered to be the primary constituents of the neck filaments. Septins belong to the GTPase superfamily for their conserved GTPase motifs and enzymatic activities.
  • pfam Septin 268aa 3e-84 in ref transcript
    • Septin. Members of this family include CDC3, CDC10, CDC11 and CDC12/Septin. Members of this family bind GTP. As regards the septins, these are polypeptides of 30-65kDa with three characteristic GTPase motifs (G-1, G-3 and G-4) that are similar to those of the Ras family. The G-4 motif is strictly conserved with a unique septin consensus of AKAD. Most septins are thought to have at least one coiled-coil region, which in some cases is necessary for intermolecular interactions that allow septins to polymerise to form rod-shaped complexes. In turn, these are arranged into tandem arrays to form filaments. They are multifunctional proteins, with roles in cytokinesis, sporulation, germ cell development, exocytosis and apoptosis.
  • COG CDC3 322aa 6e-74 in ref transcript
    • Septin family protein [Cell division and chromosome partitioning / Cytoskeleton].

SERBP1

  • rs.SERBP1.F1 rs.SERBP1.R1 103 148
  • NCBIGene 36.3 26135
  • Alternative 3-prime, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001018067

  • Changed! pfam HABP4_PAI-RBP1 125aa 2e-16 in ref transcript
    • Hyaluronan / mRNA binding family. This family includes the HABP4 family of hyaluronan-binding proteins, and the PAI-1 mRNA-binding protein, PAI-RBP1. HABP4 has been observed to bind hyaluronan (a glucosaminoglycan), but it is not known whether this is its primary role in vivo. It has also been observed to bind RNA, but with a lower affinity than that for hyaluronan. PAI-1 mRNA-binding protein specifically binds the mRNA of type-1 plasminogen activator inhibitor (PAI-1), and is thought to be involved in regulation of mRNA stability. However, in both cases, the sequence motifs predicted to be important for ligand binding are not conserved throughout the family, so it is not known whether members of this family share a common function.
  • Changed! pfam HABP4_PAI-RBP1 110aa 9e-18 in modified transcript

SERPINB8

  • rs.SERPINB8.F1 rs.SERPINB8.R1 100 117
  • NCBIGene 36.3 5271
  • Alternative 5-prime, size difference: 17
  • Exclusion in 5'UTR
  • Reference transcript: NM_198833

  • cd PAI-2 371aa 1e-137 in ref transcript
    • Plasminogen Activator Inhibitor-2 (PAI-2). PAI-2 is a serine protease inhibitor that belongs to the ov-serpin branch of the serpin superfamily. It is is an effective inhibitor of urinary plasminogen activator (urokinase or uPA) and is involved in cell differentiation, tissue growth and regeneration.
  • smart SERPIN 362aa 1e-116 in ref transcript
    • SERine Proteinase INhibitors.
  • COG COG4826 372aa 1e-60 in ref transcript
    • Serine protease inhibitor [Posttranslational modification, protein turnover, chaperones].

SEZ6

  • rs.SEZ6.F1 rs.SEZ6.R1 103 119
  • NCBIGene 36.3 124925
  • Alternative 3-prime, size difference: 16
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_178860

  • cd CUB 111aa 9e-19 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 109aa 6e-16 in ref transcript
  • cd CCP 57aa 1e-10 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd CCP 59aa 2e-08 in ref transcript
  • cd CCP 60aa 1e-07 in ref transcript
  • cd CCP 56aa 2e-06 in ref transcript
  • cd CCP 57aa 3e-04 in ref transcript
  • smart CUB 99aa 1e-15 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • smart CUB 100aa 1e-14 in ref transcript
  • pfam Sushi 56aa 2e-12 in ref transcript
    • Sushi domain (SCR repeat).
  • smart CCP 60aa 7e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 58aa 7e-08 in ref transcript
  • pfam Sushi 55aa 5e-07 in ref transcript
  • smart CCP 56aa 4e-05 in ref transcript

SEZ6L2

  • rs.SEZ6L2.F1 rs.SEZ6L2.R1 123 333
  • NCBIGene 36.3 26470
  • Alternative 3-prime, size difference: 210
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_201575

  • cd CUB 91aa 6e-11 in ref transcript
    • CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.
  • cd CUB 69aa 7e-11 in ref transcript
  • cd CCP 57aa 4e-09 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
  • cd CUB 113aa 3e-08 in ref transcript
  • cd CCP 59aa 3e-08 in ref transcript
  • cd CCP 60aa 2e-07 in ref transcript
  • cd CCP 56aa 2e-06 in ref transcript
  • pfam CUB 91aa 2e-08 in ref transcript
    • CUB domain.
  • smart CCP 56aa 3e-08 in ref transcript
    • Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
  • smart CCP 56aa 5e-08 in ref transcript
  • pfam Sushi 58aa 1e-07 in ref transcript
    • Sushi domain (SCR repeat).
  • smart CCP 60aa 3e-07 in ref transcript
  • smart CUB 59aa 3e-06 in ref transcript
    • Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.
  • smart CUB 102aa 5e-04 in ref transcript
  • pfam Sushi 57aa 0.003 in ref transcript

SFRS12

  • rs.SFRS12.F1 rs.SFRS12.R1 372 496
  • NCBIGene 36.3 140890
  • Single exon skipping, size difference: 124
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001077199

  • Changed! cd RRM 73aa 2e-14 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 74aa 2e-05 in ref transcript
  • Changed! smart RRM_2 72aa 5e-14 in ref transcript
    • RNA recognition motif.
  • Changed! smart RRM 71aa 1e-04 in ref transcript
    • RNA recognition motif.
  • Changed! COG COG0724 106aa 7e-05 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! pfam RRM_1 69aa 1e-05 in modified transcript
    • RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain). The RRM motif is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins have a N terminus rrm which is included in the seed. There is a second region towards the C terminus that has some features of a rrm but does not appear to have the important structural core of a rrm. The LA proteins are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
  • Changed! COG COG0724 86aa 7e-04 in modified transcript

SFRS18

  • rs.SFRS18.F1 rs.SFRS18.R1 203 283
  • NCBIGene 36.3 25957
  • Single exon skipping, size difference: 80
  • Exclusion in 5'UTR
  • Reference transcript: NM_032870

SGCE

  • rs.SGCE.F1 rs.SGCE.R1 163 198
  • NCBIGene 36.3 8910
  • Single exon skipping, size difference: 35
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001099401

  • pfam Sarcoglycan_2 389aa 1e-168 in ref transcript
    • Sarcoglycan alpha/epsilon. Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear. This family contains alpha and epsilon members.

SGCE

  • rs.SGCE.F2 rs.SGCE.R2 147 222
  • NCBIGene 36.3 8910
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001099401

  • Changed! pfam Sarcoglycan_2 389aa 1e-168 in ref transcript
    • Sarcoglycan alpha/epsilon. Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear. This family contains alpha and epsilon members.
  • Changed! pfam Sarcoglycan_2 386aa 1e-169 in modified transcript

SGCE

  • rs.SGCE.F3 rs.SGCE.R3 137 164
  • NCBIGene 36.3 8910
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001099401

  • Changed! pfam Sarcoglycan_2 389aa 1e-168 in ref transcript
    • Sarcoglycan alpha/epsilon. Sarcoglycans are a subcomplex of transmembrane proteins which are part of the dystrophin-glycoprotein complex. They are expressed in the skeletal, cardiac and smooth muscle. Although numerous studies have been conducted on the sarcoglycan subcomplex in skeletal and cardiac muscle, the manner of the distribution and localisation of these proteins along the nonjunctional sarcolemma is not clear. This family contains alpha and epsilon members.
  • Changed! pfam Sarcoglycan_2 380aa 1e-162 in modified transcript

SGSM1

  • rs.SGSM1.F1 rs.SGSM1.R1 174 339
  • NCBIGene 36.3 129049
  • Single exon skipping, size difference: 165
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001039948

  • smart TBC 145aa 2e-30 in ref transcript
    • Domain in Tre-2, BUB2p, and Cdc16p. Probable Rab-GAPs. Widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it performs a GTP-activator activity on Rab-like GTPases.
  • pfam RUN 146aa 5e-21 in ref transcript
    • RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
  • COG COG5210 212aa 2e-18 in ref transcript
    • GTPase-activating protein [General function prediction only].

SGSM2

  • rs.SGSM2.F1 rs.SGSM2.R1 329 464
  • NCBIGene 36.3 9905
  • Single exon skipping, size difference: 135
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014853

  • smart TBC 147aa 1e-31 in ref transcript
    • Domain in Tre-2, BUB2p, and Cdc16p. Probable Rab-GAPs. Widespread domain present in Gyp6 and Gyp7, thereby giving rise to the notion that it performs a GTP-activator activity on Rab-like GTPases.
  • pfam RUN 144aa 6e-25 in ref transcript
    • RUN domain. This domain is present in several proteins that are linked to the functions of GTPases in the Rap and Rab families. They could hence play important roles in multiple Ras-like GTPase signalling pathways. The domain is comprises six conserved regions, which in some proteins have considerable insertions between them. The domain core is thought to take up a predominantly alpha fold, with basic amino acids in regions A and D possibly playing a functional role in interactions with Ras GTPases.
  • COG COG5210 212aa 1e-19 in ref transcript
    • GTPase-activating protein [General function prediction only].

SH2D5

  • rs.SH2D5.F1 rs.SH2D5.R1 170 299
  • NCBIGene 36.3 400745
  • Single exon skipping, size difference: 129
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001103161

  • cd SH2 88aa 2e-09 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • Changed! cd PTB 119aa 2e-07 in ref transcript
    • Phosphotyrosine-binding (PTB) domain; PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. The PTB domain of SHC binds to a NPXpY sequence. More recent studies have found that some types of PTB domains such as the neuronal protein X11 and in the cell-fate determinant protein Numb can bind to peptides which are not tyrosine phosphorylated; whereas, other PTB domains can bind motifs lacking tyrosine residues altogether.
  • Changed! smart PTB 121aa 3e-09 in ref transcript
    • Phosphotyrosine-binding domain, phosphotyrosine-interaction (PI) domain. PTB/PI domain structure similar to those of pleckstrin homology (PH) and IRS-1-like PTB domains.
  • smart SH2 77aa 0.002 in ref transcript
    • Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.

SHMT1

  • rs.SHMT1.F1 rs.SHMT1.R1 215 332
  • NCBIGene 36.3 6470
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004169

  • Changed! cd SHMT 427aa 0.0 in ref transcript
    • Serine-glycine hydroxymethyltransferase (SHMT). This family belongs to pyridoxal phosphate (PLP)-dependent aspartate aminotransferase superfamily (fold I). SHMT carries out interconversion of serine and glycine; it catalyzes the transfer of hydroxymethyl group of N5, N10-methylene tetrahydrofolate to glycine resulting in the formation of serine and tetrahydrofolate. Both eukaryotic and prokaryotic SHMT enzymes form tight obligate homodimers; the mammalian enzyme forms a homotetramer comprising four pyridoxal phosphate-bound active sites.
  • Changed! pfam SHMT 400aa 0.0 in ref transcript
    • Serine hydroxymethyltransferase.
  • Changed! PTZ PTZ00094 457aa 0.0 in ref transcript
    • serine hydroxymethyltransferase; Provisional.
  • Changed! cd SHMT 388aa 1e-167 in modified transcript
  • Changed! pfam SHMT 361aa 1e-178 in modified transcript
  • Changed! PTZ PTZ00094 418aa 0.0 in modified transcript

SHPRH

  • rs.SHPRH.F1 rs.SHPRH.R1 100 447
  • NCBIGene 36.3 257218
  • Alternative 3-prime, size difference: 347
  • Exclusion of the stop codon
  • Reference transcript: NM_001042683

  • cd H15 73aa 5e-11 in ref transcript
    • linker histone 1 and histone 5 domains; the basic subunit of chromatin is the nucleosome, consisting of an octamer of core histones, two full turns of DNA, a linker histone (H1 or H5) and a variable length of linker DNA; H1/H5 are chromatin-associated proteins that bind to the exterior of nucleosomes and dramatically stabilize the highly condensed states of chromatin fibers; stabilization of higher order folding occurs through electrostatic neutralization of the linker DNA segments, through a highly positively charged carboxy- terminal domain known as the AKP helix (Ala, Lys, Pro); thought to be involved in specific protein-protein and protein-DNA interactions and play a role in suppressing core histone tail domain acetylation in the chromatin fiber.
  • cd HELICc 121aa 1e-08 in ref transcript
    • Helicase superfamily c-terminal domain; associated with DEXDc-, DEAD-, and DEAH-box proteins, yeast initiation factor 4A, Ski2p, and Hepatitis C virus NS3 helicases; this domain is found in a wide variety of helicases and helicase related proteins; may not be an autonomously folding unit, but an integral part of the helicase; 4 helicase superfamilies at present according to the organization of their signature motifs; all helicases share the ability to unwind nucleic acid duplexes with a distinct directional polarity; they utilize the free energy from nucleoside triphosphate hydrolysis to fuel their translocation along DNA, unwinding the duplex in the process.
  • cd RING 51aa 1e-04 in ref transcript
    • RING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H).
  • pfam SNF2_N 217aa 2e-14 in ref transcript
    • SNF2 family N-terminal domain. This domain is found in proteins involved in a variety of processes including transcription regulation (e.g., SNF2, STH1, brahma, MOT1), DNA repair (e.g., ERCC6, RAD16, RAD5), DNA recombination (e.g., RAD54), and chromatin unwinding (e.g., ISWI) as well as a variety of other proteins with little functional information (e.g., lodestar, ETL1).
  • pfam Linker_histone 73aa 2e-08 in ref transcript
    • linker histone H1 and H5 family. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures Histone H1 is replaced by histone H5 in some cell types.
  • smart HELICc 81aa 7e-06 in ref transcript
    • helicase superfamily c-terminal domain.
  • pfam SNF2_N 25aa 1e-04 in ref transcript
  • smart RING 47aa 0.001 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • smart PHD 32aa 0.006 in ref transcript
    • PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the RI NG finger and the FYV E finger. It is not yet known if PHD fingers have a common molecular function. Several reports suggest that it can function as a protein-protein interacton domain and it was recently demonstrated that the PHD finger of p300 can cooperate with the adjacent BR OMO domain in nucleosome binding in vitro. Other reports suggesting that the PHD finger is a ubiquitin ligase have been refuted as these domains were RI NG fingers misidentified as PHD fingers.
  • Changed! COG HepA 168aa 1e-16 in ref transcript
    • Superfamily II DNA/RNA helicases, SNF2 family [Transcription / DNA replication, recombination, and repair].
  • COG HepA 282aa 3e-13 in ref transcript
  • COG PEX10 73aa 0.009 in ref transcript
    • RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG HepA 138aa 2e-15 in modified transcript

SIKE

  • rs.SIKE.F1 rs.SIKE.R1 100 112
  • NCBIGene 36.3 80143
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001102396

  • Changed! pfam DUF837 176aa 7e-57 in ref transcript
    • Protein of unknown function (DUF837). This family consists of several eukaryotic proteins of unknown function. One of the family members is a circulating cathodic antigen (CCA) found in Schistosoma mansoni (Blood fluke).
  • Changed! pfam DUF837 172aa 8e-58 in modified transcript

SLA

  • rs.SLA.F1 rs.SLA.R1 105 383
  • NCBIGene 36.3 6503
  • Single exon skipping, size difference: 278
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001045557

  • Changed! cd SH2 93aa 3e-20 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • Changed! smart SH2 85aa 6e-21 in ref transcript
    • Src homology 2 domains. Src homology 2 domains bind phosphotyrosine-containing polypeptides via 2 surface pockets. Specificity is provided via interaction with residues that are distinct from the phosphotyrosine. Only a single occurrence of a SH2 domain has been found in S. cerevisiae.
  • Changed! smart SH3 38aa 5e-04 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.

SLC12A6

  • rs.SLC12A6.F1 rs.SLC12A6.R1 134 179
  • NCBIGene 36.3 9990
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_133647

  • TIGR 2a30 1042aa 0.0 in ref transcript
  • COG PotE 244aa 2e-16 in ref transcript
    • Amino acid transporters [Amino acid transport and metabolism].
  • COG PotE 201aa 5e-08 in ref transcript

SLC12A6

  • rs.SLC12A6.F2 rs.SLC12A6.R2 243 309
  • NCBIGene 36.3 9990
  • Alternative 3-prime, size difference: 66
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001042496

  • TIGR 2a30 1042aa 0.0 in ref transcript
  • COG PotE 244aa 3e-16 in ref transcript
    • Amino acid transporters [Amino acid transport and metabolism].
  • COG PotE 201aa 7e-08 in ref transcript

SLC17A3

  • rs.SLC17A3.F1 rs.SLC17A3.R1 130 364
  • NCBIGene 36.3 10786
  • Single exon skipping, size difference: 234
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098486

  • Changed! cd MFS 356aa 9e-21 in ref transcript
    • The Major Facilitator Superfamily (MFS) is a large and diverse group of secondary transporters that includes uniporters, symporters, and antiporters. MFS proteins facilitate the transport across cytoplasmic or internal membranes of a variety of substrates including ions, sugar phosphates, drugs, neurotransmitters, nucleosides, amino acids, and peptides. They do so using the electrochemical potential of the transported substrates. Uniporters transport a single substrate, while symporters and antiporters transport two substrates in the same or in opposite directions, respectively, across membranes. MFS proteins are typically 400 to 600 amino acids in length, and the majority contain 12 transmembrane alpha helices (TMs) connected by hydrophilic loops. The N- and C-terminal halves of these proteins display weak similarity and may be the result of a gene duplication/fusion event. Based on kinetic studies and the structures of a few bacterial superfamily members, GlpT (glycerol-3-phosphate transporter), LacY (lactose permease), and EmrD (multidrug transporter), MFS proteins are thought to function through a single substrate binding site, alternating-access mechanism involving a rocker-switch type of movement. Bacterial members function primarily for nutrient uptake, and as drug-efflux pumps to confer antibiotic resistance. Some MFS proteins have medical significance in humans such as the glucose transporter Glut4, which is impaired in type II diabetes, and glucose-6-phosphate transporter (G6PT), which causes glycogen storage disease when mutated.
  • Changed! TIGR 2A0114euk 475aa 1e-102 in ref transcript
  • Changed! COG UhpC 228aa 2e-10 in ref transcript
    • Sugar phosphate permease [Carbohydrate transport and metabolism].
  • Changed! cd MFS 272aa 3e-10 in modified transcript
  • Changed! TIGR 2A0114euk 313aa 2e-72 in modified transcript
  • Changed! TIGR 2A0114euk 50aa 6e-07 in modified transcript
  • Changed! COG UhpC 166aa 4e-05 in modified transcript

SLC23A1

  • rs.SLC23A1.F1 rs.SLC23A1.R1 97 109
  • NCBIGene 36.3 9963
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152685

  • Changed! pfam Xan_ur_permease 437aa 8e-61 in ref transcript
    • Permease family. This family includes permeases for diverse substrates such as xanthine, uracil and vitamin C. However many members of this family are functionally uncharacterised and may transport other substrates. Members of this family have ten predicted transmembrane helices.
  • Changed! COG UraA 499aa 2e-43 in ref transcript
    • Xanthine/uracil permeases [Nucleotide transport and metabolism].
  • Changed! pfam Xan_ur_permease 433aa 1e-62 in modified transcript
  • Changed! COG UraA 495aa 2e-44 in modified transcript

SLC25A45

  • rs.SLC25A45.F1 rs.SLC25A45.R1 117 172
  • NCBIGene 36.3 283130
  • Single exon skipping, size difference: 55
  • Exclusion of the protein initiation site
  • Reference transcript: NM_182556

  • pfam Mito_carr 92aa 5e-19 in ref transcript
    • Mitochondrial carrier protein.
  • pfam Mito_carr 92aa 8e-19 in ref transcript
  • Changed! pfam Mito_carr 76aa 2e-15 in ref transcript
  • PTZ PTZ00169 163aa 1e-11 in ref transcript
    • ADP/ATP transporter on adenylate translocase; Provisional.
  • Changed! PTZ PTZ00169 166aa 2e-11 in ref transcript

SLC35C2

  • rs.SLC35C2.F1 rs.SLC35C2.R1 142 205
  • NCBIGene 36.3 51006
  • Alternative 3-prime, size difference: 63
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_173179

  • pfam TPT 149aa 5e-16 in ref transcript
    • Triose-phosphate Transporter family. This family includes transporters with a specificity for triose phosphate.

SLC35C2

  • rs.SLC35C2.F2 rs.SLC35C2.R2 108 303
  • NCBIGene 36.3 51006
  • Alternative 3-prime, size difference: 195
  • Inclusion in 5'UTR
  • Reference transcript: NM_015945

  • pfam TPT 149aa 5e-16 in ref transcript
    • Triose-phosphate Transporter family. This family includes transporters with a specificity for triose phosphate.

SLC38A1

  • rs.SLC38A1.F1 rs.SLC38A1.R1 191 530
  • NCBIGene 36.3 81539
  • Alternative 5-prime, size difference: 339
  • Exclusion in 5'UTR
  • Reference transcript: NM_030674

  • pfam Aa_trans 386aa 2e-42 in ref transcript
    • Transmembrane amino acid transporter protein. This transmembrane region is found in many amino acid transporters including UNC-47 and MTR. UNC-47 encodes a vesicular amino butyric acid (GABA) transporter, (VGAT). UNC-47 is predicted to have 10 transmembrane domains. MTR is a N system amino acid transporter system protein involved in methyltryptophan resistance. Other members of this family include proline transporters and amino acid permeases.
  • PTZ PTZ00206 364aa 6e-22 in ref transcript
    • amino acid transporter; Provisional.

SLC3A2

  • rs.SLC3A2.F1 rs.SLC3A2.R1 313 403
  • NCBIGene 36.3 6520
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012661

  • TIGR trehalose_treC 136aa 4e-16 in ref transcript
    • Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
  • pfam Alpha-amylase 244aa 8e-14 in ref transcript
    • Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain.
  • TIGR treS_nterm 120aa 3e-08 in ref transcript
    • Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PMID:15378530).
  • COG AmyA 340aa 3e-18 in ref transcript
    • Glycosidases [Carbohydrate transport and metabolism].

SLC3A2

  • rs.SLC3A2.F2 rs.SLC3A2.R2 231 324
  • NCBIGene 36.3 6520
  • Single exon skipping, size difference: 93
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001012661

  • TIGR trehalose_treC 136aa 4e-16 in ref transcript
    • Trehalose is a glucose disaccharide that serves in many biological systems as a compatible solute for protection against hyperosmotic and thermal stress. This family describes trehalose-6-phosphate hydrolase, product of the treC (or treA) gene, which is often found together with a trehalose uptake transporter and a trehalose operon repressor.
  • pfam Alpha-amylase 244aa 8e-14 in ref transcript
    • Alpha amylase, catalytic domain. Alpha amylase is classified as family 13 of the glycosyl hydrolases. The structure is an 8 stranded alpha/beta barrel containing the active site, interrupted by a ~70 a.a. calcium-binding domain protruding between beta strand 3 and alpha helix 3, and a carboxyl-terminal Greek key beta-barrel domain.
  • TIGR treS_nterm 120aa 3e-08 in ref transcript
    • Trehalose synthase interconverts maltose and alpha, alpha-trehalose by transglucosylation. This is one of at least three mechanisms for biosynthesis of trehalose, an important and widespread compatible solute. However, it is not driven by phosphate activation of sugars and its physiological role may tend toward trehalose degradation. This view is accentuated by numerous examples of fusion to a probable maltokinase domain. The sequence region described by this model is found both as the whole of a trehalose synthase and as the N-terminal region of a larger fusion protein that includes trehalose synthase activity. Several of these fused trehalose synthases have a domain homologous to proteins with maltokinase activity from Actinoplanes missouriensis and Streptomyces coelicolor (PMID:15378530).
  • COG AmyA 340aa 3e-18 in ref transcript
    • Glycosidases [Carbohydrate transport and metabolism].

SLC47A2

  • rs.SLC47A2.F1 rs.SLC47A2.R1 99 207
  • NCBIGene 36.3 146802
  • Alternative 3-prime, size difference: 108
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152908

  • Changed! TIGR matE 373aa 6e-35 in ref transcript
    • The MATE family consists of probable efflux proteins including a functionally characterized multi drug efflux system from Vibrio parahaemolyticus, a putative ethionine resistance protein of Saccharomyces cerevisiae, and the functionally uncharacterized DNA damage-inducible protein F (DinF) of E. coli. These proteins have 12 probable TMS.
  • Changed! COG NorM 471aa 3e-36 in ref transcript
    • Na+-driven multidrug efflux pump [Defense mechanisms].
  • Changed! TIGR matE 337aa 6e-39 in modified transcript
  • Changed! COG NorM 435aa 7e-40 in modified transcript

SLC4A3

  • rs.SLC4A3.F1 rs.SLC4A3.R1 191 251
  • NCBIGene 36.3 6508
  • Alternative 3-prime, size difference: 60
  • Inclusion in 5'UTR
  • Reference transcript: NM_201574

  • TIGR ae 910aa 0.0 in ref transcript
    • They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.

SLC5A10

  • rs.SLC5A10.F1 rs.SLC5A10.R1 385 433
  • NCBIGene 36.3 125206
  • Alternative 3-prime, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_152351

  • Changed! pfam SSF 446aa 3e-88 in ref transcript
    • Sodium:solute symporter family.
  • Changed! PRK PRK10484 592aa 5e-41 in ref transcript
    • putative symporter YidK; Provisional.
  • Changed! pfam SSF 430aa 3e-90 in modified transcript
  • Changed! PRK PRK10484 576aa 3e-42 in modified transcript

SLC7A3

  • rs.SLC7A3.F1 rs.SLC7A3.R1 123 147
  • NCBIGene 36.3 84889
  • Alternative 3-prime, size difference: 24
  • Inclusion in 5'UTR
  • Reference transcript: NM_001048164

  • TIGR 2A0303 583aa 1e-180 in ref transcript
  • COG PotE 418aa 2e-34 in ref transcript
    • Amino acid transporters [Amino acid transport and metabolism].

SLC7A6

  • rs.SLC7A6.F1 rs.SLC7A6.R1 270 357
  • NCBIGene 36.3 9057
  • Single exon skipping, size difference: 87
  • Exclusion in 5'UTR
  • Reference transcript: NM_001076785

  • TIGR 2A0308 485aa 1e-147 in ref transcript
  • COG PotE 456aa 9e-35 in ref transcript
    • Amino acid transporters [Amino acid transport and metabolism].

SLC9A6

  • rs.SLC9A6.F1 rs.SLC9A6.R1 154 250
  • NCBIGene 36.3 10479
  • Alternative 5-prime, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042537

  • Changed! TIGR b_cpa1 408aa 7e-72 in ref transcript
    • This Hmm is specific for the eukaryotic members members of this family.
  • Changed! COG NhaP 359aa 7e-42 in ref transcript
    • NhaP-type Na+/H+ and K+/H+ antiporters [Inorganic ion transport and metabolism].
  • Changed! TIGR b_cpa1 459aa 1e-72 in modified transcript
  • Changed! COG NhaP 361aa 6e-41 in modified transcript

SLFN11

  • rs.SLFN11.F1 rs.SLFN11.R1 147 255
  • NCBIGene 36.3 91607
  • Alternative 5-prime, size difference: 108
  • Exclusion in 5'UTR
  • Reference transcript: NM_001104587

  • pfam AAA_4 101aa 5e-16 in ref transcript
    • Divergent AAA domain. This family is related to the pfam00004 family, and presumably has the same function (ATP-binding).
  • COG COG2865 87aa 2e-06 in ref transcript
    • Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription].

SLFN11

  • rs.SLFN11.F2 rs.SLFN11.R2 100 162
  • NCBIGene 36.3 91607
  • Alternative 5-prime, size difference: 62
  • Exclusion in 5'UTR
  • Reference transcript: NM_001104587

  • pfam AAA_4 101aa 5e-16 in ref transcript
    • Divergent AAA domain. This family is related to the pfam00004 family, and presumably has the same function (ATP-binding).
  • COG COG2865 87aa 2e-06 in ref transcript
    • Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription].

SLFN11

  • rs.SLFN11.F3 rs.SLFN11.R3 140 300
  • NCBIGene 36.3 91607
  • Single exon skipping, size difference: 160
  • Exclusion in 5'UTR
  • Reference transcript: NM_001104587

  • pfam AAA_4 101aa 5e-16 in ref transcript
    • Divergent AAA domain. This family is related to the pfam00004 family, and presumably has the same function (ATP-binding).
  • COG COG2865 87aa 2e-06 in ref transcript
    • Predicted transcriptional regulator containing an HTH domain and an uncharacterized domain shared with the mammalian protein Schlafen [Transcription].

SLTM

  • rs.SLTM.F1 rs.SLTM.R1 264 318
  • NCBIGene 36.3 79811
  • Alternative 3-prime, size difference: 54
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024755

  • cd RRM 75aa 6e-12 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart RRM 71aa 1e-11 in ref transcript
    • RNA recognition motif.
  • smart SAP 35aa 8e-06 in ref transcript
    • Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation.
  • COG COG0724 112aa 0.001 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].

SMAD5

  • rs.SMAD5.F1 rs.SMAD5.R1 176 260
  • NCBIGene 36.3 4090
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_001001419

  • cd MH2 176aa 1e-73 in ref transcript
    • MH2 domain; C terminal domain of SMAD family proteins, responsible for receptor interaction, transactivation, and homo- and heterooligomerisation; also known as Domain B in dwarfin family proteins.
  • cd MH1 121aa 4e-34 in ref transcript
    • MH1 is a small DNA binding domain, binding in an unusal way involving a beta hairpin structure binding to the major groove. MH1 is present in Smad proteins, an important family of proteins involved in TGF-beta signalling and frequent targets of tumorigenic mutations. Also known as Domain A in dwarfin family proteins.
  • pfam MH2 176aa 1e-80 in ref transcript
    • MH2 domain. This is the MH2 (MAD homology 2) domain found at the carboxy terminus of MAD related proteins such as Smads. This domain is separated from the MH1 domain by a non-conserved linker region. The MH2 domain mediates interaction with a wide variety of proteins and provides specificity and selectivity to Smad function and also is critical for mediating interactions in Smad oligomers. Unlike MH1, MH2 does not bind DNA. The well-studied MH2 domain of Smad4 is composed of five alpha helices and three loops enclosing a beta sandwich. Smads are involved in the propagation of TGF-beta signals by direct association with the TGF-beta receptor kinase which phosphorylates the last two Ser of a conserved 'SSXS' motif located at the C-terminus of MH2.
  • pfam MH1 76aa 2e-35 in ref transcript
    • MH1 domain. The MH1 (MAD homology 1) domain is found at the amino terminus of MAD related proteins such as Smads. This domain is separated from the MH2 domain by a non-conserved linker region. The crystal structure of the MH1 domain shows that a highly conserved 11 residue beta hairpin is used to bind the DNA consensus sequence GNCN in the major groove, shown to be vital for the transcriptional activation of target genes. Not all examples of MH1 can bind to DNA however. Smad2 cannot bind DNA and has a large insertion within the hairpin that presumably abolishes DNA binding. A basic helix (H2) in MH1 with the nuclear localisation signal KKLKK has been shown to be essential for Smad3 nuclear import. Smads also use the MH1 domain to interact with transcription factors such as Jun, TFE3, Sp1, and Runx.

SMAP1

  • rs.SMAP1.F1 rs.SMAP1.R1 151 232
  • NCBIGene 36.3 60682
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001044305

  • pfam ArfGap 100aa 8e-37 in ref transcript
    • Putative GTPase activating protein for Arf. Putative zinc fingers with GTPase activating proteins (GAPs) towards the small GTPase, Arf. The GAP of ARD1 stimulates GTPase hydrolysis for ARD1 but not ARFs.
  • COG COG5347 91aa 2e-29 in ref transcript
    • GTPase-activating protein that regulates ARFs (ADP-ribosylation factors), involved in ARF-mediated vesicular transport [Intracellular trafficking and secretion].

SMN1

  • rs.SMN1.F1 rs.SMN1.R1 118 214
  • NCBIGene 36.3 6606
  • Single exon skipping, size difference: 96
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000344

  • cd TUDOR 48aa 2e-08 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • Changed! pfam SMN 166aa 3e-40 in ref transcript
    • Survival motor neuron protein (SMN). This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.
  • Changed! pfam SMN 35aa 2e-11 in ref transcript
  • Changed! pfam SMN 229aa 7e-52 in modified transcript

SMN2

  • rs.SMN2.F1 rs.SMN2.R1 123 177
  • NCBIGene 36.3 6607
  • Single exon skipping, size difference: 54
  • Exclusion of the stop codon
  • Reference transcript: NM_017411

  • cd TUDOR 48aa 2e-08 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • pfam SMN 166aa 3e-40 in ref transcript
    • Survival motor neuron protein (SMN). This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.
  • Changed! pfam SMN 35aa 2e-11 in ref transcript
  • Changed! pfam SMN 27aa 2e-08 in modified transcript

SNHG3-RCC1

  • rs.SNHG3-RCC1.F1 rs.SNHG3-RCC1.R1 103 246
  • NCBIGene 36.3 751867
  • Single exon skipping, size difference: 143
  • Exclusion in 5'UTR
  • Reference transcript: NM_001048197

  • pfam RCC1 50aa 1e-10 in ref transcript
    • Regulator of chromosome condensation (RCC1).
  • pfam RCC1 49aa 2e-10 in ref transcript
  • pfam RCC1 51aa 1e-09 in ref transcript
  • pfam RCC1 52aa 2e-09 in ref transcript
  • pfam RCC1 66aa 3e-08 in ref transcript
  • pfam RCC1 31aa 2e-04 in ref transcript
  • pfam RCC1 51aa 7e-04 in ref transcript
  • COG ATS1 350aa 3e-42 in ref transcript
    • Alpha-tubulin suppressor and related RCC1 domain-containing proteins [Cell division and chromosome partitioning / Cytoskeleton].

SNRK

  • rs.SNRK.F1 rs.SNRK.R1 338 400
  • NCBIGene 36.3 54861
  • Single exon skipping, size difference: 62
  • Exclusion in 5'UTR
  • Reference transcript: NM_017719

  • cd S_TKc 254aa 6e-82 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • smart S_TKc 243aa 3e-84 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • PTZ PTZ00263 258aa 5e-42 in ref transcript
    • protein kinase A catalytic subunit; Provisional.

SNUPN

  • rs.SNUPN.F1 rs.SNUPN.R1 201 306
  • NCBIGene 36.3 10073
  • Alternative 5-prime, size difference: 105
  • Exclusion in 5'UTR
  • Reference transcript: NM_005701

SNX14

  • rs.SNX14.F1 rs.SNX14.R1 161 293
  • NCBIGene 36.3 57231
  • Multiple exon skipping, size difference: 132
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_153816

  • cd PX_SNX14 123aa 3e-50 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 14. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. SNX14 may be involved in recruiting other proteins to the membrane via protein-protein and protein-ligand interaction. It is expressed in the embryonic nervous system of mice, and is co-expressed in the motoneurons and the anterior pituary with Islet-1. SNX14 shows a similar domain architecture as SNX13, containing an N-terminal PXA domain, a regulator of G protein signaling (RGS) domain, a PX domain, and a C-terminal domain that is conserved in some SNXs.
  • pfam Nexin_C 106aa 5e-25 in ref transcript
    • Sorting nexin C terminal. This region is found a the C terminal of proteins belonging to the sorting nexin family. It is found on proteins which also contain pfam00787.
  • Changed! smart PXA 156aa 6e-22 in ref transcript
    • Domain associated with PX domains. unpubl. observations.
  • pfam PX 105aa 2e-17 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • smart RGS 128aa 2e-07 in ref transcript
    • Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
  • COG COG5391 140aa 0.002 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
  • Changed! smart PXA 131aa 2e-12 in modified transcript

SNX21

  • rs.SNX21.F1 rs.SNX21.R1 99 110
  • NCBIGene 36.3 90203
  • Single exon skipping, size difference: 11
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_033421

  • Changed! cd PX_SNX21 112aa 1e-55 in ref transcript
    • The phosphoinositide binding Phox Homology domain of Sorting Nexin 21. The PX domain is a phosphoinositide (PI) binding module present in many proteins with diverse functions. Sorting nexins (SNXs) make up the largest group among PX domain containing proteins. They are involved in regulating membrane traffic and protein sorting in the endosomal system. The PX domain of SNXs binds PIs and targets the protein to PI-enriched membranes. SNXs differ from each other in PI-binding specificity and affinity, and the presence of other protein-protein interaction domains, which help determine subcellular localization and specific function in the endocytic pathway. Some SNXs are localized in early endosome structures such as clathrin-coated pits, while others are located in late structures of the endocytic pathway. SNX21, also called SNX-L, is distinctly and highly-expressed in fetal liver and may be involved in protein sorting and degradation during embryonic liver development.
  • Changed! pfam PX 106aa 8e-12 in ref transcript
    • PX domain. PX domains bind to phosphoinositides.
  • Changed! COG COG5391 112aa 3e-04 in ref transcript
    • Phox homology (PX) domain protein [Intracellular trafficking and secretion / General function prediction only].
  • Changed! cd PX_SNX21 19aa 1e-04 in modified transcript

SOD2

  • rs.SOD2.F1 rs.SOD2.R1 283 400
  • NCBIGene 36.3 6648
  • Single exon skipping, size difference: 117
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001024465

  • Changed! pfam Sod_Fe_C 106aa 3e-49 in ref transcript
    • Iron/manganese superoxide dismutases, C-terminal domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to hydrogen peroxide and molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one. In humans, there is a cytoplasmic Cu/Zn SOD, and a mitochondrial Mn/Fe SOD. C-terminal domain is a mixed alpha/beta fold.
  • Changed! pfam Sod_Fe_N 82aa 3e-34 in ref transcript
    • Iron/manganese superoxide dismutases, alpha-hairpin domain. superoxide dismutases (SODs) catalyse the conversion of superoxide radicals to hydrogen peroxide and molecular oxygen. Three evolutionarily distinct families of SODs are known, of which the Mn/Fe-binding family is one. In humans, there is a cytoplasmic Cu/Zn SOD, and a mitochondrial Mn/Fe SOD. N-terminal domain is a long alpha antiparallel hairpin. A small fragment of YTRE_LEPBI matches well - sequencing error?.
  • Changed! COG SodA 200aa 1e-69 in ref transcript
    • Superoxide dismutase [Inorganic ion transport and metabolism].
  • Changed! pfam Sod_Fe_C 103aa 3e-47 in modified transcript
  • Changed! pfam Sod_Fe_N 59aa 6e-21 in modified transcript
  • Changed! COG SodA 161aa 3e-52 in modified transcript

SORBS1

  • rs.SORBS1.F1 rs.SORBS1.R1 129 195
  • NCBIGene 36.3 10580
  • Mutually exclusive exon skipping, size difference: 66
  • Inclusion in the protein (no stop codon or frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001034954

  • cd SH3 54aa 3e-12 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 52aa 8e-11 in ref transcript
  • cd SH3 52aa 6e-10 in ref transcript
  • smart SH3 59aa 4e-13 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 53aa 3e-12 in ref transcript
  • smart SH3 59aa 7e-11 in ref transcript
  • pfam Sorb 41aa 4e-10 in ref transcript
    • Sorbin homologous domain.

SORBS1

  • rs.SORBS1.F2 rs.SORBS1.R2 106 208
  • NCBIGene 36.3 10580
  • Single exon skipping, size difference: 102
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001034954

  • cd SH3 54aa 3e-12 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 52aa 8e-11 in ref transcript
  • cd SH3 52aa 6e-10 in ref transcript
  • smart SH3 59aa 4e-13 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 53aa 3e-12 in ref transcript
  • smart SH3 59aa 7e-11 in ref transcript
  • pfam Sorb 41aa 4e-10 in ref transcript
    • Sorbin homologous domain.

SORBS2

  • rs.SORBS2.F1 rs.SORBS2.R1 156 206
  • NCBIGene 36.3 8470
  • Single exon skipping, size difference: 50
  • Inclusion in 5'UTR
  • Reference transcript: NM_021069

  • cd SH3 52aa 8e-14 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • cd SH3 54aa 9e-12 in ref transcript
  • cd SH3 54aa 3e-11 in ref transcript
  • pfam Sorb 50aa 9e-21 in ref transcript
    • Sorbin homologous domain.
  • smart SH3 53aa 5e-16 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • smart SH3 59aa 2e-12 in ref transcript
  • smart SH3 59aa 2e-12 in ref transcript

SP8

  • rs.SP8.F1 rs.SP8.R1 128 224
  • NCBIGene 36.3 221833
  • Single exon skipping, size difference: 96
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_182700

SPAG11B

  • rs.SPAG11B.F1 rs.SPAG11B.R1 195 271
  • NCBIGene 36.3 10407
  • Single exon skipping, size difference: 76
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_058207

  • Changed! pfam Defensin_beta 36aa 1e-04 in ref transcript
    • Beta defensin. The beta defensins are antimicrobial peptides implicated in the resistance of epithelial surfaces to microbial colonisation.

SPAG11B

  • rs.SPAG11B.F2 rs.SPAG11B.R2 117 195
  • NCBIGene 36.3 10407
  • Single exon skipping, size difference: 78
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_058200

  • pfam Sperm_Ag_HE2 70aa 2e-28 in ref transcript
    • Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.

SPAG11B

  • rs.SPAG11B.F3 rs.SPAG11B.R3 180 256
  • NCBIGene 36.3 10407
  • Single exon skipping, size difference: 76
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_058201

  • Changed! pfam Sperm_Ag_HE2 70aa 1e-30 in ref transcript
    • Sperm antigen HE2. This family consists of several variants of the human and chimpanzee sperm antigen proteins (HE2 and EP2 respectively). The EP2 gene codes for a family of androgen-dependent, epididymis-specific secretory proteins.The EP2 gene uses alternative promoters and differential splicing to produce a family of variant messages. The translated putative protein variants differ significantly from each other. Some of these putative proteins have similarity to beta-defensins, a family of antimicrobial peptides.
  • Changed! pfam Defensin_beta 36aa 2e-06 in ref transcript
    • Beta defensin. The beta defensins are antimicrobial peptides implicated in the resistance of epithelial surfaces to microbial colonisation.
  • Changed! pfam Sperm_Ag_HE2 74aa 1e-29 in modified transcript

SPAG6

  • rs.SPAG6.F1 rs.SPAG6.R1 131 277
  • NCBIGene 36.3 9576
  • Single exon skipping, size difference: 146
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_012443

  • cd ARM 119aa 7e-16 in ref transcript
    • Armadillo/beta-catenin-like repeats. An approximately 40 amino acid long tandemly repeated sequence motif first identified in the Drosophila segment polarity gene armadillo; these repeats were also found in the mammalian armadillo homolog beta-catenin, the junctional plaque protein plakoglobin, the adenomatous polyposis coli (APC) tumor suppressor protein, and a number of other proteins. ARM has been implicated in mediating protein-protein interactions, but no common features among the target proteins recognized by the ARM repeats have been identified; related to the HEAT domain; three consecutive copies of the repeat are represented by this alignment model.
  • cd ARM 119aa 4e-14 in ref transcript
  • cd ARM 119aa 6e-10 in ref transcript
  • cd ARM 78aa 0.006 in ref transcript
  • pfam Arm 41aa 3e-05 in ref transcript
    • Armadillo/beta-catenin-like repeat. Approx. 40 amino acid repeat. Tandem repeats form super-helix of helices that is proposed to mediate interaction of beta-catenin with its ligands. CAUTION: This family does not contain all known armadillo repeats.
  • pfam Adaptin_N 359aa 0.003 in ref transcript
    • Adaptin N terminal region. This family consists of the N terminal region of various alpha, beta and gamma subunits of the AP-1, AP-2 and AP-3 adaptor protein complexes. The adaptor protein (AP) complexes are involved in the formation of clathrin-coated pits and vesicles. The N-terminal region of the various adaptor proteins (APs) is constant by comparison to the C-terminal which is variable within members of the AP-2 family; and it has been proposed that this constant region interacts with another uniform component of the coated vesicles.
  • COG SRP1 312aa 1e-19 in ref transcript
    • Karyopherin (importin) alpha [Intracellular trafficking and secretion].

SPG3A

  • rs.SPG3A.F1 rs.SPG3A.R1 123 138
  • NCBIGene 36.3 51062
  • Single exon skipping, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_015915

  • cd GBP 247aa 1e-55 in ref transcript
    • Guanylate-binding protein (GBP), N-terminal domain. Guanylate-binding proteins (GBPs) define a group of proteins that are synthesized after activation of the cell by interferons. The biochemical properties of GBPs are clearly different from those of Ras-like and heterotrimeric GTP-binding proteins. They bind guanine nucleotides with low affinity (micromolar range), are stable in their absence and have a high turnover GTPase. In addition to binding GDP/GTP, they have the unique ability to bind GMP with equal affinity and hydrolyze GTP not only to GDP, but also to GMP. Furthermore, two unique regions around the base and the phosphate-binding areas, the guanine and the phosphate caps, respectively, give the nucleotide-binding site a unique appearance not found in the canonical GTP-binding proteins. The phosphate cap, which constitutes the region analogous to switch I, completely shields the phosphate-binding site from solvent such that a potential GTPase-activating protein (GAP) cannot approach.
  • pfam GBP 272aa 1e-101 in ref transcript
    • Guanylate-binding protein, N-terminal domain. Transcription of the anti-viral guanylate-binding protein (GBP) is induced by interferon-gamma during macrophage induction. This family contains GBP1 and GPB2, both GTPases capable of binding GTP, GDP and GMP.

SPOP

  • rs.SPOP.F1 rs.SPOP.R1 193 245
  • NCBIGene 36.3 8405
  • Single exon skipping, size difference: 52
  • Exclusion in 5'UTR
  • Reference transcript: NM_001007230

  • cd MATH_SPOP 139aa 8e-70 in ref transcript
    • Speckle-type POZ protein (SPOP) family, MATH domain; composed of proteins with similarity to human SPOP. SPOP was isolated as a novel antigen recognized by serum from a scleroderma patient, whose overexpression in COS cells results in a discrete speckled pattern in the nuclei. It contains an N-terminal MATH domain and a C-terminal BTB (also called POZ) domain. Together with Cul3, SPOP constitutes an ubiquitin E3 ligase which is able to ubiquitinate the PcG protein BMI1, the variant histone macroH2A1 and the death domain-associated protein Daxx. Therefore, SPOP may be involved in the regulation of these proteins and may play a role in transcriptional regulation, apoptosis and X-chromosome inactivation. Cul3 binds to the BTB domain of SPOP whereas Daxx and the macroH2A1 nonhistone region have been shown to bind to the MATH domain. Both MATH and BTB domains are necessary for the nuclear speckled accumulation of SPOP. There are many proteins, mostly uncharacterized, containing both MATH and BTB domains from C. elegans and plants which are excluded from this family.
  • pfam BTB 108aa 3e-21 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • smart MATH 102aa 8e-10 in ref transcript
    • meprin and TRAF homology.

SPP1

  • rs.SPP1.F1 rs.SPP1.R1 250 331
  • NCBIGene 36.3 6696
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040058

  • Changed! pfam Osteopontin 314aa 1e-96 in ref transcript
    • Osteopontin.
  • Changed! pfam Osteopontin 287aa 2e-76 in modified transcript

SPP1

  • rs.SPP1.F2 rs.SPP1.R2 373 415
  • NCBIGene 36.3 6696
  • Single exon skipping, size difference: 42
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001040058

  • Changed! pfam Osteopontin 314aa 1e-96 in ref transcript
    • Osteopontin.
  • Changed! pfam Osteopontin 300aa 9e-89 in modified transcript

SPPL2B

  • rs.SPPL2B.F1 rs.SPPL2B.R1 149 212
  • NCBIGene 36.3 56928
  • Alternative 5-prime, size difference: 63
  • Exclusion in 3'UTR
  • Reference transcript: NM_001077238

  • cd PA_hSPPL_like 120aa 6e-51 in ref transcript
    • PA_hSPPL_like: Protease-associated domain containing human signal peptide peptidase-like (hSPPL)-like. This group contains various PA domain-containing proteins similar to hSPPL2a and 2b. These SPPLs are GxGD aspartic proteases. SPPL2a is sorted to the late endosomes, SPPL2b to the plasma membrane. In activated dendritic cells, hSPPL2a and 2b catalyze the intramembrane proteolysis of tumor necrosis factor alpha triggering IL-12 production. hSPPL2a and 2b may have a broad substrate spectrum. The significance of the PA domain to these SPPLs has not been ascertained. It may be a protein-protein interaction domain. At peptidase active sites, the PA domain may participate in substrate binding and/or promoting conformational changes, which influence the stability and accessibility of the site to substrate.
  • pfam PA 81aa 9e-10 in ref transcript
    • PA domain. The PA (Protease associated) domain is found as an insert domain in diverse proteases. The PA domain is also found in a plant vacuolar sorting receptor and members of the RZF family. It has been suggested that this domain forms a lid-like structure that covers the active site in active proteases, and is involved in protein recognition in vacuolar sorting receptors.
  • smart PSN 58aa 0.003 in ref transcript
    • Presenilin, signal peptide peptidase, family. Presenilin 1 and presenilin 2 are polytopic membrane proteins, whose genes are mutated in some individuals with Alzheimer's disease. Distant homologues, present in eukaryotes and archaea, also contain conserved aspartic acid residues which are predicted to contribute to catalysis. At least one member of this family has been shown to possess signal peptide peptidase activity.

SPRR3

  • rs.SPRR3.F1 rs.SPRR3.R1 107 191
  • NCBIGene 36.3 6707
  • Single exon skipping, size difference: 84
  • Exclusion in 5'UTR
  • Reference transcript: NM_005416

  • pfam Cornifin 137aa 1e-18 in ref transcript
    • Cornifin (SPRR) family. SPRR genes (formerly SPR) encode a novel class of polypeptides (small proline rich proteins) that are strongly induced during differentiation of human epidermal keratinocytes in vitro and in vivo.The most characteristic feature of the SPRR gene family resides in the structure of the central segments of the encoded polypeptides that are built up from tandemly repeated units of either eight (SPRR1 and SPRR3) or nine (SPRR2) amino acids with the general consensus XKXPEPXX where X is any amino acid.

SRGAP3

  • rs.SRGAP3.F1 rs.SRGAP3.R1 96 168
  • NCBIGene 36.3 9901
  • Mutually exclusive exon skipping, size difference: 72
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_014850

  • Changed! cd RhoGAP_srGAP 188aa 6e-99 in ref transcript
    • RhoGAP_srGAP: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain present in srGAPs. srGAPs are components of the intracellular part of Slit-Robo signalling pathway that is important for axon guidance and cell migration. srGAPs contain an N-terminal FCH domain, a central RhoGAP domain and a C-terminal SH3 domain; this SH3 domain interacts with the intracellular proline-rich-tail of the Roundabout receptor (Robo). This interaction with Robo then activates the rhoGAP domain which in turn inhibits Cdc42 activity. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
  • cd SH3 50aa 2e-09 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • smart RhoGAP 172aa 1e-51 in ref transcript
    • GTPase-activator protein for Rho-like GTPases. GTPase activator proteins towards Rho/Rac/Cdc42-like small GTPases. etter domain limits and outliers.
  • pfam FCH 94aa 4e-10 in ref transcript
    • Fes/CIP4 homology domain. Alignment extended from. Highly alpha-helical.
  • smart SH3 50aa 1e-09 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! cd RhoGAP_srGAP 177aa 1e-94 in modified transcript

SSBP3

  • rs.SSBP3.F1 rs.SSBP3.R1 364 424
  • NCBIGene 36.3 23648
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145716

  • Changed! pfam SSDP 144aa 2e-06 in ref transcript
    • Single-stranded DNA binding protein, SSDP. This is a family of eukaryotic single-stranded DNA binding proteins with specificity to a pyrimidine-rich element found in the promoter region of the alpha2(I) collagen gene.
  • Changed! pfam SSDP 138aa 2e-07 in modified transcript

SSX5

  • rs.SSX5.F1 rs.SSX5.R1 330 453
  • NCBIGene 36.3 6758
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_021015

  • Changed! smart KRAB 59aa 6e-11 in ref transcript
    • krueppel associated box.
  • pfam SSXRD 34aa 5e-08 in ref transcript
    • SSXRD motif. SSX1 can repress transcription, and this has been attributed to a putative Kruppel associated box (KRAB) repression domain at the N-terminus. However, from the analysis of these deletion constructs further repression activity was found at the C-terminus of SSX1. Which has been called the SSXRD (SSX Repression Domain). The potent repression exerted by full-length SSX1 appears to localise to this region.
  • Changed! smart KRAB 60aa 6e-11 in modified transcript

STAG2

  • rs.STAG2.F1 rs.STAG2.R1 357 422
  • NCBIGene 36.3 10735
  • Single exon skipping, size difference: 65
  • Exclusion in 5'UTR
  • Reference transcript: NM_001042750

  • pfam STAG 107aa 4e-44 in ref transcript
    • STAG domain. STAG domain proteins are subunits of cohesin complex - a protein complex required for sister chromatid cohesion in eukaryotes. The STAG domain is present in Schizosaccharomyces pombe mitotic cohesin Psc3, and the meiosis specific cohesin Rec11. Many organisms express a meiosis-specific STAG protein, for example, mice and humans have a meiosis specific variant called STAG3, although budding yeast does not have a meiosis specific version.
  • COG IRR1 294aa 2e-18 in ref transcript
    • Cohesin [Cell division and chromosome partitioning].

STAG2

  • rs.STAG2.F2 rs.STAG2.R2 208 319
  • NCBIGene 36.3 10735
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042749

  • pfam STAG 107aa 4e-44 in ref transcript
    • STAG domain. STAG domain proteins are subunits of cohesin complex - a protein complex required for sister chromatid cohesion in eukaryotes. The STAG domain is present in Schizosaccharomyces pombe mitotic cohesin Psc3, and the meiosis specific cohesin Rec11. Many organisms express a meiosis-specific STAG protein, for example, mice and humans have a meiosis specific variant called STAG3, although budding yeast does not have a meiosis specific version.
  • COG IRR1 294aa 2e-18 in ref transcript
    • Cohesin [Cell division and chromosome partitioning].

STAG3L1

  • rs.STAG3L1.F1 rs.STAG3L1.R1 108 232
  • NCBIGene 36.3 54441
  • Single exon skipping, size difference: 124
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_018991

  • Changed! COG IRR1 114aa 1e-08 in ref transcript
    • Cohesin [Cell division and chromosome partitioning].

STAT3

  • rs.STAT3.F1 rs.STAT3.R1 105 127
  • NCBIGene 36.3 6774
  • Alternative 3-prime, size difference: 22
  • Exclusion in 5'UTR
  • Reference transcript: NM_139276

  • cd SH2 92aa 4e-06 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • pfam STAT_bind 254aa 1e-123 in ref transcript
    • STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain pfam00017.
  • pfam STAT_alpha 182aa 4e-63 in ref transcript
    • STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT proteins also include an SH2 domain pfam00017.
  • pfam STAT_int 115aa 4e-46 in ref transcript
    • STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT proteins also include an SH2 domain pfam00017.
  • pfam SH2 91aa 4e-09 in ref transcript
    • SH2 domain.

STAT3

  • rs.STAT3.F2 rs.STAT3.R2 102 152
  • NCBIGene 36.3 6774
  • Alternative 3-prime, size difference: 50
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_139276

  • cd SH2 92aa 4e-06 in ref transcript
    • Src homology 2 domains; Signal transduction, involved in recognition of phosphorylated tyrosine (pTyr). SH2 domains typically bind pTyr-containing ligands via two surface pockets, a pTyr and hydrophobic binding pocket, allowing proteins with SH2 domains to localize to tyrosine phosphorylated sites.
  • pfam STAT_bind 254aa 1e-123 in ref transcript
    • STAT protein, DNA binding domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. This family represents the DNA binding domain of STAT, which has an ig-like fold. STAT proteins also include an SH2 domain pfam00017.
  • pfam STAT_alpha 182aa 4e-63 in ref transcript
    • STAT protein, all-alpha domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT proteins also include an SH2 domain pfam00017.
  • pfam STAT_int 115aa 4e-46 in ref transcript
    • STAT protein, protein interaction domain. STAT proteins (Signal Transducers and Activators of Transcription) are a family of transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors. STAT proteins also include an SH2 domain pfam00017.
  • pfam SH2 91aa 4e-09 in ref transcript
    • SH2 domain.

STAU1

  • rs.STAU1.F1 rs.STAU1.R1 177 300
  • NCBIGene 36.3 6780
  • Single exon skipping, size difference: 123
  • Inclusion in 5'UTR
  • Reference transcript: NM_017453

  • cd DSRM 67aa 1e-12 in ref transcript
    • Double-stranded RNA binding motif. Binding is not sequence specific but is highly specific for double stranded RNA. Found in a variety of proteins including dsRNA dependent protein kinase PKR, RNA helicases, Drosophila staufen protein, E. coli RNase III, RNases H1, and dsRNA dependent adenosine deaminases.
  • smart DSRM 67aa 3e-14 in ref transcript
    • Double-stranded RNA binding motif.
  • smart DSRM 32aa 0.001 in ref transcript
  • PRK rnc 72aa 1e-10 in ref transcript
    • ribonuclease III; Reviewed.

STEAP2

  • rs.STEAP2.F1 rs.STEAP2.R1 303 353
  • NCBIGene 36.3 261729
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 50
  • Inclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_001040665

  • pfam F420_oxidored 175aa 3e-37 in ref transcript
    • NADP oxidoreductase coenzyme F420-dependent.
  • COG COG2085 186aa 4e-26 in ref transcript
    • Predicted dinucleotide-binding enzymes [General function prediction only].
  • COG MgtA 151aa 0.001 in ref transcript
    • Cation transport ATPase [Inorganic ion transport and metabolism].

STK19

  • rs.STK19.F1 rs.STK19.R1 100 112
  • NCBIGene 36.3 8859
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032454

  • Changed! pfam Stk19 240aa 4e-83 in ref transcript
    • Serine-threonine protein kinase 19. This serine-threonine protein kinase number 19 is expressed from the MHC and predominantly in the nucleus. Protein kinases are involved in signal transduction pathways and play fundamental roles in the regulation of cell functions. This is a novel Ser/Thr protein kinase, that has Mn2+-dependent protein kinase activity that phosphorylates alpha -casein at Ser/Thr residues and histone at Ser residues. It can be covalently modified by the reactive ATP analogue 5'-p-fluorosulfonylbenzoyladenosine in the absence of ATP, and this modification is prevented in the presence of 1 mM ATP, indicating that the kinase domain of is capable of binding ATP.
  • Changed! pfam Stk19 236aa 2e-83 in modified transcript

STRN3

  • rs.STRN3.F1 rs.STRN3.R1 147 399
  • NCBIGene 36.3 29966
  • Multiple exon skipping, size difference: 252
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001083893

  • cd WD40 322aa 9e-49 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • pfam Striatin 135aa 1e-41 in ref transcript
    • Striatin family. Striatin is an intracellular protein which has a caveolin-binding motif, a coiled-coil structure, a calmodulin-binding site, and a WD (pfam00400) repeat domain. It acts as a scaffold protein and is involved in signalling pathways.
  • smart WD40 40aa 2e-06 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 40aa 5e-06 in ref transcript
  • smart WD40 34aa 1e-05 in ref transcript
  • smart WD40 38aa 2e-04 in ref transcript
  • COG COG2319 318aa 1e-20 in ref transcript
    • FOG: WD40 repeat [General function prediction only].

SUMF2

  • rs.SUMF2.F1 rs.SUMF2.R1 157 202
  • NCBIGene 36.3 25870
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042468

  • Changed! pfam DUF323 269aa 9e-65 in ref transcript
    • Domain of unknown function (DUF323). This presumed domain is found in bacterial and eukaryotic proteins. In some cases these proteins also contain a protein kinase domain. The function of this domain is unknown. The domain has also been found in eukaryotic proteins required for post-translational sulphatase modification (SUMF1). These proteins are associated with the rare disorder multiple sulphatase deficiency (MSD).
  • Changed! COG COG1262 275aa 1e-36 in ref transcript
    • Uncharacterized conserved protein [Function unknown].
  • Changed! pfam DUF323 254aa 2e-63 in modified transcript
  • Changed! COG COG1262 260aa 1e-37 in modified transcript

SUMO2

  • rs.SUMO2.F1 rs.SUMO2.R1 250 322
  • NCBIGene 36.3 6613
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006937

  • Changed! cd Sumo 87aa 6e-31 in ref transcript
    • Small ubiquitin-related modifier (SUMO) proteins are conjugated to numerous intracellular targets and serve to modulate protein interaction, localization, activity or stability. SUMO (also known as "Smt3" and "sentrin" in other organisms) is linked to several different pathways, including nucleocytoplasmic transport. Attachment of SUMO to targets proteins is stimulated by PIAS (Protein inhibitor of activated STATs) proteins which serve as E3-like ligases.
  • Changed! smart UBQ 65aa 4e-08 in ref transcript
    • Ubiquitin homologues. Ubiquitin-mediated proteolysis is involved in the regulated turnover of proteins required for controlling cell cycle progression.
  • Changed! COG SMT3 91aa 8e-18 in ref transcript
    • Ubiquitin-like protein (sentrin) [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd Sumo 63aa 1e-16 in modified transcript
  • Changed! COG SMT3 67aa 3e-08 in modified transcript

SYN1

  • rs.SYN1.F1 rs.SYN1.R1 123 161
  • NCBIGene 36.3 6853
  • Alternative 3-prime, size difference: 38
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006950

  • pfam Synapsin_C 203aa 1e-121 in ref transcript
    • Synapsin, ATP binding domain. Ca dependent ATP binding in this ATP grasp fold. Function unknown.
  • pfam Synapsin 98aa 7e-52 in ref transcript
    • Synapsin, N-terminal domain.
  • pfam Synapsin_N 27aa 2e-10 in ref transcript
    • Synapsin N-terminal. This highly conserved domain of synapsin proteins has a serine at position 9 or 10 which is a phosphorylation site. The domain appears to be the part of the molecule that binds to calmodulin.
  • COG RimK 298aa 4e-06 in ref transcript
    • Glutathione synthase/Ribosomal protein S6 modification enzyme (glutaminyl transferase) [Coenzyme metabolism / Translation, ribosomal structure and biogenesis].

SYNE2

  • rs.SYNE2.F2 rs.SYNE2.R2 132 174
  • NCBIGene 36.3 23224
  • Alternative 3-prime, size difference: 42
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_182914

  • cd SPEC 214aa 1e-12 in ref transcript
    • Spectrin repeats, found in several proteins involved in cytoskeletal structure; family members include spectrin, alpha-actinin and dystrophin; the spectrin repeat forms a three helix bundle with the second helix interrupted by proline in some sequences; the repeats are independent folding units; tandem repeats are found in differing numbers and arrange in an antiparallel manner to form dimers; the repeats are defined by a characteristic tryptophan (W) residue in helix A and a leucine (L) at the carboxyl end of helix C and separated by a linker of 5 residues; two copies of the repeat are present here.
  • cd CH 103aa 5e-12 in ref transcript
    • Calponin homology domain; actin-binding domain which may be present as a single copy or in tandem repeats (which increases binding affinity). The CH domain is found in cytoskeletal and signal transduction proteins, including actin-binding proteins like spectrin, alpha-actinin, dystrophin, utrophin, and fimbrin, proteins essential for regulation of cell shape (cortexillins), and signaling proteins (Vav).
  • cd SPEC 215aa 1e-10 in ref transcript
  • cd CH 103aa 6e-10 in ref transcript
  • cd SPEC 214aa 4e-09 in ref transcript
  • cd SPEC 221aa 8e-08 in ref transcript
  • Changed! cd SPEC 216aa 1e-07 in ref transcript
  • cd SPEC 207aa 2e-05 in ref transcript
  • pfam KASH 60aa 7e-14 in ref transcript
    • Nuclear envelope localisation domain. The KASH (for Klarsicht/ANC-1/Syne-1 homology) or KLS domain is a highly hydrophobic nuclear envelope localisation domain of approximately 60 amino acids comprising an 20-amino-acid transmembrane region and a 30-35-residue C-terminal region that lies between the inner and the outer nuclear membranes.
  • pfam CH 99aa 2e-13 in ref transcript
    • Calponin homology (CH) domain. The CH domain is found in both cytoskeletal proteins and signal transduction proteins. The CH domain is involved in actin binding in some members of the family. However in calponins there is evidence that the CH domain is not involved in its actin binding activity. Most proteins have two copies of the CH domain, however some proteins such as calponin have only a single copy.
  • smart CH 102aa 2e-13 in ref transcript
    • Calponin homology domain. Actin binding domains present in duplicate at the N-termini of spectrin-like proteins (including dystrophin, alpha-actinin). These domains cross-link actin filaments into bundles and networks. A calponin homology domain is predicted in yeasst Cdc24p.
  • pfam SMC_N 302aa 1e-06 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • TIGR SMC_prok_B 305aa 4e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • smart SPEC 102aa 9e-05 in ref transcript
    • Spectrin repeats.
  • smart SPEC 104aa 3e-04 in ref transcript
  • TIGR SMC_prok_A 303aa 3e-04 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
  • TIGR SMC_prok_A 375aa 4e-04 in ref transcript
  • pfam SMC_N 281aa 0.002 in ref transcript
  • COG SAC6 255aa 3e-22 in ref transcript
    • Ca2+-binding actin-bundling protein fimbrin/plastin (EF-Hand superfamily) [Cytoskeleton].
  • COG Smc 266aa 3e-04 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • Changed! cd SPEC 230aa 1e-05 in modified transcript

SYTL2

  • rs.SYTL2.F1 rs.SYTL2.R1 110 158
  • NCBIGene 36.3 54843
  • Single exon skipping, size difference: 48
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_206927

  • cd C2_1 127aa 9e-17 in ref transcript
    • Protein kinase C conserved region 2, subgroup 1; C2 Ca2+-binding motif present in phospholipases, protein kinases C, and synaptotagmins (amongst others); some PKCs lack calcium dependence. Particular C2s appear to bind phospholipids, inositol polyphosphates,and intracellular proteins. Two distinct C2 topologies generated by permutation of the sequence with respect to the N- and C-terminal beta strands are seen. In this subgroup, containing synaptotagmins, specific protein kinases C (PKC) subtypes and other proteins, the N-terminal beta strand occupies the position of what is the C-terminal strand in subgroup 2.
  • cd C2_1 128aa 4e-15 in ref transcript
  • pfam C2 90aa 2e-10 in ref transcript
    • C2 domain.
  • pfam C2 88aa 7e-10 in ref transcript

TAC4

  • rs.TAC4.F1 rs.TAC4.R1 102 120
  • NCBIGene 36.3 255061
  • Alternative 5-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170685

TAC4

  • rs.TAC4.F2 rs.TAC4.R2 139 172
  • NCBIGene 36.3 255061
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170685

TAC4

  • rs.TAC4.F3 rs.TAC4.R3 164 224
  • NCBIGene 36.3 255061
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_170685

TAX1BP1

  • rs.TAX1BP1.F1 rs.TAX1BP1.R1 200 326
  • NCBIGene 36.3 8887
  • Alternative 5-prime, size difference: 126
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_006024

  • pfam CALCOCO1 454aa 3e-51 in ref transcript
    • Calcium binding and coiled-coil domain (CALCOCO1) like. Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein coexpressed by Mus musculus (CoCoA/CALCOCO1). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1, and thus enhances transcriptional activation by a number of nuclear receptors. CALCOCO1 has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region.
  • TIGR SMC_prok_B 249aa 2e-05 in ref transcript
    • SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
  • COG Smc 316aa 2e-10 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].

TBCE

  • rs.TBCE.F1 rs.TBCE.R1 154 202
  • NCBIGene 36.3 6905
  • Alternative 5-prime, size difference: 48
  • Exclusion in 5'UTR
  • Reference transcript: NM_001079515

  • cd LRR_RI 159aa 7e-05 in ref transcript
    • Leucine-rich repeats (LRRs), ribonuclease inhibitor (RI)-like subfamily. LRRs are 20-29 residue sequence motifs present in many proteins that participate in protein-protein interactions and have different functions and cellular locations. LRRs correspond to structural units consisting of a beta strand (LxxLxLxxN/CxL conserved pattern) and an alpha helix. This alignment contains 12 strands corresponding to 11 full repeats, consistent with the extent observed in the subfamily acting as Ran GTPase Activating Proteins (RanGAP1).
  • pfam CAP_GLY 67aa 9e-20 in ref transcript
    • CAP-Gly domain. Cytoskeleton-associated proteins (CAPs) are involved in the organisation of microtubules and transportation of vesicles and organelles along the cytoskeletal network. A conserved motif, CAP-Gly, has been identified in a number of CAPs, including CLIP-170 and dynactins. The crystal structure of Caenorhabditis elegans F53F4.3 protein CAP-Gly domain was recently solved. The domain contains three beta-strands. The most conserved sequence, GKNDG, is located in two consecutive sharp turns on the surface, forming the entrance to a groove.
  • COG NIP100 67aa 1e-07 in ref transcript
    • Dynactin complex subunit involved in mitotic spindle partitioning in anaphase B [Cell division and chromosome partitioning].
  • COG COG4886 217aa 4e-07 in ref transcript
    • Leucine-rich repeat (LRR) protein [Function unknown].

TBX3

  • rs.TBX3.F1 rs.TBX3.R1 283 343
  • NCBIGene 36.3 6926
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_016569

  • Changed! cd TBOX 202aa 1e-85 in ref transcript
    • T-box DNA binding domain of the T-box family of transcriptional regulators. The T-box family is an ancient group that appears to play a critical role in development in all animal species. These genes were uncovered on the basis of similarity to the DNA binding domain of murine Brachyury (T) gene product, the defining feature of the family. Common features shared by T-box family members are DNA-binding and transcriptional regulatory activity, a role in development and conserved expression patterns, most of the known genes in all species being expressed in mesoderm or mesoderm precursors.
  • Changed! pfam T-box 200aa 1e-97 in ref transcript
    • T-box. The T-box encodes a 180 amino acid domain that binds to DNA. Genes encoding T-box proteins are found in a wide range of animals, but not in other kingdoms such as plants. Family members are all thought to bind to the DNA consensus sequence TCACACCT. they are found exclusively in the nucleus, and perform DNA-binding and transcriptional activation/repression roles. They are generally required for development of the specific tissues they are expressed in, and mutations in T-box genes are implicated in human conditions such as DiGeorge syndrome and X-linked cleft palate, which feature malformations.
  • Changed! cd TBOX 182aa 4e-89 in modified transcript
  • Changed! pfam T-box 180aa 1e-101 in modified transcript

TCEAL1

  • rs.TCEAL1.F1 rs.TCEAL1.R1 100 112
  • NCBIGene 36.3 9338
  • Alternative 5-prime, size difference: 12
  • Exclusion in 5'UTR
  • Reference transcript: NM_004780

  • pfam TFA 86aa 3e-12 in ref transcript
    • Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.

TCEAL3

  • rs.TCEAL3.F1 rs.TCEAL3.R1 99 111
  • NCBIGene 36.3 85012
  • Alternative 5-prime, size difference: 12
  • Exclusion in 5'UTR
  • Reference transcript: NM_001006933

  • pfam TFA 133aa 2e-11 in ref transcript
    • Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.

TCEAL4

  • rs.TCEAL4.F1 rs.TCEAL4.R1 178 237
  • NCBIGene 36.3 79921
  • Alternative 5-prime, size difference: 59
  • Exclusion in 5'UTR
  • Reference transcript: NM_024863

  • pfam TFA 79aa 5e-08 in ref transcript
    • Transcription elongation factor A, SII-related family. The function of this family is unclear, but two members from Homo sapiesn are described as transcription elongation factor A, SII-like proteins.

TCF12

  • rs.TCF12.F1 rs.TCF12.R1 90 100
  • NCBIGene 36.3 6938
  • Alternative 3-prime, size difference: 10
  • Inclusion in 5'UTR
  • Reference transcript: NM_207036

  • cd HLH 61aa 1e-06 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • smart HLH 54aa 7e-10 in ref transcript
    • helix loop helix domain.

TCF19

  • rs.TCF19.F1 rs.TCF19.R1 124 370
  • NCBIGene 36.3 6941
  • Alternative 3-prime, size difference: 246
  • Inclusion in 5'UTR
  • Reference transcript: NM_001077511

  • cd FHA 98aa 2e-08 in ref transcript
    • Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
  • pfam FHA 72aa 4e-06 in ref transcript
    • FHA domain. The FHA (Forkhead-associated) domain is a phosphopeptide binding motif.
  • pfam PHD 36aa 2e-05 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.

TCF4

  • rs.TCF4.F1 rs.TCF4.R1 101 113
  • NCBIGene 36.3 6925
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001083962

  • cd HLH 61aa 2e-07 in ref transcript
    • Helix-loop-helix domain, found in specific DNA- binding proteins that act as transcription factors; 60-100 amino acids long. A DNA-binding basic region is followed by two alpha-helices separated by a variable loop region; HLH forms homo- and heterodimers, dimerization creates a parallel, left-handed, four helix bundle; the basic region N-terminal to the first amphipathic helix mediates high-affinity DNA-binding; there are several groups of HLH proteins: those (E12/E47) which bind specific hexanucleotide sequences such as E-box (5-CANNTG-3) or StRE 5-ATCACCCCAC-3), those lacking the basic domain (Emc, Id) function as negative regulators since they fail to bind DNA, those (hairy, E(spl), deadpan) which repress transcription although they can bind specific hexanucleotide sequences such as N-box (5-CACGc/aG-3), those which have a COE domain (Collier/Olf-1/EBF) which is involved in both in dimerization and in DNA binding, and those which bind pentanucleotides ACGTG or GCGTG and have a PAS domain which allows the dimerization between PAS proteins, the binding of small molecules (e.g., dioxin), and interactions with non-PAS proteins.
  • smart HLH 54aa 2e-10 in ref transcript
    • helix loop helix domain.

TCL6

  • rs.TCL6.F1 rs.TCL6.R1 352 547
  • NCBIGene 36.3 27004
  • Mutually exclusive exon skipping, size difference: 195
  • Inclusion in 5'UTR, Exclusion of the protein initiation site
  • Reference transcript: NM_020554

TCOF1

  • rs.TCOF1.F1 rs.TCOF1.R1 135 366
  • NCBIGene 36.3 6949
  • Single exon skipping, size difference: 231
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001008656

  • pfam Treacle 321aa 9e-36 in ref transcript
    • Treacher Collins syndrome protein Treacle.

TCP11

  • rs.TCP11.F1 rs.TCP11.R1 413 507
  • NCBIGene 36.3 6954
  • Mutually exclusive exon skipping, size difference: 94
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001093728

  • Changed! pfam Tcp11 433aa 1e-130 in ref transcript
    • T-complex protein 11. This family consists of several eukaryotic T-complex protein 11 (Tcp11) related sequences. Tcp11 is only expressed in fertile adult mammalian testes and is thought to be important in sperm function and fertility. The family also contains the yeast Sok1 protein which is known to suppress cyclic AMP-dependent protein kinase mutants.

TCTN1

  • rs.TCTN1.F1 rs.TCTN1.R1 225 267
  • NCBIGene 36.3 79600
  • Mutually exclusive exon skipping, size difference: 42
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001082538

  • Changed! pfam DUF1619 310aa 1e-106 in ref transcript
    • Protein of unknown function (DUF1619). This is a family of sequences derived from hypothetical eukaryotic proteins. The region in question is approximately 330 residues long and has a cysteine rich amino-terminus.
  • Changed! pfam DUF1619 296aa 8e-88 in modified transcript

TCTN1

  • rs.TCTN1.F2 rs.TCTN1.R2 100 115
  • NCBIGene 36.3 79600
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001082538

  • pfam DUF1619 310aa 1e-106 in ref transcript
    • Protein of unknown function (DUF1619). This is a family of sequences derived from hypothetical eukaryotic proteins. The region in question is approximately 330 residues long and has a cysteine rich amino-terminus.

TDRKH

  • rs.TDRKH.F1 rs.TDRKH.R1 143 278
  • NCBIGene 36.3 11022
  • Alternative 5-prime and 3-prime, size difference: 135
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_006862

  • Changed! cd KH-I 63aa 6e-12 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • cd TUDOR 48aa 2e-08 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • pfam TUDOR 120aa 8e-30 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein.
  • Changed! pfam KH_1 62aa 4e-13 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • Changed! PRK PRK11824 65aa 0.001 in ref transcript
    • polynucleotide phosphorylase/polyadenylase; Provisional.
  • Changed! cd KH-I 59aa 1e-09 in modified transcript
  • Changed! pfam KH_1 59aa 6e-11 in modified transcript
  • Changed! PRK PRK11824 33aa 0.004 in modified transcript

TDRKH

  • rs.TDRKH.F2 rs.TDRKH.R2 141 166
  • NCBIGene 36.3 11022
  • Alternative 5-prime, size difference: 25
  • Exclusion in 5'UTR
  • Reference transcript: NM_006862

  • cd KH-I 63aa 6e-12 in ref transcript
    • K homology RNA-binding domain, type I. KH binds single-stranded RNA or DNA. It is found in a wide variety of proteins including ribosomal proteins, transcription factors and post-transcriptional modifiers of mRNA. There are two different KH domains that belong to different protein folds, but they share a single KH motif. The KH motif is folded into a beta alpha alpha beta unit. In addition to the core, type II KH domains (e.g. ribosomal protein S3) include N-terminal extension and type I KH domains (e.g. hnRNP K) contain C-terminal extension.
  • cd TUDOR 48aa 2e-08 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • pfam TUDOR 120aa 8e-30 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. copies in the Drosophila Tudor protein.
  • pfam KH_1 62aa 4e-13 in ref transcript
    • KH domain. KH motifs can bind RNA in vitro. Autoantibodies to Nova, a KH domain protein, cause paraneoplastic opsoclonus ataxia.
  • PRK PRK11824 65aa 0.001 in ref transcript
    • polynucleotide phosphorylase/polyadenylase; Provisional.

TERF1

  • rs.TERF1.F1 rs.TERF1.R1 104 164
  • NCBIGene 36.3 7013
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_017489

  • cd TRFH 188aa 5e-76 in ref transcript
    • Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
  • cd SANT 47aa 2e-05 in ref transcript
    • 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
  • pfam TRF 181aa 1e-78 in ref transcript
    • Telomere repeat binding factor (TRF). Telomere repeat binding factor (TRF) family proteins are important for the regulation of telomere stability. The two related human TRF proteins hTRF1 and hTRF2 form homodimers and bind directly to telomeric TTAGGG repeats via the myb DNA binding domain pfam00249 at the carboxy terminus. TRF1 is implicated in telomere length regulation and TRF2 in telomere protection. Other telomere complex associated proteins are recruited through their interaction with either TRF1 or TRF2. The fission yeast protein Taz1p (telomere-associated in Schizosaccharomyces pombe) has similarity to both hTRF1 and hTRF2 and may perform the dual functions of TRF1 and TRF2 at fission yeast telomeres. This domain is composed of multiple alpha helices arranged in a solenoid conformation similar to TPR repeats.
  • pfam Myb_DNA-binding 49aa 1e-07 in ref transcript
    • Myb-like DNA-binding domain. This family contains the DNA binding domains from Myb proteins, as well as the SANT domain family.

TERT

  • rs.TERT.F1 rs.TERT.R1 208 244
  • NCBIGene 36.3 7015
  • Alternative 3-prime, size difference: 36
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_198253

  • cd TERT 112aa 8e-23 in ref transcript
    • TERT: Telomerase reverse transcriptase (TERT). Telomerase is a ribonucleoprotein (RNP) that synthesizes telomeric DNA repeats. The telomerase RNA subunit provides the template for synthesis of these repeats. The catalytic subunit of RNP is known as telomerase reverse transcriptase (TERT). The reverse transcriptase (RT) domain is located in the C-terminal region of the TERT polypeptide. Single amino acid substitutions in this region lead to telomere shortening and senescence. Telomerase is an enzyme that, in certain cells, maintains the physical ends of chromosomes (telomeres) during replication. In somatic cells, replication of the lagging strand requires the continual presence of an RNA primer approximately 200 nucleotides upstream, which is complementary to the template strand. Since there is a region of DNA less than 200 base pairs from the end of the chromosome where this is not possible, the chromosome is continually shortened. However, a surplus of repetitive DNA at the chromosome ends protects against the erosion of gene-encoding DNA. Telomerase is not normally expressed in somatic cells. It has been suggested that exogenous TERT may extend the lifespan of, or even immortalize, the cell. However, recent studies have shown that telomerase activity can be induced by a number of oncogenes. Conversely, the oncogene c-myc can be activated in human TERT immortalized cells. Sequence comparisons place the telomerase proteins in the RT family but reveal hallmarks that distinguish them from retroviral and retrotransposon relatives.

TH

  • rs.TH.F1 rs.TH.R1 100 112
  • NCBIGene 36.3 7054
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199292

  • cd eu_TyrOH 298aa 1e-166 in ref transcript
    • Eukaryotic tyrosine hydroxylase (TyrOH); a member of the biopterin-dependent aromatic amino acid hydroxylase family of non-heme, iron(II)-dependent enzymes that also includes prokaryotic and eukaryotic phenylalanine-4-hydroxylase (PheOH) and eukaryotic tryptophan hydroxylase (TrpOH). TyrOH catalyzes the conversion of tyrosine to L-dihydroxyphenylalanine (L-DOPA), the rate-limiting step in the biosynthesis of the catecholamines dopamine, noradrenaline, and adrenaline.
  • cd ACT_TH 122aa 6e-30 in ref transcript
    • ACT domain of the nonheme iron-dependent aromatic amino acid hydroxylase, tyrosine hydroxylases (TH). TH catalyses the hydroxylation of L-Tyr to 3,4-dihydroxyphenylalanine, the rate limiting step in the biosynthesis of catecholamines (dopamine, noradrenaline and adrenaline), functioning as hormones and neurotransmitters. The enzyme is not regulated by its amino acid substrate, but instead by phosphorylation at several serine residues located N-terminal of the ACT domain, and by feedback inhibition by catecholamines at the active site. Members of this CD belong to the superfamily of ACT regulatory domains.
  • TIGR Tyr_3_monoox 457aa 0.0 in ref transcript
    • This model describes tyrosine 3-monooxygenase, a member of the family of tetrameric, biopterin-dependent aromatic amino acid hydroxylases found in metazoans. It is closely related to tetrameric phenylalanine-4-hydroxylase and tryptophan 5-monooxygenase, and more distantly related to the monomeric phenylalanine-4-hydroxylase found in some Gram-negative bacteria.
  • PRK phhA 238aa 5e-61 in ref transcript
    • phenylalanine 4-monooxygenase; Reviewed.

THADA

  • rs.THADA.F1 rs.THADA.R1 116 400
  • NCBIGene 36.3 63892
  • Alternative 5-prime, size difference: 284
  • Exclusion in 5'UTR
  • Reference transcript: NM_001083953

  • pfam DUF2428 304aa 3e-68 in ref transcript
    • Putative death-receptor fusion protein (DUF2428). This is a family of proteins conserved from plants to humans. The function is not known. Several members have been annotated as being HEAT repeat-containing proteins while others are designated as death-receptor interacting proteins, but neither of these could be confirmed.
  • COG COG5543 475aa 5e-37 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

TIA1

  • rs.TIA1.F1 rs.TIA1.R1 100 133
  • NCBIGene 36.3 7072
  • Single exon skipping, size difference: 33
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022173

  • cd RRM 74aa 3e-19 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 73aa 1e-15 in ref transcript
  • cd RRM 69aa 6e-14 in ref transcript
  • Changed! TIGR PABP-1234 269aa 2e-24 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • Changed! COG COG0724 177aa 1e-18 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! COG COG0724 175aa 7e-12 in ref transcript
  • Changed! TIGR PABP-1234 258aa 2e-27 in modified transcript
  • Changed! COG COG0724 200aa 4e-19 in modified transcript

TIRAP

  • rs.TIRAP.F1 rs.TIRAP.R1 100 116
  • NCBIGene 36.3 114609
  • Alternative 5-prime, size difference: 16
  • Exclusion in 5'UTR
  • Reference transcript: NM_148910

  • smart TIR 83aa 3e-07 in ref transcript
    • Toll - interleukin 1 - resistance.

TJP2

  • rs.TJP2.F1 rs.TJP2.R1 101 542
  • NCBIGene 36.3 9414
  • Multiple exon skipping, size difference: 441
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004817

  • cd PDZ_signaling 86aa 9e-15 in ref transcript
    • PDZ domain found in a variety of Eumetazoan signaling molecules, often in tandem arrangements. May be responsible for specific protein-protein interactions, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of PDZ domains an N-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in proteases.
  • cd PDZ_signaling 80aa 1e-09 in ref transcript
  • cd PDZ_signaling 75aa 1e-07 in ref transcript
  • smart GuKc 188aa 3e-33 in ref transcript
    • Guanylate kinase homologues. Active enzymes catalyze ATP-dependent phosphorylation of GMP to GDP. Structure resembles that of adenylate kinase. So-called membrane-associated guanylate kinase homologues (MAGUKs) do not possess guanylate kinase activities; instead at least some possess protein-binding functions.
  • smart PDZ 90aa 6e-14 in ref transcript
    • Domain present in PSD-95, Dlg, and ZO-1/2. Also called DHR (Dlg homologous region) or GLGF (relatively well conserved tetrapeptide in these domains). Some PDZs have been shown to bind C-terminal polypeptides; others appear to bind internal (non-C-terminal) polypeptides. Different PDZs possess different binding specificities.
  • smart PDZ 72aa 1e-10 in ref transcript
  • smart PDZ 78aa 3e-10 in ref transcript
  • pfam SH3_2 59aa 0.001 in ref transcript
    • Variant SH3 domain. SH3 (Src homology 3) domains are often indicative of a protein involved in signal transduction related to cytoskeletal organisation. First described in the Src cytoplasmic tyrosine kinase. The structure is a partly opened beta barrel.
  • COG Prc 88aa 1e-05 in ref transcript
    • Periplasmic protease [Cell envelope biogenesis, outer membrane].

TMC5

  • rs.TMC5.F1 rs.TMC5.R1 100 274
  • NCBIGene 36.3 79838
  • Single exon skipping, size difference: 174
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001105248

  • pfam TMC 107aa 3e-48 in ref transcript
    • TMC domain. These sequences are similar to a region conserved amongst various protein products of the transmembrane channel-like (TMC) gene family, such as Transmembrane channel-like protein 3 and EVIN2 - this region is termed the TMC domain. Mutations in these genes are implicated in a number of human conditions, such as deafness and epidermodysplasia verruciformis. TMC proteins are thought to have important cellular roles, and may be modifiers of ion channels or transporters.

TMEM134

  • rs.TMEM134.F1 rs.TMEM134.R1 112 157
  • NCBIGene 36.3 80194
  • Single exon skipping, size difference: 45
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_025124

  • Changed! pfam DUF872 81aa 7e-19 in ref transcript
    • Eukaryotic protein of unknown function (DUF872). This family consists of several uncharacterised eukaryotic proteins. The function of this family is unknown.
  • Changed! pfam DUF872 38aa 8e-13 in modified transcript

TMEM134

  • rs.TMEM134.F2 rs.TMEM134.R2 120 147
  • NCBIGene 36.3 80194
  • Alternative 3-prime, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_025124

  • pfam DUF872 81aa 7e-19 in ref transcript
    • Eukaryotic protein of unknown function (DUF872). This family consists of several uncharacterised eukaryotic proteins. The function of this family is unknown.

TMEM176B

  • rs.TMEM176B.F1 rs.TMEM176B.R1 341 452
  • NCBIGene 36.3 28959
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014020

  • Changed! pfam LR8 205aa 2e-75 in ref transcript
    • LR8 protein. This family consists of several LR8 like proteins from humans, mice and rats. The function of the human LR8 protein is unknown although it is known to be strongly expressed in the lung fibroblasts.
  • Changed! pfam LR8 168aa 2e-61 in modified transcript

TMEM177

  • rs.TMEM177.F1 rs.TMEM177.R1 105 446
  • NCBIGene 36.3 80775
  • Alternative 5-prime, size difference: 341
  • Exclusion in 5'UTR
  • Reference transcript: NM_001105198

TMEM22

  • rs.TMEM22.F1 rs.TMEM22.R1 132 397
  • NCBIGene 36.3 80723
  • Alternative 5-prime, size difference: 265
  • Exclusion in 5'UTR
  • Reference transcript: NM_025246

  • TIGR 2A78 258aa 1e-08 in ref transcript
  • COG RhaT 288aa 2e-10 in ref transcript
    • Permeases of the drug/metabolite transporter (DMT) superfamily [Carbohydrate transport and metabolism / Amino acid transport and metabolism / General function prediction only].

TMEM55B

  • rs.TMEM55B.F1 rs.TMEM55B.R1 106 127
  • NCBIGene 36.3 90809
  • Alternative 5-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100814

  • Changed! pfam Tmemb_55A 277aa 3e-85 in ref transcript
    • Transmembrane protein 55A. Members of this family catalyse the hydrolysis of the 4-position phosphate of phosphatidylinositol 4,5-bisphosphate, in the reaction: 1-phosphatidyl-myo-inositol 4,5-bisphosphate + H(2)O = 1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
  • Changed! pfam Tmemb_55A 270aa 3e-84 in modified transcript

TMEM70

  • rs.TMEM70.F1 rs.TMEM70.R1 102 116
  • NCBIGene 36.3 54968
  • Alternative 5-prime, size difference: 14
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_017866

  • Changed! pfam DUF1301 135aa 6e-56 in ref transcript
    • Protein of unknown function (DUF1301). This family contains a number of eukaryotic proteins of unknown function that are approximately 160 residues long.

TMEM80

  • rs.TMEM80.F1 rs.TMEM80.R1 136 160
  • NCBIGene 36.3 283232
  • Alternative 5-prime, size difference: 24
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001042463

  • pfam Transmemb_17 68aa 1e-05 in ref transcript
    • Predicted membrane protein. This is a 100 amino acid region of a family of proteins conserved from nematodes to humans. It is predicted to be a transmembrane region but its function is not known.

TMEM91

  • rs.TMEM91.F1 rs.TMEM91.R1 222 376
  • NCBIGene 36.3 641649
  • Alternative 3-prime, size difference: 154
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001098821

  • Changed! pfam CD225 49aa 1e-08 in ref transcript
    • Interferon-induced transmembrane protein. This family includes the human leukocyte antigen CD225, which is an interferon inducible transmembrane protein, and is associated with interferon induced cell growth suppression.
  • Changed! pfam CD225 29aa 0.003 in modified transcript

TMPO

  • rs.TMPO.F1 rs.TMPO.R1 150 477
  • NCBIGene 36.3 7112
  • Multiple exon skipping, size difference: 327
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001032283

  • pfam Thymopoietin 49aa 4e-19 in ref transcript
    • Thymopoietin protein. Short protein of 49 amino acid isolated from bovine spleen cells. Thymopoietins (TMPOs) are a group of ubiquitously expressed nuclear proteins. They are suggested to play an important role in nuclear envelope organisation and cell cycle control.
  • pfam LEM 28aa 1e-04 in ref transcript
    • LEM domain. The LEM domain is 50 residues long and is composed of two parallel alpha helices. This domain is found in inner nuclear membrane proteins. It is called the LEM domain after LAP2, Emerin and Man1.

TMPRSS4

  • rs.TMPRSS4.F1 rs.TMPRSS4.R1 104 119
  • NCBIGene 36.3 56649
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_019894

  • cd Tryp_SPc 228aa 2e-68 in ref transcript
    • Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.
  • cd LDLa 35aa 0.002 in ref transcript
    • Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure.
  • smart Tryp_SPc 226aa 2e-77 in ref transcript
    • Trypsin-like serine protease. Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.
  • Changed! smart SR 91aa 9e-07 in ref transcript
    • Scavenger receptor Cys-rich. The sea ucrhin egg peptide speract contains 4 repeats of SR domains that contain 6 conserved cysteines. May bind bacterial antigens in the protein MARCO.
  • pfam Ldl_recept_a 22aa 0.005 in ref transcript
    • Low-density lipoprotein receptor domain class A.
  • COG COG5640 228aa 8e-26 in ref transcript
    • Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones].
  • Changed! smart SR 86aa 4e-07 in modified transcript

TMTC4

  • rs.TMTC4.F1 rs.TMTC4.R1 143 353
  • NCBIGene 36.3 84899
  • Single exon skipping, size difference: 210
  • Exclusion of the protein initiation site
  • Reference transcript: NM_032813

  • cd TPR 100aa 3e-12 in ref transcript
    • Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]- X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here.
  • cd TPR 99aa 4e-11 in ref transcript
  • cd TPR 99aa 2e-07 in ref transcript
  • cd TPR 85aa 1e-06 in ref transcript
  • pfam DUF1736 80aa 5e-33 in ref transcript
    • Domain of unknown function (DUF1736). This domain of unknown function is found in various hypothetical metazoan proteins.
  • TIGR PEP_TPR_lipo 231aa 1e-12 in ref transcript
    • This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
  • COG TadD 202aa 5e-08 in ref transcript
    • Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion].

TMUB2

  • rs.TMUB2.F1 rs.TMUB2.R1 133 220
  • NCBIGene 36.3 79089
  • Alternative 3-prime, size difference: 87
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001076674

  • Changed! cd UBL 63aa 9e-06 in ref transcript
    • UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
  • Changed! pfam ubiquitin 51aa 6e-08 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.

TMUB2

  • rs.TMUB2.F2 rs.TMUB2.R1 186 254
  • NCBIGene 36.3 79089
  • Single exon skipping, size difference: 68
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001076674

  • cd UBL 63aa 9e-06 in ref transcript
    • UBLs function by remodeling the surface of their target proteins, changing their target's half-life, enzymatic activity, protein-protein interactions, subcellular localization or other properties. At least 10 different ubiquitin-like modifications exist in mammals, and attachment of different ubls to a target leads to different biological consequences. Ubl-conjugation cascades are initiated by activating enzymes, which also coordinate the ubls with their downstream pathways.
  • pfam ubiquitin 51aa 6e-08 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.

TNRC15

  • rs.TNRC15.F1 rs.TNRC15.R1 116 186
  • NCBIGene 36.3 26058
  • Single exon skipping, size difference: 70
  • Inclusion in 5'UTR
  • Reference transcript: NM_001103147

  • cd GYF 56aa 5e-16 in ref transcript
    • GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
  • pfam GYF 56aa 4e-18 in ref transcript
    • GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein.

TNRC15

  • rs.TNRC15.F2 rs.TNRC15.R2 100 118
  • NCBIGene 36.3 26058
  • Alternative 3-prime, size difference: 18
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001103147

  • cd GYF 56aa 5e-16 in ref transcript
    • GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
  • pfam GYF 56aa 4e-18 in ref transcript
    • GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein.

TNRC15

  • rs.TNRC15.F3 rs.TNRC15.R3 158 224
  • NCBIGene 36.3 26058
  • Single exon skipping, size difference: 66
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001103147

  • cd GYF 56aa 5e-16 in ref transcript
    • GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
  • pfam GYF 56aa 4e-18 in ref transcript
    • GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein.

TNRC15

  • rs.TNRC15.F4 rs.TNRC15.R4 106 176
  • NCBIGene 36.3 26058
  • Single exon skipping, size difference: 70
  • Exclusion in 5'UTR
  • Reference transcript: NM_001103147

  • cd GYF 56aa 5e-16 in ref transcript
    • GYF domain: contains conserved Gly-Tyr-Phe residues; Proline-binding domain in CD2-binding and other proteins. Involved in signaling lymphocyte activity. Also present in other unrelated proteins (mainly unknown) derived from diverse eukaryotic species.
  • pfam GYF 56aa 4e-18 in ref transcript
    • GYF domain. The GYF domain is named because of the presence of Gly-Tyr-Phe residues. The GYF domain is a proline-binding domain in CD2-binding protein.

TOM1L2

  • rs.TOM1L2.F1 rs.TOM1L2.R1 159 309
  • NCBIGene 36.3 146691
  • Single exon skipping, size difference: 150
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001082968

  • Changed! cd VHS_Tom1 141aa 2e-74 in ref transcript
    • VHS domain family, Tom1 subfamily; The VHS domain is an essential part of Tom1 (Target of myb1 - retroviral oncogene) protein. The VHS domain has a superhelical structure similar to the structure of the ARM repeats and is present at the very N-termini of proteins. It is a right-handed superhelix of eight alpha helices. The VHS domain has been found in a number of proteins, some of which have been implicated in intracellular trafficking and sorting. The VHS domain of the Tom1 protein is essential for the negative regulation of Interleukin-1 and Tumor Necrosis Factor-induced signaling pathways.
  • Changed! smart VHS 136aa 2e-46 in ref transcript
    • Domain present in VPS-27, Hrs and STAM. Unpublished observations. Domain of unknown function.
  • pfam GAT 93aa 5e-21 in ref transcript
    • GAT domain. The GAT domain is responsible for binding of GGA proteins to several members of the ARF family including ARF1 and ARF3. The GAT domain stabilises membrane bound ARF1 in its GTP bound state, by interfering with GAP proteins.
  • Changed! cd VHS_Tom1 91aa 2e-39 in modified transcript
  • Changed! smart VHS 86aa 1e-21 in modified transcript

TOX2

  • rs.TOX2.F1 rs.TOX2.R1 272 353
  • NCBIGene 36.3 84969
  • Single exon skipping, size difference: 81
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098797

  • Changed! cd HMG-box 48aa 9e-10 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • Changed! pfam HMG_box 49aa 4e-07 in ref transcript
    • HMG (high mobility group) box.
  • Changed! COG NHP6B 48aa 3e-04 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].
  • Changed! cd HMG-box 43aa 1e-08 in modified transcript
  • Changed! pfam HMG_box 44aa 3e-06 in modified transcript
  • Changed! COG NHP6B 43aa 0.002 in modified transcript

TOX2

  • rs.TOX2.F2 rs.TOX2.R2 103 228
  • NCBIGene 36.3 84969
  • Single exon skipping, size difference: 125
  • Exclusion in 5'UTR
  • Reference transcript: NM_032883

  • cd HMG-box 48aa 1e-09 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • pfam HMG_box 49aa 4e-07 in ref transcript
    • HMG (high mobility group) box.
  • COG NHP6B 48aa 5e-04 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

TP53BP2

  • rs.TP53BP2.F1 rs.TP53BP2.R1 304 436
  • NCBIGene 36.3 7159
  • Single exon skipping, size difference: 132
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001031685

  • Changed! cd ANK 118aa 4e-20 in ref transcript
    • ankyrin repeats; ankyrin repeats mediate protein-protein interactions in very diverse families of proteins. The number of ANK repeats in a protein can range from 2 to over 20 (ankyrins, for example). ANK repeats may occur in combinations with other types of domains. The structural repeat unit contains two antiparallel helices and a beta-hairpin, repeats are stacked in a superhelical arrangement; this alignment contains 4 consecutive repeats.
  • Changed! cd SH3 51aa 2e-09 in ref transcript
    • Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
  • Changed! smart SH3 49aa 2e-10 in ref transcript
    • Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
  • Changed! pfam Ank 32aa 2e-06 in ref transcript
    • Ankyrin repeat. There's no clear separation between noise and signal on the HMM search Ankyrin repeats generally consist of a beta, alpha, alpha, beta order of secondary structures. The repeats associate to form a higher order structure.
  • Changed! pfam Ank 32aa 2e-05 in ref transcript
  • Changed! pfam SMC_N 146aa 2e-05 in ref transcript
    • RecF/RecN/SMC N terminal domain. This domain is found at the N terminus of SMC proteins. The SMC (structural maintenance of chromosomes) superfamily proteins have ATP-binding domains at the N- and C-termini, and two extended coiled-coil domains separated by a hinge in the middle. The eukaryotic SMC proteins form two kind of heterodimers: the SMC1/SMC3 and the SMC2/SMC4 types. These heterodimers constitute an essential part of higher order complexes, which are involved in chromatin and DNA dynamics. This family also includes the RecF and RecN proteins that are involved in DNA metabolism and recombination.
  • Changed! COG Arp 95aa 6e-05 in ref transcript
    • FOG: Ankyrin repeat [General function prediction only].
  • Changed! COG SbcC 144aa 1e-04 in ref transcript
    • ATPase involved in DNA repair [DNA replication, recombination, and repair].

TP53I11

  • rs.TP53I11.F1 rs.TP53I11.R1 250 380
  • NCBIGene 36.3 9537
  • Single exon skipping, size difference: 130
  • Exclusion in 5'UTR
  • Reference transcript: NM_001076787

TPD52

  • rs.TPD52.F1 rs.TPD52.R1 101 170
  • NCBIGene 36.3 7163
  • Multiple exon skipping, size difference: 69
  • Inclusion in the protein (no stop codon or frameshift), Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001025252

  • Changed! pfam TPD52 159aa 1e-39 in ref transcript
    • Tumour protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family. These proteins have an amino terminal coiled-coil that allows members to form homo- and heterodimers with each other.
  • Changed! pfam TPD52 182aa 7e-37 in modified transcript

TPD52L2

  • rs.TPD52L2.F1 rs.TPD52L2.R1 105 165
  • NCBIGene 36.3 7165
  • Single exon skipping, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199360

  • Changed! pfam TPD52 163aa 4e-26 in ref transcript
    • Tumour protein D52 family. The hD52 gene was originally identified through its elevated expression level in human breast carcinoma. Cloning of D52 homologues from other species has indicated that D52 may play roles in calcium-mediated signal transduction and cell proliferation. Two human homologues of hD52, hD53 and hD54, have also been identified, demonstrating the existence of a novel gene/protein family. These proteins have an amino terminal coiled-coil that allows members to form homo- and heterodimers with each other.
  • Changed! pfam TPD52 143aa 2e-29 in modified transcript

TPK1

  • rs.TPK1.F1 rs.TPK1.R1 312 459
  • NCBIGene 36.3 27010
  • Single exon skipping, size difference: 147
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_022445

  • Changed! TIGR thi_PPkinase 214aa 8e-40 in ref transcript
    • This model has been revised. Originally, it described strictly eukaryotic thiamine pyrophosphokinase. However, it is now expanded to include also homologous enzymes, apparently functionally equivalent, from species that rely on thiamine pyrophosphokinase rather than thiamine-monophosphate kinase (TIGR01379) to produce the active TPP cofactor. This includes the thiamine pyrophosphokinase from Bacillus subtilis, previously designated YloS.
  • Changed! COG THI80 222aa 2e-27 in ref transcript
    • Thiamine pyrophosphokinase [Coenzyme metabolism].
  • Changed! pfam TPK_catalytic 90aa 2e-27 in modified transcript
    • Thiamin pyrophosphokinase, catalytic domain. Family of thiamin pyrophosphokinase (EC:2.7.6.2). Thiamin pyrophosphokinase (TPK) catalyses the transfer of a pyrophosphate group from ATP to vitamin B1 (thiamin) to form the coenzyme thiamin pyrophosphate (TPP). Thus, TPK is important for the formation of a coenzyme required for central metabolic functions. The structure of thiamin pyrophosphokinase suggest that the enzyme may operate by a mechanism of pyrophosphoryl transfer similar to those described for pyrophosphokinases functioning in nucleotide biosynthesis.
  • Changed! pfam TPK_B1_binding 70aa 6e-21 in modified transcript
    • Thiamin pyrophosphokinase, vitamin B1 binding domain. Family of thiamin pyrophosphokinase (EC:2.7.6.2). Thiamin pyrophosphokinase (TPK) catalyses the transfer of a pyrophosphate group from ATP to vitamin B1 (thiamin) to form the coenzyme thiamin pyrophosphate (TPP). Thus, TPK is important for the formation of a coenzyme required for central metabolic functions. The structure of thiamin pyrophosphokinase suggest that the enzyme may operate by a mechanism of pyrophosphoryl transfer similar to those described for pyrophosphokinases functioning in nucleotide biosynthesis.
  • Changed! COG THI80 173aa 4e-16 in modified transcript

TPTE2

  • rs.TPTE2.F1 rs.TPTE2.R1 171 291
  • NCBIGene 36.3 93492
  • Single exon skipping, size difference: 120
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199254

  • cd PTPc 89aa 8e-05 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • pfam PTEN_C2 131aa 4e-30 in ref transcript
    • C2 domain of PTEN tumour-suppressor protein. This is the C2 domain-like domain, in greek key form, of the PTEN protein, phosphatidyl-inositol triphosphate phosphatase, and it is the C-terminus. This domain may well include a CBR3 loop which means it plays a central role in membrane binding. This domain associates across an extensive interface with the N-terminal phosphatase domain DSPc (pfam00782) suggesting that the C2 domain productively positions the catalytic part of the protein on the membrane 1].
  • smart PTPc_motif 59aa 2e-04 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain motif.
  • pfam Y_phosphatase 108aa 4e-04 in ref transcript
    • Protein-tyrosine phosphatase.
  • COG CDC14 126aa 8e-07 in ref transcript
    • Predicted protein-tyrosine phosphatase [Signal transduction mechanisms].

TPTE2

  • rs.TPTE2.F2 rs.TPTE2.R2 374 485
  • NCBIGene 36.3 93492
  • Multiple exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift), Exclusion in the protein (no frameshift)
  • Reference transcript: NM_199254

  • cd PTPc 89aa 8e-05 in ref transcript
    • Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
  • pfam PTEN_C2 131aa 4e-30 in ref transcript
    • C2 domain of PTEN tumour-suppressor protein. This is the C2 domain-like domain, in greek key form, of the PTEN protein, phosphatidyl-inositol triphosphate phosphatase, and it is the C-terminus. This domain may well include a CBR3 loop which means it plays a central role in membrane binding. This domain associates across an extensive interface with the N-terminal phosphatase domain DSPc (pfam00782) suggesting that the C2 domain productively positions the catalytic part of the protein on the membrane 1].
  • smart PTPc_motif 59aa 2e-04 in ref transcript
    • Protein tyrosine phosphatase, catalytic domain motif.
  • pfam Y_phosphatase 108aa 4e-04 in ref transcript
    • Protein-tyrosine phosphatase.
  • COG CDC14 126aa 8e-07 in ref transcript
    • Predicted protein-tyrosine phosphatase [Signal transduction mechanisms].

TRAF3

  • rs.TRAF3.F1 rs.TRAF3.R1 201 340
  • NCBIGene 36.3 7187
  • Single exon skipping, size difference: 139
  • Exclusion in 5'UTR
  • Reference transcript: NM_145725

  • cd MATH_TRAF3 186aa 1e-106 in ref transcript
    • Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF3 subfamily, TRAF domain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF3 was first described as a molecule that binds the cytoplasmic tail of CD40. However, it is not required for CD40 signaling. More recently, TRAF3 has been identified as a key regulator of type I interferon (IFN) production and the mammalian innate antiviral immunity. It mediates IFN responses in Toll-like receptor (TLR)-dependent as well as TLR-independent viral recognition pathways. It is also a key element in immunological homeostasis through its regulation of the anti-inflammatory cytokine interleukin-10. TRAF3 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal alpha helical region (TRAF-N), and a highly conserved C-terminal MATH subdomain (TRAF-C) with an eight-stranded beta-sandwich structure. TRAF-N mediates trimerization while TRAF-C interacts with receptors.
  • pfam MATH 126aa 4e-18 in ref transcript
    • MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode Caenorhabditis elegans.
  • pfam zf-TRAF 57aa 1e-11 in ref transcript
    • TRAF-type zinc finger.
  • pfam Cast 176aa 2e-05 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • smart RING 36aa 0.002 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • TIGR rad18 203aa 0.005 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • COG Smc 161aa 5e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG COG5222 59aa 0.005 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].

TRAF3

  • rs.TRAF3.F2 rs.TRAF3.R2 252 327
  • NCBIGene 36.3 7187
  • Single exon skipping, size difference: 75
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_145725

  • cd MATH_TRAF3 186aa 1e-106 in ref transcript
    • Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF3 subfamily, TRAF domain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF3 was first described as a molecule that binds the cytoplasmic tail of CD40. However, it is not required for CD40 signaling. More recently, TRAF3 has been identified as a key regulator of type I interferon (IFN) production and the mammalian innate antiviral immunity. It mediates IFN responses in Toll-like receptor (TLR)-dependent as well as TLR-independent viral recognition pathways. It is also a key element in immunological homeostasis through its regulation of the anti-inflammatory cytokine interleukin-10. TRAF3 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more divergent N-terminal alpha helical region (TRAF-N), and a highly conserved C-terminal MATH subdomain (TRAF-C) with an eight-stranded beta-sandwich structure. TRAF-N mediates trimerization while TRAF-C interacts with receptors.
  • pfam MATH 126aa 4e-18 in ref transcript
    • MATH domain. This motif has been called the Meprin And TRAF-Homology (MATH) domain. This domain is hugely expanded in the nematode Caenorhabditis elegans.
  • pfam zf-TRAF 57aa 1e-11 in ref transcript
    • TRAF-type zinc finger.
  • Changed! pfam Cast 176aa 2e-05 in ref transcript
    • RIM-binding protein of the cytomatrix active zone. This is a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion. The C-terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). The family also contains four coiled-coil domains.
  • smart RING 36aa 0.002 in ref transcript
    • Ring finger. E3 ubiquitin-protein ligase activity is intrinsic to the RING domain of c-Cbl and is likely to be a general function of this domain; Various RING fingers exhibit binding activity towards E2 ubiquitin-conjugating enzymes (Ubc' s).
  • Changed! TIGR rad18 203aa 0.005 in ref transcript
    • This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • COG Smc 161aa 5e-07 in ref transcript
    • Chromosome segregation ATPases [Cell division and chromosome partitioning].
  • COG COG5222 59aa 0.005 in ref transcript
    • Uncharacterized conserved protein, contains RING Zn-finger [General function prediction only].
  • Changed! pfam Cast 155aa 6e-05 in modified transcript

TRAPPC5

  • rs.TRAPPC5.F1 rs.TRAPPC5.R1 123 146
  • NCBIGene 36.3 126003
  • Alternative 5-prime, size difference: 23
  • Exclusion in 5'UTR
  • Reference transcript: NM_174894

  • pfam TRAPP 139aa 5e-40 in ref transcript
    • Transport protein particle (TRAPP) component. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is a large multimeric protein that contains at least 10 subunits. This family contains many TRAPP family proteins. The Bet3 subunit is one of the better characterised TRAPP proteins and has a dimeric structure with hydrophobic channels. The channel entrances are located on a putative membrane-interacting surface that is distinctively flat, wide and decorated with positively charged residues. Bet3 is proposed to localise TRAPP to the Golgi.
  • COG COG5128 148aa 5e-29 in ref transcript
    • Transport protein particle (TRAPP) complex subunit [Intracellular trafficking and secretion].

TRAPPC6B

  • rs.TRAPPC6B.F1 rs.TRAPPC6B.R1 142 226
  • NCBIGene 36.3 122553
  • Single exon skipping, size difference: 84
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001079537

  • Changed! pfam TRAPP 153aa 4e-48 in ref transcript
    • Transport protein particle (TRAPP) component. TRAPP plays a key role in the targeting and/or fusion of ER-to-Golgi transport vesicles with their acceptor compartment. TRAPP is a large multimeric protein that contains at least 10 subunits. This family contains many TRAPP family proteins. The Bet3 subunit is one of the better characterised TRAPP proteins and has a dimeric structure with hydrophobic channels. The channel entrances are located on a putative membrane-interacting surface that is distinctively flat, wide and decorated with positively charged residues. Bet3 is proposed to localise TRAPP to the Golgi.
  • Changed! COG COG5128 141aa 1e-05 in ref transcript
    • Transport protein particle (TRAPP) complex subunit [Intracellular trafficking and secretion].
  • Changed! pfam TRAPP 125aa 1e-34 in modified transcript
  • Changed! COG COG5128 113aa 0.001 in modified transcript

TRDMT1

  • rs.TRDMT1.F1 rs.TRDMT1.R1 169 241
  • NCBIGene 36.3 1787
  • Single exon skipping, size difference: 72
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_004412

  • Changed! cd Cyt_C5_DNA_methylase 176aa 4e-46 in ref transcript
    • Cytosine-C5 specific DNA methylases; Methyl transfer reactions play an important role in many aspects of biology. Cytosine-specific DNA methylases are found both in prokaryotes and eukaryotes. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the mammalian genome. These effects include transcriptional repression via inhibition of transcription factor binding or the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability.
  • cd Cyt_C5_DNA_methylase 132aa 4e-14 in ref transcript
  • Changed! TIGR dcm 382aa 1e-21 in ref transcript
    • All proteins in this family for which functions are known are DNA-cytosine methyltransferases. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
  • Changed! COG Dcm 167aa 2e-33 in ref transcript
    • Site-specific DNA methylase [DNA replication, recombination, and repair].
  • PRK PRK10458 47aa 0.002 in ref transcript
    • DNA cytosine methylase; Provisional.
  • Changed! cd Cyt_C5_DNA_methylase 152aa 4e-36 in modified transcript
  • Changed! TIGR dcm 358aa 5e-17 in modified transcript
  • Changed! COG Dcm 143aa 2e-26 in modified transcript

TRO

  • rs.TRO.F1 rs.TRO.R1 103 118
  • NCBIGene 36.3 7216
  • Alternative 3-prime, size difference: 15
  • Exclusion in 5'UTR
  • Reference transcript: NM_001039705

  • pfam MAGE 170aa 9e-79 in ref transcript
    • MAGE family. The MAGE (melanoma antigen-encoding gene) family are expressed in a wide variety of tumours but not in normal cells, with the exception of the male germ cells, placenta, and, possibly, cells of the developing embryo. The cellular function of this family is unknown. This family also contains the yeast protein, Nse3. The Nse3 protein is part of the Smc5-6 complex. Nse3 has been demonstrated to be important for meiosis.

TRSPAP1

  • rs.TRSPAP1.F1 rs.TRSPAP1.R1 298 446
  • NCBIGene 36.3 54952
  • Single exon skipping, size difference: 148
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_017846

  • Changed! cd RRM 75aa 1e-09 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 69aa 3e-09 in ref transcript
  • Changed! TIGR PABP-1234 182aa 1e-18 in ref transcript
    • There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed, and of unknown tissue range.
  • Changed! COG COG0724 97aa 2e-08 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • Changed! COG COG0724 88aa 6e-07 in ref transcript
  • Changed! smart RRM 66aa 1e-10 in modified transcript
    • RNA recognition motif.
  • Changed! COG COG0724 70aa 3e-08 in modified transcript

TSC2

  • rs.TSC2.F1 rs.TSC2.R1 210 279
  • NCBIGene 36.3 7249
  • Single exon skipping, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000548

  • pfam Rap_GAP 184aa 2e-82 in ref transcript
    • Rap/ran-GAP.
  • pfam Tuberin 347aa 8e-53 in ref transcript
    • Tuberin. Tuberous sclerosis complex (TSC) is an autosomal dominant disorder and is characterised by the presence of hamartomas in many organs, such as brain, skin, heart, lung, and kidney. It is caused by mutation either TSC1 or TSC2 tumour suppressor gene. The TSC2 gene codes for tuberin and interacts with hamartin pfam04388, containing two coiled-coil regions, which have been shown to mediate binding to tuberin. These two proteins function within the same pathway(s) regulating cell cycle, cell growth, adhesion, and vesicular trafficking.

TSPAN4

  • rs.TSPAN4.F1 rs.TSPAN4.R1 163 243
  • NCBIGene 36.3 7106
  • Single exon skipping, size difference: 80
  • Exclusion of the protein initiation site
  • Reference transcript: NM_003271

  • cd NET-5_like_LEL 98aa 1e-40 in ref transcript
    • Tetraspanin, extracellular domain or large extracellular loop (LEL), NET-5_like family. Tetraspanins are trans-membrane proteins with 4 trans-membrane segments. Both the N- and C-termini lie on the intracellular side of the membrane. This alignment model spans the extracellular domain between the 3rd and 4th trans-membrane segment. Tetraspanins are involved in diverse processes and their various functions may relate to their ability to act as molecular facilitators. Tetraspanins associate laterally with one another and cluster dynamically with numerous parnter domains in membrane microdomains, forming a network of multimolecular complexes, the "tetraspanin web". This sub-family contains proteins similar to human tetraspan NET-5.
  • Changed! pfam Tetraspannin 222aa 3e-35 in ref transcript
    • Tetraspanin family.
  • Changed! pfam Tetraspannin 166aa 3e-24 in modified transcript

TTC23

  • rs.TTC23.F1 rs.TTC23.R1 229 351
  • NCBIGene 36.3 64927
  • Single exon skipping, size difference: 122
  • Exclusion in 5'UTR
  • Reference transcript: NM_001040655

TTLL3

  • rs.TTLL3.F1 rs.TTLL3.R1 360 486
  • NCBIGene 36.3 26140
  • Single exon skipping, size difference: 129
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001025930

  • pfam TTL 292aa 7e-90 in ref transcript
    • Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis.

TTLL3

  • rs.TTLL3.F2 rs.TTLL3.R2 169 298
  • NCBIGene 36.3 26140
  • Single exon skipping, size difference: 129
  • Inclusion in the protein (no stop codon or frameshift)
  • Reference transcript: NM_001025930

  • pfam TTL 292aa 7e-90 in ref transcript
    • Tubulin-tyrosine ligase family. Tubulins and microtubules are subjected to several post-translational modifications of which the reversible detyrosination/tyrosination of the carboxy-terminal end of most alpha-tubulins has been extensively analysed. This modification cycle involves a specific carboxypeptidase and the activity of the tubulin-tyrosine ligase (TTL). The true physiological function of TTL has so far not been established. Tubulin-tyrosine ligase (TTL) catalyses the ATP-dependent post-translational addition of a tyrosine to the carboxy terminal end of detyrosinated alpha-tubulin. In normally cycling cells, the tyrosinated form of tubulin predominates. However, in breast cancer cells, the detyrosinated form frequently predominates, with a correlation to tumour aggressiveness. On the other hand, 3-nitrotyrosine has been shown to be incorporated, by TTL, into the carboxy terminal end of detyrosinated alpha-tubulin. This reaction is not reversible by the carboxypeptidase enzyme. Cells cultured in 3-nitrotyrosine rich medium showed evidence of altered microtubule structure and function, including altered cell morphology, epithelial barrier dysfunction, and apoptosis.

TTN

  • rs.TTN.F1 rs.TTN.R1 163 301
  • NCBIGene 36.3 7273
  • Single exon skipping, size difference: 138
  • Exclusion in 5'UTR
  • Reference transcript: NM_133378

  • cd S_TKc 256aa 3e-62 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases of the serine or threonine-specific kinase subfamily. The enzymatic activity of these protein kinases is controlled by phosphorylation of specific residues in the activation segment of the catalytic domain, sometimes combined with reversible conformational changes in the C-terminal autoregulatory tail.
  • cd FN3 91aa 3e-16 in ref transcript
    • Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
  • cd FN3 93aa 1e-15 in ref transcript
  • cd FN3 87aa 3e-15 in ref transcript
  • cd FN3 93aa 4e-15 in ref transcript
  • cd FN3 94aa 7e-15 in ref transcript
  • cd FN3 89aa 1e-14 in ref transcript
  • cd FN3 94aa 2e-14 in ref transcript
  • cd FN3 93aa 3e-14 in ref transcript
  • cd FN3 90aa 4e-14 in ref transcript
  • cd FN3 90aa 5e-14 in ref transcript
  • cd FN3 89aa 6e-14 in ref transcript
  • cd FN3 90aa 6e-14 in ref transcript
  • cd FN3 93aa 7e-14 in ref transcript
  • cd FN3 94aa 1e-13 in ref transcript
  • cd FN3 94aa 1e-13 in ref transcript
  • cd FN3 94aa 1e-13 in ref transcript
  • cd FN3 90aa 2e-13 in ref transcript
  • cd FN3 86aa 2e-13 in ref transcript
  • cd FN3 89aa 2e-13 in ref transcript
  • cd FN3 92aa 2e-13 in ref transcript
  • cd FN3 92aa 3e-13 in ref transcript
  • cd FN3 93aa 3e-13 in ref transcript
  • cd FN3 91aa 4e-13 in ref transcript
  • cd FN3 94aa 4e-13 in ref transcript
  • cd FN3 91aa 4e-13 in ref transcript
  • cd FN3 91aa 4e-13 in ref transcript
  • cd FN3 94aa 5e-13 in ref transcript
  • cd FN3 92aa 6e-13 in ref transcript
  • cd FN3 88aa 6e-13 in ref transcript
  • cd FN3 94aa 1e-12 in ref transcript
  • cd FN3 92aa 1e-12 in ref transcript
  • cd FN3 93aa 1e-12 in ref transcript
  • cd FN3 88aa 2e-12 in ref transcript
  • cd FN3 86aa 2e-12 in ref transcript
  • cd FN3 90aa 2e-12 in ref transcript
  • cd FN3 84aa 2e-12 in ref transcript
  • cd FN3 84aa 2e-12 in ref transcript
  • cd FN3 84aa 2e-12 in ref transcript
  • cd FN3 90aa 3e-12 in ref transcript
  • cd FN3 90aa 4e-12 in ref transcript
  • cd FN3 94aa 4e-12 in ref transcript
  • cd FN3 84aa 4e-12 in ref transcript
  • cd FN3 84aa 5e-12 in ref transcript
  • cd FN3 94aa 5e-12 in ref transcript
  • cd FN3 84aa 5e-12 in ref transcript
  • cd FN3 92aa 6e-12 in ref transcript
  • cd FN3 93aa 6e-12 in ref transcript
  • cd FN3 90aa 6e-12 in ref transcript
  • cd FN3 94aa 6e-12 in ref transcript
  • cd FN3 90aa 7e-12 in ref transcript
  • cd FN3 92aa 8e-12 in ref transcript
  • cd FN3 79aa 8e-12 in ref transcript
  • cd FN3 94aa 1e-11 in ref transcript
  • cd FN3 92aa 1e-11 in ref transcript
  • cd FN3 94aa 1e-11 in ref transcript
  • cd FN3 94aa 2e-11 in ref transcript
  • cd FN3 93aa 2e-11 in ref transcript
  • cd FN3 91aa 2e-11 in ref transcript
  • cd FN3 92aa 3e-11 in ref transcript
  • cd FN3 93aa 3e-11 in ref transcript
  • cd FN3 92aa 3e-11 in ref transcript
  • cd IGcam 87aa 3e-11 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 82aa 4e-11 in ref transcript
  • cd FN3 89aa 4e-11 in ref transcript
  • cd FN3 99aa 5e-11 in ref transcript
  • cd FN3 89aa 5e-11 in ref transcript
  • cd FN3 88aa 5e-11 in ref transcript
  • cd FN3 95aa 5e-11 in ref transcript
  • cd FN3 82aa 6e-11 in ref transcript
  • cd FN3 93aa 6e-11 in ref transcript
  • cd FN3 92aa 7e-11 in ref transcript
  • cd FN3 89aa 7e-11 in ref transcript
  • cd FN3 88aa 8e-11 in ref transcript
  • cd FN3 94aa 1e-10 in ref transcript
  • cd FN3 94aa 1e-10 in ref transcript
  • cd FN3 93aa 1e-10 in ref transcript
  • cd FN3 92aa 2e-10 in ref transcript
  • cd FN3 89aa 3e-10 in ref transcript
  • cd FN3 85aa 3e-10 in ref transcript
  • cd FN3 94aa 4e-10 in ref transcript
  • cd FN3 80aa 4e-10 in ref transcript
  • cd FN3 91aa 4e-10 in ref transcript
  • cd FN3 94aa 4e-10 in ref transcript
  • cd IGcam 89aa 4e-10 in ref transcript
  • cd FN3 95aa 5e-10 in ref transcript
  • cd FN3 94aa 5e-10 in ref transcript
  • cd FN3 92aa 5e-10 in ref transcript
  • cd FN3 82aa 6e-10 in ref transcript
  • cd FN3 94aa 1e-09 in ref transcript
  • cd FN3 89aa 1e-09 in ref transcript
  • cd IGcam 86aa 1e-09 in ref transcript
  • cd FN3 91aa 2e-09 in ref transcript
  • cd FN3 94aa 2e-09 in ref transcript
  • cd FN3 92aa 2e-09 in ref transcript
  • cd FN3 97aa 2e-09 in ref transcript
  • cd FN3 94aa 2e-09 in ref transcript
  • cd IGcam 88aa 2e-09 in ref transcript
  • cd FN3 90aa 3e-09 in ref transcript
  • cd IGcam 77aa 5e-09 in ref transcript
  • cd FN3 86aa 1e-08 in ref transcript
  • cd FN3 94aa 1e-08 in ref transcript
  • cd FN3 92aa 1e-08 in ref transcript
  • cd IGcam 87aa 2e-08 in ref transcript
  • cd IGcam 87aa 6e-08 in ref transcript
  • cd FN3 92aa 8e-08 in ref transcript
  • cd FN3 94aa 8e-08 in ref transcript
  • cd FN3 84aa 8e-08 in ref transcript
  • cd FN3 87aa 1e-07 in ref transcript
  • cd IGcam 91aa 1e-07 in ref transcript
  • cd IGcam 77aa 1e-07 in ref transcript
  • cd FN3 84aa 3e-07 in ref transcript
  • cd IGcam 92aa 3e-07 in ref transcript
  • cd IGcam 82aa 3e-07 in ref transcript
  • cd IGcam 79aa 3e-07 in ref transcript
  • cd IGcam 88aa 4e-07 in ref transcript
  • cd IGcam 80aa 1e-06 in ref transcript
  • cd IGcam 88aa 1e-06 in ref transcript
  • cd FN3 94aa 2e-06 in ref transcript
  • cd IGcam 92aa 2e-06 in ref transcript
  • cd IGcam 86aa 2e-06 in ref transcript
  • cd IGcam 77aa 4e-06 in ref transcript
  • cd IGcam 88aa 9e-06 in ref transcript
  • cd IGcam 80aa 1e-05 in ref transcript
  • cd IGcam 93aa 1e-05 in ref transcript
  • cd IGcam 83aa 2e-05 in ref transcript
  • cd FN3 94aa 3e-05 in ref transcript
  • cd IGcam 92aa 4e-05 in ref transcript
  • cd IGcam 92aa 4e-05 in ref transcript
  • cd IGcam 79aa 4e-05 in ref transcript
  • cd IGcam 82aa 4e-05 in ref transcript
  • cd IGcam 78aa 5e-05 in ref transcript
  • cd IGcam 91aa 6e-05 in ref transcript
  • cd IGcam 83aa 6e-05 in ref transcript
  • cd IGcam 85aa 8e-05 in ref transcript
  • cd IGcam 86aa 9e-05 in ref transcript
  • cd IGcam 91aa 1e-04 in ref transcript
  • cd IG 72aa 1e-04 in ref transcript
    • Immunoglobulin domain family; members are components of immunoglobulins, neuroglia, cell surface glycoproteins, such as, T-cell receptors, CD2, CD4, CD8, and membrane glycoproteins, such as, butyrophilin and chondroitin sulfate proteoglycan core protein. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd FN3 90aa 2e-04 in ref transcript
  • cd IGcam 79aa 2e-04 in ref transcript
  • cd IGcam 81aa 2e-04 in ref transcript
  • cd IGcam 75aa 3e-04 in ref transcript
  • cd IGcam 78aa 4e-04 in ref transcript
  • cd IGcam 74aa 4e-04 in ref transcript
  • cd IGcam 82aa 6e-04 in ref transcript
  • cd IGcam 92aa 0.001 in ref transcript
  • cd IGcam 85aa 0.001 in ref transcript
  • cd IGcam 72aa 0.001 in ref transcript
  • cd IGcam 71aa 0.004 in ref transcript
  • cd IGcam 92aa 0.007 in ref transcript
  • cd IGcam 84aa 0.008 in ref transcript
  • cd IGcam 77aa 0.009 in ref transcript
  • smart S_TKc 244aa 2e-63 in ref transcript
    • Serine/Threonine protein kinases, catalytic domain. Phosphotransferases. Serine or threonine-specific kinase subfamily.
  • pfam I-set 89aa 2e-23 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 88aa 4e-19 in ref transcript
  • pfam I-set 90aa 5e-19 in ref transcript
  • pfam I-set 90aa 1e-18 in ref transcript
  • pfam I-set 93aa 2e-18 in ref transcript
  • smart FN3 83aa 8e-17 in ref transcript
    • Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
  • pfam I-set 91aa 6e-16 in ref transcript
  • pfam I-set 92aa 6e-16 in ref transcript
  • smart FN3 84aa 1e-14 in ref transcript
  • smart FN3 83aa 1e-14 in ref transcript
  • pfam I-set 79aa 3e-14 in ref transcript
  • pfam I-set 81aa 3e-14 in ref transcript
  • smart FN3 86aa 3e-14 in ref transcript
  • pfam I-set 92aa 5e-14 in ref transcript
  • pfam I-set 81aa 8e-14 in ref transcript
  • smart FN3 83aa 2e-13 in ref transcript
  • smart FN3 84aa 4e-13 in ref transcript
  • pfam I-set 90aa 8e-13 in ref transcript
  • smart FN3 83aa 8e-13 in ref transcript
  • pfam I-set 76aa 1e-12 in ref transcript
  • smart FN3 80aa 1e-12 in ref transcript
  • smart FN3 84aa 1e-12 in ref transcript
  • smart FN3 81aa 1e-12 in ref transcript
  • smart FN3 83aa 1e-12 in ref transcript
  • smart FN3 85aa 1e-12 in ref transcript
  • smart FN3 83aa 2e-12 in ref transcript
  • pfam fn3 86aa 2e-12 in ref transcript
    • Fibronectin type III domain.
  • smart FN3 84aa 3e-12 in ref transcript
  • smart FN3 84aa 3e-12 in ref transcript
  • smart FN3 82aa 3e-12 in ref transcript
  • smart FN3 82aa 3e-12 in ref transcript
  • smart FN3 84aa 4e-12 in ref transcript
  • pfam I-set 87aa 5e-12 in ref transcript
  • smart FN3 84aa 5e-12 in ref transcript
  • smart FN3 82aa 6e-12 in ref transcript
  • smart FN3 82aa 6e-12 in ref transcript
  • pfam I-set 84aa 7e-12 in ref transcript
  • pfam I-set 81aa 7e-12 in ref transcript
  • smart FN3 83aa 7e-12 in ref transcript
  • pfam I-set 81aa 8e-12 in ref transcript
  • smart FN3 83aa 8e-12 in ref transcript
  • smart FN3 85aa 9e-12 in ref transcript
  • smart FN3 82aa 1e-11 in ref transcript
  • smart FN3 82aa 1e-11 in ref transcript
  • smart FN3 82aa 1e-11 in ref transcript
  • pfam I-set 93aa 2e-11 in ref transcript
  • smart FN3 83aa 2e-11 in ref transcript
  • smart FN3 90aa 2e-11 in ref transcript
  • smart FN3 81aa 2e-11 in ref transcript
  • smart FN3 84aa 3e-11 in ref transcript
  • smart FN3 83aa 3e-11 in ref transcript
  • smart FN3 85aa 3e-11 in ref transcript
  • smart FN3 84aa 3e-11 in ref transcript
  • smart FN3 84aa 3e-11 in ref transcript
  • smart FN3 84aa 3e-11 in ref transcript
  • smart FN3 84aa 3e-11 in ref transcript
  • pfam I-set 77aa 4e-11 in ref transcript
  • smart FN3 84aa 4e-11 in ref transcript
  • smart FN3 84aa 4e-11 in ref transcript
  • smart FN3 84aa 4e-11 in ref transcript
  • smart FN3 84aa 4e-11 in ref transcript
  • smart FN3 84aa 4e-11 in ref transcript
  • smart FN3 82aa 4e-11 in ref transcript
  • smart FN3 84aa 4e-11 in ref transcript
  • smart FN3 85aa 4e-11 in ref transcript
  • smart FN3 83aa 4e-11 in ref transcript
  • smart FN3 81aa 5e-11 in ref transcript
  • smart FN3 81aa 5e-11 in ref transcript
  • pfam I-set 81aa 6e-11 in ref transcript
  • smart FN3 84aa 6e-11 in ref transcript
  • smart FN3 82aa 6e-11 in ref transcript
  • smart FN3 84aa 6e-11 in ref transcript
  • pfam I-set 82aa 7e-11 in ref transcript
  • smart FN3 81aa 7e-11 in ref transcript
  • smart FN3 85aa 7e-11 in ref transcript
  • smart FN3 83aa 7e-11 in ref transcript
  • smart FN3 84aa 7e-11 in ref transcript
  • smart FN3 83aa 7e-11 in ref transcript
  • smart FN3 74aa 9e-11 in ref transcript
  • smart FN3 84aa 9e-11 in ref transcript
  • pfam I-set 84aa 1e-10 in ref transcript
  • pfam I-set 72aa 1e-10 in ref transcript
  • smart FN3 84aa 1e-10 in ref transcript
  • smart FN3 81aa 1e-10 in ref transcript
  • pfam I-set 90aa 2e-10 in ref transcript
  • smart FN3 84aa 2e-10 in ref transcript
  • smart FN3 83aa 2e-10 in ref transcript
  • smart FN3 84aa 2e-10 in ref transcript
  • smart FN3 84aa 2e-10 in ref transcript
  • pfam I-set 81aa 4e-10 in ref transcript
  • smart FN3 79aa 4e-10 in ref transcript
  • pfam I-set 81aa 5e-10 in ref transcript
  • smart FN3 84aa 5e-10 in ref transcript
  • smart FN3 84aa 5e-10 in ref transcript
  • smart FN3 84aa 6e-10 in ref transcript
  • smart FN3 83aa 6e-10 in ref transcript
  • smart FN3 83aa 6e-10 in ref transcript
  • pfam fn3 80aa 6e-10 in ref transcript
  • smart FN3 85aa 7e-10 in ref transcript
  • pfam fn3 86aa 9e-10 in ref transcript
  • smart FN3 82aa 1e-09 in ref transcript
  • smart FN3 84aa 1e-09 in ref transcript
  • smart FN3 85aa 1e-09 in ref transcript
  • smart FN3 77aa 1e-09 in ref transcript
  • pfam I-set 78aa 2e-09 in ref transcript
  • smart FN3 84aa 2e-09 in ref transcript
  • smart FN3 82aa 2e-09 in ref transcript
  • smart FN3 81aa 2e-09 in ref transcript
  • smart FN3 80aa 2e-09 in ref transcript
  • smart FN3 83aa 2e-09 in ref transcript
  • smart FN3 84aa 3e-09 in ref transcript
  • pfam I-set 90aa 4e-09 in ref transcript
  • smart FN3 85aa 4e-09 in ref transcript
  • pfam I-set 72aa 5e-09 in ref transcript
  • smart FN3 81aa 5e-09 in ref transcript
  • smart FN3 84aa 5e-09 in ref transcript
  • pfam fn3 87aa 5e-09 in ref transcript
  • pfam I-set 88aa 6e-09 in ref transcript
  • smart FN3 76aa 6e-09 in ref transcript
  • pfam I-set 70aa 7e-09 in ref transcript
  • smart FN3 84aa 7e-09 in ref transcript
  • pfam I-set 90aa 8e-09 in ref transcript
  • pfam I-set 82aa 9e-09 in ref transcript
  • pfam I-set 93aa 1e-08 in ref transcript
  • smart FN3 85aa 2e-08 in ref transcript
  • pfam I-set 81aa 4e-08 in ref transcript
  • smart FN3 84aa 5e-08 in ref transcript
  • pfam I-set 77aa 6e-08 in ref transcript
  • pfam I-set 81aa 7e-08 in ref transcript
  • smart FN3 72aa 8e-08 in ref transcript
  • pfam I-set 83aa 9e-08 in ref transcript
  • pfam I-set 79aa 9e-08 in ref transcript
  • smart FN3 84aa 9e-08 in ref transcript
  • smart FN3 82aa 9e-08 in ref transcript
  • pfam I-set 81aa 1e-07 in ref transcript
  • pfam I-set 78aa 2e-07 in ref transcript
  • smart FN3 84aa 2e-07 in ref transcript
  • smart FN3 70aa 2e-07 in ref transcript
  • smart FN3 84aa 2e-07 in ref transcript
  • pfam I-set 60aa 3e-07 in ref transcript
  • pfam I-set 83aa 3e-07 in ref transcript
  • smart FN3 80aa 3e-07 in ref transcript
  • pfam I-set 81aa 5e-07 in ref transcript
  • smart FN3 85aa 5e-07 in ref transcript
  • smart FN3 77aa 6e-07 in ref transcript
  • pfam I-set 81aa 1e-06 in ref transcript
  • pfam I-set 79aa 1e-06 in ref transcript
  • smart FN3 84aa 3e-06 in ref transcript
  • smart FN3 84aa 4e-06 in ref transcript
  • pfam I-set 81aa 1e-05 in ref transcript
  • pfam I-set 81aa 1e-05 in ref transcript
  • smart FN3 77aa 2e-05 in ref transcript
  • smart FN3 84aa 2e-05 in ref transcript
  • pfam I-set 73aa 4e-05 in ref transcript
  • pfam I-set 81aa 5e-05 in ref transcript
  • pfam I-set 85aa 5e-05 in ref transcript
  • pfam I-set 78aa 8e-05 in ref transcript
  • COG SPS1 286aa 2e-28 in ref transcript
    • Serine/threonine protein kinase [General function prediction only / Signal transduction mechanisms / Transcription / DNA replication, recombination, and repair].
  • COG COG4733 244aa 2e-04 in ref transcript
    • Phage-related protein, tail component [Function unknown].
  • COG COG4733 213aa 2e-04 in ref transcript
  • COG COG4733 184aa 0.004 in ref transcript

TXNDC2

  • rs.TXNDC2.F1 rs.TXNDC2.R1 167 370
  • NCBIGene 36.3 84203
  • Alternative 5-prime, size difference: 203
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001098529

  • cd TRX_family 94aa 1e-19 in ref transcript
    • TRX family; composed of two groups: Group I, which includes proteins that exclusively encode a TRX domain; and Group II, which are composed of fusion proteins of TRX and additional domains. Group I TRX is a small ancient protein that alter the redox state of target proteins via the reversible oxidation of an active site dithiol, present in a CXXC motif, partially exposed at the protein's surface. TRX reduces protein disulfide bonds, resulting in a disulfide bond at its active site. Oxidized TRX is converted to the active form by TRX reductase, using reducing equivalents derived from either NADPH or ferredoxins. By altering their redox state, TRX regulates the functions of at least 30 target proteins, some of which are enzymes and transcription factors. It also plays an important role in the defense against oxidative stress by directly reducing hydrogen peroxide and certain radicals, and by serving as a reductant for peroxiredoxins. At least two major types of functional TRXs have been reported in most organisms; in eukaryotes, they are located in the cytoplasm and the mitochondria. Higher plants contain more types (at least 20 TRX genes have been detected in the genome of Arabidopsis thaliana), two of which (types f amd m) are located in the same compartment, the chloroplast. Also included in the alignment are TRX-like domains which show sequence homology to TRX but do not contain the redox active CXXC motif. Group II proteins, in addition to either a redox active TRX or a TRX-like domain, also contain additional domains, which may or may not possess homology to known proteins.
  • pfam Thioredoxin 103aa 4e-18 in ref transcript
    • Thioredoxin. Thioredoxins are small enzymes that participate in redox reactions, via the reversible oxidation of an active centre disulfide bond. Some members with only the active site are not separated from the noise.
  • pfam Glutenin_hmw 360aa 9e-13 in ref transcript
    • High molecular weight glutenin subunit. Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterised by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.
  • PTZ PTZ00051 99aa 2e-18 in ref transcript
    • thioredoxin; Provisional.
  • PRK PRK07764 177aa 0.002 in ref transcript
    • DNA polymerase III subunits gamma and tau; Validated.

U2AF2

  • rs.U2AF2.F1 rs.U2AF2.R1 104 116
  • NCBIGene 36.3 11338
  • Alternative 5-prime, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007279

  • cd RRM 75aa 1e-12 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • cd RRM 79aa 4e-08 in ref transcript
  • cd RRM 58aa 2e-05 in ref transcript
  • Changed! TIGR U2AF_lg 412aa 1e-160 in ref transcript
    • Members of this subfamily are found in plants, metazoa and fungi.
  • COG COG0724 82aa 2e-09 in ref transcript
    • RNA-binding proteins (RRM domain) [General function prediction only].
  • COG COG0724 189aa 8e-06 in ref transcript
  • Changed! TIGR U2AF_lg 408aa 1e-161 in modified transcript

UBE2C

  • rs.UBE2C.F1 rs.UBE2C.R1 218 305
  • NCBIGene 36.3 11065
  • Single exon skipping, size difference: 87
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007019

  • Changed! cd UBCc 138aa 6e-45 in ref transcript
    • Ubiquitin-conjugating enzyme E2, catalytic (UBCc) domain. This is part of the ubiquitin-mediated protein degradation pathway in which a thiol-ester linkage forms between a conserved cysteine and the C-terminus of ubiquitin and complexes with ubiquitin protein ligase enzymes, E3. This pathway regulates many fundamental cellular processes. There are also other E2s which form thiol-ester linkages without the use of E3s as well as several UBC homologs (TSG101, Mms2, Croc-1 and similar proteins) which lack the active site cysteine essential for ubiquitination and appear to function in DNA repair pathways which were omitted from the scope of this CD.
  • Changed! pfam UQ_con 137aa 5e-53 in ref transcript
    • Ubiquitin-conjugating enzyme. Proteins destined for proteasome-mediated degradation may be ubiquitinated. Ubiquitination follows conjugation of ubiquitin to a conserved cysteine residue of UBC homologues. TSG101 is one of several UBC homologues that lacks this active site cysteine.
  • Changed! COG COG5078 140aa 3e-43 in ref transcript
    • Ubiquitin-protein ligase [Posttranslational modification, protein turnover, chaperones].
  • Changed! cd UBCc 98aa 1e-30 in modified transcript
  • Changed! pfam UQ_con 98aa 3e-37 in modified transcript
  • Changed! COG COG5078 100aa 3e-32 in modified transcript

UBE4B

  • rs.UBE4B.F1 rs.UBE4B.R1 102 489
  • NCBIGene 36.3 10277
  • Single exon skipping, size difference: 387
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001105562

  • pfam Ufd2P_core 623aa 0.0 in ref transcript
    • Ubiquitin elongating factor core. This is the most conserved part of the core region of Ufd2P ubiquitin elongating factor or E4, running from helix alpha-11 to alpha-38. It consists of 31 helices of variable length connected by loops of variable size forming a compact unit; the helical packing pattern of the compact unit consists of five structural repeats that resemble tandem Armadillo (ARM) repeats. This domain is involved in ubiquitination as it binds Cdc48p and escorts ubiquitinated proteins from Cdc48p to the proteasome for degradation. The core is structurally similar to the nuclear transporter protein importin-alpha. The core is associated with the U-box at the C-terminus, pfam04564, which has ligase activity.
  • pfam U-box 74aa 2e-27 in ref transcript
    • U-box domain. This domain is related to the Ring finger pfam00097 but lacks the zinc binding residues.
  • COG UFD2 804aa 1e-107 in ref transcript
    • Ubiquitin fusion degradation protein 2 [Posttranslational modification, protein turnover, chaperones].

UBL5

  • rs.UBL5.F1 rs.UBL5.R1 109 189
  • NCBIGene 36.3 59286
  • Alternative 5-prime, size difference: 80
  • Exclusion in 5'UTR
  • Reference transcript: NM_024292

  • cd Ubl5 73aa 3e-34 in ref transcript
    • UBL5 (also known as HUB1) is a ubiquitin-like modifier that is both widely expressed and highly phylogenetically conserved. At the C-terminal end of the ubiquitin-like fold of UBL5 is a di-tyrosine motif followed by a single variable residue instead of the characteristic di-glycine found in all other ubiquitin-like modifiers. ULB5 interacts with a cyclin-like kinase called CLK4 but not with other cyclin-like kinase family members.
  • pfam ubiquitin 61aa 0.004 in ref transcript
    • Ubiquitin family. This family contains a number of ubiquitin-like proteins: SUMO (smt3 homologue), Nedd8, Elongin B, Rub1, and Parkin. A number of them are thought to carry a distinctive five-residue motif termed the proteasome-interacting motif (PIM), which may have a biologically significant role in protein delivery to proteasomes and recruitment of proteasomes to transcription sites.

UBTF

  • rs.UBTF.F1 rs.UBTF.R1 326 437
  • NCBIGene 36.3 7343
  • Single exon skipping, size difference: 111
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_014233

  • Changed! cd HMGB-UBF_HMG-box 66aa 1e-06 in ref transcript
    • HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • cd HMGB-UBF_HMG-box 60aa 4e-05 in ref transcript
  • cd HMGB-UBF_HMG-box 63aa 2e-04 in ref transcript
  • cd HMG-box 44aa 2e-04 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • Changed! smart HMG 68aa 9e-08 in ref transcript
    • high mobility group.
  • smart HMG 67aa 2e-06 in ref transcript
  • smart HMG 62aa 4e-06 in ref transcript
  • pfam HMG_box 44aa 4e-05 in ref transcript
    • HMG (high mobility group) box.
  • pfam HMG_box 65aa 2e-04 in ref transcript
  • smart HMG 65aa 3e-04 in ref transcript
  • COG NHP6B 103aa 0.003 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

UBXD5

  • rs.UBXD5.F1 rs.UBXD5.R1 133 493
  • NCBIGene 36.3 91544
  • Multiple exon skipping, size difference: 360
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_183008

  • pfam SEP 72aa 6e-19 in ref transcript
    • SEP domain. The SEP domain is named after Saccharomyces cerevisiae Shp1, Drosophila melanogaster eyes closed gene (eyc), and vertebrate p47. In p47, the SEP domain has been shown to bind to and inhibit the cysteine protease cathepsin L. Most SEP domains are succeeded closely by a UBX domain.
  • Changed! PRK bchH 138aa 0.007 in ref transcript
    • magnesium chelatase subunit H; Provisional.

UEVLD

  • rs.UEVLD.F1 rs.UEVLD.R1 193 317
  • NCBIGene 36.3 55293
  • Single exon skipping, size difference: 124
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001040697

  • Changed! cd LDH_1 289aa 1e-117 in ref transcript
    • A subgroup of L-lactate dehydrogenases. L-lactate dehydrogenases (LDH) are tetrameric enzymes catalyzing the last step of glycolysis in which pyruvate is converted to L-lactate. This subgroup is composed of eukaryotic LDHs. Vertebrate LDHs are non-allosteric. This is in contrast to some bacterial LDHs that are activated by an allosteric effector such as fructose-1,6-bisphosphate. LDHs are part of the NAD(P)-binding Rossmann fold superfamily, which includes a wide variety of protein families including the NAD(P)-binding domains of alcohol dehydrogenases, tyrosine-dependent oxidoreductases, glyceraldehyde-3-phosphate dehydrogenases, formate/glycerate dehydrogenases, siroheme synthases, 6-phosphogluconate dehydrogenases, aminoacid dehydrogenases, repressor rex, and NAD-binding potassium channel domains, among others.
  • Changed! TIGR L-LDH-NAD 276aa 3e-52 in ref transcript
    • This model represents the NAD-dependent L-lactate dehydrogenases from bacteria and eukaryotes. This enzyme function as as the final step in anaerobic glycolysis. Although lactate dehydrogenases have in some cases been mistaken for malate dehydrogenases due to the similarity of these two substrates and the apparent ease with which evolution can toggle these activities, critical residues have been identified which can discriminate between the two activities. At the time of the creation of this model no hits above the trusted cutoff contained critical residues typical of malate dehydrogenases.
  • pfam UEV 120aa 4e-52 in ref transcript
    • UEV domain. This family includes the eukaryotic tumour susceptibility gene 101 protein (TSG101). Altered transcripts of this gene have been detected in sporadic breast cancers and many other human malignancies. However, the involvement of this gene in neoplastic transformation and tumorigenesis is still elusive. TSG101 is required for normal cell function of embryonic and adult tissues but that this gene is not a tumour suppressor for sporadic forms of breast cancer. This family is related to the ubiquitin conjugating enzymes.
  • Changed! COG Mdh 288aa 1e-44 in ref transcript
    • Malate/lactate dehydrogenases [Energy production and conversion].
  • Changed! cd LDH_1 182aa 1e-75 in modified transcript
  • Changed! TIGR L-LDH-NAD 173aa 2e-31 in modified transcript
  • Changed! COG Mdh 176aa 5e-29 in modified transcript

UMOD

  • rs.UMOD.F1 rs.UMOD.R1 107 170
  • NCBIGene 36.3 7369
  • Alternative 3-prime, size difference: 63
  • Inclusion in 5'UTR
  • Reference transcript: NM_001008389

  • cd EGF_CA 42aa 2e-04 in ref transcript
    • Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
  • cd EGF_CA 36aa 4e-04 in ref transcript
  • smart ZP 250aa 1e-55 in ref transcript
    • Zona pellucida (ZP) domain. ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).
  • pfam EGF_CA 39aa 1e-06 in ref transcript
    • Calcium binding EGF domain.
  • smart EGF_CA 42aa 1e-05 in ref transcript
    • Calcium-binding EGF-like domain.

UPF3B

  • rs.UPF3B.F1 rs.UPF3B.R1 176 215
  • NCBIGene 36.3 65109
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_080632

  • pfam Smg4_UPF3 163aa 1e-34 in ref transcript
    • Smg-4/UPF3 family. This family contains proteins that are involved in nonsense mediated mRNA decay. A process that is triggered by premature stop codons in mRNA. The family includes Smg-4 and UPF3.

URG4

  • rs.URG4.F1 rs.URG4.R1 100 127
  • NCBIGene 36.3 55665
  • Single exon skipping, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001077663

  • COG COG2433 120aa 4e-05 in ref transcript
    • Uncharacterized conserved protein [Function unknown].

USP5

  • rs.USP5.F1 rs.USP5.R1 180 249
  • NCBIGene 36.3 8078
  • Alternative 5-prime, size difference: 69
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001098536

  • cd Peptidase_C19B 272aa 6e-91 in ref transcript
    • A subfamily of Peptidase C19. Peptidase C19 contains ubiquitinyl hydrolases. They are intracellular peptidases that remove ubiquitin molecules from polyubiquinated peptides by cleavage of isopeptide bonds. They hydrolyze bonds involving the carboxyl group of the C-terminal Gly residue of ubiquitin. The purpose of the de-ubiquitination is thought to be editing of the ubiquitin conjugates, which could rescue them from degradation, as well as recycling of the ubiquitin. The ubiquitin/proteasome system is responsible for most protein turnover in the mammalian cell, and with over 50 members, family C19 is one of the largest families of peptidases in the human genome.
  • cd Peptidase_C19B 58aa 8e-25 in ref transcript
  • cd UBA 34aa 1e-07 in ref transcript
    • Ubiquitin Associated domain. The UBA domain is a commonly occurring sequence motif in some members of the ubiquitination pathway, UV excision repair proteins, and certain protein kinases. Although its specific role is so far unknown, it has been suggested that UBA domains are involved in conferring protein target specificity. The domain, a compact three helix bundle, has a conserved GFP-loop and the proline is thought to be critical for binding. The UBA domain is distinct from the conserved three helical domain seen in the N-terminus of EF-TS and eukaryotic NAC proteins.
  • cd UBA 39aa 6e-06 in ref transcript
  • pfam UCH 290aa 1e-44 in ref transcript
    • Ubiquitin carboxyl-terminal hydrolase.
  • pfam zf-UBP 76aa 3e-20 in ref transcript
    • Zn-finger in ubiquitin-hydrolases and other protein.
  • pfam UCH 70aa 2e-15 in ref transcript
  • pfam UBA 35aa 5e-08 in ref transcript
    • UBA/TS-N domain. This small domain is composed of three alpha helices. This family includes the previously defined UBA and TS-N domains. The UBA-domain (ubiquitin associated domain) is a novel sequence motif found in several proteins having connections to ubiquitin and the ubiquitination pathway. The structure of the UBA domain consists of a compact three helix bundle. This domain is found at the N terminus of EF-TS hence the name TS-N. The structure of EF-TS is known and this domain is implicated in its interaction with EF-TU. The domain has been found in non EF-TS proteins such as alpha-NAC and MJ0280.
  • pfam UBA 36aa 3e-05 in ref transcript
  • Changed! COG UBP14 838aa 1e-101 in ref transcript
    • Isopeptidase T [Posttranslational modification, protein turnover, chaperones].
  • Changed! COG UBP14 815aa 1e-105 in modified transcript

VCAM1

  • rs.VCAM1.F1 rs.VCAM1.R1 110 386
  • NCBIGene 36.3 7412
  • Single exon skipping, size difference: 276
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001078

  • cd IGcam 76aa 2e-08 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • cd IGcam 76aa 1e-07 in ref transcript
  • cd IGcam 86aa 1e-05 in ref transcript
  • Changed! cd IGcam 86aa 4e-05 in ref transcript
  • cd IGcam 81aa 0.003 in ref transcript
  • pfam C2-set 89aa 7e-15 in ref transcript
    • Immunoglobulin C2-set domain.
  • pfam C2-set 89aa 6e-12 in ref transcript
  • smart IGc2 61aa 6e-12 in ref transcript
    • Immunoglobulin C-2 Type.
  • Changed! pfam I-set 87aa 2e-11 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam I-set 84aa 3e-11 in ref transcript
  • smart IG_like 79aa 3e-09 in ref transcript
    • Immunoglobulin like. IG domains that cannot be classified into one of IGv1, IGc1, IGc2, IG.
  • smart IG_like 78aa 9e-06 in ref transcript

VDR

  • rs.VDR.F1 rs.VDR.R1 118 240
  • NCBIGene 36.3 7421
  • Single exon skipping, size difference: 122
  • Exclusion in 5'UTR
  • Reference transcript: NM_001017535

  • cd NR_LBD_VDR 204aa 1e-117 in ref transcript
    • The ligand binding domain of vitamin D receptors, a member of the nuclear receptor superfamily. The ligand binding domain of vitamin D receptors (VDR): VDR is a member of the nuclear receptor (NR) superfamily that functions as classical endocrine receptors. VDR controls a wide range of biological activities including calcium metabolism, cell proliferation and differentiation, and immunomodulation. VDR is a high affinity receptor for the biologically most active Vitamin D metabolite, 1alpha,25-dihydroxyvitamin D3 (1alpha,25(OH)2D3). The binding of the ligand to the receptor induces a conformational change of the ligand binding domain (LBD) with consequent dissociation of corepressors. Upon ligand binding, VDR forms heterodimer with the retinoid X receptor (RXR) that binds to vitamin D response elements (VDREs), recruits coactivators. This leads to the expression of a large number of genes. Approximately 200 human genes are considered to be primary targets of VDR and even more genes are regulated indirectly. Like other members of the nuclear receptor (NR) superfamily of ligand-activated transcription factors, VDR has a central well conserved DNA binding domain (DBD), a variable N-terminal domain, a flexible hinge and a C-terminal ligand binding domain (LBD).
  • cd NR_LBD_VDR 36aa 8e-14 in ref transcript
  • pfam Hormone_recep 187aa 5e-34 in ref transcript
    • Ligand-binding domain of nuclear hormone receptor. This all helical domain is involved in binding the hormone in these receptors.
  • pfam zf-C4 76aa 1e-33 in ref transcript
    • Zinc finger, C4 type (two domains). In nearly all cases, this is the DNA binding domain of a nuclear hormone receptor. The alignment contains two Zinc finger domains that are too dissimilar to be aligned with each other.

VEGFA

  • rs.VEGFA.F1 rs.VEGFA.R1 106 141
  • NCBIGene 36.3 7422
  • Alternative 5-prime, size difference: 35
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001025366

  • cd PDGF 84aa 2e-24 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
  • smart PDGF 83aa 2e-33 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family. Platelet-derived growth factor is a potent activator for cells of mesenchymal origin. PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer. Members of the VEGF family are homologues of PDGF.

VEGFA

  • rs.VEGFA.F2 rs.VEGFA.R2 338 404
  • NCBIGene 36.3 7422
  • Alternative 3-prime, size difference: 66
  • Exclusion of the stop codon
  • Reference transcript: NM_001025366

  • cd PDGF 84aa 2e-24 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
  • smart PDGF 83aa 2e-33 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family. Platelet-derived growth factor is a potent activator for cells of mesenchymal origin. PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer. Members of the VEGF family are homologues of PDGF.

VEGFA

  • rs.VEGFA.F3 rs.VEGFA.R3 127 178
  • NCBIGene 36.3 7422
  • Alternative 5-prime, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001025366

  • cd PDGF 84aa 2e-24 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family domain; PDGF is a potent activator for cells of mesenchymal origin; PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer; VEGF is a potent mitogen in embryonic and somatic angiogenesis with a unique specificity for vascular endothelial cells; VEGF forms homodimers and exists in 4 different isoforms; overall, the VEGF monomer resembles that of PDGF, but its N-terminal segment is helical rather than extended; the cysteine knot motif is a common feature of this domain.
  • smart PDGF 83aa 2e-33 in ref transcript
    • Platelet-derived and vascular endothelial growth factors (PDGF, VEGF) family. Platelet-derived growth factor is a potent activator for cells of mesenchymal origin. PDGF-A and PDGF-B form AA and BB homodimers and an AB heterodimer. Members of the VEGF family are homologues of PDGF.

VHL

  • rs.VHL.F1 rs.VHL.R1 304 427
  • NCBIGene 36.3 7428
  • Single exon skipping, size difference: 123
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_000551

  • Changed! pfam VHL 156aa 2e-81 in ref transcript
    • von Hippel-Lindau disease tumour suppressor protein. VHL forms a ternary complex with the elonginB and elonginC proteins. This complex binds Cul2, which then is involved in regulation of vascular endothelial growth factor mRNA.
  • Changed! pfam VHL 115aa 1e-49 in modified transcript

VPS13B

  • rs.VPS13B.F1 rs.VPS13B.R1 314 389
  • NCBIGene 36.3 157680
  • Mutually exclusive exon skipping, size difference: 75
  • Inclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017890

  • COG MRS6 199aa 7e-17 in ref transcript
    • Vacuolar protein sorting-associated protein [Intracellular trafficking and secretion].
  • COG MRS6 83aa 8e-11 in ref transcript

VPS16

  • rs.VPS16.F1 rs.VPS16.R1 104 536
  • NCBIGene 36.3 64601
  • Multiple exon skipping, size difference: 432
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift), Exclusion in the protein causing a frameshift
  • Reference transcript: NM_022575

  • Changed! pfam Vps16_N 417aa 1e-167 in ref transcript
    • Vps16, N-terminal region. This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport. The role of VPS16 in this complex is not known.
  • pfam Vps16_C 319aa 1e-115 in ref transcript
    • Vps16, C-terminal region. This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport. The role of VPS16 in this complex is not known.
  • Changed! pfam Vps16_N 298aa 1e-108 in modified transcript

VPS26A

  • rs.VPS26A.F1 rs.VPS26A.R1 133 276
  • NCBIGene 36.3 9559
  • Single exon skipping, size difference: 143
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_004896

  • Changed! pfam Vps26 276aa 1e-135 in ref transcript
    • Vacuolar protein sorting-associated protein 26. Vacuolar protein sorting-associated protein (Vps) 26 is one of around 50 proteins involved in protein trafficking. In particular, Vps26 assembles into a retromer complex with at least four other proteins Vps5, Vps17, Vps29 and Vps35. This family also contains Down syndrome critical region 3/A.
  • Changed! pfam Vps26 236aa 1e-110 in modified transcript

VPS29

  • rs.VPS29.F1 rs.VPS29.R1 100 112
  • NCBIGene 36.3 51699
  • Single exon skipping, size difference: 12
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_057180

  • Changed! TIGR yfcE 139aa 5e-23 in ref transcript
    • Members of this largely uncharacterized family share a motif approximating DXH(X25)GDXXD(X25)GNHD as found in several phosphoesterases, including the nucleases SbcD and Mre11, and a family of uncharacterized archaeal putative phosphoesterases described by TIGR00024. In this family, the His residue in GNHD portion of the motif is not conserved. The member MJ0936, one of two from Methanococcus jannaschii, was shown (PMID:15128743) to act on model phosphodiesterase substrates; a divalent cation was required.
  • Changed! COG COG0622 173aa 3e-24 in ref transcript
    • Predicted phosphoesterase [General function prediction only].
  • Changed! TIGR yfcE 140aa 5e-24 in modified transcript
  • Changed! COG COG0622 174aa 4e-25 in modified transcript

VSIG4

  • rs.VSIG4.F1 rs.VSIG4.R1 102 384
  • NCBIGene 36.3 11326
  • Single exon skipping, size difference: 282
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_007268

  • Changed! cd IGcam 70aa 1e-06 in ref transcript
    • Immunoglobulin domain cell adhesion molecule (cam) subfamily; members are components of neural cell adhesion molecules (N-CAM L1), Fasciclin II and the insect immune protein Hemolin. The subfamily also includes receptor domains such as as the extracelluar ligand binding domain of Fibroblast Growth Factor Receptor 2. Members are phylogenetically diverse, occuring throughout metazoa, and are not components of the adaptive immune system molecules found in jawed vertebrates. A predominant feature of most Ig domains is a disulfide bridge connecting 2 beta-sheets with a Trp packing against the disulfide bond.
  • Changed! pfam I-set 76aa 1e-06 in ref transcript
    • Immunoglobulin I-set domain.
  • pfam V-set 115aa 6e-05 in ref transcript
    • Immunoglobulin V-set domain. This domain is found in antibodies as well as neural protein P0 and CTL4 amongst others.

WDR1

  • rs.WDR1.F1 rs.WDR1.R1 100 520
  • NCBIGene 36.3 9948
  • Multiple exon skipping, size difference: 420
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017491

  • Changed! cd WD40 340aa 3e-43 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! cd WD40 291aa 5e-25 in ref transcript
  • Changed! smart WD40 40aa 4e-06 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • pfam WD40 32aa 5e-05 in ref transcript
    • WD domain, G-beta repeat.
  • Changed! smart WD40 34aa 1e-04 in ref transcript
  • smart WD40 42aa 3e-04 in ref transcript
  • Changed! pfam eIF2A 97aa 0.004 in ref transcript
    • Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
  • Changed! COG COG2319 427aa 1e-22 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! COG COG2319 223aa 1e-10 in ref transcript
  • Changed! cd WD40 378aa 2e-31 in modified transcript
  • Changed! cd WD40 78aa 1e-05 in modified transcript
  • Changed! pfam WD40 39aa 1e-05 in modified transcript
  • Changed! pfam WD40 33aa 2e-04 in modified transcript
  • Changed! pfam eIF2A 89aa 0.006 in modified transcript
  • Changed! COG COG2319 420aa 1e-21 in modified transcript

WDR16

  • rs.WDR16.F1 rs.WDR16.R1 198 402
  • NCBIGene 36.3 146845
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 204
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_145054

  • cd WD40 285aa 9e-34 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • Changed! cd WD40 343aa 3e-16 in ref transcript
  • smart WD40 40aa 0.005 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 42aa 0.008 in ref transcript
  • COG COG2319 286aa 9e-21 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! COG COG2319 353aa 3e-14 in ref transcript
  • Changed! cd WD40 343aa 5e-14 in modified transcript
  • Changed! COG COG2319 315aa 8e-11 in modified transcript

WDR62

  • rs.WDR62.F1 rs.WDR62.R1 99 114
  • NCBIGene 36.3 284403
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001083961

  • cd WD40 253aa 2e-25 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd WD40 138aa 8e-12 in ref transcript
  • cd WD40 200aa 2e-10 in ref transcript
  • cd WD40 294aa 1e-09 in ref transcript
  • smart WD40 40aa 0.001 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • COG COG2319 307aa 3e-18 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • COG COG2319 350aa 5e-11 in ref transcript

WHSC1

  • rs.WHSC1.F1 rs.WHSC1.R1 172 352
  • NCBIGene 36.3 7468
  • Single exon skipping, size difference: 180
  • Exclusion in 5'UTR
  • Reference transcript: NM_133330

  • cd MSH6_like 120aa 2e-43 in ref transcript
    • The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS. The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain. Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis. In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
  • cd WHSC1_related 95aa 2e-40 in ref transcript
    • The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS). When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
  • cd HMG-box 50aa 2e-06 in ref transcript
    • High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
  • pfam SET 126aa 1e-36 in ref transcript
    • SET domain. SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.
  • smart PWWP 63aa 3e-18 in ref transcript
    • domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues.
  • smart AWS 51aa 2e-10 in ref transcript
    • associated with SET domains. subdomain of PRESET.
  • smart PWWP 63aa 2e-09 in ref transcript
  • smart HMG 63aa 1e-06 in ref transcript
    • high mobility group.
  • pfam PHD 43aa 2e-06 in ref transcript
    • PHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains. Several PHD fingers have been identified as binding modules of methylated histone H3.
  • pfam PHD 45aa 0.004 in ref transcript
  • COG COG2940 245aa 5e-19 in ref transcript
    • Proteins containing SET domain [General function prediction only].
  • COG NHP6B 66aa 0.003 in ref transcript
    • Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics].

WISP1

  • rs.WISP1.F1 rs.WISP1.R1 125 386
  • NCBIGene 36.3 8840
  • Single exon skipping, size difference: 261
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_003882

  • Changed! pfam VWC 58aa 5e-04 in ref transcript
    • von Willebrand factor type C domain. The high cutoff was used to prevent overlap with pfam00094.
  • pfam TSP_1 40aa 5e-04 in ref transcript
    • Thrombospondin type 1 domain.

WSB1

  • rs.WSB1.F1 rs.WSB1.R1 106 544
  • NCBIGene 36.3 26118
  • Multiple exon skipping, size difference: 438
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_015626

  • Changed! cd WD40 244aa 9e-44 in ref transcript
    • WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
  • cd SOCS_WSB1_SWIP1 40aa 7e-17 in ref transcript
    • SOCS (suppressors of cytokine signaling) box of WSB1/SWiP1-like proteins. This subfamily contains WSB-1 (SOCS-box-containing WD-40 protein), part of an E3 ubiquitin ligase for the thyroid-hormone-activating type 2 iodothyronine deiodinase (D2) and SWiP-1 (SOCS box and WD-repeats in Protein), a WD40-containing protein that is expressed in embryonic structures of chickens and regulated by Sonic Hedgehog (Shh). The general function of the SOCS box is the recruitment of the ubiquitin-transferase system. The SOCS box interacts with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
  • smart SOCS 43aa 2e-12 in ref transcript
    • suppressors of cytokine signalling. suppressors of cytokine signalling.
  • pfam WD40 36aa 7e-06 in ref transcript
    • WD domain, G-beta repeat.
  • Changed! smart WD40 41aa 6e-05 in ref transcript
    • WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
  • smart WD40 40aa 7e-05 in ref transcript
  • pfam eIF2A 77aa 0.003 in ref transcript
    • Eukaryotic translation initiation factor eIF2A. This is a family of eukaryotic translation initiation factors.
  • Changed! COG COG2319 345aa 2e-30 in ref transcript
    • FOG: WD40 repeat [General function prediction only].
  • Changed! cd WD40 227aa 4e-38 in modified transcript
  • Changed! pfam WD40 40aa 8e-05 in modified transcript
  • Changed! COG COG2319 230aa 8e-26 in modified transcript

WT1

  • rs.WT1.F1 rs.WT1.R1 344 395
  • NCBIGene 36.3 7490
  • Single exon skipping, size difference: 51
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_024426

  • Changed! pfam WT1 321aa 1e-143 in ref transcript
    • Wilm's tumour protein.
  • COG COG5048 129aa 2e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! pfam WT1 304aa 1e-132 in modified transcript

XAGE1D

  • rs.XAGE1D.F1 rs.XAGE1D.R1 111 431
  • NCBIGene 36.3 9503
  • Alternative 5-prime, size difference: 320
  • Exclusion of the protein initiation site
  • Reference transcript: NM_133431

  • Changed! pfam GAGE 27aa 0.003 in modified transcript
    • GAGE protein. This family consists of several GAGE and XAGE proteins which are found exclusively in humans. The function of this family is unknown although they have been implicated in human cancers.

XCR1

  • rs.XCR1.F1 rs.XCR1.R1 181 303
  • NCBIGene 36.3 2829
  • Single exon skipping, size difference: 122
  • Exclusion in 5'UTR
  • Reference transcript: NM_005283

  • pfam 7tm_1 234aa 2e-26 in ref transcript
    • 7 transmembrane receptor (rhodopsin family). This family contains, amongst other G-protein-coupled receptors (GCPRs), members of the opsin family, which have been considered to be typical members of the rhodopsin superfamily. They share several motifs, mainly the seven transmembrane helices, GCPRs of the rhodopsin superfamily. All opsins bind a chromophore, such as 11-cis-retinal. The function of most opsins other than the photoisomerases is split into two steps: light absorption and G-protein activation. Photoisomerases, on the other hand, are not coupled to G-proteins - they are thought to generate and supply the chromophore that is used by visual opsins.

XPO7

  • rs.XPO7.F1 rs.XPO7.R1 191 218
  • NCBIGene 36.3 23039
  • Alternative 5-prime, size difference: 27
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001100161

  • pfam IBN_N 67aa 6e-08 in ref transcript
    • Importin-beta N-terminal domain.
  • Changed! COG CRM1 286aa 2e-05 in ref transcript
    • Importin beta-related nuclear transport receptor [Nuclear structure / Intracellular trafficking and secretion].
  • Changed! COG CRM1 277aa 1e-05 in modified transcript

XRCC3

  • rs.XRCC3.F1 rs.XRCC3.R1 103 140
  • NCBIGene 36.3 7517
  • Alternative 3-prime, size difference: 37
  • Inclusion in 5'UTR
  • Reference transcript: NM_005432

  • cd Rad51_DMC1_radA 257aa 2e-64 in ref transcript
    • Rad51_DMC1_radA,B. This group of recombinases includes the eukaryotic proteins RAD51, RAD55/57 and the meiosis-specific protein DMC1, and the archaeal proteins radA and radB. They are closely related to the bacterial RecA group. Rad51 proteins catalyze a similiar recombination reaction as RecA, using ATP-dependent DNA binding activity and a DNA-dependent ATPase. However, this reaction is less efficient and requires accessory proteins such as RAD55/57.
  • TIGR recomb_radA 258aa 2e-26 in ref transcript
    • This family consists exclusively of archaeal RadA protein, a homolog of bacterial RecA (TIGR02012), eukaryotic RAD51 (TIGR02239), and archaeal RadB (TIGR02237). This protein is involved in DNA repair and recombination. The member from Pyrococcus horikoshii contains an intein.
  • PRK radA 275aa 3e-27 in ref transcript
    • DNA repair and recombination protein RadA; Validated.

XRN1

  • rs.XRN1.F1 rs.XRN1.R1 187 226
  • NCBIGene 36.3 54464
  • Single exon skipping, size difference: 39
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_019001

  • pfam XRN_N 228aa 1e-113 in ref transcript
    • XRN 5'-3' exonuclease N-terminus. This family aligns residues towards the N-terminus of several proteins with multiple functions. The members of this family all appear to possess 5'-3' exonuclease activity EC:3.1.11.-. Thus, the aligned region may be necessary for 5'->3' exonuclease function. The family also contains several Xrn1 and Xrn2 proteins. The 5'-3' exoribonucleases Xrn1p and Xrn2p/Rat1p function in the degradation and processing of several classes of RNA in Saccharomyces cerevisiae. Xrn1p is the main enzyme catalysing cytoplasmic mRNA degradation in multiple decay pathways, whereas Xrn2p/Rat1p functions in the processing of rRNAs and small nucleolar RNAs (snoRNAs) in the nucleus.
  • COG XRN1 396aa 1e-119 in ref transcript
    • 5'-3' exonuclease [DNA replication, recombination, and repair / Cell division and chromosome partitioning / Translation].
  • COG XRN1 304aa 6e-66 in ref transcript

YIPF5

  • rs.YIPF5.F1 rs.YIPF5.R1 133 384
  • NCBIGene 36.3 81555
  • Alternative 5-prime, size difference: 251
  • Exclusion in 5'UTR
  • Reference transcript: NM_001024947

  • pfam Yip1 145aa 1e-32 in ref transcript
    • Yip1 domain. The Yip1 integral membrane domain contains four transmembrane alpha helices. The domain is characterised by the motifs DLYGP and GY. The Yip1 protein is a golgi protein involved in vesicular transport that interacts with GTPases.
  • COG YIP1 172aa 2e-36 in ref transcript
    • Rab GTPase interacting factor, Golgi membrane protein [Intracellular trafficking and secretion].

ZBTB40

  • rs.ZBTB40.F1 rs.ZBTB40.R1 110 398
  • NCBIGene 36.3 9923
  • Single exon skipping, size difference: 288
  • Exclusion in 5'UTR
  • Reference transcript: NM_001083621

  • pfam BTB 95aa 5e-17 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.
  • COG SFP1 80aa 0.002 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].
  • COG COG5048 182aa 0.002 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZFAND5

  • rs.ZFAND5.F1 rs.ZFAND5.R1 263 400
  • NCBIGene 36.3 7763
  • Single exon skipping, size difference: 137
  • Exclusion in 5'UTR
  • Reference transcript: NM_001102420

  • pfam zf-AN1 40aa 7e-13 in ref transcript
    • AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis. The following pattern describes the zinc finger. C-X2-C-X(9-12)-C-X(1-2)-C-X4-C-X2-H-X5-H-X-C Where X can be any amino acid, and numbers in brackets indicate the number of residues.
  • smart ZnF_A20 25aa 0.001 in ref transcript
    • A20-like zinc fingers. A20- (an inhibitor of cell death)-like zinc fingers. The zinc finger mediates self-association in A20. These fingers also mediate IL-1-induced NF-kappaB activation.

ZFYVE16

  • rs.ZFYVE16.F1 rs.ZFYVE16.R1 238 366
  • NCBIGene 36.3 9765
  • Single exon skipping, size difference: 128
  • Exclusion in 5'UTR
  • Reference transcript: NM_001105251

  • cd FYVE 54aa 4e-17 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • pfam FYVE 65aa 2e-24 in ref transcript
    • FYVE zinc finger. The FYVE zinc finger is named after four proteins that it has been found in: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn++ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. We have included members which do not conserve these histidine residues but are clearly related.
  • PTZ PTZ00303 93aa 0.003 in ref transcript
    • phosphatidylinositol kinase; Provisional.

ZFYVE27

  • rs.ZFYVE27.F1 rs.ZFYVE27.R1 117 138
  • NCBIGene 36.3 118813
  • Single exon skipping, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001002261

  • cd FYVE 59aa 3e-07 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • smart FYVE 63aa 5e-09 in ref transcript
    • Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PH D finger and the RI NG finger. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. The FYVE finger functions in the membrane recruitment of cytosolic proteins by binding to phosphatidylinositol 3-phosphate (PI3P), which is prominent on endosomes. The R+HHC+XCG motif is critical for PI3P binding.

ZFYVE27

  • rs.ZFYVE27.F2 rs.ZFYVE27.R2 107 122
  • NCBIGene 36.3 118813
  • Alternative 3-prime, size difference: 15
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001002261

  • cd FYVE 59aa 3e-07 in ref transcript
    • FYVE domain; Zinc-binding domain; targets proteins to membrane lipids via interaction with phosphatidylinositol-3-phosphate, PI3P; present in Fab1, YOTB, Vac1, and EEA1;.
  • smart FYVE 63aa 5e-09 in ref transcript
    • Protein present in Fab1, YOTB, Vac1, and EEA1. The FYVE zinc finger is named after four proteins where it was first found: Fab1, YOTB/ZK632.12, Vac1, and EEA1. The FYVE finger has been shown to bind two Zn2+ ions. The FYVE finger has eight potential zinc coordinating cysteine positions. The FYVE finger is structurally related to the PH D finger and the RI NG finger. Many members of this family also include two histidines in a motif R+HHC+XCG, where + represents a charged residue and X any residue. The FYVE finger functions in the membrane recruitment of cytosolic proteins by binding to phosphatidylinositol 3-phosphate (PI3P), which is prominent on endosomes. The R+HHC+XCG motif is critical for PI3P binding.

ZGPAT

  • rs.ZGPAT.F1 rs.ZGPAT.R1 172 232
  • NCBIGene 36.3 84619
  • Alternative 3-prime, size difference: 60
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_032527

  • cd TUDOR 31aa 0.005 in ref transcript
    • Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
  • pfam G-patch 42aa 2e-08 in ref transcript
    • G-patch domain. This domain is found in a number of RNA binding proteins, and is also found in proteins that contain RNA binding domains. This suggests that this domain may have an RNA binding function. This domain has seven highly conserved glycines.
  • smart ZnF_C3H1 22aa 6e-04 in ref transcript
    • zinc finger.
  • smart TUDOR 36aa 0.006 in ref transcript
    • Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).

ZMYM5

  • rs.ZMYM5.F1 rs.ZMYM5.R1 150 436
  • NCBIGene 36.3 9205
  • Single exon skipping, size difference: 286
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001039650

  • Changed! pfam zf-FCS 36aa 2e-06 in ref transcript
    • MYM-type Zinc finger with FCS sequence motif. MYM-type zinc fingers were identified in MYM family proteins. Human zinc finger protein 261 is involved in a chromosomal translocation and may be responsible for X-linked retardation in XQ13.1. Human zinc finger protein 198 is also involved in disease. In myeloproliferative disorders it is fused to FGF receptor 1; in atypical myeloproliferative disorders it is rearranged. Members of the family generally are involved in development. This Zn-finger domain functions as a transcriptional trans-activator of late vaccinia viral genes, and orthologues are also found in all nucleocytoplasmic large DNA viruses, NCLDV. This domain is also found fused to the C termini of recombinases from certain prokaryotic transposons.

ZNF133

  • rs.ZNF133.F1 rs.ZNF133.R1 138 550
  • NCBIGene 36.3 7692
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 412
  • Inclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR, Exclusion in 5'UTR
  • Reference transcript: NM_003434

  • smart KRAB 60aa 6e-28 in ref transcript
    • krueppel associated box.
  • COG COG5048 410aa 2e-11 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF142

  • rs.ZNF142.F1 rs.ZNF142.R1 284 406
  • NCBIGene 36.3 7701
  • Single exon skipping, size difference: 122
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_001105537

ZNF142

  • rs.ZNF142.F2 rs.ZNF142.R2 252 347
  • NCBIGene 36.3 7701
  • Alternative 3-prime, size difference: 95
  • Inclusion in 5'UTR
  • Reference transcript: NM_001105537

ZNF146

  • rs.ZNF146.F1 rs.ZNF146.R1 111 185
  • NCBIGene 36.3 7705
  • Single exon skipping, size difference: 74
  • Exclusion in 5'UTR
  • Reference transcript: NM_007145

  • COG COG5048 254aa 5e-04 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF155

  • rs.ZNF155.F1 rs.ZNF155.R1 193 270
  • NCBIGene 36.3 7711
  • Alternative 5-prime, size difference: 77
  • Exclusion in 5'UTR
  • Reference transcript: NM_003445

  • pfam KRAB 41aa 4e-19 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • COG COG5048 285aa 5e-06 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF187

  • rs.ZNF187.F1 rs.ZNF187.R1 102 506
  • NCBIGene 36.3 7741
  • Alternative 5-prime, size difference: 404
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_152736

  • Changed! COG COG5048 154aa 9e-04 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! smart SCAN 27aa 1e-07 in modified transcript
    • leucine rich region.

ZNF205

  • rs.ZNF205.F1 rs.ZNF205.R1 181 249
  • NCBIGene 36.3 7755
  • Alternative 3-prime, size difference: 68
  • Inclusion in 5'UTR
  • Reference transcript: NM_001042428

  • pfam KRAB 39aa 4e-13 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • pfam zf-C2H2 23aa 0.004 in ref transcript
    • Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
  • pfam zf-C2H2 23aa 0.006 in ref transcript
  • COG COG5048 162aa 6e-08 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 67aa 4e-04 in ref transcript

ZNF226

  • rs.ZNF226.F1 rs.ZNF226.R1 98 267
  • NCBIGene 36.3 7769
  • Alternative 3-prime, size difference: 169
  • Inclusion in 5'UTR
  • Reference transcript: NM_001032373

  • smart KRAB 60aa 7e-19 in ref transcript
    • krueppel associated box.
  • COG COG5048 407aa 4e-13 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF228

  • rs.ZNF228.F1 rs.ZNF228.R1 170 219
  • NCBIGene 36.3 7771
  • Single exon skipping, size difference: 49
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001083335

  • smart KRAB 61aa 3e-20 in ref transcript
    • krueppel associated box.
  • COG COG5048 393aa 3e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 388aa 3e-04 in ref transcript

ZNF239

  • rs.ZNF239.F1 rs.ZNF239.R1 199 240
  • NCBIGene 36.3 8187
  • Single exon skipping, size difference: 41
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099282

  • COG COG5048 201aa 0.003 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF273

  • rs.ZNF273.F1 rs.ZNF273.R1 185 274
  • NCBIGene 36.3 10793
  • Single exon skipping, size difference: 89
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_021148

  • Changed! smart KRAB 61aa 1e-26 in ref transcript
    • krueppel associated box.
  • Changed! COG COG5048 369aa 9e-06 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF280D

  • rs.ZNF280D.F1 rs.ZNF280D.R1 406 475
  • NCBIGene 36.3 54816
  • Single exon skipping, size difference: 69
  • Exclusion of the protein initiation site
  • Reference transcript: NM_017661

ZNF295

  • rs.ZNF295.F1 rs.ZNF295.R1 108 173
  • NCBIGene 36.3 49854
  • Single exon skipping, size difference: 65
  • Exclusion in 5'UTR
  • Reference transcript: NM_001098402

  • pfam BTB 101aa 1e-21 in ref transcript
    • BTB/POZ domain. The BTB (for BR-C, ttk and bab) or POZ (for Pox virus and Zinc finger) domain is present near the N-terminus of a fraction of zinc finger (pfam00096) proteins and in proteins that contain the pfam01344 motif such as Kelch and a family of pox virus proteins. The BTB/POZ domain mediates homomeric dimerisation and in some instances heteromeric dimerisation. The structure of the dimerised PLZF BTB/POZ domain has been solved and consists of a tightly intertwined homodimer. The central scaffolding of the protein is made up of a cluster of alpha-helices flanked by short beta-sheets at both the top and bottom of the molecule. POZ domains from several zinc finger proteins have been shown to mediate transcriptional repression and to interact with components of histone deacetylase co-repressor complexes including N-CoR and SMRT. The POZ or BTB domain is also known as BR-C/Ttk or ZiN.

ZNF30

  • rs.ZNF30.F1 rs.ZNF30.R1 238 304
  • NCBIGene 36.3 90075
  • Alternative 5-prime, size difference: 66
  • Exclusion in 5'UTR
  • Reference transcript: NM_001099438

  • smart KRAB 62aa 3e-23 in ref transcript
    • krueppel associated box.
  • COG COG5048 366aa 2e-10 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 217aa 5e-05 in ref transcript

ZNF331

  • rs.ZNF331.F1 rs.ZNF331.R1 113 143
  • NCBIGene 36.3 55422
  • Single exon skipping, size difference: 30
  • Exclusion in 5'UTR
  • Reference transcript: NM_001079907

  • pfam KRAB 41aa 7e-21 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • COG COG5048 315aa 1e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF334

  • rs.ZNF334.F1 rs.ZNF334.R1 163 315
  • NCBIGene 36.3 55713
  • Single exon skipping, size difference: 152
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_018102

  • Changed! smart KRAB 61aa 2e-29 in ref transcript
    • krueppel associated box.
  • Changed! COG COG5048 441aa 2e-04 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF419

  • rs.ZNF419.F2 rs.ZNF419.R2 256 355
  • NCBIGene 36.3 79744
  • Exon skipping and alternative 3-prime or 5-prime, size difference: 99
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_001098491

  • Changed! smart KRAB 61aa 8e-22 in ref transcript
    • krueppel associated box.
  • COG COG5048 156aa 1e-06 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 331aa 2e-04 in ref transcript
  • Changed! pfam KRAB 41aa 1e-18 in modified transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.

ZNF468

  • rs.ZNF468.F1 rs.ZNF468.R1 126 244
  • NCBIGene 36.3 90333
  • Single exon skipping, size difference: 118
  • Inclusion in the protein causing a frameshift
  • Reference transcript: NM_001008801

  • Changed! pfam KRAB 41aa 2e-21 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • Changed! COG COG5048 196aa 4e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! smart KRAB 42aa 9e-20 in modified transcript
    • krueppel associated box.

ZNF493

  • rs.ZNF493.F1 rs.ZNF493.R1 117 340
  • NCBIGene 36.3 284443
  • Multiple exon skipping, size difference: 223
  • Exclusion in the protein causing a frameshift, Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001076678

  • Changed! smart KRAB 61aa 2e-26 in ref transcript
    • krueppel associated box.
  • Changed! COG COG5048 415aa 8e-12 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! COG SFP1 80aa 0.004 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].

ZNF557

  • rs.ZNF557.F1 rs.ZNF557.R1 107 119
  • NCBIGene 36.3 79230
  • Alternative 5-prime, size difference: 12
  • Exclusion in 5'UTR
  • Reference transcript: NM_024341

  • smart KRAB 58aa 4e-28 in ref transcript
    • krueppel associated box.
  • pfam zf-C2H2 23aa 0.006 in ref transcript
    • Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
  • COG COG5048 286aa 3e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF557

  • rs.ZNF557.F2 rs.ZNF557.R2 119 140
  • NCBIGene 36.3 79230
  • Alternative 3-prime, size difference: 21
  • Exclusion in the protein (no frameshift)
  • Reference transcript: NM_001044387

  • smart KRAB 58aa 4e-28 in ref transcript
    • krueppel associated box.
  • pfam zf-C2H2 23aa 0.006 in ref transcript
    • Zinc finger, C2H2 type. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter.
  • COG COG5048 286aa 3e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF586

  • rs.ZNF586.F1 rs.ZNF586.R1 181 308
  • NCBIGene 36.3 54807
  • Single exon skipping, size difference: 127
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_017652

  • Changed! pfam KRAB 41aa 3e-16 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.
  • Changed! COG COG5048 154aa 2e-04 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF596

  • rs.ZNF596.F1 rs.ZNF596.R1 217 287
  • NCBIGene 36.3 169270
  • Alternative 5-prime, size difference: 70
  • Exclusion in 5'UTR
  • Reference transcript: NM_001042416

  • smart KRAB 59aa 2e-20 in ref transcript
    • krueppel associated box.
  • COG COG5048 142aa 2e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • COG COG5048 398aa 8e-05 in ref transcript

ZNF613

  • rs.ZNF613.F1 rs.ZNF613.R1 330 438
  • NCBIGene 36.3 79898
  • Single exon skipping, size difference: 208
  • Exclusion of the protein initiation site
  • Reference transcript: NM_001031721

  • Changed! smart KRAB 60aa 3e-27 in ref transcript
    • krueppel associated box.
  • COG COG5048 339aa 6e-04 in ref transcript
    • FOG: Zn-finger [General function prediction only].
  • Changed! smart KRAB 31aa 5e-09 in modified transcript

ZNF621

  • rs.ZNF621.F1 rs.ZNF621.R1 108 344
  • NCBIGene 36.3 285268
  • Alternative 5-prime, size difference: 236
  • Exclusion in 5'UTR
  • Reference transcript: NM_198484

  • smart KRAB 61aa 2e-22 in ref transcript
    • krueppel associated box.
  • COG SFP1 84aa 0.004 in ref transcript
    • Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning].
  • COG COG4049 54aa 0.005 in ref transcript
    • Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only].

ZNF638

  • rs.ZNF638.F1 rs.ZNF638.R1 90 100
  • NCBIGene 36.3 27332
  • Alternative 5-prime, size difference: 10
  • Exclusion in 5'UTR
  • Reference transcript: NM_014497

  • cd RRM 70aa 3e-04 in ref transcript
    • RRM (RNA recognition motif), also known as RBD (RNA binding domain) or RNP (ribonucleoprotein domain), is a highly abundant domain in eukaryotes found in proteins involved in post-transcriptional gene expression processes including mRNA and rRNA processing, RNA export, and RNA stability. This domain is 90 amino acids in length and consists of a four-stranded beta-sheet packed against two alpha-helices. RRM usually interacts with ssRNA, but is also known to interact with ssDNA as well as proteins. RRM binds a variable number of nucleotides, ranging from two to eight. The active site includes three aromatic side-chains located within the conserved RNP1 and RNP2 motifs of the domain. The RRM domain is found in a variety heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs).
  • smart ZnF_U1 33aa 5e-05 in ref transcript
    • U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
  • TIGR hnRNP-L_PTB 68aa 7e-04 in ref transcript
    • Included in this family of heterogeneous ribonucleoproteins are PTB (polypyrimidine tract binding protein ) and hnRNP-L. These proteins contain four RNA recognition motifs (rrm: pfam00067).
  • TIGR hnRNP-L_PTB 67aa 0.003 in ref transcript

ZNF655

  • rs.ZNF655.F1 rs.ZNF655.R1 182 395
  • NCBIGene 36.3 79027
  • Multiple exon skipping, size difference: 213
  • Exclusion in the protein causing a frameshift, Exclusion in the protein causing a frameshift
  • Reference transcript: NM_024061

  • Changed! pfam KRAB 41aa 4e-19 in ref transcript
    • KRAB box. The KRAB domain (or Kruppel-associated box) is present in about a third of zinc finger proteins containing C2H2 fingers. The KRAB domain is found to be involved in protein-protein interactions. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B. The A box plays an important role in repression by binding to corepressors, while the B box is thought to enhance this repression brought about by the A box. KRAB-containing proteins are thought to have critical functions in cell proliferation and differentiation, apoptosis and neoplastic transformation.

ZNF706

  • rs.ZNF706.F1 rs.ZNF706.R1 246 361
  • NCBIGene 36.3 51123
  • Single exon skipping, size difference: 115
  • Exclusion in 5'UTR
  • Reference transcript: NM_001042510

  • pfam DUF1909 33aa 9e-06 in ref transcript
    • Domain of unknown function (DUF1909). This domain is found in a set of hypothetical eukaryotic proteins.

ZNF707

  • rs.ZNF707.F1 rs.ZNF707.R1 169 243
  • NCBIGene 36.3 286075
  • Single exon skipping, size difference: 74
  • Exclusion in 5'UTR
  • Reference transcript: NM_173831

  • smart KRAB 61aa 6e-26 in ref transcript
    • krueppel associated box.
  • COG COG5048 162aa 1e-07 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF74

  • rs.ZNF74.F1 rs.ZNF74.R1 263 349
  • NCBIGene 36.3 7625
  • Single exon skipping, size difference: 86
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_003426

  • Changed! smart KRAB 61aa 1e-27 in ref transcript
    • krueppel associated box.
  • Changed! COG COG5048 157aa 2e-05 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF807

  • rs.ZNF807.F1 rs.ZNF807.R1 235 289
  • NCBIGene 36.3 643719
  • Alternative 3-prime, size difference: 54
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: XM_001718828

ZNF83

  • rs.ZNF83.F1 rs.ZNF83.R1 316 406
  • NCBIGene 36.3 55769
  • Single exon skipping, size difference: 90
  • Exclusion in 5'UTR
  • Reference transcript: NM_001105549

  • COG COG5048 391aa 8e-07 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZNF92

  • rs.ZNF92.F1 rs.ZNF92.R1 183 310
  • NCBIGene 36.3 168374
  • Single exon skipping, size difference: 127
  • Exclusion in the protein causing a frameshift
  • Reference transcript: NM_152626

  • Changed! smart KRAB 61aa 2e-27 in ref transcript
    • krueppel associated box.
  • Changed! COG COG5048 381aa 3e-08 in ref transcript
    • FOG: Zn-finger [General function prediction only].

ZWILCH

  • rs.ZWILCH.F1 rs.ZWILCH.R1 306 401
  • NCBIGene 36.3 55055
  • Alternative 3-prime, size difference: 95
  • Inclusion in the protein causing a new stop codon
  • Reference transcript: NM_017975

  • Changed! pfam DUF2352 556aa 0.0 in ref transcript
    • Uncharacterized conserved protein (DUF2352). Members of this family of uncharacterised proteins have no known function.

Copyright ©2008,2017, all rights reserved.