LLPS-Hos-0719
CELF1

▼ OVERVIEW


Status: Reviewed
Protein Name: CUGBP Elav-like family member 1
Gene Name: CELF1, BRUNOL2, CUGBP, CUGBP1, NAB50
Ensembl Gene: ENSG00000149187.18
Ensembl Protein: ENSP00000433986.1
Organism: Homo sapiens
Taxa ID: 9606
LLPS Type: Client
PDB: 3NNC (A, B) More


▼ PROPERTY



——— Disorder propensity (calculated by IUPred2A)

▼ Classification


Condensates:
CondensateDescriptionTissue/CellPMIDs
P-body
"...Analysis of correlated patterns between endogenous preys uncovers the spatial organization of RNA regulatory structures and enables the definition of 144 core components of SGs and PBs."
N/A29395067
Stress granule
"...Together, our APEX and IF screening approaches identified 260 SG proteins, including ~150 candidates that had not previously been associated with SGs."
Neural progenitor cells29373831
Stress granule
"...Analysis of correlated patterns between endogenous preys uncovers the spatial organization of RNA regulatory structures and enables the definition of 144 core components of SGs and PBs."
N/A29395067
Stress granule
"...A list of candidate SG proteins identified by mass spectrometry in NPCs, including statistical analysis."
Neural progenitor cells29373831
Stress granule
"...By adding known stress granule components not identified by mass spectrometry to this list, we created the first stress granule proteome for a mammalian cell."
U2OS cells26777405
Stress granule
"...283 proteins identified as APEX hits in HEK293T cells or previously shown to associate with SGs."
N/A29373831
Nuclear speckle, Perinucleolar compartmentPredicted from orthologsN/A(View)

▼ FUNCTION


RNA-binding protein implicated in the regulation of several post-transcriptional events. Involved in pre-mRNA alternative splicing, mRNA translation and stability. Mediates exon inclusion and/or exclusion in pre-mRNA that are subject to tissue-specific and developmentally regulated alternative splicing. Specifically activates exon 5 inclusion of cardiac isoforms of TNNT2 during heart remodeling at the juvenile to adult transition. Acts as both an activator and repressor of a pair of coregulated exons: promotes inclusion of the smooth muscle (SM) exon but exclusion of the non-muscle (NM) exon in actinin pre-mRNAs. Activates SM exon 5 inclusion by antagonizing the repressive effect of PTB. Promotes exclusion of exon 11 of the INSR pre-mRNA. Inhibits, together with HNRNPH1, insulin receptor (IR) pre-mRNA exon 11 inclusion in myoblast. Increases translation and controls the choice of translation initiation codon of CEBPB mRNA. Increases mRNA translation of CEBPB in aging liver (By similarity). Increases translation of CDKN1A mRNA by antagonizing the repressive effect of CALR3. Mediates rapid cytoplasmic mRNA deadenylation. Recruits the deadenylase PARN to the poly(A) tail of EDEN-containing mRNAs to promote their deadenylation. Required for completion of spermatogenesis (By similarity). Binds to (CUG)n triplet repeats in the 3'-UTR of transcripts such as DMPK and to Bruno response elements (BREs). Binds to muscle-specific splicing enhancer (MSE) intronic sites flanking the alternative exon 5 of TNNT2 pre-mRNA. Binds to AU-rich sequences (AREs or EDEN-like) localized in the 3'-UTR of JUN and FOS mRNAs. Binds to the IR RNA. Binds to the 5'-region of CDKN1A and CEBPB mRNAs. Binds with the 5'-region of CEBPB mRNA in aging liver. May be a specific regulator of miRNA biogenesis. Binds to primary microRNA pri-MIR140 and, with CELF2, negatively regulates the processing to mature miRNA (PubMed:28431233).

▼ CROSS REFERENCE


DatabaseNucleotide IDProtein ID
EnsemblENST00000530151.5ENSP00000433986.1
EnsemblENST00000525841.5ENSP00000436191.1
EnsemblENST00000358597.7ENSP00000351409.3
EnsemblENST00000532048.5ENSP00000435926.1
EnsemblENST00000543178.5ENSP00000444825.1
EnsemblENST00000531165.5ENSP00000436864.1
EnsemblENST00000526419.1ENSP00000435423.1
EnsemblENST00000395290.6ENSP00000378705.2
EnsemblENST00000528538.5ENSP00000442415.1
EnsemblENST00000361904.7ENSP00000354639.3
EnsemblENST00000310513.10ENSP00000308386.5
EnsemblENST00000535982.1ENSP00000438044.1
EnsemblENST00000395292.6ENSP00000378706.2
EnsemblENST00000528434.5ENSP00000435320.1
EnsemblENST00000526277.1ENSP00000438862.1
UniProtQ92879, CELF1_HUMAN, B4E2U5, D3DQS0, F8W940, Q4LE52, Q9NP83, Q9NR06
GeneBankU63289, BC031079, AJ007988, AC090559, AK304430, AF267533, AF248648, CH471064, AF267534, AB210019EAW67909.1, BAG65257.1, AAF78956.1, AAF86230.1, EAW67912.1, AAC50895.1, AAH31079.1, CAC20566.1, BAE06101.1, AAF78955.1
RefSeqNM_001025596.2, NM_001172639.1, NM_001172640.1, NM_006560.3, NM_198700.2, XM_011519859.1, XM_017017124.1, XM_017017125.1, XM_017017126.1, XM_017017127.1, XM_017017128.1, XM_017017129.1, XM_017017130.1, XM_017017131.1, XM_017017132.1, XM_017017133.1, XM_017017134.1, XM_017017135.1NP_001020767.1, NP_001166110.1, NP_001166111.1, NP_006551.1, NP_941989.1, XP_011518161.1, XP_016872613.1, XP_016872614.1, XP_016872615.1, XP_016872616.1, XP_016872617.1, XP_016872618.1, XP_016872619.1, XP_016872620.1, XP_016872621.1, XP_016872622.1, XP_016872623.1, XP_016872624.1
Entrez10658

▼ SEQUENCE


Protein Sequence (FASTA)
1     MAAFKLDFLP  EMMVDHCSLN  SSPVSKKMNG  TLDHPDQPDL  DAIKMFVGQV  PRTWSEKDLR  60
61    ELFEQYGAVY  EINVLRDRSQ  NPPQSKGCCF  VTFYTRKAAL  EAQNALHNMK  VLPGMHHPIQ  120
121   MKPADSEKNN  AVEDRKLFIG  MISKKCTEND  IRVMFSSFGQ  IEECRILRGP  DGLSRGCAFV  180
181   TFTTRAMAQT  AIKAMHQAQT  MEGCSSPMVV  KFADTQKDKE  QKRMAQQLQQ  QMQQISAASV  240
241   WGNLAGLNTL  GPQYLALYLQ  LLQQTASSGN  LNTLSSLHPM  GGLNAMQLQN  LAALAAAASA  300
301   AQNTPSGTNA  LTTSSSPLSV  LTSSAGSSPS  SSSSNSVNPI  ASLGALQTLA  GATAGLNVGS  360
361   LAGMAALNGG  LGSSGLSNGT  GSTMEALTQA  YSGIQQYAAA  ALPTLYNQNL  LTQQSIGAAG  420
421   SQKEGPEGAN  LFIYHLPQEF  GDQDLLQMFM  PFGNVVSAKV  FIDKQTNLSK  CFGFVSYDNP  480
481   VSAQAAIQSM  NGFQIGMKRL  KVQLKRSKND  SKPY  514
Nucleotide CDS Sequence (FASTA)
1     ATGAACGGCA  CCCTGGACCA  CCCAGACCAA  CCAGATCTTG  ATGCTATCAA  GATGTTTGTG  60
61    GGCCAGGTTC  CAAGGACCTG  GTCTGAAAAG  GACTTGCGGG  AACTCTTCGA  ACAGTATGGT  120
121   GCTGTGTATG  AAATCAACGT  CCTAAGGGAT  AGGAGCCAAA  ACCCGCCTCA  GAGCAAAGGG  180
181   TGCTGTTTTG  TTACATTTTA  CACCCGTAAA  GCTGCATTAG  AAGCTCAGAA  TGCTCTTCAC  240
241   AACATGAAAG  TCCTCCCAGG  GATGCATCAC  CCTATACAGA  TGAAACCTGC  TGACAGTGAG  300
301   AAGAACAATG  CAGTGGAAGA  CAGGAAGCTG  TTTATTGGTA  TGATTTCCAA  GAAGTGCACT  360
361   GAAAATGACA  TCCGAGTCAT  GTTCTCTTCG  TTTGGACAGA  TTGAAGAATG  CCGGATATTG  420
421   CGGGGACCTG  ATGGCCTGAG  CCGAGGTTGT  GCATTTGTGA  CTTTTACAAC  AAGAGCCATG  480
481   GCACAGACGG  CTATCAAGGC  AATGCACCAA  GCACAGACCA  TGGAGGGTTG  CTCATCACCC  540
541   ATGGTGGTAA  AATTTGCTGA  TACACAGAAG  GACAAAGAAC  AGAAGAGAAT  GGCCCAGCAG  600
601   CTCCAGCAGC  AGATGCAGCA  AATCAGCGCA  GCATCTGTGT  GGGGAAACCT  TGCTGGTCTA  660
661   AATACTCTTG  GACCCCAGTA  TTTAGCACTC  CTTCAGCAGA  CTGCCTCCTC  TGGGAACCTC  720
721   AACACCCTGA  GCAGCCTCCA  CCCAATGGGA  GGGTTGAATG  CAATGCAGTT  ACAGAATTTG  780
781   GCTGCACTAG  CTGCTGCAGC  TAGTGCAGCT  CAGAACACAC  CAAGTGGTAC  CAATGCTCTC  840
841   ACTACATCCA  GCAGTCCCCT  CAGCGTGCTC  ACTAGTTCAG  GGTCCTCACC  TAGCTCTAGC  900
901   AGCAGTAATT  CTGTCAACCC  CATAGCCTCA  CTTGGAGCCC  TGCAGACATT  AGCTGGAGCA  960
961   ACGGCTGGCC  TCAATGTTGG  CTCTTTGGCA  GGAATGGCTG  CTTTAAATGG  TGGCCTGGGC  1020
1021  AGCAGTGGCC  TTTCCAATGG  CACCGGGAGC  ACCATGGAGG  CCCTCACTCA  GGCCTACTCG  1080
1081  GGTATCCAGC  AATATGCTGC  TGCTGCGCTC  CCCACTCTGT  ACAACCAGAA  TCTTCTGACA  1140
1141  CAGCAGAGTA  TTGGTGCTGC  TGGAAGCCAG  AAGGAAGGTC  CAGAGGGAGC  CAACCTGTTC  1200
1201  ATCTACCACC  TGCCCCAGGA  GTTTGGTGAT  CAGGACCTGC  TGCAGATGTT  TATGCCCTTT  1260
1261  GGGAATGTCG  TGTCTGCCAA  GGTTTTCATA  GACAAGCAGA  CAAACCTGAG  CAAGTGTTTT  1320
1321  GGTTTTGTAA  GTTACGACAA  TCCTGTTTCG  GCCCAAGCTG  CCATCCAGTC  CATGAACGGC  1380
1381  TTTCAGATTG  GCATGAAGCG  GCTTAAAGTG  CAGCTCAAAC  GTTCGAAGAA  TGACAGCAAG  1440
1441  CCCTACTGA  1449

▼ KEYWORD


ID
Family
3D-structure
Acetylation
Activator
Alternative splicing
Complete proteome
Cytoplasm
Isopeptide bond
mRNA processing
mRNA splicing
Nucleus
Phosphoprotein
Reference proteome
Repeat
RNA-binding
Ubl conjugation

▼ GENE ONTOLOGY


ID
Classification
Description
Cellular Component
Cytoplasm
Cellular Component
Membrane
Cellular Component
Nucleoplasm
Cellular Component
Nucleus
Cellular Component
Ribonucleoprotein complex
Molecular Function
BRE binding
Molecular Function
MRNA binding
Molecular Function
Pre-mRNA binding
Molecular Function
RNA binding
Molecular Function
Translation repressor activity, mRNA regulatory element binding
Biological Process
Embryo development ending in birth or egg hatching
Biological Process
Germ cell development
Biological Process
MRNA processing
Biological Process
MRNA splice site selection
Biological Process
Regulation of alternative mRNA splicing, via spliceosome
Biological Process
Regulation of RNA splicing
Biological Process
RNA interference

▼ KEGG



▼ ORTHOLOGY


DrLLPS IDOrganismIdentityE-valueScore
LLPS-Ict-3774Ictidomys tridecemlineatus100.06e-82 270
LLPS-Nol-1370Nomascus leucogenys100.05e-82 271
LLPS-Cas-3628Carlito syrichta100.05e-82 271
LLPS-Mam-3537Macaca mulatta100.05e-82 271
LLPS-Caf-2086Canis familiaris100.00.0 555
LLPS-Eqc-0269Equus caballus100.01e-81 271
LLPS-Orc-0215Oryctolagus cuniculus100.04e-82 271
LLPS-Rhb-1232Rhinopithecus bieti100.00.0 799
LLPS-Man-1644Macaca nemestrina100.00.0 799
LLPS-Loa-3546Loxodonta africana100.05e-82 271
LLPS-Urm-1754Ursus maritimus100.00.0 555
LLPS-Pap-2191Pan paniscus100.00.0 743
LLPS-Pat-3306Pan troglodytes100.00.0 799
LLPS-Mod-3398Monodelphis domestica100.03e-82 270
LLPS-Fec-3444Felis catus100.01e-81 271
LLPS-Paa-2588Papio anubis100.00.0 799
LLPS-Mal-1954Mandrillus leucophaeus100.00.0 555
LLPS-Mup-1927Mustela putorius furo100.06e-82 271
LLPS-Aim-2132Ailuropoda melanoleuca100.04e-82 271
LLPS-Dio-3878Dipodomys ordii100.01e-82 270
LLPS-Tut-0980Tursiops truncatus100.04e-171 499
LLPS-Caj-1129Callithrix jacchus100.00.0 799
LLPS-Gog-4163Gorilla gorilla100.00.0 799
LLPS-Cea-3334Cercocebus atys100.00.0 743
LLPS-Aon-2102Aotus nancymaae100.01e-172 505
LLPS-Sus-1659Sus scrofa100.00.0 555
LLPS-Mea-3825Mesocricetus auratus100.05e-82 271
LLPS-Mum-2611Mus musculus100.05e-82 271
LLPS-Maf-1746Macaca fascicularis100.00.0 743
LLPS-Chs-3401Chlorocebus sabaeus100.05e-82 271
LLPS-Cap-2621Cavia porcellus99.790.0 744
LLPS-Poa-0536Pongo abelii99.680.0 555
LLPS-Ran-3018Rattus norvegicus99.380.0 740
LLPS-Anc-1809Anolis carolinensis99.233e-81 268
LLPS-Tag-0026Taeniopygia guttata99.233e-81 268
LLPS-Meg-0349Meleagris gallopavo99.233e-81 268
LLPS-Bot-3365Bos taurus99.232e-81 269
LLPS-Fud-2632Fukomys damarensis99.232e-81 269
LLPS-Otg-2348Otolemur garnettii99.030.0 791
LLPS-Myl-1913Myotis lucifugus98.840.0 793
LLPS-Sah-1598Sarcophilus harrisii98.560.0 736
LLPS-Ova-2784Ovis aries98.450.0 791
LLPS-Ora-1276Ornithorhynchus anatinus97.860.0 783
LLPS-Anp-2032Anas platyrhynchos96.710.0 778
LLPS-Leo-2395Lepisosteus oculatus96.153e-79 264
LLPS-Xet-0939Xenopus tropicalis95.385e-79 262
LLPS-Gaga-1736Gallus gallus94.940.0 777
LLPS-Asm-2201Astyanax mexicanus94.661e-75 256
LLPS-Xim-1971Xiphophorus maculatus94.621e-75 254
LLPS-Fia-2220Ficedula albicollis94.550.0 775
LLPS-Pes-2241Pelodiscus sinensis94.380.0 754
LLPS-Icp-0421Ictalurus punctatus93.893e-76 255
LLPS-Ten-3059Tetraodon nigroviridis93.852e-75 253
LLPS-Orn-2099Oreochromis niloticus93.854e-75 253
LLPS-Dar-0059Danio rerio93.854e-75 252
LLPS-Lac-2156Latimeria chalumnae93.857e-75 252
LLPS-Tar-1738Takifugu rubripes93.854e-75 253
LLPS-Orl-0786Oryzias latipes93.853e-75 253
LLPS-Scf-3739Scleropages formosus93.089e-75 251
LLPS-Scm-1701Scophthalmus maximus90.082e-69 238
LLPS-Gaa-3616Gasterosteus aculeatus89.471e-69 238
LLPS-Pof-2122Poecilia formosa88.641e-68 236
LLPS-Drm-0131Drosophila melanogaster77.891e-40 160
LLPS-Cii-1650Ciona intestinalis70.345e-46 173
LLPS-Cae-1757Caenorhabditis elegans70.331e-33 137
LLPS-Cis-1057Ciona savignyi64.896e-44 167
LLPS-Sem-1280Selaginella moellendorffii55.472e-36 144