LLPS-Bot-0448
DGCR8
Integrated Annotations
▼ OVERVIEW
Status: | Unreviewed |
Protein Name: | Microprocessor complex subunit DGCR8; DiGeorge syndrome critical region 8 homolog |
Gene Name: | DGCR8 |
Ensembl Gene: | ENSBTAG00000019869.5 |
Ensembl Protein: | ENSBTAP00000026474.4 |
Organism: | Bos taurus |
Taxa ID: | 9913 |
LLPS Type: | Others |
▼ Classification
Condensates:
Condensate | Evidence | Orthologs |
---|---|---|
Nuclear speckle, Nucleolus, P-body | Predicted from orthologs | (View) |
▼ FUNCTION
Component of the microprocessor complex that acts as a RNA- and heme-binding protein that is involved in the initial step of microRNA (miRNA) biogenesis. Component of the microprocessor complex that is required to process primary miRNA transcripts (pri-miRNAs) to release precursor miRNA (pre-miRNA) in the nucleus. Within the microprocessor complex, DGCR8 function as a molecular anchor necessary for the recognition of pri-miRNA at dsRNA-ssRNA junction and directs DROSHA to cleave 11 bp away form the junction to release hairpin-shaped pre-miRNAs that are subsequently cut by the cytoplasmic DICER to generate mature miRNAs. The heme-bound DGCR8 dimer binds pri-miRNAs as a cooperative trimer (of dimers) and is active in triggering pri-miRNA cleavage, whereas the heme-free DGCR8 monomer binds pri-miRNAs as a dimer and is much less active. Both double-stranded and single-stranded regions of a pri-miRNA are required for its binding. Specifically recognizes and binds N6-methyladenosine (m6A)-containing pri-miRNAs, a modification required for pri-miRNAs processing (By similarity). Involved in the silencing of embryonic stem cell self-renewal (By similarity). |
▼ CROSS REFERENCE
Database | Nucleotide ID | Protein ID |
---|---|---|
Ensembl | ENSBTAT00000026474.5 | ENSBTAP00000026474.4 |
Ensembl | ENSBTAT00000085166.1 | ENSBTAP00000057457.1 |
UniProt | A6QR44, DGCR8_BOVIN | |
GeneBank | BC150109 | AAI50110.1 |
RefSeq | NM_001101204.1 | NP_001094674.1 |
▼ SEQUENCE
Protein Sequence (FASTA) |
---|
1 METCGSPSPL PREPAGGVAM EDRARPLRAL PRGQSPPPPL QTSSDAEVMD VGSGGDGQAE 60 61 PPAEDPLNFY GASLLSKGSS SKARLLVDPN CSGHSPRTAR HAPAVRKFSP DLKLLKDVKI 120 121 SVSFTESCRS EDRKVLYTGA ERDVRAECGL ALSPVIGDVH AGPFGGSVGN GVGAGGESAG 180 181 KRDEEHELDQ EKRVEYAVLD ELEDFTDNLE LDEEGAGGFT AKAIVQRDRV DEEALNFSYE 240 241 DDFDNDVDAL LEEGLCAPKK RRMEEKYGGD SDHPSDGETS VQPMMTKIKT VLKSRGRPPT 300 301 EPLPDGWIMT FHNSGVPVYL HRESRVVTWS RPYFLGTGSI RKHGPPLTSI PCLHYRKMKD 360 361 SEERERAAGI APPEPELPPD EPDPLGTDAG PPDEKDPLGA EAAPGALGQV KAKVEVCKDE 420 421 SVDLEEFRNY LEKRFDFEQV TVKKFRTWAE RRQFNREMKR KQAESERPIL PANQKLITLS 480 481 VQDAPTKKEF VINPNGKSEV CILHEYMQRV LKVRPVYSFF ECENPSEPFG ASVTIDGVTY 540 541 GSGTASSKKL AKNKAARATL EILIPDFVKQ TSEEKPRDSE ELEYFNHISI EDSRVYELTS 600 601 KAGLLSPYQI LHECLKRNHG MGDTSIKFEV VPGKNQKSEY VMACGKHTVR GWCKNKRVGK 660 661 QLASQKILQL LHPHVKNWGS LLRMYGRESS KMVKQETSDK SVIELQQFAR KNKPNLHILS 720 721 KLQEEMRRLA EERVCVGSEP LGAATLTLLL GRASRLTPRK HQCVVLCAGL APSPGPHCAW 780 781 AVVPAAPPDL SVLPGGDAQE AQDVHRGFRT ARRRAPVHRG RVRAAGAGLP AHWGPPHTPC 840 841 HPGLWPPPPP HARRQATQLP WRPPTVRAAV SRPPPTCRTG RHGQLGDRCE ALVVWTDPEG 900 901 FHEFYM 906 |
Nucleotide CDS Sequence (FASTA) |
1 ATGGAGACAT GTGGGAGCCC CTCTCCTCTC CCGCGCGAGC CCGCAGGAGG AGTGGCGATG 60 61 GAGGACCGAG CTCGCCCCCT CCGTGCGCTG CCCCGTGGAC AGTCTCCACC ACCTCCCCTG 120 121 CAAACGTCCA GTGATGCAGA GGTAATGGAC GTTGGCTCTG GTGGTGATGG ACAGGCCGAA 180 181 CCCCCTGCCG AGGACCCGCT CAACTTCTAC GGAGCTTCTC TTCTCTCCAA AGGATCCTCC 240 241 TCTAAGGCCC GCCTCCTCGT AGACCCGAAC TGTAGTGGCC ACAGCCCGCG CACAGCGCGG 300 301 CATGCACCTG CGGTCCGGAA GTTCTCCCCT GACCTTAAGT TGCTTAAGGA TGTAAAGATT 360 361 AGCGTGAGCT TTACGGAGAG CTGCAGGAGT GAGGACAGGA AGGTGCTGTA CACGGGAGCG 420 421 GAGCGCGACG TGCGGGCAGA GTGTGGCCTG GCCCTCAGCC CTGTCATTGG GGACGTGCAT 480 481 GCTGGTCCCT TTGGCGGGAG CGTGGGGAAC GGGGTAGGCG CAGGGGGTGA GAGTGCGGGT 540 541 AAGAGGGATG AGGAACATGA GCTGGATCAG GAAAAGAGAG TGGAGTATGC AGTGCTCGAT 600 601 GAGTTAGAAG ATTTTACTGA CAATTTGGAG CTAGATGAAG AAGGCGCAGG CGGGTTCACG 660 661 GCTAAAGCGA TCGTGCAGAG AGACAGAGTG GACGAAGAGG CCTTGAATTT CTCCTACGAG 720 721 GATGATTTTG ACAACGATGT TGATGCCCTT CTGGAAGAGG GCCTCTGTGC TCCCAAAAAG 780 781 AGGCGAATGG AGGAGAAATA CGGAGGAGAC AGCGACCACC CGTCGGATGG GGAGACAAGC 840 841 GTGCAGCCAA TGATGACCAA GATTAAAACC GTTCTCAAAA GCCGTGGCCG CCCGCCGACG 900 901 GAGCCGCTGC CCGATGGCTG GATCATGACG TTCCACAATT CCGGAGTGCC CGTGTACCTG 960 961 CACCGGGAGT CGCGGGTGGT CACCTGGTCC AGGCCCTACT TCCTGGGCAC GGGGAGCATC 1020 1021 CGGAAACACG GCCCTCCCCT GACCAGCATC CCCTGCCTGC ACTACCGGAA GATGAAGGAC 1080 1081 AGTGAGGAGC GGGAGCGGGC CGCGGGGATA GCCCCCCCCG AGCCGGAGCT GCCCCCGGAC 1140 1141 GAGCCCGACC CGCTGGGCAC CGACGCGGGG CCCCCGGACG AGAAGGACCC GCTGGGGGCT 1200 1201 GAGGCGGCAC CCGGGGCCCT GGGGCAGGTG AAGGCCAAGG TGGAGGTGTG CAAGGACGAG 1260 1261 TCGGTCGACC TTGAGGAGTT CCGGAATTAC CTGGAGAAGC GCTTTGACTT TGAGCAAGTG 1320 1321 ACCGTGAAGA AGTTCAGGAC GTGGGCTGAG CGTCGGCAGT TCAACCGAGA AATGAAGCGA 1380 1381 AAACAGGCCG AGTCCGAGAG GCCCATCCTG CCCGCCAACC AGAAGCTCAT CACGCTGTCT 1440 1441 GTGCAGGACG CGCCCACGAA GAAAGAGTTT GTCATCAACC CCAACGGGAA GTCTGAGGTG 1500 1501 TGCATCCTGC ACGAGTACAT GCAGCGCGTG CTCAAGGTCC GCCCCGTGTA TAGCTTCTTT 1560 1561 GAGTGCGAGA ACCCAAGTGA GCCTTTTGGT GCCTCGGTGA CCATTGACGG TGTGACCTAT 1620 1621 GGATCTGGAA CGGCGAGCAG CAAAAAACTG GCCAAGAACA AAGCTGCCCG CGCCACGCTG 1680 1681 GAGATCCTCA TCCCCGACTT CGTCAAGCAG ACGTCGGAGG AGAAGCCCAG AGACAGCGAG 1740 1741 GAGCTGGAGT ATTTCAACCA CATCAGTATC GAGGACTCGC GGGTCTACGA GCTGACCAGC 1800 1801 AAGGCCGGGC TGCTCTCCCC CTACCAGATC CTCCACGAGT GCCTTAAAAG AAACCACGGG 1860 1861 ATGGGAGATA CCTCCATCAA GTTTGAAGTG GTTCCTGGTA AAAACCAGAA GAGCGAATAC 1920 1921 GTCATGGCGT GCGGCAAGCA CACAGTGCGC GGCTGGTGCA AGAACAAGCG CGTTGGGAAG 1980 1981 CAGTTGGCGT CTCAGAAGAT CCTGCAACTG CTGCACCCGC ATGTCAAGAA CTGGGGGTCC 2040 2041 TTGCTGCGCA TGTATGGCCG TGAGAGCAGC AAGATGGTCA AGCAGGAGAC CTCGGACAAG 2100 2101 AGCGTGATCG AGCTGCAGCA GTTCGCTCGC AAGAACAAGC CCAACCTGCA CATCCTGAGC 2160 2161 AAGCTGCAGG AGGAGATGCG GCGGCTGGCG GAGGAGCGGG AGGAGACGCG CAAGAAGCCC 2220 2221 AAGATGTCCA TCGTGGCTTC CGCACAGCCC GGCGGCGAGC CCCTGTGCAC CGTGGACGTG 2280 2281 TGA 2283 |
▼ KEYWORD
▼ GENE ONTOLOGY
ID | Classification | Description |
Cellular Component | Microprocessor complex | |
Cellular Component | Nucleolus | |
Molecular Function | Double-stranded RNA binding | |
Molecular Function | Heme binding | |
Molecular Function | Identical protein binding | |
Molecular Function | Metal ion binding | |
Molecular Function | Primary miRNA binding | |
Biological Process | Primary miRNA processing | |
Biological Process | RNA phosphodiester bond hydrolysis, endonucleolytic |
▼ ORTHOLOGY
DrLLPS ID | Organism | Identity | E-value | Score |
---|---|---|---|---|
LLPS-Ova-2611 | Ovis aries | 96.05 | 0.0 | 1277 |
LLPS-Tag-2544 | Taeniopygia guttata | 92.73 | 3e-168 | 499 |
LLPS-Mup-0488 | Mustela putorius furo | 92.3 | 0.0 | 1238 |
LLPS-Urm-1598 | Ursus maritimus | 92.09 | 0.0 | 1263 |
LLPS-Caf-3389 | Canis familiaris | 92.09 | 0.0 | 1264 |
LLPS-Scm-0618 | Scophthalmus maximus | 92.0 | 0.0 | 588 |
LLPS-Aim-2595 | Ailuropoda melanoleuca | 91.96 | 0.0 | 1261 |
LLPS-Fec-1983 | Felis catus | 91.96 | 0.0 | 1264 |
LLPS-Chs-1856 | Chlorocebus sabaeus | 91.15 | 0.0 | 1246 |
LLPS-Cea-2961 | Cercocebus atys | 91.15 | 0.0 | 1246 |
LLPS-Paa-2160 | Papio anubis | 91.15 | 0.0 | 1246 |
LLPS-Mal-3764 | Mandrillus leucophaeus | 91.15 | 0.0 | 1246 |
LLPS-Rhb-2190 | Rhinopithecus bieti | 91.15 | 0.0 | 1246 |
LLPS-Maf-1314 | Macaca fascicularis | 91.02 | 0.0 | 1246 |
LLPS-Pat-3117 | Pan troglodytes | 91.02 | 0.0 | 1244 |
LLPS-Pap-2467 | Pan paniscus | 91.02 | 0.0 | 1244 |
LLPS-Hos-0701 | Homo sapiens | 91.02 | 0.0 | 1244 |
LLPS-Mam-2041 | Macaca mulatta | 91.02 | 0.0 | 1246 |
LLPS-Gog-1085 | Gorilla gorilla | 90.88 | 0.0 | 1244 |
LLPS-Ict-2509 | Ictidomys tridecemlineatus | 90.88 | 0.0 | 1248 |
LLPS-Caj-2468 | Callithrix jacchus | 90.75 | 0.0 | 1244 |
LLPS-Eqc-1250 | Equus caballus | 90.57 | 0.0 | 1259 |
LLPS-Aon-0950 | Aotus nancymaae | 90.48 | 0.0 | 1239 |
LLPS-Nol-3650 | Nomascus leucogenys | 90.35 | 0.0 | 1244 |
LLPS-Poa-2604 | Pongo abelii | 90.35 | 0.0 | 1244 |
LLPS-Ora-1476 | Ornithorhynchus anatinus | 90.22 | 3e-51 | 181 |
LLPS-Fud-4341 | Fukomys damarensis | 90.21 | 0.0 | 1255 |
LLPS-Cap-1599 | Cavia porcellus | 90.21 | 0.0 | 1245 |
LLPS-Dio-1446 | Dipodomys ordii | 90.08 | 0.0 | 1248 |
LLPS-Mum-2304 | Mus musculus | 89.95 | 0.0 | 1234 |
LLPS-Otg-1036 | Otolemur garnettii | 89.95 | 0.0 | 1243 |
LLPS-Loa-3333 | Loxodonta africana | 89.95 | 0.0 | 1247 |
LLPS-Man-3687 | Macaca nemestrina | 89.73 | 0.0 | 1239 |
LLPS-Mea-0875 | Mesocricetus auratus | 89.54 | 0.0 | 1229 |
LLPS-Orc-3533 | Oryctolagus cuniculus | 89.41 | 0.0 | 1231 |
LLPS-Ran-0088 | Rattus norvegicus | 89.28 | 0.0 | 1225 |
LLPS-Sus-3690 | Sus scrofa | 88.46 | 0.0 | 1245 |
LLPS-Myl-2663 | Myotis lucifugus | 88.34 | 0.0 | 1206 |
LLPS-Cas-0548 | Carlito syrichta | 87.52 | 0.0 | 1202 |
LLPS-Anp-1287 | Anas platyrhynchos | 80.37 | 0.0 | 1082 |
LLPS-Gaga-1813 | Gallus gallus | 80.37 | 0.0 | 1085 |
LLPS-Fia-1135 | Ficedula albicollis | 79.71 | 0.0 | 1083 |
LLPS-Pes-0860 | Pelodiscus sinensis | 79.01 | 0.0 | 1044 |
LLPS-Anc-1302 | Anolis carolinensis | 78.91 | 0.0 | 1082 |
LLPS-Sah-1273 | Sarcophilus harrisii | 78.85 | 0.0 | 1088 |
LLPS-Mod-2008 | Monodelphis domestica | 78.8 | 0.0 | 1083 |
LLPS-Meg-1038 | Meleagris gallopavo | 72.9 | 0.0 | 957 |
LLPS-Scf-0763 | Scleropages formosus | 72.89 | 0.0 | 968 |
LLPS-Gaa-0246 | Gasterosteus aculeatus | 72.35 | 0.0 | 948 |
LLPS-Xet-1404 | Xenopus tropicalis | 72.33 | 0.0 | 1009 |
LLPS-Ten-1043 | Tetraodon nigroviridis | 72.22 | 0.0 | 929 |
LLPS-Dar-2328 | Danio rerio | 71.43 | 0.0 | 927 |
LLPS-Leo-2390 | Lepisosteus oculatus | 71.28 | 0.0 | 980 |
LLPS-Tar-1438 | Takifugu rubripes | 70.24 | 0.0 | 941 |
LLPS-Asm-2148 | Astyanax mexicanus | 70.07 | 0.0 | 935 |
LLPS-Orn-0136 | Oreochromis niloticus | 69.95 | 0.0 | 939 |
LLPS-Pof-1398 | Poecilia formosa | 69.17 | 0.0 | 923 |
LLPS-Icp-2833 | Ictalurus punctatus | 68.72 | 0.0 | 913 |
LLPS-Xim-3805 | Xiphophorus maculatus | 68.66 | 0.0 | 912 |
LLPS-Orl-1127 | Oryzias latipes | 68.18 | 0.0 | 902 |
LLPS-Cii-0227 | Ciona intestinalis | 47.44 | 2e-37 | 146 |
LLPS-Cis-0155 | Ciona savignyi | 44.3 | 9e-53 | 192 |
LLPS-Drm-1613 | Drosophila melanogaster | 36.7 | 1e-94 | 318 |
LLPS-Cae-0932 | Caenorhabditis elegans | 24.86 | 6e-31 | 134 |