LLPS-Drm-2363
Sfmbt

▼ OVERVIEW


Status: Unreviewed
Protein Name: Polycomb protein Sfmbt; Scm-like with four MBT domain-containing protein 1; dSfmbt
Gene Name: Sfmbt, CG16975
Ensembl Gene: FBgn0032475
Ensembl Protein: FBpp0080069
Organism: Drosophila melanogaster
Taxa ID: 7227
LLPS Type: Regulator
PDB: 5J8Y (A, C) More


▼ PROPERTY



▼ Classification


Condensates:
CondensateEvidenceOrthologs
P-bodyPredicted from orthologs(View)

▼ FUNCTION


Polycomb group (PcG) protein that binds to the Polycomb response elements (PREs) found in the regulatory regions of many genes. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development. PcG proteins are not required to initiate repression, but to maintain it during later stages of development. They probably act via the methylation of histones, rendering chromatin heritably changed in its expressibility. Necessary but not sufficient to recruit a functional PcG repressive complex that represses target genes, suggesting that the recruitment of the distinct PRC1 complex is also required to allow a subsequent repression.

▼ SEQUENCE


Protein Sequence (FASTA)
1     MNPSELRMMW  MSSQYNSERI  TLEDAATLLG  HPTVGLSVME  DLSAHQPTLD  MNPMMSLMGG  60
61    DFTGQAAATA  AALGVQPGTL  IATNSNNLYG  FAHMGGLQQQ  LLQQSAAAAV  FQNYAEAMDN  120
121   DVENGMVGMA  MEAVVDDDDQ  VYGQRDNNFD  DNGSELEPKQ  EIINIDDFVM  MNEDNNSYDG  180
181   TDFMTSSDKD  ISQSSSSCMA  QMPGSLGVPG  VEHDLLVPLP  DGLLHHKLLG  TTLVPAMGTL  240
241   NGNAFGNIMV  STENTSSKQM  QRTYSTAKGA  NSTATTATCS  ASTSSALRSQ  RKTRKIEPVN  300
301   RPGLVLKTPI  AYRGNIDPSV  IPIQKDGMAV  CKRCGAIGVK  HTFYTKSRRF  CSMACARGEL  360
361   YSLVLNTKME  GDQATTSSPD  PGAGSESADL  PGDQQQSQSD  IELDLHAAHI  KNANYRFRIT  420
421   DQSKITQLNS  FGEPMSMGGD  AAANNVQMAA  DETIAALNGG  AVGDATAPGS  TEEGASTPNS  480
481   YLSAAPTPKA  LRLFKDIYPQ  DDLPQIPKYE  RLPVPCPQME  KIISIRRRMY  DPTHSYDWLP  540
541   RLSKENFNAA  PVTCFPHAPG  CEVWDNLGVG  MKVEVENTDC  DSIEVIQPGQ  TPTSFWVATI  600
601   LEIKGYKALM  SYEGFDTDSH  DFWVNLCNAE  VHSVGWCATR  GKPLIPPRTI  EHKYKDWKDF  660
661   LVGRLSGART  LPSNFYNKIN  DSLQSRFRLG  LNLECVDKDR  ISQVRLATVT  KIVGKRLFLR  720
721   YFDSDDGFWC  HEDSPIIHPV  GWATTVGHNL  AAPQDYLERM  LAGREAMIEV  HEDDATIELF  780
781   KMNFTFDEYY  SDGKTNSFVE  GMKLEAVDPL  NLSSICPATV  MAVLKFGYMM  IRIDSYQPDA  840
841   SGSDWFCYHE  KSPCIFPAGF  CSVNNISVTP  PNGYDSRTFT  WEGYLRDTGA  VAAGQHLFHR  900
901   IIPDHGFEVG  MSLECADLMD  PRLVCVATVA  RVVGRLLKVH  FDGWTDEYDQ  WLDCESADIY  960
961   PVGWCVLVNH  KLEGPPRVAH  QQAPKPAPKP  KIQRKRKPKK  GAAGGKTPTD  NNTQSVKSRT  1020
1021  IALKTTPHLP  KLSIKLELKP  EHHNAAFYEN  NQPEEEGDEE  DPDADGDGDG  STSHISEQST  1080
1081  TQSSSDLIAG  SGSGSGSASL  VTLATGSNKT  VKKSTTKSPA  PPTVGRKATS  YIANSSATNN  1140
1141  KYIPRLADID  SSEPHLELVP  DTWNVYDVSQ  FLRVNDCTAH  CDTFSRNKID  GKRLLQLTKD  1200
1201  DIMPLLGMKV  GPALKISDLI  AQLKCKVNPG  RARSHKTNKS  PFL  1243
Nucleotide CDS Sequence (FASTA)
1     ATGAACCCAT  CCGAGCTGCG  CATGATGTGG  ATGAGTAGTC  AGTACAACTC  GGAGCGCATT  60
61    ACACTGGAGG  ATGCGGCCAC  TCTGCTGGGT  CATCCCACTG  TTGGACTCTC  TGTCATGGAG  120
121   GACCTCTCCG  CCCATCAGCC  AACTTTGGAC  ATGAATCCGA  TGATGAGCCT  AATGGGGGGA  180
181   GATTTCACTG  GTCAAGCGGC  GGCGACTGCT  GCGGCACTGG  GTGTACAGCC  GGGCACCCTG  240
241   ATTGCCACCA  ACTCGAACAA  CCTGTACGGA  TTTGCCCACA  TGGGCGGCCT  GCAGCAGCAA  300
301   CTGCTCCAGC  AGTCGGCGGC  GGCGGCAGTC  TTCCAGAACT  ATGCGGAGGC  AATGGATAAC  360
361   GATGTGGAGA  ACGGCATGGT  TGGCATGGCC  ATGGAGGCGG  TCGTGGACGA  TGACGACCAA  420
421   GTGTACGGTC  AGAGGGACAA  TAACTTTGAT  GATAATGGAT  CCGAGTTGGA  GCCCAAGCAA  480
481   GAGATTATCA  ACATCGACGA  CTTTGTGATG  ATGAACGAGG  ACAATAATTC  GTACGATGGC  540
541   ACCGACTTTA  TGACCTCCTC  CGATAAGGAC  ATTTCGCAGT  CCTCCTCCTC  CTGCATGGCA  600
601   CAGATGCCCG  GAAGCTTGGG  CGTCCCTGGA  GTCGAGCACG  ATCTTTTGGT  TCCACTGCCC  660
661   GATGGCCTGC  TGCACCACAA  GCTGCTGGGC  ACAACTCTGG  TCCCTGCGAT  GGGTACGCTA  720
721   AATGGCAATG  CCTTTGGCAA  CATAATGGTC  AGCACTGAAA  ACACATCCAG  CAAGCAAATG  780
781   CAGCGCACCT  ATAGTACGGC  CAAGGGAGCC  AATTCCACCG  CAACCACCGC  CACCTGCAGT  840
841   GCCTCCACGT  CCTCCGCATT  GCGATCACAG  CGCAAGACGC  GCAAGATCGA  GCCCGTTAAC  900
901   AGGCCAGGAC  TGGTGCTGAA  GACACCCATC  GCCTACAGGG  GCAACATTGA  CCCTTCGGTG  960
961   ATCCCCATAC  AGAAAGATGG  CATGGCTGTC  TGTAAACGTT  GTGGCGCCAT  TGGAGTGAAG  1020
1021  CACACATTCT  ACACAAAATC  GCGACGTTTT  TGCAGCATGG  CCTGTGCCCG  AGGCGAACTA  1080
1081  TATTCGTTGG  TACTTAACAC  TAAGATGGAG  GGAGACCAGG  CAACCACTAG  TTCCCCGGAT  1140
1141  CCTGGAGCGG  GTTCGGAGTC  AGCCGATTTG  CCTGGCGATC  AGCAACAGTC  GCAGTCGGAT  1200
1201  ATTGAGCTCG  ATCTGCATGC  GGCGCACATT  AAGAATGCGA  ATTATCGTTT  TCGCATTACG  1260
1261  GATCAATCAA  AGATTACCCA  GTTGAATAGC  TTCGGCGAAC  CCATGTCAAT  GGGCGGCGAC  1320
1321  GCAGCTGCCA  ATAATGTACA  AATGGCAGCA  GATGAAACAA  TTGCTGCGTT  AAACGGCGGA  1380
1381  GCAGTGGGCG  ATGCCACTGC  TCCGGGCAGC  ACCGAGGAGG  GTGCATCCAC  ACCCAATTCT  1440
1441  TATCTGAGCG  CAGCACCCAC  GCCCAAGGCC  TTGCGACTTT  TTAAGGACAT  CTACCCACAG  1500
1501  GATGACCTGC  CGCAAATACC  CAAATACGAG  CGCCTTCCCG  TACCGTGTCC  GCAAATGGAG  1560
1561  AAGATAATCA  GCATTCGGCG  GCGCATGTAT  GATCCCACAC  ACTCCTACGA  CTGGTTGCCC  1620
1621  CGCCTTAGCA  AGGAGAACTT  CAATGCGGCA  CCCGTCACTT  GTTTTCCTCA  TGCACCCGGT  1680
1681  TGCGAGGTAT  GGGACAATTT  GGGCGTGGGC  ATGAAGGTTG  AAGTGGAAAA  TACGGATTGC  1740
1741  GATAGCATCG  AAGTGATCCA  ACCGGGTCAG  ACTCCTACCT  CGTTTTGGGT  GGCCACCATC  1800
1801  CTGGAAATCA  AAGGCTATAA  GGCCTTAATG  AGCTACGAGG  GTTTCGATAC  GGACTCGCAC  1860
1861  GACTTCTGGG  TGAACCTCTG  CAATGCCGAG  GTGCATTCGG  TGGGTTGGTG  CGCCACTCGG  1920
1921  GGCAAGCCAT  TAATTCCGCC  CCGCACCATC  GAGCACAAGT  ACAAGGACTG  GAAGGACTTT  1980
1981  CTGGTGGGAC  GTTTATCCGG  AGCCCGCACC  CTTCCCTCCA  ACTTTTACAA  CAAAATCAAC  2040
2041  GACAGCCTCC  AGTCGCGCTT  CCGCCTTGGC  CTGAATCTCG  AGTGCGTGGA  CAAGGATCGC  2100
2101  ATTTCGCAGG  TGCGCCTGGC  CACCGTCACC  AAAATCGTGG  GAAAGCGCCT  CTTCCTGCGC  2160
2161  TACTTCGATT  CCGACGACGG  CTTTTGGTGT  CACGAGGACT  CGCCCATCAT  CCATCCAGTT  2220
2221  GGCTGGGCAA  CCACAGTAGG  CCATAATCTG  GCTGCACCGC  AGGACTATCT  GGAGCGCATG  2280
2281  TTAGCTGGTC  GCGAAGCCAT  GATTGAGGTT  CATGAGGACG  ATGCCACAAT  CGAGTTGTTT  2340
2341  AAGATGAACT  TCACCTTCGA  CGAATACTAC  AGTGACGGCA  AAACCAATAG  CTTTGTGGAG  2400
2401  GGCATGAAGC  TGGAAGCGGT  GGATCCACTC  AACCTTTCAT  CCATATGCCC  GGCTACAGTA  2460
2461  ATGGCGGTTC  TTAAGTTCGG  ATACATGATG  ATACGCATTG  ATTCCTACCA  ACCGGATGCC  2520
2521  TCAGGGTCGG  ATTGGTTCTG  TTACCATGAA  AAGAGTCCGT  GTATCTTTCC  GGCTGGATTC  2580
2581  TGTTCCGTCA  ACAACATTTC  GGTTACCCCA  CCGAACGGCT  ACGACTCTCG  TACATTCACC  2640
2641  TGGGAGGGTT  ACCTCCGCGA  CACGGGAGCC  GTAGCCGCTG  GCCAGCATCT  ATTTCATCGG  2700
2701  ATTATTCCCG  ATCATGGATT  TGAGGTGGGT  ATGAGTCTGG  AGTGTGCAGA  TCTCATGGAT  2760
2761  CCCCGACTCG  TTTGCGTGGC  CACGGTGGCG  CGAGTGGTTG  GTCGACTACT  CAAGGTTCAC  2820
2821  TTTGACGGAT  GGACGGATGA  GTACGACCAG  TGGTTGGATT  GCGAATCAGC  CGATATATAT  2880
2881  CCAGTCGGAT  GGTGTGTACT  GGTCAACCAT  AAGCTAGAGG  GCCCACCGAG  AGTAGCACAT  2940
2941  CAGCAGGCCC  CGAAACCGGC  ACCAAAGCCC  AAAATACAGC  GAAAGCGAAA  GCCCAAAAAG  3000
3001  GGAGCAGCGG  GAGGCAAAAC  TCCAACCGAT  AATAATACTC  AGTCGGTCAA  ATCGCGTACA  3060
3061  ATTGCGCTCA  AGACCACGCC  GCACTTACCC  AAGCTGAGCA  TCAAGCTGGA  GCTAAAGCCG  3120
3121  GAGCATCACA  ATGCTGCCTT  TTACGAGAAC  AATCAGCCCG  AGGAGGAAGG  CGACGAGGAG  3180
3181  GATCCCGATG  CGGATGGCGA  TGGAGACGGA  AGCACCAGCC  ACATCTCCGA  GCAGTCAACG  3240
3241  ACACAGTCGT  CCAGTGATCT  GATCGCTGGA  TCGGGCAGTG  GAAGTGGGTC  CGCCTCGCTG  3300
3301  GTAACTCTCG  CCACGGGCAG  TAACAAAACG  AACTCCTCTG  CGACGAATAA  TAAGTACATT  3360
3361  CCGCGTCTGG  CCGACATCGA  TTCGAGTGAA  CCTCACTTGG  AGCTGGTGCC  GGATACATGG  3420
3421  AACGTGTACG  ACGTATCCCA  GTTCTTGCGG  GTGAACGATT  GCACAGCCCA  CTGTGACACG  3480
3481  TTCAGCCGGA  ACAAGATCGA  CGGAAAGCGA  CTCCTGCAGC  TGACCAAGGA  CGATATCATG  3540
3541  CCACTGTTGG  GCATGAAGGT  GGGCCCAGCA  CTGAAAATTT  CCGACCTTAT  TGCACAGCTT  3600
3601  AAGTGCAAGG  TTAATCCGGG  CAGAGCCAGA  TCCCACAAGA  CCAACAAATC  ACCGTTTTTA  3660

▼ KEYWORD


ID
Family
3D-structure
Alternative splicing
Chromatin regulator
Complete proteome
DNA-binding
Metal-binding
Nucleus
Reference proteome
Repeat
Repressor
Transcription
Transcription regulation
Zinc
Zinc-finger

▼ GENE ONTOLOGY


ID
Classification
Description
Cellular Component
Nucleus
Cellular Component
PcG protein complex
Molecular Function
Chromatin binding
Molecular Function
DNA binding
Molecular Function
Methylated histone binding
Molecular Function
Zinc ion binding
Biological Process
Chromatin silencing
Biological Process
Imaginal disc growth
Biological Process
Negative regulation of gene expression
Biological Process
Oogenesis

▼ KEGG



▼ ORTHOLOGY


DrLLPS IDOrganismIdentityE-valueScore
LLPS-Cii-2295Ciona intestinalis59.341e-64 231
LLPS-Lac-3581Latimeria chalumnae51.466e-149 473
LLPS-Bot-1244Bos taurus50.99e-99 341
LLPS-Cis-2053Ciona savignyi50.221e-138 437
LLPS-Cap-0015Cavia porcellus49.891e-139 448
LLPS-Man-4061Macaca nemestrina49.661e-138 445
LLPS-Pat-0675Pan troglodytes49.669e-139 446
LLPS-Mam-2337Macaca mulatta49.661e-138 445
LLPS-Poa-2761Pongo abelii49.661e-138 446
LLPS-Aon-1384Aotus nancymaae49.665e-139 446
LLPS-Cea-3567Cercocebus atys49.661e-138 445
LLPS-Chs-2721Chlorocebus sabaeus49.661e-138 445
LLPS-Mum-0464Mus musculus49.668e-138 444
LLPS-Maf-3639Macaca fascicularis49.662e-138 445
LLPS-Mal-0395Mandrillus leucophaeus49.662e-139 445
LLPS-Leo-1129Lepisosteus oculatus49.556e-144 460
LLPS-Rhb-1895Rhinopithecus bieti49.443e-139 447
LLPS-Fia-2862Ficedula albicollis49.442e-140 451
LLPS-Hos-3052Homo sapiens49.446e-138 444
LLPS-Pap-2485Pan paniscus49.441e-137 443
LLPS-Myl-2141Myotis lucifugus49.447e-138 443
LLPS-Ran-1718Rattus norvegicus49.445e-138 444
LLPS-Tag-1154Taeniopygia guttata49.341e-146 464
LLPS-Gaga-4027Gallus gallus49.344e-147 468
LLPS-Orn-1142Oreochromis niloticus49.334e-146 466
LLPS-Gog-2120Gorilla gorilla49.322e-137 442
LLPS-Ova-3309Ovis aries49.35e-130 422
LLPS-Anp-1580Anas platyrhynchos49.237e-147 464
LLPS-Orc-3333Oryctolagus cuniculus49.215e-138 444
LLPS-Ict-2884Ictidomys tridecemlineatus49.214e-138 444
LLPS-Cas-2475Carlito syrichta49.215e-138 444
LLPS-Caj-1491Callithrix jacchus49.218e-138 443
LLPS-Fud-0417Fukomys damarensis49.216e-138 444
LLPS-Mea-2140Mesocricetus auratus49.215e-137 441
LLPS-Paa-1346Papio anubis49.219e-138 443
LLPS-Meg-2258Meleagris gallopavo49.122e-146 463
LLPS-Dio-2615Dipodomys ordii49.119e-138 441
LLPS-Sus-2414Sus scrofa49.14e-139 447
LLPS-Anc-2354Anolis carolinensis49.013e-148 468
LLPS-Xet-2506Xenopus tropicalis48.998e-145 460
LLPS-Loa-2541Loxodonta africana48.983e-136 439
LLPS-Ora-2356Ornithorhynchus anatinus48.983e-136 440
LLPS-Caf-3271Canis familiaris48.883e-138 444
LLPS-Mod-4172Monodelphis domestica48.84e-89 311
LLPS-Scf-3126Scleropages formosus48.784e-143 457
LLPS-Orl-0975Oryzias latipes48.776e-141 451
LLPS-Xim-1036Xiphophorus maculatus48.772e-142 454
LLPS-Otg-2930Otolemur garnettii48.768e-138 443
LLPS-Pes-2906Pelodiscus sinensis48.681e-143 457
LLPS-Fec-3949Felis catus48.659e-137 441
LLPS-Urm-0963Ursus maritimus48.652e-137 442
LLPS-Eqc-0462Equus caballus48.657e-137 441
LLPS-Aim-3869Ailuropoda melanoleuca48.651e-137 443
LLPS-Mup-2226Mustela putorius furo48.655e-138 444
LLPS-Scm-0344Scophthalmus maximus48.559e-143 456
LLPS-Tut-2330Tursiops truncatus48.446e-142 452
LLPS-Sah-0990Sarcophilus harrisii48.214e-142 452
LLPS-Dar-0374Danio rerio48.111e-141 453
LLPS-Tar-3275Takifugu rubripes48.112e-141 453
LLPS-Nol-3218Nomascus leucogenys48.04e-141 449
LLPS-Pof-3828Poecilia formosa47.993e-137 435
LLPS-Ten-2746Tetraodon nigroviridis47.873e-142 452
LLPS-Asm-0725Astyanax mexicanus47.587e-140 449
LLPS-Gaa-3578Gasterosteus aculeatus47.342e-120 387
LLPS-Icp-3009Ictalurus punctatus47.124e-139 446
LLPS-Ere-0072Erinaceus europaeus46.272e-0759.3
LLPS-Cae-0331Caenorhabditis elegans27.633e-38 157