LLPS-Mum-4741
Herc2
▼ OVERVIEW
Status: | Unreviewed |
Protein Name: | E3 ubiquitin-protein ligase HERC2; HECT domain and RCC1-like domain-containing protein 2; HECT-type E3 ubiquitin transferase HERC2 |
Gene Name: | Herc2, Jdf2, Kiaa0393, Rjs |
Ensembl Gene: | ENSMUSG00000030451.16 |
Ensembl Protein: | ENSMUSP00000145997.1 |
Organism: | Mus musculus |
Taxa ID: | 10090 |
LLPS Type: | Others |
▼ PROPERTY
▼ Classification
Condensates:
Condensate | Evidence | Orthologs |
---|---|---|
Centrosome/Spindle pole body | Predicted from orthologs | (View) |
▼ FUNCTION
E3 ubiquitin-protein ligase that regulates ubiquitin-dependent retention of repair proteins on damaged chromosomes. Recruited to sites of DNA damage in response to ionizing radiation (IR) and facilitates the assembly of UBE2N and RNF8 promoting DNA damage-induced formation of 'Lys-63'-linked ubiquitin chains. Acts as a mediator of binding specificity between UBE2N and RNF8. Involved in the maintenance of RNF168 levels. E3 ubiquitin-protein ligase that promotes the ubiquitination and proteasomal degradation of XPA which influences the circadian oscillation of DNA excision repair activity. By controlling the steady-state expression of the IGF1R receptor, indirectly regulates the insulin-like growth factor receptor signaling pathway. |
▼ CROSS REFERENCE
Database | Nucleotide ID | Protein ID |
---|---|---|
Ensembl | ENSMUST00000205303.1 | ENSMUSP00000145997.1 |
Ensembl | ENSMUST00000205653.1 | ENSMUSP00000145771.1 |
Ensembl | ENSMUST00000205678.1 | ENSMUSP00000145560.1 |
Ensembl | ENSMUST00000164095.2 | ENSMUSP00000131573.1 |
Ensembl | ENSMUST00000206537.1 | ENSMUSP00000146182.1 |
Ensembl | ENSMUST00000076226.12 | ENSMUSP00000075579.6 |
UniProt | Q4U2R1, HERC2_MOUSE, E9PZT6, O88473, Q3TRJ8, Q3TS47, Q3TST2, Q3UFQ6, Q3URH7, Q5DU32, Q7TPR5, Q80VV7, Q9QYT1, Q9Z168, Q9Z171 | |
GeneBank | AF071173, AK148361, AK141515, AF061529, AK161826, AC102121, BC044667, AC102150, BC054829, AK220338, AK162270, AK162708 | AAD08658.1, BAE24711.1, AAH54829.1, BAE36593.1, BAE37031.1, BAE36828.1, AAH44667.1, AAC31431.1, BAE28504.1, BAD90404.1 |
RefSeq | NM_010418.2, XM_006540637.3, XM_006540638.2, XM_006540639.1 | NP_034548.2, XP_006540700.1, XP_006540701.1, XP_006540702.1 |
Entrez | 15204 |
▼ SEQUENCE
Protein Sequence (FASTA) |
---|
1 MPSESFCLAA QSRLDSKWLK TDIQLAFTRD GLCGLWNEMV KDGEIVYTGT ELAQNRELPL 60 61 RKDDGVDAQS GTKKEDLNDK EKKEEEETPA PVYRAKSILE SWVWGRQPDV NELKECLSVL 120 121 VKEQQALAVQ SATTTLSALR LKQRLVILER YFIALNRTVF QENVKVKWKS SSISVPPTEK 180 181 KSARPTGRGV EGLARVGSRA ALSFAFAFLR RAWRSGEDAD LCSELLQESL DALRALPEAS 240 241 LFDESTVSSV WLEVVERATR FLRSVVTGDV HGTPGTKGPG GVPLQDQHLA LAILLELAVQ 300 301 RGTLSQMLSA ILLLLQLWDS GAQETDNERS AQGTSAPLLP LLQRFQSIIC SKDVPHTESD 360 361 MHLLSGPLSP NESFLRYLTL PQDNELAIDL RQTAVVVMAH LDRLATPCMP PLCSSPTSHK 420 421 GSLQEVIGWG LIGWKYYANV IGPIQCEGLA SLGVMQVACA EKRFLILSRN GRVYTQAYNS 480 481 DMLAPQLVQG LASRNIVKIA AHSDGHHYLA LAATGEVYSW GCGDGGRLGH GDTVPLEEPK 540 541 VISAFSGKQA GKHVVHIACG STYSAAITAE GELYTWGRGN YGRLGHGSSE DEAIPMLVAG 600 601 LKGLKVIDVA CGSGDAQTLA VTENGQVWSW GDGDYGKLGR GGSDGCKTPK LIEKLQDLDV 660 661 IKVRCGSQFS IALTKDGQVY SWGKGDNQRL GHGTEEHVRY PKLLEGLQGK KVIDVAAGST 720 721 HCLALTEDSE VHSWGSNDQC QHFDTLRVTK PEPTALPGLD SKHIVGIACG PAQSFAWSSC 780 781 SEWSIGLRVP FVVDICSMTF EQLDLLLRQV SEGMDGTADW PPPQEKECMA VATLNLLRLQ 840 841 LHAAISHQVD PEFLGLGLGS VLLNSLKQTV VTLASSAGVL STVQSAAQAV LQSGWSVLLP 900 901 TAEERARALS ALLPCTVSGN EVNISPGRRF MIDLLVGSLM ADGGLESALN AAITAEIQDI 960 961 EAKKEAQKEK EIDEQEASAS TFHRSRTPLD KDLINTGIYE SSGKQCLPLV QLIQQLLRNI 1020 1021 ASQTVARLKD VARRISSCLD FEQQSCERSA SLDLLLRFQR LLISKLYPGE KIGPISDTSS 1080 1081 PELMGVGSLL KKYTALVCTH IGDILPVAAS IASSSWQHFA EVACVMEGDF TGVLLPELVV 1140 1141 SIVLLLSKNA SLMQEAGAIP LLGGLLEHLD RFNHLAPGKE RDDHEELAWP GIMESFFTGQ 1200 1201 NCRNNEEVTL IRKADLENHN KDGGFWTVID GKVYGIKDFQ TQSLTGNSIL AQFAGEDPVV 1260 1261 ALEAALQFED TQESMHAFCV GQYLEPDQEV VTIPDLGSLS SPLIDTERNL GLLLGLHASY 1320 1321 LAMSTPLSPV EVECAKWLQS SIFSGGLQTS QIHYSYNEEK DEDHCSSPGG TPISKSRLCS 1380 1381 HRWALGDHSQ AFLQAIADNN IQDYNVKDFL CQIERYCRQC HLTTPITFPP EHPVEEVGRL 1440 1441 LLCCLLKHED LGHVALSLVH VGTLGIEQVK HRTLPKSVVD VCRVVYQAKC SLIKTHQEQG 1500 1501 RSYKEVCAPV IERLRFLFNE LRPAVCSDLS IMSKFKLLGS LPRWRRIAQK IIRERRKKRV 1560 1561 PKKPESIDSE EKIGNEESDL EEACVLPHSP INVDKRPISM KSPKDKWQPL LNTVTGVHKY 1620 1621 KWLKQNVQGL YPQSALLNTI VEFALKEEPV DVEKMRKCLL KQLERAEVRL EGIDTILKLA 1680 1681 AKSFLLPSVQ YAMFCGWQRL IPEGIDIGEP LTDCLRDVDL IPPFNRMLLE VTFGKLYAWA 1740 1741 VQNIRSVLMD ASARFKELGI QPVPLQTITN ENPAGPSLGT IPQARFLLVM LSMLTLQHGA 1800 1801 NNLDLLLNSG TLALTQTALR LIGPTCDSVE DDMNASARGA SATVLEETRK ETAPVQLPVS 1860 1861 GPELAAMMKI GTRVMRGVDW KWGDQDGPPP GLGRVIGELG EDGWIRVQWD TGSTNSYRMG 1920 1921 KEGKYDLKLV ELPVSSQPSA EDSDTEDDSE AEQGERNIHP TAMMLTSVIN LLQTLCLSVG 1980 1981 VHADIMQSEA TKTLCGLLRM LVESGTTDKP APPDRLVARE QHRSWCTLGF VRSIALTPQA 2040 2041 CGALSSPRWI TLLMKVVEGH APFTAASLQR QILAVHLLQA VLPSWDKTER ARDMKCLVEK 2100 2101 LFGFLGSLLT TCSSDVPLLR ESTLRKRRAR PQASLTATHS STLAEEVVGL LRTLHSLTQW 2160 2161 NGLINKYINS QLCSVTQSYA GKTSERAQLE DYFPDSENLE VGGLMAVLAV IGGIDGRLRL 2220 2221 GGQVMHDEFG EGTVTRITPK GRITVQFCDM RMCRVCPLNQ LKPLPAVAFS VNNLPFTEPM 2280 2281 LSVWAELVNL AGSKLEKHKT KKSAKPAFAG QVDLDLLRSQ QLKLYILKAG RALLSHQDKL 2340 2341 RQILSQPAVQ GTGTLQTDDG AAASPDLGDM SPEGPQPPMI LLQQLLSSAT QPSPVKAIFD 2400 2401 KQELEAAALA LCQCLAVEST HPSSPGCEDC SSSEATTPVS VQHIHLARAK KRRQSPAPAL 2460 2461 PIVVQLMEMG FPRKNIEFAL KSLTGTSGNA SGLPGVEALV GWLLDHSDVQ VTEFSDAETL 2520 2521 SDEYSDEEVV EDVDDTPYPV AAGAVVTESQ TYKKRADFLS NDDYAVYVRE NVQVGMMVRC 2580 2581 CRTYEEVCEG DVGKVIKLDR DGLHDLNVQC DWQQKGGTYW VRYIHVELIG YPPPSSSSHI 2640 2641 KIGDKVRVKA SVTTPKYKWG SVTHQSVGLV KAFSANGKDI IVDFPQQSHW TGLLSEMELV 2700 2701 PSIHPGVTCD GCQTFPINGS RFKCRNCDDF DFCETCFKTK KHNTRHTFGR INEPGQSAVF 2760 2761 CGRSGKQLKR CHSSQPGMLL DSWSRMVKSL NVSSSVNQAS RLIDGSEPCW QSSGSQGKHW 2820 2821 IRLEIFPDVL VHRLKMIVDP ADSSYMPSLV VVSGGNSLNN LIELKTININ QTDTTVPLLS 2880 2881 DCAEYHRYIE IAIKQCRSSG IDCKIHGLIL LGRIRAEEED LAAVPFLASD NEEEEDDKGS 2940 2941 TGSLIRKKTP GLESTATIRT KVFVWGLNDK DQLGGLKGSK IKVPSFSETL SALNVVQVAG 3000 3001 GSKSLFAVTV EGKVYSCGEA TNGRLGLGMS SGTVPIPRQI TALSSYVVKK VAVHSGGRHA 3060 3061 TALTVDGKVF SWGEGDDGKL GHFSRMNCDK PRLIEALKTK RIRDIACGSS HSAALTSSGE 3120 3121 LYTWGLGEYG RLGHGDNTTQ LKPKMVKVLL GHRVIQVACG SRDAQTLALT DEGLVFSWGD 3180 3181 GDFGKLGRGG SEGCNIPQNI ERLNGQGVCQ IECGAQFSLA LTKSGVVWTW GKGDYFRLGH 3240 3241 GSDVHVRKPQ VVEGLRGKKI VHVAVGALHC LAVTDSGQVY AWGDNDHGQQ GNGTTTVNRK 3300 3301 PTLVQGLEGQ KITRVACGSS HSVAWTTVDV ATPSVHEPVL FQTARDPLGA SYLGVPSDAD 3360 3361 SSSSSNKISG ANNCKPNRPS LAKILLSLEG NLAKQQALSH ILTALQIMYA RDAVVGALMP 3420 3421 AGMLAPVECP SFSSSAPASD VSAMASPMHM EDSTLAADLE DRLSPNLWQE KREIVSSEDA 3480 3481 VTPSAVTPSA PSASSRPFIP VTDDPGAASI IAETMTKTKE DVESQNKTSG PEPQSLDEFT 3540 3541 SLLIPDDTRV VVELLKLSVC SRAGDKGREV LSAVLSGMGT AYPQVADMLL ELCVTELEDV 3600 3601 ATDSQSGRLS SQPVVVESSH PYTDDTSTSG TVKIPGAEGL RVEFDRQCST ERRHDPLTVM 3660 3661 DGVNRIVSVR SGREWSDWSS ELRIPGDELK WKFISDGSVN GWGWRFTVYP IMPAAGPKDL 3720 3721 LSDRCVLSCP SMDLVTCLLD FRLNLTSNRS IVPRLAASLA ACAQLSALAA SHRMWALQRL 3780 3781 RRLLTTEFGQ SININRLLGE NDGESRALSF TGSALAALVK GLPEALQRQF EYEDPIVRGG 3840 3841 KQLLHSPFFK VLVALACDLE LDTLPCCAET HKWAWFRRYC MASRVAVALD KRTPLPRLFL 3900 3901 DEVAKKIREL MADSESMDVL HESHSIFKRE QDEQLVQWMN RRPDDWTLSA GGSGTIYGWG 3960 3961 HNHRGQLGGI EGAKVKVPTP CEALATLRPV QLIGGEQTLF AVTADGKLYA TGYGAGGRLG 4020 4021 IGGTESVSTP TLLESIQHVF IKKVAVNSGG KHCLALSSEG EVYSWGEAED GKLGHGNRSP 4080 4081 CDRPRVIESL RGIEVVDVAA GGAHSACVTA AGDLYTWGKG RYGRLGHSDS EDQLKPKLVE 4140 4141 ALQGHRVIDI ACGSGDAQTL CLTDDDTVWS WGDGDYGKLG RGGSDGCKVP MKIDSLTGLG 4200 4201 VVKVECGSQF SVALTKSGAV YTWGKGDYHR LGHGSDDHVR RPRQVQGLQG KKVIAIATGS 4260 4261 LHCVCCTEDG EVYTWGDNDE GQLGDGTTNA IQRPRLVAAL QGKKVNRVAC GSAHTLAWST 4320 4321 SKPASAGKLP AQVPMEYNHL QEIPIIALRN RLLLLHHISE LFCPCIPMFD LEGSLDETGL 4380 4381 GPSVGFDTLR GILISQGKEA AFRKVVQATM VRDRQHGPVV ELNRIQVKRS RSKGGLAGPD 4440 4441 GTKSVFGQMC AKMSSFSPDS LLLPHRVWKV KFVGESVDDC GGGYSESIAE ICEELQNGLT 4500 4501 PLLIVTPNGR DESGANRDCY LLNPATRAPV HCSMFRFLGV LLGIAIRTGS PLSLNLAEPV 4560 4561 WKQLAGMSLT IADLSEVDKD FIPGLMYIRD NEATSEEFEA MSLPFTVPSA SGQDIQLSSK 4620 4621 HTHITLDNRA EYVRLAINYR LHEFDEQVAA VREGMARVVP VPLLSLFTGY ELETMVCGSP 4680 4681 DIPLHLLKSV ATYKGIEPSA SLVQWFWEVM ESFSNTERSL FLRFVWGRTR LPRTIADFRG 4740 4741 RDFVIQVLDK YNPPDHFLPE SYTCFFLLKL PRYSCKQVLE EKLKYAIHFC KSIDTDDYAR 4800 4801 IALTGEPAAD DSSEDSDNED ADSFASDSTQ DYLTGH 4836 |
Nucleotide CDS Sequence (FASTA) |
1 ATGCCTTCTG AGTCTTTCTG TTTGGCTGCC CAGTCTCGCC TTGACTCCAA ATGGTTGAAA 60 61 ACAGATATCC AGCTTGCATT CACAAGAGAT GGACTCTGTG GTCTGTGGAA CGAAATGGTT 120 121 AAAGATGGGG AAATTGTGTA CACTGGAACA GAATTAGCAC AAAATCGAGA ACTCCCTTTG 180 181 AGAAAAGATG ATGGTGTCGA TGCTCAGAGT GGAACCAAGA AAGAAGATCT TAATGACAAA 240 241 GAGAAGAAGG AAGAAGAAGA GACACCTGCA CCTGTGTATA GAGCCAAGTC AATCCTGGAA 300 301 AGCTGGGTGT GGGGCAGGCA GCCAGATGTG AATGAACTGA AGGAGTGTCT TTCTGTGCTG 360 361 GTTAAAGAAC AGCAAGCCCT GGCTGTCCAG TCAGCTACCA CTACACTCTC AGCCTTGAGG 420 421 TTGAAGCAGA GGCTGGTGAT CCTCGAGCGC TACTTCATTG CCTTGAACAG AACTGTTTTC 480 481 CAGGAGAATG TCAAAGTTAA GTGGAAAAGC AGCAGCATTT CTGTGCCTCC CACGGAGAAG 540 541 AAGAGTGCCC GTCCCACAGG CAGAGGGGTA GAAGGTCTTG CCCGAGTCGG ATCCCGAGCA 600 601 GCACTCTCCT TCGCCTTTGC TTTCTTACGG AGGGCCTGGC GTTCAGGTGA GGATGCGGAT 660 661 CTCTGCAGTG AGCTCTTGCA GGAGTCCCTG GATGCCCTTC GAGCTCTTCC TGAAGCCTCC 720 721 CTCTTTGATG AGAGCACAGT ATCCTCTGTG TGGTTGGAGG TGGTAGAGAG AGCAACGAGG 780 781 TTCCTCAGGT CTGTCGTGAC TGGAGATGTT CATGGAACAC CAGGGACCAA GGGGCCAGGA 840 841 GGAGTTCCTC TACAGGATCA GCACTTGGCT TTGGCCATTC TACTGGAGTT GGCTGTCCAG 900 901 AGAGGCACGC TTAGCCAAAT GTTGTCTGCC ATCCTGTTGT TGCTTCAGCT GTGGGACAGT 960 961 GGGGCACAGG AAACAGATAA TGAGCGCTCT GCCCAGGGCA CCAGTGCTCC TCTTTTGCCT 1020 1021 CTGCTGCAGA GGTTTCAGAG CATCATCTGC AGTAAGGACG TGCCACACAC AGAGAGCGAC 1080 1081 ATGCATCTTT TATCTGGCCC TTTAAGCCCC AATGAGAGTT TCCTAAGGTA CCTTACTCTT 1140 1141 CCACAAGACA ATGAACTTGC CATTGATTTA CGACAAACTG CAGTTGTTGT CATGGCTCAT 1200 1201 CTAGACCGTC TGGCTACACC CTGTATGCCT CCTCTGTGTA GCTCTCCAAC ATCTCATAAG 1260 1261 GGATCATTGC AGGAGGTCAT AGGTTGGGGG TTAATAGGCT GGAAATACTA TGCGAATGTG 1320 1321 ATTGGTCCGA TCCAGTGTGA AGGCCTTGCC AGTCTGGGGG TAATGCAGGT CGCCTGTGCA 1380 1381 GAAAAGCGGT TCTTGATCTT GTCCCGCAAT GGTCGTGTAT ACACACAGGC TTACAACAGT 1440 1441 GACATGCTGG CTCCACAGCT GGTCCAGGGT CTTGCCTCCA GAAACATTGT GAAAATTGCT 1500 1501 GCCCATTCTG ATGGTCACCA CTATCTGGCT TTGGCTGCCA CGGGGGAGGT GTACTCCTGG 1560 1561 GGCTGTGGAG ATGGCGGAAG GCTGGGCCAT GGGGACACAG TGCCTCTGGA GGAGCCCAAG 1620 1621 GTGATCTCGG CTTTCTCAGG AAAGCAGGCT GGGAAGCACG TGGTGCACAT AGCATGTGGG 1680 1681 AGCACGTACA GTGCGGCCAT CACTGCCGAG GGGGAGCTGT ACACTTGGGG CCGTGGGAAC 1740 1741 TACGGCCGCT TGGGCCATGG CTCTAGTGAA GATGAGGCCA TTCCAATGCT GGTAGCTGGA 1800 1801 CTCAAAGGTC TAAAGGTCAT TGATGTTGCC TGTGGGAGTG GAGATGCTCA AACCCTGGCC 1860 1861 GTGACTGAGA ACGGCCAAGT GTGGTCTTGG GGAGATGGCG ACTATGGGAA GTTGGGCAGA 1920 1921 GGTGGAAGTG ATGGCTGCAA AACCCCAAAG CTGATTGAAA AACTTCAAGA CTTAGATGTC 1980 1981 ATCAAAGTGC GCTGTGGAAG TCAGTTTTCT ATTGCCTTGA CAAAAGATGG CCAAGTTTAT 2040 2041 TCATGGGGGA AAGGTGACAA TCAGAGGCTT GGTCATGGAA CAGAAGAACA TGTCCGTTAT 2100 2101 CCCAAACTCT TGGAAGGTTT GCAAGGGAAG AAGGTGATTG ATGTGGCTGC AGGCTCCACG 2160 2161 CACTGCCTTG CTCTGACAGA AGACAGTGAG GTGCACAGCT GGGGGAGCAA TGATCAGTGC 2220 2221 CAGCACTTTG ACACTTTGCG TGTGACCAAG CCAGAACCGA CTGCACTCCC AGGACTGGAC 2280 2281 AGCAAACACA TAGTTGGAAT CGCTTGTGGG CCTGCCCAGA GCTTTGCGTG GTCATCTTGT 2340 2341 TCTGAGTGGT CCATTGGCCT TCGTGTCCCT TTTGTAGTGG ACATCTGCTC AATGACGTTT 2400 2401 GAGCAGCTGG ATCTCCTGCT GCGTCAGGTC AGTGAGGGGA TGGACGGCAC TGCTGACTGG 2460 2461 CCCCCACCAC AAGAGAAGGA GTGCATGGCT GTGGCAACGC TGAACCTTCT GCGACTGCAG 2520 2521 TTGCATGCTG CCATTAGTCA CCAGGTTGAC CCGGAATTCC TTGGTTTAGG TCTGGGCAGT 2580 2581 GTGCTCCTGA ACAGCCTGAA GCAGACCGTG GTGACCCTGG CCAGCAGTGC AGGAGTGCTG 2640 2641 AGCACTGTGC AGTCAGCTGC CCAGGCTGTA CTACAGAGCG GCTGGTCTGT GCTGCTGCCC 2700 2701 ACAGCTGAGG AGCGGGCCAG GGCACTCTCA GCACTCTTGC CCTGCACAGT TTCAGGCAAT 2760 2761 GAAGTGAATA TAAGTCCAGG TCGTCGATTC ATGATTGATC TTTTAGTTGG CAGCTTGATG 2820 2821 GCTGATGGAG GATTGGAGTC AGCTTTAAAT GCAGCCATTA CTGCAGAAAT TCAGGATATA 2880 2881 GAAGCCAAAA AAGAAGCACA GAAGGAAAAA GAAATTGATG AACAAGAAGC CAGTGCCTCC 2940 2941 ACATTTCATA GAAGTAGGAC TCCTCTGGAT AAAGACCTAA TTAATACTGG GATCTATGAA 3000 3001 TCATCTGGCA AACAGTGCTT GCCTCTAGTT CAACTCATAC AGCAGCTTCT TAGAAACATT 3060 3061 GCTTCTCAGA CTGTAGCCAG ACTGAAAGAT GTTGCTCGCC GCATTTCCTC ATGTCTGGAC 3120 3121 TTTGAGCAGC AGAGTTGTGA AAGATCGGCT TCACTGGATC TGTTGCTGCG CTTTCAACGC 3180 3181 CTGCTCATTA GTAAACTTTA CCCAGGAGAG AAGATTGGTC CAATCTCAGA TACTTCCAGT 3240 3241 CCTGAGCTAA TGGGTGTTGG CTCTTTGCTG AAAAAGTACA CAGCCCTTGT GTGTACACAC 3300 3301 ATTGGAGACA TATTGCCTGT GGCTGCCAGC ATCGCTTCTA GCAGCTGGCA GCACTTTGCA 3360 3361 GAAGTGGCCT GCGTAATGGA AGGGGACTTT ACAGGTGTAC TCCTTCCGGA GCTAGTGGTG 3420 3421 TCCATCGTGC TTCTGCTCAG TAAGAACGCC AGCCTAATGC AAGAGGCTGG AGCTATACCT 3480 3481 CTGCTTGGAG GCCTCCTGGA GCACCTGGAT CGTTTCAACC ATTTAGCACC AGGGAAGGAG 3540 3541 CGGGATGATC ATGAAGAGCT GGCCTGGCCT GGGATTATGG AGTCATTTTT CACTGGTCAG 3600 3601 AACTGTAGAA ATAATGAGGA GGTAACACTG ATACGCAAAG CTGATTTGGA GAACCATAAT 3660 3661 AAAGATGGTG GCTTCTGGAC TGTAATTGAT GGAAAAGTAT ATGGTATAAA AGACTTTCAG 3720 3721 ACACAATCCT TAACAGGAAA TAGTATACTG GCTCAGTTTG CAGGGGAAGA TCCAGTGGTC 3780 3781 GCTCTGGAGG CAGCTTTGCA GTTTGAAGAC ACACAGGAAT CCATGCATGC CTTCTGTGTT 3840 3841 GGCCAGTATT TGGAGCCTGA CCAAGAAGTT GTTACCATAC CAGATCTTGG AAGTTTGTCT 3900 3901 TCACCACTGA TTGACACAGA GAGGAACCTG GGCCTACTTC TTGGATTGCA TGCTTCTTAT 3960 3961 TTAGCAATGA GCACACCACT ATCTCCTGTG GAGGTTGAAT GCGCCAAGTG GCTGCAGTCA 4020 4021 TCGATTTTTT CTGGGGGTCT GCAGACCAGC CAGATTCACT ACAGCTACAA TGAAGAGAAA 4080 4081 GATGAGGACC ACTGTAGCTC TCCTGGGGGC ACACCCATCA GCAAGTCCCG ACTGTGTTCT 4140 4141 CACAGATGGG CCTTGGGTGA TCATTCTCAG GCTTTCCTGC AAGCCATTGC AGACAATAAC 4200 4201 ATCCAGGACT ACAATGTGAA GGACTTTCTG TGTCAAATTG AAAGGTACTG TAGACAGTGC 4260 4261 CACTTGACTA CACCCATCAC GTTTCCTCCT GAGCATCCTG TGGAAGAGGT TGGCCGTCTT 4320 4321 CTGTTATGCT GCCTCTTAAA ACATGAAGAT TTAGGTCATG TGGCATTATC TTTAGTTCAT 4380 4381 GTAGGTACAC TTGGTATTGA GCAAGTAAAG CACAGAACAT TGCCTAAATC TGTTGTGGAT 4440 4441 GTGTGTAGAG TTGTCTACCA AGCAAAATGC TCACTCATTA AGACTCATCA AGAACAGGGC 4500 4501 CGTTCCTACA AGGAGGTGTG TGCTCCTGTC ATTGAACGCT TGAGATTTCT CTTCAATGAA 4560 4561 TTGAGGCCTG CTGTTTGTAG TGACCTCTCT ATAATGTCTA AGTTTAAGCT GTTAGGTTCA 4620 4621 TTACCACGTT GGAGGAGGAT AGCTCAGAAG ATCATTCGAG AAAGAAGGAA AAAGAGAGTT 4680 4681 CCTAAGAAAC CAGAATCTAT TGATAGTGAA GAAAAAATTG GAAATGAAGA GAGTGATTTA 4740 4741 GAAGAAGCTT GTGTTTTGCC TCATAGTCCT ATAAATGTAG ACAAAAGACC TATTTCTATG 4800 4801 AAGTCTCCCA AGGATAAATG GCAACCATTG TTGAATACAG TTACAGGGGT TCATAAATAT 4860 4861 AAATGGTTAA AGCAAAATGT ACAGGGCCTT TACCCACAGT CTGCACTCCT CAATACAATT 4920 4921 GTTGAATTTG CCCTTAAAGA AGAGCCAGTA GATGTGGAAA AAATGCGGAA GTGCCTCCTA 4980 4981 AAACAGTTGG AAAGAGCAGA GGTTCGTCTG GAAGGGATAG ATACAATTCT GAAACTGGCA 5040 5041 GCCAAAAGTT TTTTACTTCC TTCTGTGCAG TATGCTATGT TTTGTGGATG GCAAAGACTT 5100 5101 ATTCCTGAAG GAATTGATAT AGGGGAACCA CTTACAGACT GTTTGAGGGA TGTGGATTTG 5160 5161 ATCCCACCAT TTAATCGGAT GCTGCTGGAA GTGACTTTTG GCAAGCTGTA TGCTTGGGCA 5220 5221 GTTCAGAATA TTCGATCTGT TCTGATGGAT GCAAGTGCCA GGTTTAAAGA GCTGGGTATC 5280 5281 CAGCCTGTTC CCCTACAAAC CATTACCAAT GAGAACCCGG CGGGACCAAG TCTGGGTACC 5340 5341 ATCCCACAAG CTCGATTTCT TCTGGTGATG CTCAGCATGC TCACCCTGCA GCATGGTGCA 5400 5401 AACAACCTTG ACCTCCTGCT CAACTCAGGC ACGCTGGCAC TCACTCAGAC AGCCCTAAGG 5460 5461 CTGATCGGTC CTACTTGTGA CAGTGTTGAG GACGACATGA ATGCGTCTGC CCGGGGAGCC 5520 5521 TCTGCTACAG TGTTGGAAGA AACAAGGAAG GAAACGGCTC CTGTTCAGCT CCCTGTTTCA 5580 5581 GGGCCAGAAC TGGCTGCTAT GATGAAGATT GGAACCAGGG TCATGAGAGG GGTGGATTGG 5640 5641 AAGTGGGGTG ACCAGGATGG GCCTCCCCCA GGTCTAGGCC GGGTAATTGG TGAGCTGGGC 5700 5701 GAGGATGGCT GGATCAGGGT CCAGTGGGAC ACAGGCAGCA CTAACTCATA CAGGATGGGA 5760 5761 AAAGAAGGAA AATATGACCT CAAGCTGGTG GAGCTGCCTG TGTCATCACA GCCCTCTGCA 5820 5821 GAGGATTCAG ACACAGAGGA TGACTCCGAA GCAGAGCAAG GTGAGAGGAA TATTCACCCC 5880 5881 ACAGCGATGA TGCTGACCAG CGTTATTAAC TTACTGCAAA CCCTGTGTCT CTCTGTTGGA 5940 5941 GTCCATGCTG ACATCATGCA GAGCGAGGCC ACCAAGACTT TGTGTGGACT ACTTCGAATG 6000 6001 TTAGTAGAAA GTGGAACAAC AGATAAGCCA GCTCCTCCAG ACAGACTGGT GGCCCGGGAG 6060 6061 CAGCACCGGA GCTGGTGTAC ACTGGGCTTC GTCCGCAGCA TTGCCCTCAC CCCACAGGCC 6120 6121 TGTGGTGCTC TCAGCTCCCC AAGGTGGATC ACACTGCTCA TGAAGGTTGT GGAAGGCCAC 6180 6181 GCACCGTTCA CTGCTGCCTC CCTGCAGAGG CAGATCTTAG CTGTTCATTT GCTACAAGCA 6240 6241 GTTCTTCCAT CATGGGACAA GACAGAAAGG GCACGGGACA TGAAGTGTCT TGTGGAAAAG 6300 6301 CTGTTTGGCT TCTTGGGCAG TTTGCTTACT ACTTGTTCTT CTGATGTTCC GTTACTTAGA 6360 6361 GAATCCACGT TGAGAAAGCG GAGGGCCCGG CCCCAGGCTT CCTTGACTGC CACCCACAGC 6420 6421 AGCACGCTGG CTGAAGAAGT GGTGGGGCTG CTCCGCACTC TGCACTCCCT GACCCAGTGG 6480 6481 AATGGCCTCA TCAACAAATA CATTAACTCC CAGCTTTGCT CTGTCACACA AAGCTATGCC 6540 6541 GGAAAGACAT CAGAAAGGGC CCAGCTAGAA GATTACTTCC CAGACTCTGA GAACCTTGAA 6600 6601 GTGGGAGGCC TCATGGCAGT CCTCGCTGTC ATTGGAGGGA TTGATGGCCG TCTGCGCCTG 6660 6661 GGAGGCCAGG TCATGCATGA TGAGTTTGGA GAGGGCACAG TGACTCGCAT CACTCCGAAG 6720 6721 GGCAGAATCA CCGTGCAGTT CTGTGACATG CGGATGTGCC GAGTTTGCCC ACTGAATCAG 6780 6781 CTCAAACCGC TCCCTGCTGT GGCCTTTAGT GTGAACAATT TGCCTTTCAC GGAGCCTATG 6840 6841 CTTTCTGTCT GGGCTGAGTT GGTGAATCTT GCTGGAAGCA AATTGGAAAA GCACAAAACA 6900 6901 AAGAAGTCAG CAAAACCGGC CTTTGCAGGG CAAGTAGACT TGGACCTGCT GCGGAGCCAG 6960 6961 CAGTTAAAGC TGTACATCCT GAAAGCAGGG CGGGCACTGC TTTCCCACCA AGACAAGCTG 7020 7021 AGGCAGATAC TATCTCAGCC CGCGGTTCAG GGGACTGGAA CTCTTCAGAC AGATGATGGA 7080 7081 GCAGCTGCAT CCCCTGACCT TGGGGATATG TCACCTGAAG GGCCACAGCC TCCCATGATC 7140 7141 CTCCTTCAGC AGTTACTGTC CTCAGCCACT CAGCCATCTC CTGTGAAGGC AATATTTGAT 7200 7201 AAACAGGAAC TTGAGGCTGC TGCTTTGGCT CTCTGTCAGT GTTTGGCTGT GGAATCCACA 7260 7261 CATCCTTCAA GTCCAGGGTG TGAAGACTGC AGCTCCAGTG AGGCCACAAC CCCAGTCTCT 7320 7321 GTACAGCACA TCCATCTTGC CAGGGCGAAG AAGCGCAGAC AGTCCCCAGC CCCTGCTCTG 7380 7381 CCTATTGTAG TGCAGCTCAT GGAAATGGGA TTTCCAAGGA AAAATATAGA GTTTGCTCTG 7440 7441 AAGTCGCTCA CCGGCACCTC TGGGAATGCA TCTGGTTTGC CTGGTGTGGA AGCCTTGGTT 7500 7501 GGGTGGCTGC TAGACCACTC TGATGTCCAG GTTACAGAGT TCTCGGATGC AGAAACACTA 7560 7561 TCTGACGAGT ATTCAGACGA GGAGGTGGTA GAAGATGTGG ATGATACCCC TTACCCTGTG 7620 7621 GCTGCGGGTG CTGTTGTGAC TGAGAGCCAG ACTTACAAGA AGCGTGCTGA TTTCCTGAGC 7680 7681 AATGATGATT ATGCCGTGTA TGTGAGAGAG AACGTTCAGG TGGGAATGAT GGTGAGATGC 7740 7741 TGTCGAACAT ATGAAGAAGT ATGTGAAGGT GATGTGGGCA AAGTCATCAA ACTGGATAGA 7800 7801 GATGGATTAC ATGACCTTAA TGTTCAGTGT GATTGGCAGC AGAAAGGCGG CACTTACTGG 7860 7861 GTTAGGTATA TTCATGTTGA ACTTATAGGC TATCCTCCAC CAAGTTCTTC TTCTCACATC 7920 7921 AAGATCGGTG ATAAAGTACG GGTCAAAGCT TCTGTTACCA CACCAAAGTA CAAGTGGGGA 7980 7981 TCCGTCACTC ATCAAAGTGT GGGGCTTGTG AAAGCTTTCA GTGCCAATGG GAAAGACATC 8040 8041 ATTGTGGACT TCCCCCAGCA GTCCCACTGG ACTGGGTTGC TGTCAGAAAT GGAGTTGGTG 8100 8101 CCTAGTATTC ATCCAGGGGT TACGTGTGAC GGCTGTCAGA CTTTTCCTAT CAATGGATCC 8160 8161 AGATTCAAAT GTAGAAACTG TGATGACTTT GATTTTTGTG AAACATGTTT CAAGACAAAA 8220 8221 AAACACAACA CCAGGCATAC TTTCGGCAGG ATTAATGAGC CAGGGCAGTC TGCAGTGTTT 8280 8281 TGTGGCCGTT CTGGAAAGCA GCTGAAACGA TGTCACAGCA GTCAGCCTGG AATGCTCCTG 8340 8341 GACAGCTGGT CCCGAATGGT GAAAAGCCTG AACGTTTCAT CCTCTGTAAA CCAGGCTTCT 8400 8401 CGTCTCATTG ATGGCAGTGA GCCCTGCTGG CAGTCATCTG GGTCACAAGG AAAGCATTGG 8460 8461 ATTCGTTTGG AGATTTTTCC AGATGTTCTT GTTCATAGAT TAAAAATGAT CGTGGATCCA 8520 8521 GCGGACAGTA GCTACATGCC ATCCCTGGTC GTGGTGTCAG GTGGAAATTC CTTAAATAAC 8580 8581 CTAATTGAAC TAAAGACAAT TAATATCAAC CAAACTGACA CCACTGTACC CCTGCTAAGT 8640 8641 GACTGTGCAG AGTATCACAG GTACATTGAG ATTGCAATAA AACAGTGCAG GAGCTCAGGA 8700 8701 ATCGACTGCA AAATCCACGG TCTCATCCTG CTTGGCCGCA TCCGTGCTGA AGAGGAGGAC 8760 8761 CTGGCTGCAG TTCCTTTTTT GGCTTCAGAC AATGAGGAGG AGGAGGATGA TAAGGGCAGC 8820 8821 ACTGGGAGTC TTATTAGAAA GAAGACTCCA GGGTTGGAAT CAACAGCTAC AATAAGAACC 8880 8881 AAAGTGTTTG TGTGGGGACT GAATGACAAG GACCAGCTTG GTGGCCTTAA AGGCTCGAAG 8940 8941 ATAAAGGTTC CTTCATTCTC TGAGACTCTG TCTGCCCTGA ATGTGGTTCA GGTGGCTGGT 9000 9001 GGTTCTAAGA GCTTGTTTGC AGTGACTGTG GAAGGGAAGG TGTATTCTTG TGGAGAAGCC 9060 9061 ACAAATGGCC GCCTAGGTCT GGGTATGTCC AGTGGGACGG TTCCTATTCC TCGGCAGATT 9120 9121 ACAGCCCTCA GCAGCTACGT GGTCAAGAAG GTAGCAGTTC ACTCAGGTGG CCGGCATGCT 9180 9181 ACAGCCTTAA CCGTGGATGG GAAGGTGTTT TCATGGGGTG AAGGTGATGA TGGGAAACTT 9240 9241 GGACACTTCA GCAGAATGAA CTGTGACAAG CCAAGGCTCA TAGAGGCTCT GAAAACCAAG 9300 9301 CGTATCAGGG ACATAGCCTG CGGGAGTTCT CATAGTGCAG CCCTGACATC CAGTGGGGAA 9360 9361 CTGTACACTT GGGGCCTTGG AGAGTATGGC CGATTGGGAC ATGGGGATAA TACAACACAG 9420 9421 CTGAAGCCCA AAATGGTAAA AGTCCTTCTT GGTCACCGAG TAATCCAGGT TGCGTGTGGA 9480 9481 AGCAGAGATG CACAGACACT GGCTCTGACC GATGAAGGTT TGGTGTTTTC TTGGGGTGAT 9540 9541 GGTGATTTTG GAAAATTGGG CCGAGGAGGA AGTGAAGGCT GTAATATTCC CCAGAACATT 9600 9601 GAGAGACTAA ATGGCCAGGG GGTGTGCCAG ATTGAGTGTG GAGCTCAGTT TTCCTTGGCA 9660 9661 CTCACCAAGT CAGGAGTGGT GTGGACGTGG GGGAAGGGCG ACTACTTCAG ACTGGGCCAT 9720 9721 GGCTCTGATG TCCATGTGCG CAAGCCACAG GTGGTGGAAG GACTGAGAGG GAAAAAGATT 9780 9781 GTGCATGTGG CAGTTGGAGC CCTGCATTGC CTAGCAGTCA CAGACTCAGG GCAGGTGTAT 9840 9841 GCTTGGGGGG ACAATGACCA TGGCCAACAA GGCAATGGCA CAACCACAGT TAACAGAAAG 9900 9901 CCAACCCTTG TGCAAGGCCT GGAAGGCCAG AAGATCACGC GAGTGGCTTG TGGGTCATCA 9960 9961 CATAGTGTAG CATGGACAAC AGTGGATGTA GCTACACCAT CTGTCCATGA GCCTGTCCTC 10020 10021 TTCCAGACTG CAAGAGACCC GCTAGGTGCT TCTTACTTAG GTGTCCCTTC GGATGCTGAT 10080 10081 TCTTCTTCTT CCAGCAATAA GATTAGTGGT GCAAATAACT GTAAGCCTAA TCGCCCTTCT 10140 10141 CTTGCCAAGA TCCTCCTGTC ACTGGAAGGA AACCTGGCCA AGCAGCAGGC CTTATCTCAC 10200 10201 ATCCTTACAG CACTGCAGAT CATGTATGCC AGAGATGCAG TGGTTGGAGC CTTGATGCCA 10260 10261 GCCGGCATGC TCGCCCCAGT GGAATGCCCC TCATTCTCCT CGTCAGCGCC TGCCTCTGAT 10320 10321 GTGTCAGCCA TGGCCAGTCC CATGCATATG GAGGACAGCA CGCTAGCTGC TGACCTTGAG 10380 10381 GACAGACTGA GTCCAAATCT GTGGCAAGAA AAGAGAGAGA TCGTCTCCTC TGAGGATGCT 10440 10441 GTGACCCCCT CTGCAGTGAC ACCGTCTGCT CCCTCAGCCT CCTCTCGCCC ATTTATCCCA 10500 10501 GTGACAGATG ACCCAGGAGC TGCCAGCATC ATTGCAGAAA CCATGACTAA AACCAAAGAG 10560 10561 GATGTTGAAA GCCAAAATAA AACATCAGGC CCAGAGCCCC AGTCCTTGGA TGAGTTCACC 10620 10621 AGTCTGCTGA TCCCTGATGA CACCCGGGTG GTGGTGGAGC TGCTCAAGCT GTCAGTGTGC 10680 10681 AGCAGGGCTG GAGACAAGGG CAGGGAAGTG CTTTCTGCTG TGCTGTCCGG CATGGGCACT 10740 10741 GCCTACCCAC AGGTGGCAGA CATGCTGTTG GAGCTCTGTG TCACCGAGTT AGAAGATGTG 10800 10801 GCCACAGACT CACAGAGTGG CCGTCTTTCC TCCCAGCCTG TGGTGGTAGA GAGTAGCCAT 10860 10861 CCCTATACAG ACGACACCTC CACCAGTGGT ACAGTGAAGA TACCAGGTGC AGAAGGACTC 10920 10921 AGGGTGGAGT TTGACCGGCA GTGCTCCACA GAGAGGCGCC ACGATCCTCT CACTGTCATG 10980 10981 GATGGTGTCA ACAGGATTGT GTCGGTGCGG TCAGGTCGAG AATGGTCTGA CTGGTCGAGT 11040 11041 GAGCTTCGCA TCCCTGGAGA TGAGCTGAAA TGGAAGTTCA TCAGTGATGG GTCTGTGAAT 11100 11101 GGGTGGGGCT GGCGCTTCAC AGTTTATCCC ATCATGCCAG CTGCAGGCCC TAAGGACCTC 11160 11161 CTCTCGGATC GCTGTGTCCT CTCATGTCCA TCCATGGACC TGGTGACCTG TCTCTTGGAC 11220 11221 TTCCGACTCA ATCTCACCTC CAACAGAAGC ATCGTCCCTC GCCTTGCTGC CTCTCTGGCT 11280 11281 GCGTGTGCTC AGCTGAGCGC CCTAGCTGCC AGCCACAGAA TGTGGGCTCT TCAGAGACTG 11340 11341 AGGAGACTGC TCACCACAGA ATTTGGGCAG TCAATTAACA TAAACAGGCT GCTGGGAGAA 11400 11401 AATGATGGGG AATCCAGAGC TTTGAGTTTT ACAGGCAGTG CCCTTGCTGC TTTGGTGAAA 11460 11461 GGCCTTCCAG AAGCTTTGCA GAGGCAGTTT GAGTATGAAG ATCCTATTGT GAGAGGTGGC 11520 11521 AAACAGCTGC TCCATAGTCC ATTCTTTAAG GTGCTAGTAG CTCTTGCTTG TGACCTGGAA 11580 11581 TTAGACACAC TCCCTTGCTG CGCAGAGACC CATAAATGGG CTTGGTTCCG AAGATACTGC 11640 11641 ATGGCCTCCC GAGTTGCTGT GGCCCTGGAC AAAAGAACAC CATTGCCTCG TCTGTTTCTT 11700 11701 GATGAGGTTG CTAAGAAGAT TCGTGAATTA ATGGCAGACA GTGAAAGTAT GGATGTTCTC 11760 11761 CATGAAAGCC ACAGCATCTT TAAAAGAGAG CAAGATGAGC AACTTGTGCA GTGGATGAAC 11820 11821 AGGCGACCAG ATGACTGGAC TCTTTCTGCT GGTGGTAGTG GAACCATTTA TGGGTGGGGG 11880 11881 CATAATCATA GGGGACAGCT TGGAGGAATT GAAGGTGCAA AAGTAAAAGT TCCCACTCCA 11940 11941 TGTGAAGCAC TTGCGACTCT CAGACCTGTG CAGTTAATTG GAGGAGAGCA GACACTATTT 12000 12001 GCTGTGACAG CTGATGGGAA GCTCTATGCT ACTGGCTATG GTGCTGGTGG AAGGCTGGGG 12060 12061 ATTGGAGGGA CAGAGTCAGT GTCTACTCCA ACGTTACTGG AGTCCATTCA GCATGTGTTC 12120 12121 ATTAAGAAAG TTGCTGTGAA CTCAGGAGGA AAGCATTGCC TTGCTCTCTC TTCAGAAGGC 12180 12181 GAAGTTTACT CTTGGGGCGA GGCAGAAGAT GGGAAGTTGG GACATGGCAA CAGAAGTCCA 12240 12241 TGTGACCGCC CTCGTGTCAT TGAATCCCTG AGAGGAATTG AAGTGGTTGA TGTTGCTGCT 12300 12301 GGTGGAGCCC ATAGTGCCTG TGTCACAGCA GCTGGAGACC TCTACACATG GGGCAAAGGC 12360 12361 CGATATGGCC GCCTGGGGCA CAGTGACAGT GAGGACCAGT TAAAGCCAAA ACTGGTGGAA 12420 12421 GCACTACAGG GCCATCGTGT CATAGACATT GCGTGTGGCA GTGGGGATGC CCAGACACTG 12480 12481 TGCCTCACTG ATGACGATAC TGTCTGGTCC TGGGGAGATG GAGACTATGG CAAGCTTGGC 12540 12541 AGAGGAGGCA GTGATGGTTG CAAAGTGCCC ATGAAGATTG ATTCTCTCAC TGGCCTTGGA 12600 12601 GTAGTTAAAG TGGAATGTGG ATCTCAGTTT TCTGTCGCCC TTACCAAGTC TGGAGCTGTT 12660 12661 TATACATGGG GTAAAGGTGA TTATCACCGG CTGGGCCATG GATCCGATGA CCACGTTCGA 12720 12721 AGACCTCGGC AGGTTCAAGG ACTGCAGGGG AAGAAAGTCA TTGCTATTGC CACTGGCTCC 12780 12781 CTGCACTGTG TGTGTTGCAC AGAGGATGGT GAAGTTTACA CATGGGGTGA CAATGATGAG 12840 12841 GGGCAGCTAG GAGATGGAAC AACAAATGCT ATTCAGAGAC CACGATTGGT AGCTGCCCTT 12900 12901 CAGGGTAAGA AGGTTAATCG AGTGGCCTGT GGTTCTGCAC ACACCCTTGC CTGGTCAACC 12960 12961 AGCAAGCCAG CGAGCGCCGG CAAGCTCCCT GCACAGGTAC CCATGGAGTA CAACCATTTA 13020 13021 CAAGAGATCC CCATCATTGC TCTACGGAAC CGACTGCTGC TGCTGCATCA CATCTCAGAG 13080 13081 CTCTTTTGTC CCTGCATCCC CATGTTTGAC CTAGAGGGCT CTCTTGATGA AACTGGACTC 13140 13141 GGGCCTTCAG TTGGGTTTGA CACACTCCGT GGGATCTTAA TATCTCAGGG CAAGGAAGCA 13200 13201 GCTTTCAGGA AGGTAGTACA AGCAACTATG GTGCGAGATC GCCAACATGG GCCTGTGGTG 13260 13261 GAGCTGAACC GCATCCAGGT TAAACGCTCC CGGAGTAAAG GTGGGTTAGC TGGCCCTGAT 13320 13321 GGCACCAAGT CTGTCTTTGG GCAGATGTGT GCTAAGATGA GTTCCTTCAG TCCTGACAGC 13380 13381 CTTCTCCTTC CTCACCGAGT CTGGAAAGTC AAGTTTGTGG GTGAATCCGT GGACGACTGT 13440 13441 GGTGGTGGCT ACAGTGAGTC TATAGCAGAG ATTTGTGAGG AGCTGCAAAA CGGACTCACA 13500 13501 CCTCTTCTGA TTGTGACTCC CAATGGCAGG GATGAGTCTG GTGCCAACAG AGACTGTTAC 13560 13561 CTATTAAACC CTGCCACCCG TGCACCCGTG CACTGCAGCA TGTTCCGATT CCTAGGGGTG 13620 13621 TTATTGGGCA TTGCCATCCG AACTGGGAGT CCACTAAGTT TGAATCTTGC TGAACCTGTA 13680 13681 TGGAAGCAGC TGGCTGGGAT GAGCCTCACC ATTGCAGACC TGAGTGAGGT AGATAAAGAT 13740 13741 TTTATTCCTG GGCTTATGTA CATTCGTGAC AATGAAGCCA CTTCAGAGGA ATTTGAGGCT 13800 13801 ATGAGCCTGC CCTTCACCGT GCCAAGTGCC AGTGGCCAGG ATATCCAGCT GAGCTCTAAG 13860 13861 CACACGCACA TCACCCTGGA CAACCGTGCA GAATATGTCC GGCTGGCAAT AAACTACAGA 13920 13921 CTCCATGAAT TTGATGAACA AGTAGCTGCT GTCCGGGAAG GGATGGCTCG AGTTGTACCT 13980 13981 GTACCCCTTC TCTCTCTTTT CACTGGATAC GAACTGGAGA CAATGGTCTG TGGGAGCCCA 14040 14041 GATATCCCAC TGCATCTTTT AAAGTCAGTG GCAACGTATA AAGGGATTGA GCCTTCTGCA 14100 14101 TCCTTGGTCC AGTGGTTCTG GGAAGTGATG GAATCTTTCT CTAACACGGA ACGCTCCCTC 14160 14161 TTTCTTCGCT TTGTGTGGGG CAGGACAAGG CTGCCCAGGA CCATTGCTGA CTTCCGGGGC 14220 14221 AGAGACTTTG TCATCCAGGT CTTGGATAAA TACAACCCTC CTGATCACTT CCTCCCAGAA 14280 14281 TCCTACACAT GCTTCTTTCT GCTCAAGCTG CCCAGGTATT CCTGTAAGCA GGTGCTTGAG 14340 14341 GAAAAGCTGA AGTATGCCAT CCACTTTTGC AAATCCATAG ACACAGATGA CTATGCTCGC 14400 14401 ATAGCTCTCA CTGGAGAGCC CGCTGCCGAT GACAGCAGTG AGGACTCAGA TAATGAGGAC 14460 14461 GCCGACTCTT TTGCTTCAGA CTCTACACAA GACTACTTGA CAGGACACTA A 14511 |
▼ KEYWORD
ID | Family |
Alternative splicing | |
Coiled coil | |
Complete proteome | |
Cytoplasm | |
Cytoskeleton | |
DNA damage | |
DNA repair | |
Metal-binding | |
Nucleus | |
Phosphoprotein | |
Reference proteome | |
Repeat | |
Transferase | |
Ubl conjugation | |
Ubl conjugation pathway | |
Zinc | |
Zinc-finger
|
▼ GENE ONTOLOGY
ID | Classification | Description |
Cellular Component | Centriole | |
Cellular Component | Cytoplasm | |
Cellular Component | Mitochondrial inner membrane | |
Cellular Component | Nucleus | |
Molecular Function | SUMO binding | |
Molecular Function | Ubiquitin protein ligase binding | |
Molecular Function | Ubiquitin-protein transferase activity | |
Molecular Function | Zinc ion binding | |
Biological Process | Cellular response to DNA damage stimulus | |
Biological Process | DNA repair | |
Biological Process | Proteasome-mediated ubiquitin-dependent protein catabolic process | |
Biological Process | Protein ubiquitination | |
Biological Process | Spermatogenesis |
▼ ANNOTATION
Mutation | ||||
CGAP |
Physicochemical | ||||
Compute pI/Mw | AAindex |
Localization | ||||
COMPARTMENTS |
Expression | ||||
ArrayExpress | GXD | TISSUES |
▼ ORTHOLOGY
DrLLPS ID | Organism | Identity | E-value | Score |
---|---|---|---|---|
LLPS-Ran-0639 | Rattus norvegicus | 99.15 | 0.0 | 8934 |
LLPS-Dio-2822 | Dipodomys ordii | 98.1 | 0.0 | 3544 |
LLPS-Gog-3700 | Gorilla gorilla | 96.87 | 0.0 | 4267 |
LLPS-Mea-3435 | Mesocricetus auratus | 96.68 | 0.0 | 5565 |
LLPS-Ict-1714 | Ictidomys tridecemlineatus | 96.24 | 0.0 | 8705 |
LLPS-Pat-3694 | Pan troglodytes | 96.22 | 0.0 | 800 |
LLPS-Caj-0591 | Callithrix jacchus | 96.13 | 0.0 | 4272 |
LLPS-Paa-2386 | Papio anubis | 95.55 | 0.0 | 803 |
LLPS-Hos-2476 | Homo sapiens | 95.51 | 0.0 | 8647 |
LLPS-Chs-3991 | Chlorocebus sabaeus | 95.43 | 0.0 | 8629 |
LLPS-Mam-3420 | Macaca mulatta | 95.43 | 0.0 | 8633 |
LLPS-Man-3839 | Macaca nemestrina | 95.33 | 0.0 | 8621 |
LLPS-Orc-1795 | Oryctolagus cuniculus | 95.28 | 0.0 | 8615 |
LLPS-Caf-1768 | Canis familiaris | 95.18 | 0.0 | 8603 |
LLPS-Maf-0867 | Macaca fascicularis | 95.14 | 0.0 | 8592 |
LLPS-Aim-4081 | Ailuropoda melanoleuca | 95.11 | 0.0 | 7025 |
LLPS-Aon-2485 | Aotus nancymaae | 95.1 | 0.0 | 8598 |
LLPS-Mup-0844 | Mustela putorius furo | 95.0 | 0.0 | 8567 |
LLPS-Fec-1448 | Felis catus | 94.96 | 0.0 | 8586 |
LLPS-Eqc-1335 | Equus caballus | 94.91 | 0.0 | 8595 |
LLPS-Fud-1290 | Fukomys damarensis | 94.87 | 0.0 | 8607 |
LLPS-Otg-1496 | Otolemur garnettii | 94.78 | 0.0 | 8519 |
LLPS-Sus-2816 | Sus scrofa | 94.58 | 0.0 | 8596 |
LLPS-Cas-4044 | Carlito syrichta | 94.54 | 0.0 | 8521 |
LLPS-Bot-0470 | Bos taurus | 94.48 | 0.0 | 8563 |
LLPS-Ova-2197 | Ovis aries | 94.4 | 0.0 | 8564 |
LLPS-Cap-0412 | Cavia porcellus | 94.15 | 0.0 | 5665 |
LLPS-Cea-1825 | Cercocebus atys | 94.13 | 0.0 | 8494 |
LLPS-Loa-4159 | Loxodonta africana | 93.99 | 0.0 | 8515 |
LLPS-Mal-1896 | Mandrillus leucophaeus | 93.2 | 0.0 | 8418 |
LLPS-Pap-2302 | Pan paniscus | 92.84 | 0.0 | 8066 |
LLPS-Poa-4150 | Pongo abelii | 91.88 | 0.0 | 2585 |
LLPS-Pes-1161 | Pelodiscus sinensis | 90.55 | 0.0 | 7886 |
LLPS-Anp-1044 | Anas platyrhynchos | 90.2 | 0.0 | 8218 |
LLPS-Gaga-3563 | Gallus gallus | 90.08 | 0.0 | 8217 |
LLPS-Fia-0791 | Ficedula albicollis | 90.02 | 0.0 | 8202 |
LLPS-Tag-2564 | Taeniopygia guttata | 89.94 | 0.0 | 8193 |
LLPS-Rhb-3595 | Rhinopithecus bieti | 89.93 | 0.0 | 747 |
LLPS-Anc-2849 | Anolis carolinensis | 89.82 | 0.0 | 8213 |
LLPS-Myl-3228 | Myotis lucifugus | 88.69 | 0.0 | 4303 |
LLPS-Mod-2291 | Monodelphis domestica | 88.63 | 0.0 | 8012 |
LLPS-Lac-3583 | Latimeria chalumnae | 86.97 | 0.0 | 3334 |
LLPS-Xet-0865 | Xenopus tropicalis | 85.33 | 0.0 | 7751 |
LLPS-Ora-1080 | Ornithorhynchus anatinus | 84.59 | 0.0 | 1233 |
LLPS-Leo-2734 | Lepisosteus oculatus | 84.32 | 0.0 | 7623 |
LLPS-Asm-0360 | Astyanax mexicanus | 83.7 | 0.0 | 7389 |
LLPS-Scf-3381 | Scleropages formosus | 83.39 | 0.0 | 7325 |
LLPS-Scm-1087 | Scophthalmus maximus | 83.14 | 0.0 | 7356 |
LLPS-Orn-0379 | Oreochromis niloticus | 82.54 | 0.0 | 7442 |
LLPS-Tar-3910 | Takifugu rubripes | 82.03 | 0.0 | 7380 |
LLPS-Dar-3953 | Danio rerio | 81.88 | 0.0 | 7382 |
LLPS-Xim-1748 | Xiphophorus maculatus | 81.22 | 0.0 | 7110 |
LLPS-Icp-1669 | Ictalurus punctatus | 80.8 | 0.0 | 2769 |
LLPS-Orl-2562 | Oryzias latipes | 80.51 | 0.0 | 7049 |
LLPS-Pof-3452 | Poecilia formosa | 79.95 | 0.0 | 6016 |
LLPS-Sah-0909 | Sarcophilus harrisii | 79.9 | 0.0 | 6804 |
LLPS-Gaa-3910 | Gasterosteus aculeatus | 71.62 | 0.0 | 1029 |
LLPS-Orm-1818 | Oryza meridionalis | 39.23 | 2e-12 | 77.0 |
LLPS-Mae-1975 | Manihot esculenta | 38.92 | 1e-23 | 112 |
LLPS-Sol-1716 | Solanum lycopersicum | 38.83 | 6e-38 | 162 |
LLPS-Dac-0973 | Daucus carota | 38.8 | 6e-22 | 106 |
LLPS-Thc-2288 | Theobroma cacao | 38.77 | 5e-28 | 126 |
LLPS-Nia-0353 | Nicotiana attenuata | 38.67 | 1e-38 | 164 |
LLPS-Cus-2234 | Cucumis sativus | 38.64 | 4e-20 | 100 |
LLPS-Art-2726 | Arabidopsis thaliana | 38.5 | 2e-28 | 127 |
LLPS-Drm-2256 | Drosophila melanogaster | 38.36 | 6e-22 | 109 |
LLPS-Zem-2489 | Zea mays | 38.33 | 5e-12 | 75.9 |
LLPS-Sob-2022 | Sorghum bicolor | 38.33 | 6e-12 | 75.9 |
LLPS-Prp-1112 | Prunus persica | 38.15 | 3e-20 | 101 |
LLPS-Amt-1412 | Amborella trichopoda | 38.01 | 1e-21 | 108 |
LLPS-Viv-0380 | Vitis vinifera | 37.89 | 2e-31 | 137 |
LLPS-Orbr-1117 | Oryza brachyantha | 37.82 | 8e-38 | 161 |
LLPS-Brd-1375 | Brachypodium distachyon | 37.79 | 5e-18 | 94.7 |
LLPS-Nol-4381 | Nomascus leucogenys | 37.74 | 6e-36 | 155 |
LLPS-Meg-0260 | Meleagris gallopavo | 37.74 | 3e-35 | 152 |
LLPS-Urm-2134 | Ursus maritimus | 37.74 | 2e-35 | 153 |
LLPS-Vir-1610 | Vigna radiata | 37.62 | 2e-34 | 150 |
LLPS-Sei-0110 | Setaria italica | 37.5 | 8e-12 | 75.5 |
LLPS-Tra-2082 | Triticum aestivum | 37.44 | 5e-30 | 132 |
LLPS-Gas-1265 | Galdieria sulphuraria | 37.4 | 3e-16 | 89.0 |
LLPS-Coc-1146 | Corchorus capsularis | 37.39 | 8e-27 | 122 |
LLPS-Lep-1041 | Leersia perrieri | 37.37 | 8e-39 | 159 |
LLPS-Phv-1003 | Phaseolus vulgaris | 37.29 | 1e-32 | 145 |
LLPS-Mua-2482 | Musa acuminata | 37.28 | 1e-30 | 134 |
LLPS-Bro-2902 | Brassica oleracea | 37.17 | 3e-25 | 117 |
LLPS-Brr-2023 | Brassica rapa | 37.17 | 1e-25 | 118 |
LLPS-Orp-1921 | Oryza punctata | 37.12 | 2e-10 | 70.5 |
LLPS-Via-1846 | Vigna angularis | 37.11 | 5e-41 | 166 |
LLPS-Glm-1099 | Glycine max | 37.11 | 9e-42 | 169 |
LLPS-Hov-0479 | Hordeum vulgare | 37.0 | 8e-30 | 131 |
LLPS-Arl-1629 | Arabidopsis lyrata | 36.95 | 2e-38 | 158 |
LLPS-Brn-1463 | Brassica napus | 36.77 | 5e-21 | 106 |
LLPS-Orgl-1973 | Oryza glumaepatula | 36.61 | 3e-40 | 164 |
LLPS-Orni-2312 | Oryza nivara | 36.61 | 3e-40 | 164 |
LLPS-Orb-0597 | Oryza barthii | 36.61 | 4e-40 | 163 |
LLPS-Org-1310 | Oryza glaberrima | 36.61 | 4e-40 | 163 |
LLPS-Tru-1457 | Triticum urartu | 36.54 | 1e-16 | 92.4 |
LLPS-Ten-1994 | Tetraodon nigroviridis | 36.4 | 1e-35 | 154 |
LLPS-Hea-1521 | Helianthus annuus | 36.33 | 2e-40 | 164 |
LLPS-Ori-1213 | Oryza indica | 36.27 | 8e-40 | 162 |
LLPS-Orr-1091 | Oryza rufipogon | 36.27 | 8e-40 | 162 |
LLPS-Ors-0355 | Oryza sativa | 36.27 | 8e-40 | 162 |
LLPS-Met-2113 | Medicago truncatula | 36.23 | 2e-36 | 152 |
LLPS-Gor-0345 | Gossypium raimondii | 36.05 | 9e-29 | 128 |
LLPS-Pot-1563 | Populus trichocarpa | 35.96 | 2e-27 | 124 |
LLPS-Php-1416 | Physcomitrella patens | 35.65 | 7e-34 | 148 |