LLPS-Caf-2080
UBR4
Integrated Annotations
▼ OVERVIEW
Status: | Unreviewed |
Protein Name: | Ubiquitin protein ligase E3 component n-recognin 4 |
Gene Name: | UBR4 |
Ensembl Gene: | ENSCAFG00000015314.4 |
Ensembl Protein: | ENSCAFP00000022600.4 |
Organism: | Canis familiaris |
Taxa ID: | 9615 |
LLPS Type: | Others |
▼ PROPERTY
▼ Classification
Condensates:
Condensate | Evidence | Orthologs |
---|---|---|
Nucleolus, Postsynaptic density | Predicted from orthologs | (View) |
▼ CROSS REFERENCE
Database | Nucleotide ID | Protein ID |
---|---|---|
Ensembl | ENSCAFT00000024347.4 | ENSCAFP00000022600.4 |
UniProt | E2RF02, E2RF02_CANLF | |
GeneBank | AAEX03001814, AAEX03001815 | |
RefSeq | XM_848002.4 | XP_853095.1 |
▼ SEQUENCE
Protein Sequence (FASTA) |
---|
1 MATSGGEEAA AAAPAPGASA TGADTTPGWE VAVRPLLSAS YSAFEMKELP QLVASVIESE 60 61 SEILHHEKQY EPFYSSFVAL STHYITTVCS LIPRNQLQSV AAACKVLIEF SLLRLENPDE 120 121 ACAVSQKHLI LLIKGLCTGC SRLDRTEMIT FTAMMKSAKL PQTVKTLSDV EDQKELASPV 180 181 SPELRQKEVQ MNFLNQLTSV FNPRTVASPP GGPHTPAEGE NDEQSSTDQA SAIKTKNVFI 240 241 AQNVASLQEL GGSEKLLRVC LNLPYFLRYI NRFQDAVSAN SFFIMPATVA DATAVRNGFH 300 301 SLVIDVTMAL DTLSLPVLEP LNPSRLQDVT VLSLSCLYAG VSVATCMAIL HVGSAQQVRT 360 361 GSTSSKEEDY ESDAATIVQK CLEIYDMIGQ AISNSRRAGG EHYQNFQLLG AWCLLNSLFL 420 421 ILNLSPTALA DKGKEKDPLA ALRVRDILSR TKEGVGSPKL GPGKGHQGFG VLSVILANHA 480 481 VKLLTSLFQD LQVEALHKAW ETDGPPAVLN IMAQSTSIQR IQRLIDSVPL TNLLLTLLST 540 541 SYRKACVLQR QRKGSMSSDA SASTDSNTYY EDDFSSTEED SSQDDDSEPI LGQWFEETIS 600 601 PSKEKVAPPP PPPPPPLESS PRVKSPSKQA PGEKGNILAS RKDPELFLGL ASNILNFITS 660 661 SMLNSRNNFI RNYLSVSLSE HHMATLASII KEVDKDGLKG SSDEEFAAAL YHFNHSLVTS 720 721 DLQSPNLQNT LLQQLGVAPF SEGPWPLYIH PQSLSVLSRL LLIWQHKAGT QGDPDVPECL 780 781 KVWDRFLSTM KQNALQGVVP NETEDLNVEH LQLLLLIFHN FTEKGRRAIL TLFVQIIQEL 840 841 SANVDAQARS VPLILARLLL IFDYLLHQYS KAPVYLFEQV QHNLLSPPFG WANGSQDSNS 900 901 RRVTTPLYHG FKEVEENWSK HFSSDAVPQP RFYCVLSPEA SEDDLNRLDS VACDVLFSKL 960 961 VKYDELYTAL TCLLAAGSQL DTVRRKENKN VTALEACALQ YYFLILWRIL GILPPSKTYM 1020 1021 NQLAMNSPEM SECDILHTLR WSSRLRISSY VNWIKDHLIK QGMKPEHAGS LLDLASTKCS 1080 1081 SVKYDVEIVE EYFARQISSF CSIDCTTILQ LHEIPSLQSI YTLDAAISKV QVSLDEHFSK 1140 1141 MAAETDPHKS SEITKNLLPA TLQLIDTYAS FTRAYLLQNF NEEGSTEKPS QEKLHGFAAV 1200 1201 LAIGSSRCKA NTLGPTLVQN LPSWVQAVCE SWNNISTNEF PNIGSWRNAF ANDTIPSESY 1260 1261 ISAVQAAHLG TLCGQSLPLA ASLKHTLLSL VRLTGDLIVW SDEMNPPQVI RTLLPLLLES 1320 1321 STESVAEISS NSLERILGPA ESEEFLARVY EKLITGCYNI LANHADPNSG LDESILEECL 1380 1381 QYLEKQLESS QARKAMEEFF SDSGELVQIM MATANENLSA KFCNRVLKFF TKLFQLTEKS 1440 1441 PNPSLLHLCG SLAQLACVEP TRLQAWLTRM TASPPKDSDQ LDVIQENRQL LQLLTTYIVR 1500 1501 ENSQVGEGVC AVLLGTLIPM ATEMLANGDG TGFPELMVVM ATLASAGQGA GHLQLHNAAV 1560 1561 DWLSRCKKYL SQKNVVEKLN ANVMHGKHVI VLECTCHIMS YLADVTNALS QSNGQGPSHL 1620 1621 SVDGEERAIE VDSDWVEELA VEEEDSQAED SDEDSLCNKL CTFTITQKEF MNQHWYHCHT 1680 1681 CKMVDGVGVC TVCAKVCHKD HEISYAKYGS FFCDCGAKED GSCLALVKRT PSSGMSSTMK 1740 1741 ESAFQSEPRV SESLVRHTST SPADKAKVTI SDGKVADEEK PKKSSLCRTV EGCREELQNQ 1800 1801 ANFSFAPLVL DMLNFLMDAI QTNFQQASAV GSSSRAQQAL RELHTVDKVV EMTDQLMVPT 1860 1861 LGSQEGAFEN VRMNYSGDQG QTIRQLISAH VLRRVAMCVL SSPHGRRQHL AVSHEKGKIT 1920 1921 VLQLSALLKQ ADSSKRKLTL TRLASAPVPF TVLSLTGNPC KEDYLAVCGL KDCHVLTFSS 1980 1981 SGSVSDHLVL HPQLATGNFI IKAVWLPGSQ TELAIVTADF VKIYDLSVDA LSPTFYFLLP 2040 2041 SSKIRDVTFL FNEEGKNIIV IMSSAGYIYT QLMEEASSAQ QGPFYVTNVL EINHEDLKDS 2100 2101 NSQVAGGGVS VYYSHVLQML FFSYCQGRSF AATVSRTTLE VLQLFSINIK SSNGGSKTSP 2160 2161 ALCQWSEVMN HPGLVCCVQQ TTGVPLVVMV KPDTFLIQEI KTLPAKAKIQ DMVAIRHTAC 2220 2221 NEQQRTTMIL LCEDGSLRIY MANVENTSYW LQPSLQPSSV ISIMKPVRKR KTATITTRTS 2280 2281 SQVTFPIDFF EHNQQLTDVE FGGNDLLQVY NAQQIKHRLN STGMYVANTK PGGFTIEISN 2340 2341 NNSTMVMTGM RIQIGTQAIE RAPSYIEIFG RTMQLNLSRS RWFDFPFTRE EALQADKKLN 2400 2401 LFIGASVDPA GVTMIDAVKI YGKTKEQFGW PDEPPEEFPS ASVSNICPSN LNQSNGTGDS 2460 2461 DPAAPATTSG TVLERLVVSS LEALESCFAV GPIIEKERNK NAAQELATLL LSLPAPASVQ 2520 2521 QQSKSLLASL HTSRSAYHSH KDQALLSKAV QCLNTSSREG KDLDPEVFQR LVITARSIAI 2580 2581 MRPNNLVHFT ESKLPPMETE GVDEGREPQK QLEGDCCSFI TQLVNHFWKL HASKPKNAFL 2640 2641 APACLPGLTH IEATVNALVD IIHGYCTCEL DCINTASKIY MQMLLCPDPA VSFSCKQALI 2700 2701 RVLRPRNKRR HVTLPSSPRS NTPMGDKDDD DDDDADDKMQ SSGIPNGGHI RQESQEQSEV 2760 2761 DHGDFEMVSE SMVLETAENV NNGNPSPLEA LLAGAEGFPP MLDIPPDADD ETMVELAIAL 2820 2821 SLQQDQQGSS SSALGLQSLG LSGQAPSSSS LDAGTLSDTT ASAPASDDEG STAATDGSTL 2880 2881 RTSPADHGGS VGSESGGSAV DSVAGEHSVS GRSSAYGDAT VEGHPAGPGS VSSSTGAIST 2940 2941 TTGHQEGDGS EGEGEGEGEG DVHTSNRLHM VRLMLLERLL QTLPQLRNVG GVRAIPYMQV 3000 3001 ILMLTTDLDG EDEKDKGALD NLLSQLIAEL GMDKKDVSRK NERSALNEVH LVVMRLLSVF 3060 3061 MSRTKSGSKS SICESSSLIS SATAAALLSS GAVDYCLHVL KSLLEYWKSQ QNDEEPVATS 3120 3121 QLLKPHTTSS PPDMSPFFLR QYVKGHAADV FEAYTQLLTE MVLRLPYQIK KIADTNSRIP 3180 3181 PPVFDHSWFY FLSEYLMIQQ TPFVRRQVRK LLLFICGSKE KYRQLRDLHT LDSHVRGIKK 3240 3241 LLEEQGIFLR ASVVTASSGS ALQYDTLISL MEHLKACAEI AAQRTVNWQK FCIKDDSVLY 3300 3301 FLLQVSFLVD EGVSPVLLQL LSCALCGSKV LAALAASAGS SSASSSTAPV AASSGQATTQ 3360 3361 SKSSTKKSKK EEKEKEKEGE SSGSQEDQLC TALVNQLNKF ADKETLVQFL RCFLLESNSS 3420 3421 SVRWQAHCLT LHIYRNSSKS QQELLLDLMW SIWPELPAYG RKAAQFVDLL GYFSLKTPQT 3480 3481 EKKLKEYSQK AVEILRTQNH ILTNHPNSNI YNTLSGLVEF DGYYLESDPC LVCNNPEVPF 3540 3541 CYIKLSSIKV DTRYTTTQQV VKLIGSHTIS KVTVKIGDLK RTKMVRTINL YYNNRTVQAI 3600 3601 VELKNKPARW HKAKKVQLTP GQTEVKIDLP LPIVASNLMI EFADFYENYQ ASTETLQCPR 3660 3661 CSASVPANPG VCGNCGENVY QCHKCRSINY DEKDPFLCNA CGFCKYARFD FMLYAKPCCA 3720 3721 VDPIENEEDR KKAVSNINTL LDKADRVYHQ LMGHRPQLEN LLCKVNEAAP EKPQDDSGTA 3780 3781 GGISSTSASV NRYILQLAQE YCGDCKNSFD ELSKIIQKVF ASRKELLEYD LQQREAATKS 3840 3841 SRTSVQPTFT ASQYRALSVL GCGHTSSTKC YGCASAVTEH CITLLRALAT NPALRHILVS 3900 3901 QGLIRELFDY NLRRGSAGMR EEVRQLMCLL TRDNPEATQQ MNDLIIGKVS TALKGHWANP 3960 3961 DLASSLQYEM LLLTDSISKE DSCWELRLRC ALSLFLMAVN IKTPVVVENI TLMCLRILQK 4020 4021 LIKPPAPTSK KNKDVPVEAL TTVKPYCNEI HAQAQLWLKR DPKASYESWK KCLPIRGIDG 4080 4081 NGKSPSKSEL RHLYLTEKYV WRWKQFLSRR GKRTPPLDLK LGHNNWLRQV LFTPATQAAR 4140 4141 QAACTIVEAL ATIPSRKQQV LDLLTSYLDE LSIAGECAAE YLALYQKLIT SAHWKVYLAA 4200 4201 RGVLPYVGNL ITKEIARLLA LEEATLSTDL QQGYALKSLT GLLSSFVEVE SIKRHFKSRL 4260 4261 VGTVLNGYLC LRKLVVQRTK LIDETQDMLL EMLEDMTTGT ESETKAFMAV CIETAKRYNL 4320 4321 DDYRTPVFIF ERLCSIIYPE ENEVTEFFVT LEKDPQQEDF LQGRMPGNPY SSNEPGIGPL 4380 4381 MRDIKNKICQ DCDLVALLED DSGMELLVNN KIISLDLPVA EVYKKVWCTT NEGEPMRIVY 4440 4441 RMRGLLGDAT EEFIESLDST TDEEEDEEEV YKMAGVMAQC GGLECMLNRL AGIKDFKQGR 4500 4501 HLLTVLLKLF SYCVKVKVNR QQLVKLEMNT LNVMLGTLNL ALVAEQESKD SGGATVAEQV 4560 4561 LSIMEIILDE SNAEPLSEDK GNLLLTGDKD QLVMLLDQIN STFVRSNPSV LQGLLRIIPY 4620 4621 LSFGELEKMQ ILVERFKPYC SFDKYDEDHS GDDKVFLDCF CKIAAGIKNS SNGHQLKDLI 4680 4681 LQKGITQSAL DYMKKHIPSA KNLDADIWKK FLSRPALPFI LRLLRGLAVQ HPATQVLIGT 4740 4741 DSITNLHKLE QVSSDEGIGT LAENLLEALR EHPDVNKKID AARRETRAEK KRMAMAMRQK 4800 4801 ALGTLGMTTN EKGQVVTKTA LLKQMEELIE EPGLTCCICR EGYKFQPTKV LGIYTFTKRV 4860 4861 ALEEMENKPR KQQGYSTVSH FNIVHYDCHL AAVRLARGRE EWESAALQNA NTKCNGLLPV 4920 4921 WGPHVPESAF ATCLARHNTY LQECTGQREP TYQLNIHDIK LLFLRFAMEQ SFSADTGGGG 4980 4981 RESNIHLIPY IIHTVLYVLN TTRATSREEK NLQGFLEQPK EKWVESAFEV DGPHYFTVLA 5040 5041 LHILPPEKWR ATRVEILRRL LVTSQARAVA PGGATRLTDK AVKDYSAYRS SLLFWALVDL 5100 5101 IYNMFKKVPT SNTEGGWSCS LAEYIRHNDM PIYEAADKAL KTFQEEFMPV ETFSEFLDAA 5160 5161 GLLSEVTDPE NFLKDLLNSI P 5181 |
Nucleotide CDS Sequence (FASTA) |
1 ATGGCGACGA GCGGCGGCGA AGAGGCGGCG GCGGCGGCGC CGGCGCCCGG GGCCTCGGCG 60 61 ACGGGGGCGG ACACAACCCC GGGCTGGGAG GTGGCGGTGC GGCCCCTGCT GTCCGCGTCC 120 121 TATTCCGCCT TCGAGATGAA GGAGTTGCCG CAGCTGGTGG CCTCAGTCAT CGAGAGTGAA 180 181 TCCGAAATCC TGCACCACGA GAAGCAGTAT GAGCCCTTCT ACTCCTCTTT TGTTGCACTT 240 241 TCCACACACT ATATTACAAC AGTTTGCAGC CTCATTCCCC GGAACCAGCT TCAGTCGGTG 300 301 GCAGCAGCCT GTAAAGTCCT GATCGAGTTT TCTCTCCTGC GTCTGGAGAA TCCAGATGAG 360 361 GCTTGTGCTG TGTCCCAGAA ACACTTGATT CTCCTCATCA AGGGCCTGTG TACTGGCTGT 420 421 AGCCGACTAG ACAGGACTGA AATGATAACG TTCACGGCAA TGATGAAATC AGCCAAGCTG 480 481 CCACAGACAG TGAAGACGCT CTCAGATGTG GAAGACCAGA AAGAGCTGGC CTCACCAGTA 540 541 AGCCCAGAAC TGAGGCAGAA GGAGGTCCAG ATGAATTTTT TGAACCAGCT GACATCAGTT 600 601 TTCAACCCTA GAACTGTAGC ATCACCGCCT GGAGGTCCAC ACACTCCAGC CGAAGGAGAA 660 661 AATGATGAGC AGTCATCCAC AGATCAAGCC TCTGCTATCA AAACCAAGAA TGTGTTTATA 720 721 GCTCAGAATG TGGCTAGTCT TCAAGAGCTT GGTGGCTCAG AGAAGCTACT GCGCGTGTGT 780 781 TTGAACCTGC CCTATTTCCT GCGCTACATC AATCGGTTTC AGGATGCAGT GTCGGCCAAT 840 841 TCTTTCTTCA TCATGCCGGC CACAGTAGCA GATGCCACTG CTGTTCGCAA TGGTTTTCAT 900 901 TCACTGGTGA TTGATGTAAC CATGGCGTTG GATACTCTGT CTCTGCCTGT GTTGGAACCC 960 961 CTCAATCCTT CTCGTCTGCA GGATGTGACA GTTCTCAGCC TAAGTTGTCT GTATGCAGGT 1020 1021 GTGAGTGTGG CAACTTGCAT GGCCATCCTG CATGTGGGTA GTGCCCAGCA AGTACGGACA 1080 1081 GGGTCCACAA GCTCCAAAGA AGAAGACTAT GAGAGTGATG CAGCTACGAT TGTCCAGAAA 1140 1141 TGTCTTGAAA TCTACGACAT GATTGGACAA GCAATCAGCA ATTCCCGTCG GGCTGGTGGA 1200 1201 GAGCATTATC AGAACTTCCA ATTGCTGGGT GCCTGGTGTT TGTTAAACAG CCTGTTCCTC 1260 1261 ATATTGAACC TCAGTCCTAC TGCCTTGGCT GATAAGGGGA AAGAGAAAGA CCCTCTGGCT 1320 1321 GCACTCCGAG TTAGAGACAT CCTTTCTCGT ACAAAAGAGG GAGTGGGCTC TCCTAAACTG 1380 1381 GGACCTGGGA AAGGGCACCA GGGATTTGGG GTACTCTCAG TAATACTGGC AAACCATGCT 1440 1441 GTCAAGCTGT TAACCTCTCT CTTTCAAGAC CTTCAAGTGG AAGCTCTGCA CAAGGCCTGG 1500 1501 GAGACAGATG GTCCCCCTGC AGTCCTGAAC ATTATGGCCC AGAGCACCTC CATCCAGAGG 1560 1561 ATCCAACGGC TGATTGACTC AGTCCCACTG ACGAACCTGC TGTTGACATT ACTCTCAACC 1620 1621 TCCTACAGAA AGGCCTGTGT CCTGCAACGT CAGAGGAAGG GCTCCATGAG CAGCGATGCC 1680 1681 AGCGCCTCGA CTGACTCAAA CACTTACTAT GAGGATGATT TCAGCAGCAC GGAGGAGGAC 1740 1741 AGCAGCCAAG ATGATGACAG TGAGCCTATT TTGGGGCAGT GGTTTGAGGA GACCATCTCT 1800 1801 CCCAGTAAAG AGAAAGTGGC ACCTCCGCCT CCTCCCCCGC CACCTCCGCT AGAGAGTTCT 1860 1861 CCTCGAGTTA AAAGCCCCAG TAAACAGGCT CCCGGTGAAA AGGGCAACAT TTTGGCTAGT 1920 1921 CGCAAAGATC CTGAGTTGTT CTTAGGTCTG GCTTCCAACA TTTTGAACTT CATCACTTCT 1980 1981 TCCATGCTGA ACTCACGGAA CAATTTTATC CGAAACTACC TGAGTGTGTC TCTTTCAGAA 2040 2041 CACCATATGG CCACCCTGGC CAGCATCATT AAGGAGGTGG ACAAAGATGG ACTCAAGGGT 2100 2101 TCATCAGATG AAGAATTTGC TGCTGCTCTC TATCACTTCA ACCACTCTCT GGTAACTTCT 2160 2161 GACCTTCAGT CACCCAACCT GCAGAACACG CTGTTGCAGC AGCTGGGAGT AGCTCCTTTC 2220 2221 TCTGAGGGCC CTTGGCCCTT GTACATTCAC CCTCAAAGCC TCTCTGTGCT TTCACGCCTT 2280 2281 TTGCTCATCT GGCAACATAA AGCCGGCACT CAGGGTGATC CCGATGTCCC TGAATGCCTG 2340 2341 AAAGTTTGGG ACAGATTTTT GTCGACAATG AAGCAGAATG CCCTGCAAGG GGTGGTGCCC 2400 2401 AATGAGACGG AAGATTTGAA TGTAGAACAC CTACAGCTGC TCCTCCTCAT TTTCCACAAC 2460 2461 TTCACTGAGA AGGGCCGGCG GGCCATACTG ACCCTTTTTG TGCAGATCAT CCAGGAATTG 2520 2521 AGTGCCAACG TGGATGCGCA GGCGCGCTCC GTGCCTCTCA TCCTGGCTCG CCTCCTTTTG 2580 2581 ATCTTTGATT ATTTGCTTCA TCAGTACTCC AAAGCACCTG TGTATCTGTT CGAGCAGGTG 2640 2641 CAGCATAACT TGCTGAGTCC TCCTTTTGGG TGGGCAAATG GATCTCAGGA CAGCAACAGC 2700 2701 CGCCGGGTGA CCACCCCTCT CTATCATGGG TTCAAAGAAG TAGAAGAGAA CTGGTCTAAG 2760 2761 CATTTTTCAT CAGATGCTGT TCCACAACCC AGATTCTACT GTGTCCTGTC CCCAGAAGCC 2820 2821 TCAGAGGATG ATTTGAACCG ACTCGATTCT GTGGCATGTG ATGTTCTTTT CTCTAAGCTT 2880 2881 GTCAAGTATG ATGAGCTTTA CACTGCACTG ACTTGCCTGC TTGCAGCTGG GTCCCAGCTT 2940 2941 GATACAGTCA GGAGGAAGGA AAACAAGAAC GTAACAGCCT TGGAGGCCTG TGCCCTTCAG 3000 3001 TATTACTTCT TGATACTGTG GAGGATCCTA GGGATTTTGC CGCCATCAAA GACTTACATG 3060 3061 AACCAGCTGG CCATGAACTC ACCTGAGATG AGTGAATGTG ACATCCTACA CACGCTGCGG 3120 3121 TGGTCTTCCC GCCTCCGCAT CAGCTCCTAT GTCAACTGGA TAAAGGATCA CCTTATCAAA 3180 3181 CAGGGAATGA AGCCGGAACA TGCTGGTTCT CTTCTAGACC TGGCATCCAC CAAGTGCAGC 3240 3241 TCGGTGAAAT ACGACGTCGA AATTGTAGAG GAATACTTTG CTCGACAGAT CTCATCATTC 3300 3301 TGTAGCATCG ACTGTACTAC CATCTTGCAA CTCCATGAAA TTCCAAGTCT GCAGTCTATC 3360 3361 TACACCCTTG ACGCTGCCAT CTCGAAGGTT CAGGTCTCCC TGGATGAGCA TTTTTCCAAG 3420 3421 ATGGCCGCCG AGACCGATCC TCATAAGTCA TCTGAGATCA CCAAGAACCT GCTTCCAGCC 3480 3481 ACACTACAAC TCATTGACAC CTATGCATCA TTCACTAGAG CCTATTTGCT ACAGAACTTT 3540 3541 AATGAAGAAG GATCCACTGA AAAACCTTCC CAGGAGAAAT TGCATGGTTT TGCTGCTGTT 3600 3601 TTGGCTATTG GCTCTAGTAG GTGCAAGGCA AACACTCTGG GTCCCACCCT CGTTCAGAAT 3660 3661 TTGCCATCCT GGGTCCAGGC TGTATGTGAA TCCTGGAATA ACATTAGCAC CAATGAGTTT 3720 3721 CCCAACATCG GATCCTGGCG CAATGCCTTT GCTAATGACA CCATCCCTTC AGAGAGTTAC 3780 3781 ATCAGTGCTG TGCAGGCTGC TCACTTGGGG ACTCTTTGTG GGCAAAGTCT GCCTCTGGCT 3840 3841 GCTTCCCTGA AGCACACCCT CCTCTCACTG GTCAGGCTGA CTGGAGATCT TATTGTTTGG 3900 3901 TCCGATGAGA TGAACCCGCC ACAGGTAATT CGGACCCTGC TTCCCCTTCT TTTGGAATCA 3960 3961 AGCACTGAGA GCGTTGCTGA GATCAGTAGC AACTCCCTGG AGCGCATCCT GGGTCCTGCC 4020 4021 GAATCTGAGG AGTTCTTGGC TCGAGTATAT GAGAAGCTGA TCACTGGTTG TTACAACATT 4080 4081 CTGGCCAATC ATGCAGATCC CAATAGTGGA CTGGATGAAT CCATCTTGGA GGAATGTCTC 4140 4141 CAGTATTTGG AAAAGCAGCT GGAAAGCAGC CAGGCTCGTA AAGCTATGGA GGAATTTTTC 4200 4201 TCTGACAGTG GAGAACTCGT GCAGATCATG ATGGCAACAG CCAATGAGAA CCTCTCTGCT 4260 4261 AAATTTTGTA ACCGAGTTCT GAAATTCTTC ACCAAACTCT TCCAGCTGAC TGAGAAGAGC 4320 4321 CCTAACCCGA GCCTTCTGCA TCTCTGTGGC TCTCTGGCAC AGCTGGCTTG TGTGGAGCCT 4380 4381 ACACGCCTCC AGGCCTGGCT CACCCGCATG ACCGCATCAC CCCCAAAAGA CTCCGACCAA 4440 4441 CTGGATGTGA TTCAGGAGAA CCGGCAGCTG TTGCAGCTAT TGACTACGTA CATTGTTCGG 4500 4501 GAAAACAGCC AAGTTGGGGA AGGTGTGTGT GCTGTGCTGC TGGGTACCCT GATCCCCATG 4560 4561 GCGACAGAGA TGTTGGCCAA CGGTGATGGG ACTGGTTTCC CGGAACTCAT GGTTGTGATG 4620 4621 GCCACCCTGG CCAGTGCAGG TCAAGGTGCT GGCCATCTTC AGCTCCACAA TGCTGCTGTG 4680 4681 GACTGGCTGA GCAGATGCAA GAAATACCTA TCACAGAAGA ACGTGGTTGA AAAACTGAAT 4740 4741 GCCAATGTGA TGCATGGAAA GCACGTGATA GTCCTGGAGT GCACGTGCCA TATTATGTCT 4800 4801 TACTTGGCTG ATGTCACCAA TGCCCTGAGC CAGAGTAATG GTCAAGGCCC AAGTCACCTC 4860 4861 TCAGTGGATG GGGAGGAGCG GGCCATTGAG GTGGACTCGG ACTGGGTGGA GGAGTTGGCA 4920 4921 GTGGAAGAGG AAGACTCCCA GGCTGAAGAT TCAGATGAAG ATTCTCTTTG TAACAAACTC 4980 4981 TGCACTTTCA CAATCACTCA GAAAGAATTC ATGAACCAGC ATTGGTACCA CTGTCACACC 5040 5041 TGCAAAATGG TGGATGGAGT AGGTGTGTGC ACAGTTTGCG CCAAGGTGTG CCACAAGGAT 5100 5101 CATGAGATTT CCTATGCCAA ATACGGGTCC TTCTTCTGTG ACTGTGGAGC CAAGGAAGAT 5160 5161 GGCAGCTGTT TGGCGCTGGT GAAGCGAACT CCTAGCAGTG GCATGAGCTC CACTATGAAG 5220 5221 GAGTCAGCCT TTCAGAGTGA ACCCAGGGTT TCTGAGAGTC TGGTGCGCCA CACTAGCACC 5280 5281 TCTCCAGCTG ACAAGGCCAA AGTCACCATC AGTGATGGAA AGGTTGCTGA CGAAGAGAAG 5340 5341 CCAAAGAAGA GCAGCCTATG CCGCACAGTA GAGGGCTGCC GGGAGGAGCT GCAGAACCAG 5400 5401 GCCAATTTCT CCTTCGCTCC TCTCGTGTTA GACATGCTCA ATTTCCTCAT GGATGCCATT 5460 5461 CAGACCAACT TTCAGCAGGC TTCAGCTGTG GGGAGCAGCA GCCGGGCTCA GCAAGCCCTC 5520 5521 CGGGAGCTGC ATACCGTGGA CAAGGTGGTT GAGATGACAG ATCAGCTGAT GGTTCCCACC 5580 5581 TTAGGCTCCC AGGAAGGTGC CTTTGAAAAT GTTCGGATGA ATTACAGCGG AGACCAGGGC 5640 5641 CAGACTATTC GACAGCTAAT TAGTGCCCAT GTGCTCAGGC GGGTGGCTAT GTGTGTGCTC 5700 5701 TCTTCCCCGC ATGGGCGCCG CCAGCATTTG GCAGTTAGCC ATGAGAAAGG GAAGATCACT 5760 5761 GTTCTGCAGC TCTCTGCACT CTTGAAGCAA GCGGATTCCA GCAAAAGAAA GCTGACTCTG 5820 5821 ACCCGCCTGG CTTCTGCTCC CGTCCCTTTT ACTGTATTGA GCCTCACTGG GAATCCCTGT 5880 5881 AAGGAGGACT ACCTTGCTGT GTGTGGGCTG AAGGACTGCC ACGTGTTGAC CTTCAGTAGC 5940 5941 TCAGGCTCTG TTTCGGATCA CTTGGTGCTG CACCCTCAGT TGGCAACAGG CAACTTCATC 6000 6001 ATCAAAGCTG TGTGGTTACC TGGTTCGCAG ACCGAGTTAG CGATTGTCAC TGCGGACTTT 6060 6061 GTCAAGATTT ATGACCTGTC TGTTGATGCC TTGAGCCCGA CCTTCTACTT TCTCCTGCCA 6120 6121 AGCTCAAAGA TAAGAGATGT TACCTTCCTC TTCAACGAGG AGGGAAAGAA CATCATTGTT 6180 6181 ATAATGTCTT CAGCTGGTTA CATCTATACC CAGCTCATGG AAGAGGCCAG CAGTGCCCAG 6240 6241 CAGGGACCCT TTTACGTCAC TAATGTGCTG GAAATCAACC ACGAGGACTT GAAGGACAGT 6300 6301 AACAGCCAGG TGGCAGGTGG CGGTGTGTCC GTCTACTACT CCCATGTGTT GCAGATGTTG 6360 6361 TTCTTCAGCT ATTGTCAAGG CAGATCATTC GCAGCCACCG TTAGCAGGAC AACTCTGGAA 6420 6421 GTATTGCAGC TCTTCTCCAT CAACATCAAA AGTTCCAATG GTGGCAGTAA GACATCTCCT 6480 6481 GCCCTTTGCC AGTGGTCCGA GGTGATGAAC CACCCTGGCC TGGTATGTTG TGTCCAGCAA 6540 6541 ACTACAGGTG TGCCATTGGT AGTTATGGTG AAGCCGGACA CTTTCCTTAT TCAAGAGATT 6600 6601 AAGACGCTGC CTGCGAAAGC AAAGATCCAA GACATGGTTG CCATTCGGCA CACGGCCTGC 6660 6661 AACGAGCAGC AGCGGACCAC GATGATCCTG CTGTGTGAGG ATGGCAGCCT GCGCATTTAC 6720 6721 ATGGCCAACG TGGAGAACAC CTCCTATTGG CTGCAGCCGT CCCTGCAGCC CAGCAGTGTT 6780 6781 ATCAGTATCA TGAAGCCTGT GCGAAAGCGC AAGACCGCTA CCATCACGAC CCGCACATCT 6840 6841 AGTCAGGTGA CCTTCCCCAT CGACTTCTTT GAACACAACC AGCAGCTGAC GGACGTGGAG 6900 6901 TTTGGTGGGA ACGACCTTCT GCAGGTGTAC AACGCGCAGC AGATAAAGCA CCGGCTGAAT 6960 6961 TCCACCGGCA TGTATGTGGC CAACACCAAG CCCGGTGGAT TCACCATTGA GATCAGTAAC 7020 7021 AACAACAGCA CTATGGTGAT GACAGGCATG CGGATCCAGA TTGGGACCCA GGCCATTGAA 7080 7081 CGGGCCCCAT CATACATTGA GATTTTTGGC AGGACCATGC AGCTTAACCT GAGTCGCTCC 7140 7141 CGCTGGTTTG ACTTCCCCTT CACCAGAGAA GAAGCCCTAC AGGCTGACAA GAAACTGAAC 7200 7201 CTCTTCATTG GGGCTTCGGT GGATCCAGCA GGCGTCACCA TGATTGACGC GGTAAAAATT 7260 7261 TACGGCAAAA CGAAAGAGCA GTTTGGCTGG CCCGATGAAC CCCCAGAAGA GTTCCCCTCT 7320 7321 GCCTCTGTCA GCAACATCTG TCCTTCCAAC CTGAACCAGA GCAATGGCAC TGGAGACAGC 7380 7381 GACCCTGCGG CTCCTGCCAC CACCAGCGGA ACTGTCCTGG AGAGGCTGGT TGTGAGTTCT 7440 7441 TTAGAAGCCC TGGAAAGCTG CTTTGCTGTT GGCCCAATCA TCGAGAAGGA GAGAAACAAG 7500 7501 AATGCAGCTC AGGAGCTGGC CACTCTGCTG CTGTCTCTGC CAGCGCCTGC CAGTGTTCAG 7560 7561 CAGCAGTCCA AGAGCCTTCT GGCCAGCTTG CACACCAGCC GCTCAGCCTA TCACAGCCAC 7620 7621 AAGGATCAGG CCTTGCTGAG CAAAGCTGTG CAGTGTCTCA ACACATCCAG CAGAGAGGGC 7680 7681 AAGGATTTGG ACCCCGAGGT CTTCCAGAGG CTGGTGATCA CAGCTCGTTC CATTGCCATC 7740 7741 ATGCGCCCTA ACAACCTTGT CCACTTTACG GAATCCAAGC TGCCCCCAAT GGAAACAGAA 7800 7801 GGAGTGGATG AGGGGAGAGA ACCTCAGAAG CAGCTGGAAG GAGACTGCTG TAGTTTCATC 7860 7861 ACCCAGCTGG TGAACCACTT CTGGAAACTC CATGCATCCA AACCCAAGAA TGCCTTCTTG 7920 7921 GCACCTGCCT GCCTGCCAGG CCTGACTCAT ATTGAAGCTA CTGTCAATGC CCTGGTAGAC 7980 7981 ATAATTCATG GCTACTGTAC CTGTGAGCTG GACTGTATTA ATACAGCATC CAAGATCTAC 8040 8041 ATGCAGATGC TACTGTGTCC TGATCCTGCT GTGAGCTTCT CTTGTAAACA AGCTCTAATT 8100 8101 CGAGTCCTAA GGCCCAGAAA CAAGCGGAGA CATGTGACAT TGCCCTCCTC CCCTCGAAGC 8160 8161 AACACTCCAA TGGGAGACAA GGATGATGAT GACGATGATG ATGCAGACGA CAAAATGCAG 8220 8221 TCATCAGGGA TTCCGAATGG TGGTCATATC CGTCAGGAAA GCCAGGAACA GAGTGAGGTG 8280 8281 GACCATGGAG ATTTTGAGAT GGTGTCTGAG TCGATGGTGC TGGAGACAGC TGAAAATGTC 8340 8341 AACAATGGCA ACCCCTCTCC CCTGGAGGCC CTGCTGGCAG GTGCAGAGGG CTTCCCCCCC 8400 8401 ATGCTGGACA TCCCACCTGA TGCAGATGAC GAGACCATGG TTGAACTAGC CATTGCCCTG 8460 8461 AGCCTGCAGC AGGACCAGCA AGGCAGCAGC AGCAGTGCCC TGGGCCTGCA GAGCCTGGGA 8520 8521 CTGTCCGGCC AGGCACCCAG CTCTTCCTCT CTGGACGCAG GAACCCTCTC TGACACCACA 8580 8581 GCATCAGCTC CAGCCTCGGA CGATGAGGGC AGCACAGCAG CTACTGACGG CTCCACCCTT 8640 8641 CGGACCTCCC CTGCTGACCA CGGCGGTAGT GTGGGCTCAG AGAGTGGGGG AAGCGCAGTG 8700 8701 GACTCGGTGG CTGGCGAACA CAGTGTATCT GGCCGGAGCA GTGCTTACGG TGATGCCACA 8760 8761 GTGGAGGGGC ACCCAGCTGG ACCAGGAAGT GTCAGCTCAA GCACTGGCGC CATCAGCACC 8820 8821 ACCACTGGAC ACCAGGAAGG AGATGGCTCC GAGGGAGAAG GAGAAGGGGA AGGGGAAGGC 8880 8881 GATGTCCACA CTAGCAACAG GCTGCACATG GTCCGTCTGA TGCTGTTGGA GAGACTACTG 8940 8941 CAAACCTTGC CCCAGTTACG AAATGTTGGA GGTGTCCGGG CCATCCCGTA CATGCAGGTC 9000 9001 ATCCTGATGC TCACTACAGA CCTGGATGGA GAAGATGAGA AAGACAAGGG GGCCCTGGAC 9060 9061 AACCTGCTCT CCCAGCTGAT TGCTGAACTG GGCATGGACA AAAAGGACGT CTCCAGGAAG 9120 9121 AATGAGCGCA GTGCCTTGAA TGAAGTCCAT CTGGTAGTAA TGAGACTCCT GAGTGTCTTC 9180 9181 ATGTCCCGGA CCAAATCCGG ATCCAAGTCT TCTATATGTG AGTCATCTTC CCTCATCTCC 9240 9241 AGCGCCACAG CAGCGGCCCT GCTGAGCTCT GGCGCCGTGG ACTACTGCCT GCATGTGCTC 9300 9301 AAATCCCTGC TAGAATACTG GAAGAGCCAG CAGAATGATG AGGAGCCGGT GGCTACTAGC 9360 9361 CAGTTGCTGA AACCACACAC AACTTCGTCC CCACCCGACA TGAGCCCGTT CTTCCTCCGC 9420 9421 CAGTATGTGA AGGGTCATGC TGCTGATGTG TTTGAGGCCT ACACTCAGCT TCTCACAGAG 9480 9481 ATGGTCCTGA GGCTTCCTTA CCAAATCAAG AAGATTGCCG ATACCAACTC TAGAATCCCG 9540 9541 CCTCCTGTCT TTGATCATTC ATGGTTTTAC TTTCTCTCAG AGTACCTGAT GATCCAGCAA 9600 9601 ACTCCCTTTG TGCGTCGTCA AGTCCGCAAA CTTCTGCTCT TCATCTGTGG GTCAAAGGAA 9660 9661 AAGTATCGCC AGCTCCGGGA TTTGCACACC CTGGACTCCC ACGTGCGTGG GATCAAGAAG 9720 9721 CTGCTGGAGG AACAAGGGAT ATTCCTCCGG GCAAGTGTGG TTACAGCCAG CTCAGGCTCT 9780 9781 GCCTTGCAGT ATGACACACT CATCAGCCTG ATGGAGCACC TGAAGGCTTG TGCAGAGATT 9840 9841 GCTGCCCAGC GAACCGTCAA CTGGCAGAAA TTCTGCATCA AAGACGATTC CGTCCTGTAC 9900 9901 TTCCTCCTCC AAGTCAGCTT CCTGGTGGAT GAGGGGGTGT CCCCAGTACT GCTGCAGCTG 9960 9961 CTCTCCTGTG CCCTGTGTGG CAGCAAAGTC CTTGCCGCGC TGGCAGCCTC GGCGGGCTCC 10020 10021 TCGAGTGCCT CCTCTTCCAC AGCCCCTGTG GCTGCCAGTT CTGGTCAGGC CACGACACAG 10080 10081 TCCAAGTCGT CTACAAAGAA AAGCAAGAAA GAAGAAAAGG AAAAGGAGAA AGAGGGTGAG 10140 10141 AGCTCAGGCA GCCAGGAGGA CCAGCTGTGC ACAGCTCTGG TGAACCAGCT GAACAAATTT 10200 10201 GCAGATAAGG AGACCTTGGT TCAGTTCCTG CGTTGTTTCC TGTTAGAGTC CAATTCTTCT 10260 10261 TCGGTGCGCT GGCAGGCCCA CTGTCTGACC CTGCATATCT ACAGAAACTC TAGTAAATCT 10320 10321 CAACAGGAAC TTCTGTTAGA TCTGATGTGG TCCATATGGC CAGAACTCCC AGCCTATGGT 10380 10381 CGTAAGGCTG CCCAGTTTGT GGACCTACTA GGATATTTCT CCTTGAAAAC CCCACAAACG 10440 10441 GAGAAGAAGT TGAAGGAGTA TTCACAGAAG GCTGTGGAAA TTCTGCGGAC CCAAAACCAC 10500 10501 ATTCTTACCA ACCATCCCAA CTCCAATATT TACAACACCT TGTCTGGCTT AGTGGAATTT 10560 10561 GATGGCTATT ACCTGGAGAG CGATCCCTGC CTGGTGTGTA ATAATCCAGA GGTGCCGTTC 10620 10621 TGTTATATCA AGCTGTCTTC CATTAAAGTG GACACGCGGT ACACCACAAC CCAGCAGGTT 10680 10681 GTGAAGCTCA TCGGCAGCCA CACCATCAGC AAAGTGACAG TGAAAATTGG AGATCTGAAA 10740 10741 CGGACAAAGA TGGTGCGGAC CATCAACCTC TATTACAACA ACCGGACTGT GCAGGCCATC 10800 10801 GTAGAGTTAA AAAACAAGCC GGCTCGCTGG CATAAGGCCA AGAAAGTTCA GCTGACCCCT 10860 10861 GGGCAGACAG AGGTGAAGAT CGACCTGCCC TTGCCCATTG TGGCCTCCAA CCTGATGATT 10920 10921 GAGTTTGCGG ACTTCTACGA AAACTATCAG GCCTCCACAG AGACCCTGCA GTGCCCCCGC 10980 10981 TGCAGTGCCT CCGTCCCTGC CAACCCAGGG GTCTGTGGCA ACTGTGGAGA GAATGTCTAC 11040 11041 CAGTGTCACA AATGCAGGTC CATCAACTAC GATGAAAAAG ATCCCTTCCT CTGCAATGCC 11100 11101 TGTGGTTTCT GTAAATACGC TCGCTTCGAT TTTATGCTCT ATGCCAAGCC TTGCTGTGCA 11160 11161 GTGGATCCCA TTGAGAATGA AGAAGACCGG AAGAAGGCTG TTTCCAACAT TAATACACTT 11220 11221 CTGGACAAAG CTGACCGAGT GTATCACCAG CTGATGGGAC ACCGGCCACA GCTGGAGAAC 11280 11281 CTGCTCTGCA AAGTGAACGA GGCAGCCCCT GAAAAGCCAC AGGATGACTC GGGGACAGCT 11340 11341 GGGGGCATCA GTTCCACTTC AGCCAGCGTG AATCGGTACA TCCTGCAGCT GGCTCAGGAG 11400 11401 TACTGTGGAG ATTGCAAGAA CTCGTTTGAT GAGCTTTCCA AAATCATCCA GAAAGTCTTT 11460 11461 GCTTCACGAA AAGAGTTGTT GGAATATGAC CTGCAACAGA GGGAAGCGGC CACCAAATCG 11520 11521 TCCCGAACCT CCGTGCAGCC CACTTTCACT GCCAGCCAGT ACCGCGCCTT ATCTGTCCTG 11580 11581 GGCTGTGGCC ACACATCCTC CACCAAGTGC TACGGCTGCG CCTCGGCTGT CACAGAACAT 11640 11641 TGTATCACAC TGCTCCGGGC CCTGGCCACC AATCCAGCCC TGAGGCACAT CCTCGTCTCT 11700 11701 CAGGGCCTCA TCCGGGAGCT CTTTGATTAC AATCTCCGTC GGGGCTCTGC AGGCATGCGG 11760 11761 GAGGAGGTCC GCCAGCTCAT GTGCCTGCTA ACTCGAGACA ATCCAGAAGC CACCCAGCAA 11820 11821 ATGAATGACT TGATTATTGG CAAAGTCTCC ACTGCCCTGA AGGGCCACTG GGCCAATCCC 11880 11881 GATCTGGCAA GCAGCCTTCA GTATGAAATG CTGTTGCTGA CAGATTCCAT CTCAAAGGAG 11940 11941 GACAGCTGCT GGGAACTCCG GTTACGCTGT GCTCTCAGCC TTTTCCTCAT GGCCGTGAAC 12000 12001 ATTAAGACTC CTGTGGTTGT GGAGAACATT ACCCTCATGT GCCTGCGGAT CTTGCAGAAG 12060 12061 TTGATTAAAC CACCTGCTCC AACCAGCAAG AAGAACAAGG ACGTCCCCGT TGAAGCCCTC 12120 12121 ACCACAGTCA AACCGTACTG CAATGAGATC CATGCCCAGG CTCAGCTGTG GCTCAAGAGA 12180 12181 GACCCCAAAG CATCCTATGA ATCCTGGAAG AAGTGTCTTC CTATCAGAGG GATAGATGGC 12240 12241 AATGGGAAGT CCCCCAGCAA GTCAGAGCTC CGCCACCTCT ATTTGACCGA GAAGTACGTG 12300 12301 TGGAGGTGGA AACAGTTCCT GAGTCGTCGT GGGAAGAGGA CGCCCCCCTT GGATCTCAAA 12360 12361 CTAGGCCATA ACAACTGGCT GCGGCAAGTG CTCTTCACTC CAGCAACGCA GGCAGCCAGG 12420 12421 CAGGCAGCCT GCACCATTGT GGAAGCTCTT GCCACCATTC CCAGCCGCAA ACAGCAGGTC 12480 12481 CTTGACCTCC TCACCAGCTA CCTGGACGAG CTGAGCATAG CTGGCGAGTG TGCTGCTGAG 12540 12541 TACTTGGCTC TCTACCAGAA GCTCATCACT TCGGCCCACT GGAAAGTCTA CCTTGCAGCT 12600 12601 CGGGGAGTCC TGCCCTACGT CGGCAACCTC ATCACCAAGG AGATTGCCCG CTTGCTGGCC 12660 12661 CTGGAGGAGG CCACCCTGAG CACTGACCTG CAGCAGGGTT ACGCCCTCAA AAGTCTCACA 12720 12721 GGCCTTCTTT CCTCCTTTGT TGAGGTGGAG TCCATCAAAA GACATTTTAA AAGTCGTTTG 12780 12781 GTGGGTACTG TGCTGAATGG ATACCTGTGC TTGCGAAAGC TGGTGGTGCA GAGGACCAAG 12840 12841 CTGATCGACG AGACCCAGGA CATGCTGCTT GAGATGCTAG AGGACATGAC CACAGGCACA 12900 12901 GAATCGGAAA CCAAGGCCTT CATGGCTGTG TGCATCGAGA CAGCCAAGCG CTACAACCTG 12960 12961 GATGACTACC GGACACCGGT CTTCATCTTT GAGAGGCTCT GCAGCATCAT TTATCCCGAG 13020 13021 GAGAATGAGG TCACTGAATT CTTTGTAACC CTGGAGAAGG ACCCCCAACA AGAAGACTTC 13080 13081 TTACAGGGCA GGATGCCCGG GAACCCATAT AGTAGCAATG AGCCAGGCAT TGGGCCACTT 13140 13141 ATGAGGGATA TAAAGAACAA GATTTGCCAG GACTGTGACT TGGTGGCTCT CCTGGAGGAT 13200 13201 GACAGCGGGA TGGAGCTTCT AGTGAACAAT AAGATCATTA GTTTGGACCT TCCTGTGGCT 13260 13261 GAGGTTTACA AGAAAGTCTG GTGTACCACG AATGAGGGAG AACCCATGAG GATTGTTTAC 13320 13321 CGCATGCGTG GGCTATTGGG TGATGCCACT GAGGAGTTTA TTGAGTCCCT GGACTCCACC 13380 13381 ACAGATGAAG AAGAAGATGA AGAGGAAGTA TATAAGATGG CTGGTGTGAT GGCCCAGTGT 13440 13441 GGGGGCCTGG AGTGCATGCT CAACAGACTG GCTGGGATCA AAGATTTCAA GCAGGGACGC 13500 13501 CACCTTCTAA CCGTACTCCT GAAATTGTTT AGTTACTGCG TAAAGGTGAA AGTCAACCGA 13560 13561 CAGCAGCTGG TTAAGCTGGA AATGAACACC TTGAATGTCA TGCTGGGGAC CCTAAACCTG 13620 13621 GCCCTGGTAG CTGAGCAGGA AAGCAAGGAC AGTGGTGGTG CAACTGTGGC TGAGCAGGTA 13680 13681 CTTAGCATCA TGGAGATCAT TCTGGATGAA TCCAACGCTG AGCCCCTGAG TGAGGACAAG 13740 13741 GGTAACCTTC TCTTGACGGG CGACAAAGAT CAGCTTGTGA TGCTCTTGGA CCAGATTAAT 13800 13801 AGCACCTTTG TTCGCTCCAA CCCTAGTGTG CTCCAGGGCC TGCTTCGTAT CATCCCATAC 13860 13861 CTATCCTTTG GCGAACTGGA GAAAATGCAG ATCTTGGTGG AGCGGTTCAA GCCATATTGT 13920 13921 AGTTTTGACA AGTATGATGA AGACCACAGT GGAGATGATA AAGTCTTCCT GGACTGCTTC 13980 13981 TGTAAGATTG CAGCTGGCAT CAAGAACAGC AGCAATGGGC ATCAGCTAAA GGATCTGATT 14040 14041 CTCCAGAAGG GAATCACCCA GAGTGCACTG GACTATATGA AAAAACACAT TCCCAGTGCC 14100 14101 AAGAATCTGG ATGCTGACAT CTGGAAAAAG TTCTTATCTC GCCCAGCCCT GCCATTCATC 14160 14161 TTGAGGCTTC TTCGAGGGTT GGCTGTCCAG CACCCTGCCA CCCAGGTGCT GATTGGAACA 14220 14221 GACTCCATCA CAAACCTACA TAAACTGGAA CAGGTGTCCA GTGACGAGGG CATTGGGACC 14280 14281 CTGGCAGAGA ACCTACTAGA GGCATTGCGG GAGCACCCTG ATGTAAACAA GAAGATCGAT 14340 14341 GCTGCCCGCA GGGAGACCCG CGCTGAGAAG AAGCGCATGG CCATGGCAAT GCGGCAGAAG 14400 14401 GCTCTGGGCA CCCTGGGCAT GACGACAAAC GAGAAGGGCC AGGTGGTAAC CAAGACCGCG 14460 14461 CTCCTGAAGC AGATGGAGGA GCTGATTGAG GAGCCAGGCC TCACGTGCTG CATCTGCAGG 14520 14521 GAAGGGTACA AGTTCCAGCC TACAAAAGTC CTGGGCATTT ATACTTTCAC CAAGCGGGTA 14580 14581 GCCTTGGAAG AGATGGAGAA TAAGCCCCGG AAACAGCAGG GCTACAGCAC CGTGTCCCAC 14640 14641 TTCAACATCG TGCACTACGA CTGCCACCTG GCTGCCGTCA GGCTGGCTCG TGGCCGAGAA 14700 14701 GAGTGGGAAA GTGCTGCCCT TCAGAATGCC AACACTAAGT GCAATGGACT CCTTCCGGTC 14760 14761 TGGGGGCCCC ATGTCCCTGA ATCAGCTTTT GCCACCTGCT TGGCAAGGCA CAACACTTAC 14820 14821 CTCCAGGAAT GTACGGGCCA GCGGGAGCCC ACGTACCAGC TCAACATCCA CGACATCAAG 14880 14881 CTGCTCTTCC TGCGCTTCGC CATGGAACAG TCGTTCAGCG CAGACACCGG CGGAGGCGGC 14940 14941 CGGGAGAGCA ACATCCACCT GATCCCGTAC ATCATTCACA CTGTGCTTTA CGTCCTGAAC 15000 15001 ACAACCCGAG CAACTTCCCG AGAGGAGAAG AACCTCCAGG GCTTTCTGGA GCAGCCCAAG 15060 15061 GAAAAGTGGG TGGAGAGTGC CTTTGAAGTG GACGGGCCCC ACTATTTCAC AGTCCTGGCC 15120 15121 CTCCACATCC TGCCCCCCGA GAAGTGGAGA GCCACACGTG TGGAAATCCT GCGGAGGCTG 15180 15181 CTGGTGACCT CTCAGGCCCG GGCAGTGGCT CCAGGCGGAG CCACCAGGCT GACAGACAAG 15240 15241 GCAGTGAAGG ACTACTCTGC CTACCGCTCT TCCCTTCTCT TCTGGGCCCT CGTCGATCTC 15300 15301 ATTTACAACA TGTTCAAGAA GGTGCCTACC AGTAACACGG AGGGAGGCTG GTCCTGCTCT 15360 15361 CTAGCGGAAT ACATCCGCCA CAATGACATG CCCATCTACG AAGCTGCTGA CAAAGCCCTG 15420 15421 AAGACCTTCC AGGAGGAGTT CATGCCAGTG GAGACCTTCT CGGAGTTCCT TGATGCGGCT 15480 15481 GGCCTGTTGT CAGAAGTCAC CGATCCCGAG AACTTCCTGA AGGACCTGTT GAACTCAATC 15540 15541 CCTTGA 15546 |
▼ ORTHOLOGY
DrLLPS ID | Organism | Identity | E-value | Score |
---|---|---|---|---|
LLPS-Mup-3008 | Mustela putorius furo | 99.11 | 0.0 | 9762 |
LLPS-Aim-4291 | Ailuropoda melanoleuca | 98.93 | 0.0 | 9720 |
LLPS-Fec-4506 | Felis catus | 98.78 | 0.0 | 9731 |
LLPS-Sus-0553 | Sus scrofa | 98.51 | 0.0 | 9726 |
LLPS-Myl-1985 | Myotis lucifugus | 98.43 | 0.0 | 6092 |
LLPS-Eqc-3758 | Equus caballus | 98.41 | 0.0 | 9725 |
LLPS-Urm-2369 | Ursus maritimus | 98.3 | 0.0 | 9646 |
LLPS-Mal-1872 | Mandrillus leucophaeus | 98.22 | 0.0 | 9659 |
LLPS-Man-0525 | Macaca nemestrina | 98.22 | 0.0 | 9660 |
LLPS-Chs-3726 | Chlorocebus sabaeus | 98.21 | 0.0 | 9618 |
LLPS-Cea-4173 | Cercocebus atys | 98.2 | 0.0 | 9655 |
LLPS-Mam-3971 | Macaca mulatta | 98.2 | 0.0 | 9656 |
LLPS-Gog-3570 | Gorilla gorilla | 98.18 | 0.0 | 9665 |
LLPS-Pat-3339 | Pan troglodytes | 98.18 | 0.0 | 9665 |
LLPS-Pap-4060 | Pan paniscus | 98.18 | 0.0 | 9665 |
LLPS-Rhb-1499 | Rhinopithecus bieti | 98.12 | 0.0 | 9649 |
LLPS-Hos-4232 | Homo sapiens | 98.08 | 0.0 | 9655 |
LLPS-Caj-0353 | Callithrix jacchus | 98.06 | 0.0 | 9651 |
LLPS-Paa-3033 | Papio anubis | 98.06 | 0.0 | 9661 |
LLPS-Maf-3969 | Macaca fascicularis | 98.01 | 0.0 | 9650 |
LLPS-Cas-2739 | Carlito syrichta | 97.91 | 0.0 | 9655 |
LLPS-Ict-4622 | Ictidomys tridecemlineatus | 97.85 | 0.0 | 9624 |
LLPS-Ran-1601 | Rattus norvegicus | 97.83 | 0.0 | 3626 |
LLPS-Aon-4727 | Aotus nancymaae | 97.79 | 0.0 | 9614 |
LLPS-Gaga-1077 | Gallus gallus | 97.67 | 0.0 | 627 |
LLPS-Bot-4448 | Bos taurus | 97.42 | 0.0 | 9621 |
LLPS-Mea-1814 | Mesocricetus auratus | 97.36 | 0.0 | 9637 |
LLPS-Dio-0461 | Dipodomys ordii | 97.29 | 0.0 | 846 |
LLPS-Orc-1845 | Oryctolagus cuniculus | 97.24 | 0.0 | 9634 |
LLPS-Fud-0563 | Fukomys damarensis | 97.13 | 0.0 | 9532 |
LLPS-Cap-3077 | Cavia porcellus | 97.1 | 0.0 | 9622 |
LLPS-Mod-3000 | Monodelphis domestica | 96.94 | 0.0 | 3587 |
LLPS-Poa-4264 | Pongo abelii | 96.94 | 0.0 | 9525 |
LLPS-Nol-4574 | Nomascus leucogenys | 96.9 | 0.0 | 9529 |
LLPS-Loa-3603 | Loxodonta africana | 96.81 | 0.0 | 9573 |
LLPS-Sah-1947 | Sarcophilus harrisii | 96.66 | 0.0 | 3581 |
LLPS-Mum-3834 | Mus musculus | 96.65 | 0.0 | 9575 |
LLPS-Ova-4191 | Ovis aries | 95.84 | 0.0 | 9416 |
LLPS-Anp-0761 | Anas platyrhynchos | 95.6 | 0.0 | 3541 |
LLPS-Meg-1948 | Meleagris gallopavo | 95.49 | 0.0 | 3541 |
LLPS-Otg-0425 | Otolemur garnettii | 95.48 | 0.0 | 1553 |
LLPS-Fia-3059 | Ficedula albicollis | 95.27 | 0.0 | 3525 |
LLPS-Pes-3569 | Pelodiscus sinensis | 94.38 | 0.0 | 3471 |
LLPS-Xet-0671 | Xenopus tropicalis | 93.96 | 4e-52 | 210 |
LLPS-Tag-2560 | Taeniopygia guttata | 92.98 | 0.0 | 3417 |
LLPS-Tar-3242 | Takifugu rubripes | 92.89 | 0.0 | 876 |
LLPS-Lac-0025 | Latimeria chalumnae | 91.54 | 0.0 | 3392 |
LLPS-Dar-0093 | Danio rerio | 91.03 | 0.0 | 3382 |
LLPS-Scf-0568 | Scleropages formosus | 90.99 | 0.0 | 3326 |
LLPS-Anc-1309 | Anolis carolinensis | 90.98 | 0.0 | 916 |
LLPS-Asm-3725 | Astyanax mexicanus | 90.92 | 0.0 | 3376 |
LLPS-Icp-1442 | Ictalurus punctatus | 90.31 | 0.0 | 3359 |
LLPS-Orn-2336 | Oreochromis niloticus | 89.82 | 0.0 | 3336 |
LLPS-Pof-2062 | Poecilia formosa | 89.72 | 0.0 | 3320 |
LLPS-Scm-3463 | Scophthalmus maximus | 89.71 | 0.0 | 3321 |
LLPS-Ten-1063 | Tetraodon nigroviridis | 89.32 | 0.0 | 3307 |
LLPS-Orl-3616 | Oryzias latipes | 89.27 | 0.0 | 3303 |
LLPS-Gaa-3918 | Gasterosteus aculeatus | 88.96 | 0.0 | 3304 |
LLPS-Xim-3182 | Xiphophorus maculatus | 85.29 | 0.0 | 3088 |
LLPS-Leo-1801 | Lepisosteus oculatus | 85.17 | 0.0 | 5295 |
LLPS-Ora-2443 | Ornithorhynchus anatinus | 83.75 | 0.0 | 1988 |
LLPS-Cis-0251 | Ciona savignyi | 58.02 | 0.0 | 1032 |
LLPS-Cii-1639 | Ciona intestinalis | 57.6 | 0.0 | 1016 |
LLPS-Cym-0986 | Cyanidioschyzon merolae | 54.39 | 4e-15 | 87.8 |
LLPS-Brd-1299 | Brachypodium distachyon | 53.23 | 1e-15 | 90.1 |
LLPS-Hov-0354 | Hordeum vulgare | 53.23 | 9e-16 | 90.1 |
LLPS-Chc-0366 | Chondrus crispus | 50.82 | 1e-15 | 89.7 |
LLPS-Ori-2397 | Oryza indica | 48.68 | 9e-16 | 90.1 |
LLPS-Orb-1973 | Oryza barthii | 48.68 | 1e-15 | 89.7 |
LLPS-Orni-2035 | Oryza nivara | 48.68 | 9e-16 | 90.1 |
LLPS-Orgl-1178 | Oryza glumaepatula | 48.68 | 9e-16 | 90.1 |
LLPS-Orp-0632 | Oryza punctata | 48.68 | 1e-15 | 90.1 |
LLPS-Orbr-2146 | Oryza brachyantha | 48.68 | 1e-15 | 89.7 |
LLPS-Sob-2100 | Sorghum bicolor | 48.68 | 8e-16 | 90.5 |
LLPS-Orr-1135 | Oryza rufipogon | 48.68 | 1e-15 | 89.7 |
LLPS-Orm-1165 | Oryza meridionalis | 48.68 | 8e-16 | 90.1 |
LLPS-Zem-1015 | Zea mays | 48.68 | 9e-16 | 90.1 |
LLPS-Drm-1781 | Drosophila melanogaster | 47.78 | 0.0 | 1697 |
LLPS-Osl-1290 | Ostreococcus lucimarinus | 47.62 | 2e-13 | 82.4 |
LLPS-Mua-0943 | Musa acuminata | 47.37 | 9e-16 | 90.1 |
LLPS-Lep-2151 | Leersia perrieri | 46.99 | 1e-15 | 90.1 |
LLPS-Tra-2237 | Triticum aestivum | 39.5 | 7e-16 | 90.5 |
LLPS-Tru-0498 | Triticum urartu | 39.5 | 8e-16 | 90.5 |
LLPS-Vir-1933 | Vigna radiata | 38.49 | 6e-122 | 441 |
LLPS-Sot-0187 | Solanum tuberosum | 35.71 | 3e-141 | 491 |
LLPS-Nia-2225 | Nicotiana attenuata | 35.39 | 0.0 | 879 |
LLPS-Cae-0925 | Caenorhabditis elegans | 35.14 | 4e-07 | 61.6 |
LLPS-Mae-2006 | Manihot esculenta | 35.07 | 0.0 | 855 |
LLPS-Amt-2056 | Amborella trichopoda | 35.02 | 4e-146 | 521 |
LLPS-Hea-2347 | Helianthus annuus | 34.81 | 0.0 | 861 |
LLPS-Php-1331 | Physcomitrella patens | 34.66 | 0.0 | 905 |
LLPS-Via-1875 | Vigna angularis | 34.64 | 0.0 | 851 |
LLPS-Coc-0068 | Corchorus capsularis | 34.63 | 0.0 | 884 |
LLPS-Met-1271 | Medicago truncatula | 34.34 | 0.0 | 875 |
LLPS-Viv-0323 | Vitis vinifera | 34.3 | 0.0 | 860 |
LLPS-Sei-2149 | Setaria italica | 34.25 | 0.0 | 894 |
LLPS-Glm-2522 | Glycine max | 34.2 | 0.0 | 858 |
LLPS-Phv-0724 | Phaseolus vulgaris | 34.18 | 0.0 | 894 |
LLPS-Pot-0365 | Populus trichocarpa | 34.15 | 0.0 | 844 |
LLPS-Thc-2452 | Theobroma cacao | 34.14 | 0.0 | 870 |
LLPS-Dac-2408 | Daucus carota | 34.05 | 0.0 | 825 |
LLPS-Art-0334 | Arabidopsis thaliana | 33.99 | 0.0 | 854 |
LLPS-Org-0689 | Oryza glaberrima | 33.96 | 6e-57 | 220 |
LLPS-Gor-2376 | Gossypium raimondii | 33.92 | 0.0 | 840 |
LLPS-Arl-2854 | Arabidopsis lyrata | 33.9 | 0.0 | 852 |
LLPS-Brr-0741 | Brassica rapa | 33.7 | 0.0 | 819 |
LLPS-Brn-0330 | Brassica napus | 33.51 | 0.0 | 868 |
LLPS-Bro-2363 | Brassica oleracea | 33.51 | 0.0 | 865 |
LLPS-Sem-1388 | Selaginella moellendorffii | 33.38 | 0.0 | 932 |
LLPS-Cus-2122 | Cucumis sativus | 32.74 | 0.0 | 881 |
LLPS-Sol-1251 | Solanum lycopersicum | 32.72 | 0.0 | 847 |
LLPS-Prp-0917 | Prunus persica | 32.35 | 0.0 | 847 |
LLPS-Ere-1067 | Erinaceus europaeus | 27.27 | 4e-06 | 57.8 |