LLPS-Art-0295
NDX

▼ OVERVIEW


Status: Reviewed
Protein Name: Nodulin homeobox; NDX1 homeobox protein homolog; AtNDX1
Gene Name: NDX, At4g03090, F4C21.1, T4I9.3
Ensembl Gene: AT4G03090
Ensembl Protein: AT4G03090.3
Organism: Arabidopsis thaliana
Taxa ID: 3702
LLPS Type: Client


▼ PROPERTY



——— Disorder propensity (calculated by IUPred2A)

▼ Classification


Condensates:
CondensateDescriptionTissue/CellPMIDs
Nucleolus
"...We identified 1602 proteins in the nucleolar and 2544 proteins in the nuclear fraction with an overlap of 1429 proteins."
Arabidopsis thaliana cells26980300

▼ FUNCTION


Regulates COOLAIR, a set of antisense transcripts originating from the 3' end of FLOWERING LOCUS C (FLC). Associates with single-stranded DNA that is part of an RNA-DNA hybrid, or R-loop, that covers the COOLAIR promoter. R-loop stabilization mediated by NDX inhibits COOLAIR transcription, which in turn modifies FLC expression.

▼ SEQUENCE


Protein Sequence (FASTA)
1     MVRLLQPKHM  VQAVNALHWR  NSVEFHKLLK  DNGDFSICFN  SEQVLPQKIS  VEKMVKMLPR  60
61    HLIAVVMTPN  KDGKSRYILC  GIRLLQTLCD  LTPRNAKLEQ  VLLDDVKLSA  QMIDLVILVI  120
121   IALGRNRKES  CNSNKESLLE  ATLVASCLHL  FHGFISPNSQ  DLVLVLLAHP  RVDVFIDSAF  180
181   GAVLNVVISL  KAKLLYRQTD  SPKKLGASSV  EEVNFHCQQA  EAALQFLHSL  CQHKPFRERV  240
241   AKNKELCGKG  GVLRLAQSIL  SLTITPEFVG  ATVTIASTSR  MKAKVLSILQ  HLFEAESVSF  300
301   LDEVANAGNL  HLAKTVASEV  LKLLRLGLSK  ASMATASPDY  PMGFVLLNAM  RLADVLTDDS  360
361   NFRSFFTEHF  SMVLSAVFCL  SHGDFLSMLC  SSDLSSREDD  ANVDYDLFKS  AGWILSVFSS  420
421   SGQSVTPQFK  LSLQNNLTMS  SYAHQRTSLF  IKMIANLHCF  VPNVCQEQDR  NRFIQNVMSG  480
481   LRKDPSSILI  KMLPGSSYTP  VAQRGTGVCR  NLGSLLRHAE  SLIPSSLNEE  DFLLLRVFCD  540
541   QLQPLIHSEF  EESQVQVKVK  KLFALLYIGF  TILWLICLVT  LIQDIEGRGG  NLSGKLKELL  600
601   NLNNEEASED  CDVRVEGVMT  KQGVNEEIDT  VERLKESDAD  ASNLETSGSD  TSSNRGKGLV  660
661   EEGELVQNMS  KRFKGSASGE  VKEDEKSETF  LVFEKQKKKR  KRSIMNADQM  GMIEKALAEE  720
721   PDLQRNSASR  QLWADKISQK  GSEVITSSQL  KNWLNNRKAK  LARANKQTGP  AHDNNSSGDL  780
781   PESPGDENTW  QQKPSTPIKD  QTVTETPKTG  ENLMRTSSSS  EEGIKQGQQV  RLMDERGDEI  840
841   GKGTVLRTDG  EWNGLSLETR  QICVVDVMEL  SESYDGSKKM  IPYGSDDVGR  TFTEANSRFG  900
901   VMRVAWDVNK  LQY  913
Nucleotide CDS Sequence (FASTA)
1     ATGGTTCGAT  TGTTGCAGCC  TAAACACATG  GTACAAGCTG  TGAACGCTTT  GCATTGGCGA  60
61    AACTCTGTGG  AATTTCATAA  GCTGCTTAAA  GATAATGGAG  ATTTCTCTAT  TTGCTTTAAC  120
121   TCTGAGCAAG  TGTTACCACA  AAAGATTAGT  GTTGAGAAGA  TGGTGAAAAT  GTTACCTCGG  180
181   CACCTCATTG  CGGTGGTTAT  GACTCCTAAT  AAAGATGGAA  AGTCTCGTTA  TATACTGTGT  240
241   GGGATCAGAC  TGTTGCAGAC  GTTGTGTGAC  TTAACACCTC  GTAATGCTAA  ACTCGAGCAG  300
301   GTCTTGCTTG  ACGATGTGAA  ATTATCAGCA  CAGATGATTG  ATCTGGTGAT  CCTTGTGATA  360
361   ATAGCTCTTG  GCCGTAACAG  AAAGGAAAGC  TGTAATTCGA  ATAAAGAATC  GTTACTAGAG  420
421   GCTACATTGG  TGGCTTCTTG  TCTCCACCTG  TTTCACGGGT  TTATCTCTCC  TAACTCCCAA  480
481   GATCTTGTTC  TCGTCTTGCT  TGCACACCCA  AGGGTTGATG  TGTTTATAGA  CAGTGCTTTT  540
541   GGAGCTGTTC  TCAATGTTGT  GATATCTTTG  AAAGCGAAGT  TGCTGTATAG  ACAAACTGAC  600
601   TCCCCAAAAA  AGTTAGGCGC  AAGTTCTGTA  GAGGAGGTTA  ACTTCCACTG  CCAACAAGCT  660
661   GAAGCTGCTT  TGCAGTTCCT  TCATTCTCTA  TGCCAACACA  AACCCTTTAG  AGAACGTGTC  720
721   GCTAAAAACA  AGGAGCTATG  TGGAAAAGGT  GGCGTTCTTA  GGCTAGCTCA  ATCCATACTA  780
781   TCACTAACTA  TTACACCTGA  ATTTGTTGGA  GCAACCGTAA  CTATAGCTTC  CACATCTAGA  840
841   ATGAAAGCAA  AAGTTCTTTC  AATTTTGCAG  CATCTGTTTG  AAGCGGAAAG  TGTCTCATTC  900
901   CTTGACGAGG  TTGCAAATGC  AGGAAACTTG  CATTTAGCCA  AAACTGTTGC  CTCAGAGGTT  960
961   CTTAAATTAT  TGAGGCTTGG  CCTTTCTAAA  GCTTCCATGG  CTACTGCTTC  TCCTGACTAC  1020
1021  CCGATGGGTT  TTGTGCTACT  TAACGCTATG  CGCTTGGCTG  ACGTGCTCAC  TGATGACTCA  1080
1081  AATTTTCGAT  CTTTTTTCAC  TGAACATTTT  AGCATGGTTC  TCAGCGCGGT  ATTTTGTCTC  1140
1141  TCTCATGGAG  ATTTCTTGTC  AATGTTGTGC  TCTTCTGATC  TTTCTTCAAG  GGAGGATGAT  1200
1201  GCGAATGTTG  ATTATGATCT  GTTTAAGTCA  GCTGGATGGA  TTCTAAGTGT  ATTTTCATCT  1260
1261  TCTGGGCAAT  CAGTCACACC  TCAATTCAAG  CTCAGTTTAC  AAAATAACCT  TACCATGTCT  1320
1321  TCATATGCAC  ATCAACGAAC  ATCCTTATTT  ATTAAAATGA  TTGCGAATCT  TCACTGTTTC  1380
1381  GTTCCCAACG  TGTGCCAAGA  ACAGGATAGG  AACCGTTTCA  TTCAGAATGT  TATGAGTGGA  1440
1441  TTGCGAAAAG  ATCCTTCAAG  CATATTGATT  AAGATGTTAC  CAGGCTCTTC  ATATACTCCT  1500
1501  GTGGCACAGA  GAGGCACTGG  TGTTTGCAGA  AACCTAGGTT  CTCTGTTGCG  CCATGCAGAA  1560
1561  TCCTTGATCC  CTAGTTCCCT  CAACGAGGAA  GATTTTCTGC  TTTTGAGGGT  GTTTTGTGAC  1620
1621  CAGTTACAGC  CGTTAATCCA  TTCCGAGTTT  GAGGAAAGTC  AAGTACAGGT  GAAGGATATT  1680
1681  GAAGGTAGGG  GCGGGAATTT  ATCTGGTAAG  CTAAAAGAGC  TTCTGAATCT  TAACAATGAG  1740
1741  GAAGCTTCAG  AGGATTGTGA  TGTCCGAGTT  GAAGGTGTGA  TGACAAAGCA  AGGCGTGAAC  1800
1801  GAGGAGATAG  ACACAGTTGA  AAGGTTGAAA  GAGAGCGATG  CAGATGCTAG  CAATCTTGAA  1860
1861  ACCAGTGGTT  CAGATACAAG  CTCTAACAGA  GGGAAGGGTC  TGGTTGAAGA  GGGAGAGTTG  1920
1921  GTTCAGAATA  TGAGCAAGCG  ATTTAAAGGC  AGTGCATCAG  GAGAGGTGAA  GGAGGATGAG  1980
1981  AAATCTGAAA  CCTTCCTTGT  CTTTGAGAAG  CAGAAGAAGA  AACGGAAGCG  TAGTATTATG  2040
2041  AATGCTGATC  AAATGGGGAT  GATTGAGAAG  GCGCTTGCTG  AAGAACCTGA  TTTGCAGCGG  2100
2101  AATTCAGCTT  CGAGACAGTT  ATGGGCTGAT  AAAATAAGTC  AAAAGGTGAG  TTCAAGAATT  2160
2161  CTACTCTTTT  TAATTTCCAA  CCACGTATTA  CGAACGATGC  GCTCGTTGAT  TACATGA  2217

▼ KEYWORD


ID
Family
Alternative promoter usage
Alternative splicing
Complete proteome
DNA-binding
Flowering
Homeobox
Nucleus
Reference proteome

▼ GENE ONTOLOGY


ID
Classification
Description
Cellular Component
Nucleolus
Cellular Component
Nucleus
Molecular Function
Single-stranded DNA binding
Biological Process
Flower development
Biological Process
Negative regulation of antisense RNA transcription

▼ KEGG



▼ ANNOTATION


Disorder
IUPred2A
Physicochemical
Compute pI/MwAAindex
Localization
COMPARTMENTSNLSdb

▼ ORTHOLOGY


DrLLPS IDOrganismIdentityE-valueScore
LLPS-Arl-0038Arabidopsis lyrata90.490.01636
LLPS-Brn-1202Brassica napus67.850.01205
LLPS-Brr-0973Brassica rapa67.710.01194
LLPS-Bro-2525Brassica oleracea67.710.01205
LLPS-Gor-2016Gossypium raimondii53.153e-172 532
LLPS-Pot-2430Populus trichocarpa52.830.0 557
LLPS-Coc-1031Corchorus capsularis51.582e-157 503
LLPS-Prp-1922Prunus persica50.093e-164 512
LLPS-Phv-1866Phaseolus vulgaris49.829e-155 486
LLPS-Vir-1530Vigna radiata48.648e-94 308
LLPS-Glm-2203Glycine max48.553e-152 480
LLPS-Met-2259Medicago truncatula47.853e-147 468
LLPS-Thc-2139Theobroma cacao47.490.0 717
LLPS-Mae-2059Manihot esculenta47.260.0 754
LLPS-Sol-2212Solanum lycopersicum45.141e-132 428
LLPS-Viv-2275Vitis vinifera44.970.0 696
LLPS-Hea-0009Helianthus annuus41.744e-27 122
LLPS-Ori-2272Oryza indica41.715e-74 266
LLPS-Zem-1556Zea mays41.586e-105 353
LLPS-Via-1510Vigna angularis41.350.0 590
LLPS-Brd-2206Brachypodium distachyon41.282e-101 343
LLPS-Cus-0153Cucumis sativus41.150.0 647
LLPS-Nia-1895Nicotiana attenuata41.060.0 577
LLPS-Sob-0889Sorghum bicolor41.011e-104 355
LLPS-Sei-1944Setaria italica40.882e-105 355
LLPS-Orbr-1784Oryza brachyantha40.72e-97 333
LLPS-Hov-0911Hordeum vulgare40.655e-103 348
LLPS-Org-1847Oryza glaberrima40.437e-98 335
LLPS-Tru-0734Triticum urartu40.244e-36 151
LLPS-Amt-0552Amborella trichopoda40.145e-114 381
LLPS-Lep-1481Leersia perrieri38.671e-87 305
LLPS-Orp-2038Oryza punctata38.355e-84 295
LLPS-Ors-2299Oryza sativa38.341e-22 107
LLPS-Orb-0064Oryza barthii38.274e-59 221
LLPS-Dac-0024Daucus carota38.175e-165 513
LLPS-Orr-1248Oryza rufipogon38.152e-84 296
LLPS-Orgl-0542Oryza glumaepatula38.041e-82 291
LLPS-Orni-1545Oryza nivara37.863e-82 290
LLPS-Tra-0878Triticum aestivum35.931e-150 475
LLPS-Orm-1848Oryza meridionalis35.891e-49 192
LLPS-Mua-1243Musa acuminata35.016e-123 399
LLPS-Sem-1110Selaginella moellendorffii34.82e-22 107
LLPS-Php-1533Physcomitrella patens31.822e-23 110