Gene Id: HORVU.MOREX.r2.5HG0417900
Gene ID in V1: HORVU5Hr1G086970
Gene ID in V3: HORVU.MOREX.r3.5HG0503060
exon number: 5
Chromosome number: chr5H
Start Position: 519324180
End Position: 519328854
Gene length: 4674 bp
Strand positive: +
Protein Id: HORVU.MOREX.r2.5HG0417900.1
Protein length: 942 aa
Molecular weight: 102.8666 kDa
Theoretical pI: 7.83
Total number of negatively charged residues(Asp + Glu): 97
Total number of positively charged residues (Arg + Lys): 99
Instability index (II): 48.76
Aliphatic index: 73.25
Grand average of hydropathicity (GRAVY): -0.457
GO: GO:0003677,GO:0005634,GO:0006351,GO:0006355
Protein ID | Ortholog | gene symbol |
---|---|---|
HORVU.MOREX.r2.5HG0417900.1 | AT1G20640.1 | NLP4 |
HORVU.MOREX.r2.5HG0417900.1 | Os09t0549450-00 | - |
Protein sequence:
>HORVU.MOREX.r2.5HG0417900.1
MKGRSKARAGPNKARTSTANFQFSFFHLAAAAAAAERASRQTGAPLERNGGGAALWWSTEPFLELLLAGGGGQRRAKEEEGGKSRRGTHHTITSSEGSLTCMEGGDAQPGISVGRALSSEGDLDLLEQLLSGDNAWLEVAPNTSRSPNFFASPSTFLSDATTTTTMPAGASSALWIQPPSSTVRQRFEQALDHIKQMQRDAGMLVQLWVPQKNSDGKLVLTTSGQPFTLDNTSDRLIQFRDVSTHYHFSADAGSEASPVGLPGRVFIGKLPEWSPDVRLFTSYEYPRVKYAQDLDVHGTMGLPVFEKGSYSCLGVMELIMTRQKLNFTSEINNICNALQAVNLRSTEVSSTPRATKFNSASYRDALPEILEVLRAACVTHNLPLAQTWVTCAQQGKGGSRHTDENYPYCISTIDTACYVNDPQMQNFHDSCSDHHLLRGQGVAGKAFETNQPCFLPDIGSSAKEHYPLSHHAKIFNLKGAVAIRLRCTRTGTADFVLEFFLPTNCEALEEQKAVLDSLSGTMRNTCRTLRVVTDKEMGDEAMLDRNELNTFGPQGKNKVEELSFGDQATEHREEASWTSLAGTSKESDLAELSMHGMLSPEGQGLSLAGAQTSAQGSKGKRRTKTEKTVSLPVLRQYFAGSLKDAARSLGVCPTTLKRICRQHGINRWPSRKIKKVDHSLRKLQQIIDSVHGGETAFQLNTLYKDLTNTSVSSDNNLSGSITVPPHKQSNLTDFERHGHHRLSNNVPSTSHSHSSCSQSSDSSPSSCSGGSTKYPPQAGVDLLMSGNPVNHSPVQTLQTENASIIGHFPVQEAPDLLHNLNQKALGGQHSSRSPSPPKQNADTGMRIKAAFGSEKVRFRLKPECSFQELKQEMARRLSIVDISFLIVKYLDDDLEWVLMTCDADLQECLHVYKLANLQTVKISVHLAPIPEARVTVGRTGLS
CDS sequence:
>HORVU.MOREX.r2.5HG0417900
ATGAAAGGCAGAAGCAAAGCGAGGGCAGGGCCAAACAAGGCCAGGACAAGCACAGCAAATTTCCAGTTTTCCTTTTTCCATCTCGCAGCAGCGGCAGCGGCAGCGGAGAGAGCGAGCAGGCAGACAGGGGCGCCTTTGGAAAGGAATGGTGGGGGTGCGGCGCTCTGGTGGAGCACTGAGCCTTTTCTAGAGCTGCTACTTGCCGGCGGAGGAGGCCAGCGTAGAGCCAAGGAGGAAGAGGGAGGGAAGAGCAGGAGGGGGACACACCATACGATTACCAGTTCTGAAGGAAGCCTTACCTGTATGGAAGGGGGAGACGCCCAGCCCGGCATCTCCGTGGGGCGCGCCTTGTCGTCGGAGGGCGACCTGGACCTCCTGGAGCAGCTGCTCTCCGGCGACAACGCCTGGCTTGAAGTGGCGCCCAACACTTCACGCTCACCCAACTTTTTTGCTTCTCCCTCCACCTTCCTCTCAGATGCCACAACCACCACCACGATGCCGGCAGGTGCAAGCAGCGCCCTGTGGATTCAGCCACCCTCCTCCACCGTCCGGCAAAGGTTCGAGCAAGCCCTGGATCACATCAAGCAGATGCAGAGAGATGCCGGCATGCTTGTGCAGCTATGGGTGCCGCAAAAAAACAGTGATGGGAAGCTGGTGCTGACAACGAGCGGGCAGCCGTTCACGCTGGACAACACCTCAGATAGACTCATACAGTTCAGGGACGTGTCCACGCATTACCACTTCTCTGCAGATGCTGGGTCTGAGGCCTCACCGGTTGGGCTCCCCGGGAGGGTGTTCATTGGCAAGCTACCTGAGTGGTCGCCGGACGTTCGGCTCTTCACCAGCTACGAATACCCCAGGGTAAAATATGCACAGGATTTGGACGTCCATGGGACGATGGGGCTGCCGGTGTTCGAGAAGGGGAGTTACTCGTGCTTGGGTGTCATGGAATTGATCATGACCAGGCAGAAGCTCAACTTCACCTCAGAGATCAACAACATCTGCAATGCTCTCCAGGCAGTTAACTTGAGAAGCACAGAAGTTTCAAGCACTCCACGCGCCACAAAGTTCAACAGTGCTTCCTACAGAGATGCTCTACCAGAGATACTAGAAGTCCTAAGAGCAGCCTGCGTCACCCACAATCTCCCATTAGCTCAGACCTGGGTCACATGTGCTCAACAAGGGAAAGGGGGAAGCCGCCACACCGATGAGAACTACCCGTACTGCATCTCCACCATCGACACAGCATGCTACGTCAATGATCCCCAGATGCAGAACTTCCATGACTCCTGCTCTGACCACCACCTTCTGCGTGGACAAGGGGTTGCAGGGAAAGCCTTTGAAACAAACCAGCCATGCTTCTTACCAGACATTGGATCTTCAGCTAAAGAACACTATCCATTGTCCCACCATGCCAAGATCTTCAACTTAAAAGGTGCCGTGGCAATTCGGTTGAGGTGCACACGGACTGGGACAGCGGACTTTGTGCTAGAGTTCTTTCTGCCGACCAACTGTGAAGCCCTCGAGGAGCAGAAGGCAGTGCTGGACTCCTTGTCAGGCACCATGCGCAATACTTGTCGAACTCTACGTGTGGTTACAGACAAGGAGATGGGTGATGAGGCAATGCTGGACAGGAATGAGCTGAACACATTCGGTCCTCAAGGGAAGAATAAAGTTGAAGAGTTGTCCTTTGGAGATCAAGCAACAGAACATAGAGAGGAGGCATCATGGACAAGTCTAGCAGGGACTTCAAAAGAATCAGATTTAGCTGAATTAAGTATGCATGGTATGCTATCACCTGAAGGACAAGGTCTATCTCTAGCTGGTGCTCAGACAAGTGCACAAGGCAGCAAAGGAAAAAGGCGCACAAAGACGGAGAAGACTGTGAGCTTGCCAGTTCTTCGGCAGTACTTTGCTGGTAGCCTGAAAGATGCAGCAAGGAGCCTTGGAGTGTGTCCTACCACCCTCAAAAGAATATGCAGGCAGCATGGCATAAATCGCTGGCCATCACGAAAGATCAAGAAGGTAGACCATTCTCTAAGAAAGCTGCAGCAGATCATTGATTCAGTTCATGGAGGAGAGACAGCTTTCCAGCTTAATACCCTGTACAAGGATCTCACAAACACCTCTGTATCATCTGACAACAATTTATCAGGGAGCATCACAGTTCCTCCACATAAGCAGAGCAATCTTACTGATTTTGAGAGGCACGGACACCACAGGCTAAGCAATAATGTGCCATCAACCTCACACTCACACTCATCATGCAGCCAAAGTTCTGATTCAAGCCCGTCATCGTGCAGTGGTGGATCAACAAAGTATCCACCTCAGGCTGGAGTTGATTTGCTTATGTCAGGAAATCCTGTAAATCACAGCCCTGTCCAGACTCTGCAAACAGAAAATGCATCAATAATAGGACATTTCCCAGTTCAGGAGGCACCAGATCTGTTACATAATCTGAACCAAAAGGCTTTAGGTGGGCAGCATTCTTCTCGAAGCCCATCACCCCCGAAACAGAATGCAGATACAGGTATGAGAATAAAGGCCGCATTTGGCTCAGAAAAGGTCAGGTTCAGATTGAAGCCTGAGTGTAGTTTTCAAGAACTGAAGCAGGAGATGGCAAGACGTTTGAGTATAGTAGACATAAGTTTTTTGATTGTAAAGTACCTGGATGATGATTTAGAGTGGGTCTTGATGACATGCGATGCAGATTTACAGGAATGCCTTCATGTATATAAACTAGCAAATCTCCAAACAGTGAAAATTTCAGTTCATCTAGCTCCTATTCCAGAGGCGAGGGTCACTGTTGGTCGCACTGGTTTGTCATGA