|
AtCHX08 / At2g28180 ›› ARAMEMNON · TAIR · PLANTS T · MIPS · TIGR
|
|
Exon/Intron Map (revised by K. Bock):
|
|
Sequences ›› partial cDNA [ Sze Lab ] · BAC F24D13 [ ATG-STOP: 11486 - 14231 ]
|
|
Genomic DNA (3746 bp): Revised coding sequence -- confirmed by cDNA from Sze Lab -- in blue. Start: 500 bp before ATG; End: 500 bp after STOP
1 ttttgaccaa tttagtgaca aacttggcat gaaactatct agaaatagtt tcctatctta
61 tgtgtgacca aagctgtttt cattcataga aagacagtca tcttatttaa agaaattcct
121 ttttttctgg tagattctta gttgacttct ttctgtggtt tataacttat aagaagctga
181 agacatatac tatttatagt ggagaaacaa ctcaaaaagg gtctaaaact acatatgtct
241 atagatgtcc aaagtgtgtt ttttaaaaac agcataacct ataaacagca tatacatctg
301 tgtgtcattt gtttacaaat tgtttctttg ttatcaaaag gaacacaaca tttattttgt
361 tatcgtcttc cccattttga caagaactca cttaagattt tgaagtcatg gaaatgggga
421 acgggacagg acccgggatg tcgggggctt ttagtggagg cggcaacgag attttgggaa
481 tgggaggtgg cggcggcata ATGGGAGGCG GTGATATTTC ACATATGAGT CCTGAAGTAA
541 AATGGATATT TGAGATGGCT TGGTATGGTG AAACTGTAAG ATATGATGGG TTAATTTGTG
601 AGGAGCATCC CCCTAAGCTC TCTTCAGATG GTATTTGGGA GAAACTTATT ATTAAATCGG
661 CAGGTCTATA TTTTTGGCAA TATCGTCTTC CGAAGCTCGA GATTGTCATC TTGCTCGTCT
721 TCTTTCTTTG GCAAGGCTTT AATATCTTGT TTAAGAAATT GGGTCTTTCT ATTCCCAAGT
781 TATCCTCTAT GATGCTTgtg agctcttctt cttcttgttt gtcttttttt tttcttcaaa
841 acttgtgtgt ttgctgctga gaaaatcgat gttgcattgc agGCAGGGCT ACTCTTGAAT
901 GTTCTAGTTA CTTTATCGGG AGAGAACTCG ATCATTGCGG ATATCTTGGT CACGAAAAAC
961 AGAATCGACG TAGCAGGATG CCTTGGATCA TTTGGATTCT TGATTTTCTG GTTCCTCAAA
1021 GGTGTAAGAA TGGACGTCAA GAGAATCTTC AAGGCTGAAG CAAAAGCAAG AGTCACTGGA
1081 GTTGCAGCGG TTACTTTCCC TATAGTTGTT GGCTTCTTGC TTTTCAATCT CAAATCAGCT
1141 AAGAATCGAC CTCTCACTTT CCAAGAGTAT GATGTAATGC TACTAATGGA AAGCATCACG
1201 TCCTTCTCGG GGATCGCAAG ACTCTTGCGT GACCTTGGCA TGAACCATTC ATCTATTGGC
1261 CGGGTTGCTT TATCCTCAGC CTTAGTCTCT GATATAGTTG GACTCCTGCT CTTGATTGCG
1321 AACGTTTCTA GAAGTTCAGC AACTTTAGCT GATGGTTTGG CTATACTTAC AGAGATAACC
1381 TTATTCCTCG TCATTGCATT TGCGGTTGTG AGGCCGATAA TGTTCAAAAT AATAAAGCGG
1441 AAAGGAGAAG GAAGACCAAT CGAAGACAAA TACATCCACG GGGTTCTCGT CTTGGTTTGC
1501 TTATCTTGTA TGTATTGGGA AGATCTTAGC CAGTTTCCTC CACTTGGAGC CTTCTTTCTT
1561 GGTCTCGCCA TTCCCAATGG ACCTCCTATT GGATCTGCAT TGGTCGAACG ATTAGAAAGC
1621 TTCAATTTTG GTATCATATT ACCTCTTTTC TTAACAGCCG TTATGCTCAG GACTGATACC
1681 ACTGCTTGGA AAGGCGCTTT GACATTCTTT AGTGGCGATG ATAAGAAATT TGCGGTTGCG
1741 TCTCTCGTCT TGCTCATTTT CTTGTTGAAG CTCTCTGTCT CAGTCATTGT TCCTTACCTC
1801 TATAAAATGC CGTTGAGAGA CTCTATTATC CTTGCCCTAA TAATGTCTCA TAAGGGTATT
1861 ATCGAACTCA GCTTCTACCT TTTCTCTCTA AGCCTCAAGg ttagttttct catattttga
1921 ttcttgatca tgattatgat ttcttgccaa aaacaaaaag ttgaaaagat tcattgtttt
1981 gattcttttg attgatgatt tggtacagTT GGTAACCAAA GATACATTCT CAATTCTAGT
2041 CTTGTCCATT GTCCTCAACT CTCTGCTCAT ACCAATGGCG ATCGGGTTTC TCTACGACCC
2101 ATCTAAACAA TTCATATGCT ACCAAAAGAG AAATTTAGCG AGTATGAAGA ACATGGGAGA
2161 GCTAAAGACT CTTGTGTGCA TCCATAGACC AGACCACATA TCTTCCATGA TCAACCTTCT
2221 TGAAGCTTCT TATCAATCCG AAGACAGTCC TCTCACTTGC TACGTCCTTC ACCTCGTCGA
2281 GTTACGAGGT CAAGACGTTC CCACTTTGAT CTCACACAAA GTTCAGAAAC TCGGAGTCGG
2341 GGCTGGAAAT AAATATTCCG AAAATGTCAT CCTCTCTTTT GAACATTTCC ACCGTTCTGT
2401 CTGCAGTTCC ATTTCCATAG ACACATTCAC TTGCATCGCA AACGCAAACC ATATGCAGGA
2461 TGACATTTGT TGGCTAGCTC TTGATAAAGC TGTCACGCTT ATCATTCTTC CTTTTCACCG
2521 GACTTGGTCA CTTGACCGAA CATCCATCGT ATCCGACGTT GAGGCGATCC GATTTCTGAA
2581 TGTCAACGTC TTGAAACAAG CACCTTGCTC TGTCGGCATT CTTATCGAAC GCCATCTCGT
2641 TAACAAGAAG CAAGAACCAC ATGAAAGCCT TAAGgtatac ttacaattac atctactatg
2701 cttccatgtt ggtcgttgta gatgctttaa ctatatatat agaaaaacta agcacactaa
2761 gctatttgat gatagGTGTG TGTAATATTC GTGGGAGGAA AAGACGATAG GGAAGCTTTG
2821 GCCTTTGCGA AGCGAATGGC CCGTCAAGAG AACGTAACAT TAACAGTTCT ACGCCTCCTA
2881 GCATCAGGAA AGAGCAAAGA CGCGACAGGA TGGGATCAAA TGCTTGACAC GGTGGAACTA
2941 AGAGAGTTGA TTAAAAGCAA CAATGCCGGA ATGGTAAAAG AAGAAACATC AACAATTTAT
3001 TTGGAACAAG AGATATTGGA TGGAGCGGAT ACGTCAATGC TTCTACGTTC CATGGCTTTC
3061 GATTACGATC TTTTCGTCGT GGGAAGAACA TGCGGCGAGA ACCACGAGGC AACCAAAGGT
3121 ATAGAGAATT GGTGTGAGTT TGAGGAGCTT GGAGTCATTG GTGATTTCTT GGCCTCGCCG
3181 GATTTTCCGA GTAAAACATC GGTGTTAGTA GTGCAACAAC AACGAACGGT AGCCAATAAT
3241 AATTAGaagc ggagaaagca tactatggtt gtggttgttg ttacatctat ctctcctttg
3301 tgtagcatct acaaaaagaa gtttgaaaag aagaaaaaaa atgtatgacc acattttatt
3361 tgtttttttc attgctactt gtggaatatt gttttttgta gagcagaaag aaagactttg
3421 aagaatgttt taatccttaa attttcctat ccgaatagtc tcctagggtg aataattata
3481 agaacgaaaa atagtttatt tactagaata attgaaaatt tcttattagt tactagaaat
3541 agtttcatac ttagtttact ttttgaagct ttcttacgaa aattagagtt cgtaatcaaa
3601 aggatctagt ggttgagggc tcgctttgtt agtcaacctt acagtttttg gctatttttg
3661 atcacttttt tgtccaaatt ccatttgcga attgcgacca ctgatttttt ttttttacac
3721 tttcatataa ggcaagtaac acgtct
|
|
Revised Coding Sequence (2451 bp):
1 ATGGGAGGCG GTGATATTTC ACATATGAGT CCTGAAGTAA AATGGATATT TGAGATGGCT
61 TGGTATGGTG AAACTGTAAG ATATGATGGG TTAATTTGTG AGGAGCATCC CCCTAAGCTC
121 TCTTCAGATG GTATTTGGGA GAAACTTATT ATTAAATCGG CAGGTCTATA TTTTTGGCAA
181 TATCGTCTTC CGAAGCTCGA GATTGTCATC TTGCTCGTCT TCTTTCTTTG GCAAGGCTTT
241 AATATCTTGT TTAAGAAATT GGGTCTTTCT ATTCCCAAGT TATCCTCTAT GATGCTTGCA
301 GGGCTACTCT TGAATGTTCT AGTTACTTTA TCGGGAGAGA ACTCGATCAT TGCGGATATC
361 TTGGTCACGA AAAACAGAAT CGACGTAGCA GGATGCCTTG GATCATTTGG ATTCTTGATT
421 TTCTGGTTCC TCAAAGGTGT AAGAATGGAC GTCAAGAGAA TCTTCAAGGC TGAAGCAAAA
481 GCAAGAGTCA CTGGAGTTGC AGCGGTTACT TTCCCTATAG TTGTTGGCTT CTTGCTTTTC
541 AATCTCAAAT CAGCTAAGAA TCGACCTCTC ACTTTCCAAG AGTATGATGT AATGCTACTA
601 ATGGAAAGCA TCACGTCCTT CTCGGGGATC GCAAGACTCT TGCGTGACCT TGGCATGAAC
661 CATTCATCTA TTGGCCGGGT TGCTTTATCC TCAGCCTTAG TCTCTGATAT AGTTGGACTC
721 CTGCTCTTGA TTGCGAACGT TTCTAGAAGT TCAGCAACTT TAGCTGATGG TTTGGCTATA
781 CTTACAGAGA TAACCTTATT CCTCGTCATT GCATTTGCGG TTGTGAGGCC GATAATGTTC
841 AAAATAATAA AGCGGAAAGG AGAAGGAAGA CCAATCGAAG ACAAATACAT CCACGGGGTT
901 CTCGTCTTGG TTTGCTTATC TTGTATGTAT TGGGAAGATC TTAGCCAGTT TCCTCCACTT
961 GGAGCCTTCT TTCTTGGTCT CGCCATTCCC AATGGACCTC CTATTGGATC TGCATTGGTC
1021 GAACGATTAG AAAGCTTCAA TTTTGGTATC ATATTACCTC TTTTCTTAAC AGCCGTTATG
1081 CTCAGGACTG ATACCACTGC TTGGAAAGGC GCTTTGACAT TCTTTAGTGG CGATGATAAG
1141 AAATTTGCGG TTGCGTCTCT CGTCTTGCTC ATTTTCTTGT TGAAGCTCTC TGTCTCAGTC
1201 ATTGTTCCTT ACCTCTATAA AATGCCGTTG AGAGACTCTA TTATCCTTGC CCTAATAATG
1261 TCTCATAAGG GTATTATCGA ACTCAGCTTC TACCTTTTCT CTCTAAGCCT CAAGTTGGTA
1321 ACCAAAGATA CATTCTCAAT TCTAGTCTTG TCCATTGTCC TCAACTCTCT GCTCATACCA
1381 ATGGCGATCG GGTTTCTCTA CGACCCATCT AAACAATTCA TATGCTACCA AAAGAGAAAT
1441 TTAGCGAGTA TGAAGAACAT GGGAGAGCTA AAGACTCTTG TGTGCATCCA TAGACCAGAC
1501 CACATATCTT CCATGATCAA CCTTCTTGAA GCTTCTTATC AATCCGAAGA CAGTCCTCTC
1561 ACTTGCTACG TCCTTCACCT CGTCGAGTTA CGAGGTCAAG ACGTTCCCAC TTTGATCTCA
1621 CACAAAGTTC AGAAACTCGG AGTCGGGGCT GGAAATAAAT ATTCCGAAAA TGTCATCCTC
1681 TCTTTTGAAC ATTTCCACCG TTCTGTCTGC AGTTCCATTT CCATAGACAC ATTCACTTGC
1741 ATCGCAAACG CAAACCATAT GCAGGATGAC ATTTGTTGGC TAGCTCTTGA TAAAGCTGTC
1801 ACGCTTATCA TTCTTCCTTT TCACCGGACT TGGTCACTTG ACCGAACATC CATCGTATCC
1861 GACGTTGAGG CGATCCGATT TCTGAATGTC AACGTCTTGA AACAAGCACC TTGCTCTGTC
1921 GGCATTCTTA TCGAACGCCA TCTCGTTAAC AAGAAGCAAG AACCACATGA AAGCCTTAAG
1981 GTGTGTGTAA TATTCGTGGG AGGAAAAGAC GATAGGGAAG CTTTGGCCTT TGCGAAGCGA
2041 ATGGCCCGTC AAGAGAACGT AACATTAACA GTTCTACGCC TCCTAGCATC AGGAAAGAGC
2101 AAAGACGCGA CAGGATGGGA TCAAATGCTT GACACGGTGG AACTAAGAGA GTTGATTAAA
2161 AGCAACAATG CCGGAATGGT AAAAGAAGAA ACATCAACAA TTTATTTGGA ACAAGAGATA
2221 TTGGATGGAG CGGATACGTC AATGCTTCTA CGTTCCATGG CTTTCGATTA CGATCTTTTC
2281 GTCGTGGGAA GAACATGCGG CGAGAACCAC GAGGCAACCA AAGGTATAGA GAATTGGTGT
2341 GAGTTTGAGG AGCTTGGAGT CATTGGTGAT TTCTTGGCCT CGCCGGATTT TCCGAGTAAA
2401 ACATCGGTGT TAGTAGTGCA ACAACAACGA ACGGTAGCCA ATAATAATTA G
|
|
Revised Protein Sequence (822 aa):
1 MGGGDISHMS PEVKWIFEMA WYGETVRYDG LICEEHPPKL SSDGIWEKLI IKSAGLYFWQ
61 YRLPKLEIVI LLVFFLWQGF NILFKKLGLS IPKLSSMMLA GLLLNVLVTL SGENSIIADI
121 LVTKNRIDVA GCLGSFGFLI FWFLKGVRMD VKRIFKAEAK ARVTGVAAVT FPIVVGFLLF
181 NLKSAKNRPL TFQEYDVMLL MESITSFSGI ARLLRDLGMN HSSIGRVALS SALVSDIVGL
241 LLLIANVSRS SATLADGLAI LTEITLFLVI AFAVVRPIMF KIIKRKGEGR PIEDKYIHGV
301 LVLVCLSCMY WEDLSQFPPL GAFFLGLAIP NGPPIGSALV ERLESFNFGI ILPLFLTAVM
361 LRTDTTAWKG ALTFFSGDDK KFAVASLVLL IFLLKLSVSV IVPYLYKMPL RDSIILALIM
421 SHKGIIELSF YLFSLSLKLV TKDTFSILVL SIVLNSLLIP MAIGFLYDPS KQFICYQKRN
481 LASMKNMGEL KTLVCIHRPD HISSMINLLE ASYQSEDSPL TCYVLHLVEL RGQDVPTLIS
541 HKVQKLGVGA GNKYSENVIL SFEHFHRSVC SSISIDTFTC IANANHMQDD ICWLALDKAV
601 TLIILPFHRT WSLDRTSIVS DVEAIRFLNV NVLKQAPCSV GILIERHLVN KKQEPHESLK
661 VCVIFVGGKD DREALAFAKR MARQENVTLT VLRLLASGKS KDATGWDQML DTVELRELIK
721 SNNAGMVKEE TSTIYLEQEI LDGADTSMLL RSMAFDYDLF VVGRTCGENH EATKGIENWC
781 EFEELGVIGD FLASPDFPSK TSVLVVQQQR TVANNN
›› Fasta: All CHX Protein Sequences
|
|
Back to the Top · Last Revision: October 16, 2006 |