|
AtCHX16 / At1g64170 ›› ARAMEMNON · TAIR · PLANTS T · MIPS · TIGR
|
|
Exon/Intron Map (revised by K. Bock):
|
|
Sequences ›› partial cDNA [ Sze Lab ] · BAC F22C12 [ ATG-STOP: 32601 - 35655 ]
|
|
Genomic DNA (4055 bp): Revised coding sequence -- confirmed by cDNA from Sze Lab -- in blue. Start: 500 bp before ATG; End: 500 bp after STOP
1 ttgcgcgaca tgaaaacttt ctgattatag aatatgaaag tataacgatg acgttttgtt
61 tttgacaaaa tacgatgaca tctaatttat ttatctttta tcaacacaca caacataaag
121 catagtccac aagcaatctt tttttacaat gtgatggaat ctgcttgaat attgaagtaa
181 cgaaccagtg tccggtttaa ccacaccaaa atttgaaatg gcataaaccg gtgacattgt
241 tagtccactt tataaggttt tgtcttggcc tgctctatca gatttcatta gtcagaaagt
301 aaatcgaaaa atcttatcac ctacttgata taccctacgg acactatttt caaaccaagt
361 tgttggtttg agatcttatg tatatatata ttacatacat acacatatat agaccacata
421 agatactcta gggaatcaca ataagtcaat aatgattata taagtcagac acaaagatat
481 ttacattttt gcagacaata ATGGGTACTT TGGTCAACGG TACTATTCCG GCGATGAAGT
541 GTCCCAAGAA TGTGGCGATG ATGAAGACAA CGTCTAACGG AGTGTTCGAT GGAGAGAGTC
601 CATTAGATTT CGCTTTTCCT CTTGTCATTC TTCAGATTTG CCTTGTCGTC GCCGTCACTC
661 GCTCTTTGGC CTTCCTTCTC CGTCCCATGA GACAACCACG TGTTGTCGCC GAGATCATTg
721 taagtcctcc atcaaccggt ttaggtcagt cttatagttt ccggtttaat aaatatccta
781 ccagactgaa atatgagtta taccgggatt ccgttcagct tattactgta tgtggtttct
841 tggttagaaa aaataataac tacaaaccta attcctcttt agtcacacat gtcaggttca
901 attttaatca attatctctt taattttcag acaattggtt cagttttttt tggacatgtt
961 ttttttttta tttttcgttt catcactaaa caatcatgaa gccatcttac taatttggtt
1021 tagaaattat gtagattcgt ttataaatag ttaatttgtt ctttagaaaa catccaataa
1081 aattggaaag atacaataat acaaataaga ttagcatggt ggatactgat tttaaaaaat
1141 tatatttggt tgtatttatt tgtattggta taaaaacatt tcacttatcc agtttaattc
1201 aaaaactaaa ccagacagaa attgatttaa tttgtaccgg gatttagGGC GGGATTCTTC
1261 TTGGCCCGTC GGCTCTCGGT AGAATCACGT CGTACAAGAA TTCAATTTTC CCGGCGAGAA
1321 GTCTAACGGT GCTCGACACT CTAGCAAACC TCGGCCTCCT TCTCTTCCTT TTCCTCGTCG
1381 GACTCGAGAT CGATCTGACA TCTCTCCGAC GTACCGGCAA AAAAGCGATT TCCATCGCCG
1441 CCGCCGGAAT GCTTCTCCCC TTCGGTATGG GCATTGTCAC CTCCTTCGCT TTCCCTGAAG
1501 CTTCTTCTTC TGGCGACAAT AGCAAAGTAC TTCCGTTCAT CATTTTCATG GGAGTCGCGC
1561 TCTCGATCAC TGCTTTCGGA GTTTTGGCGA GAATACTCGC TGAATTAAAG CTACTCACAA
1621 CCGATCTCGG CCGGATTTCA ATGAACGCCG CCGCAATTAA CGACGTCGCG GCTTGGGTTC
1681 TCCTAGCTCT CGCTGTATCT CTCTCCGGCG ATAGGAATTC TCCGCTTGTT CCTCTCTGGG
1741 TTCTGTTAAG TGGAATCGCG TTTGTGATCG CGTGTTTTTT AATCGTACCG CGAATTTTCA
1801 AATTCATCTC TCGGCGTTGC CCAGAAGGCG AACCCATAGG CGAAATGTAC GTTTGCGTCG
1861 CCCTTTGCGC GGTTCTACTC GCGGGTTTCG CGACAGACGC CATTGGGATT CACGCGATTT
1921 TCGGTGCGTT TGTGATGGGC GTTTTGTTTC CCAAAGGACA TTTCTCAGAC GCGATTGTGG
1981 AGAAGATTGA AGATCTTGTA ATGGGTCTTC TTCTACCACT GTACTTCGTT ATGAGCGGTT
2041 TGAAAACGGA TATAACTACG ATTCAAGGTG TGAAATCGTG GGGACGACTC GCGTTGGTGA
2101 TTGTTACGGC TTGTTTTGGC AAAATCGTTG GGACTGTGAG TGTTGCCTTG TTATGCAAGG
2161 TAAGGCTCCG TGAATCGGTT GTTCTTGGGG TTTTAATGAA CACAAAGGGT TTAGTGGAGC
2221 TAATTGTTCT CAACATTGGC AAAGACAGAA AGgtactttc atctattcta aaacaatata
2281 attacaagct gaaatgattt taatcaaata gttaaggatt taattaggat ttatgtaacg
2341 cagGTTTTGA GCGATCAGAC TTTTGCAATT ATGGTTCTCA TGGCGATATT CACAACATTC
2401 ATCACAACGC CAATAGTCTT GGCGTTATAC AAACCAAGCG AGACAACACA AACACATAGT
2461 AGCGTCAGCT ACAAGAACCG CAAACATAGA CGCAAGATTG AGAATGATGA AGAAGGCGAG
2521 AAGATGCAGC AGCTTAAGGT TTTGGTATGT CTTCAAAGCA GTAAAGATAT TGATCCCATG
2581 ATGAAAATAA TGGAAGCTAC TCGTGGAAGC AACGAAACCA AAGAAAGATT TTGCGTTTAC
2641 GTTATGCATT TAACTCAACT CTCCGAGAGA CCTTCTTCTA TTCGAATGGT TCAAAAGGTG
2701 AGAAGCAACG GTTTGCCCTT TTGGAACAAG AAAAGAGAGA ATTCTAGTGC CGTTACGGTC
2761 GCGTTCGAGG CGTCTAGTAA GCTAAGTAGC GTTTCGGTGC GTTCTGTGAC CGCGATTTCA
2821 CCGTTGTCAA CAATTCATGA GGATATATGT AGCTCTGCTG ATAGTAAATG CACAGCGTTT
2881 GTGATTTTGC CGTTCCATAA GCAATGGAGA TCTCTGGAGA AAGAATTTGA AACGGTGAGA
2941 TCGGAGTATC AGGGGATTAA CAAAAGAGTT CTTGAGAATT CACCGTGTTC TGTTGGAATT
3001 TTGGTTGATC GTGGTCTCGG CGACAACAAT TCTCCGGTAG CTTCGAGCAA CTTTTCACTT
3061 TCCGTCAATG TTCTGTTCTT TGGCGGTTGC GATGATCGTG AAGCTTTGGT TTACGGGTTA
3121 CGAATGGCTG AACATCCGGG CGTTAACTTG ACCGTTGTGG TTATCTCTGG TCCGGAGAGC
3181 GCAAGGTTTG ATAGGCTTGA AGCGCAAGAA ACATCACTAT GTTCCTTAGA CGAGCAATTC
3241 CTTGCAGCAA TCAAGAAAAG GGCCAATGCA GCTAGATTTG AAGAGAGGAC GGTGAATTCA
3301 ACGGAGGAAG TGGTTGAGAT TATCCGCCAA TTTTACGAGT GCGATATTTT ATTGGTGGGA
3361 AAATCTTCCA AAGGACCTAT GGTTTCAAGA TTACCGGTTA TGAAGATAGA GTGTCCAGAA
3421 CTGGGACCGG TCGGAAACTT GATCGTGTCA AATGAGATTT CTACTTCAGT GTCTGTTTTG
3481 GTGGTTCAAC AATACACCGG GAAAGGTCCT TCTGTGGTGG GTTCCGTCTC TGTCCCGGTG
3541 GTGGAGACGC CATGAaaatc tggaaacaga ggattgtgtt ttctttacct tgaagcacaa
3601 accatgatgg ataacacgaa acactcttat gtatcaacat gcatgaatca aacgactccc
3661 ctttttatga atttgtaaga taaaacatat ttatgaattt gagtcactga ttatatcata
3721 tccttaaaac atatgatttg atggatagat tattcaagtt gttaagagtt caaaattttg
3781 accaaagata tttatatttt ggtaagtcga tccttttttc aaaggaatac ttcatatagt
3841 aattttagat actacaatta tattgactag tggtgttgta gaaatatttt ttcctactca
3901 atttactatc atcccatgca tgtgtccatt ataaagctag attgaaattt cagtagtaat
3961 caaaatctga tattgtataa cactgtgttt ggtttgtcaa acaacacatg acataacaaa
4021 tcatatacta cataattatc aaaccacata tcgaa
|
|
Revised Coding Sequence (2436 bp):
1 ATGGGTACTT TGGTCAACGG TACTATTCCG GCGATGAAGT GTCCCAAGAA TGTGGCGATG
61 ATGAAGACAA CGTCTAACGG AGTGTTCGAT GGAGAGAGTC CATTAGATTT CGCTTTTCCT
121 CTTGTCATTC TTCAGATTTG CCTTGTCGTC GCCGTCACTC GCTCTTTGGC CTTCCTTCTC
181 CGTCCCATGA GACAACCACG TGTTGTCGCC GAGATCATTG GCGGGATTCT TCTTGGCCCG
241 TCGGCTCTCG GTAGAATCAC GTCGTACAAG AATTCAATTT TCCCGGCGAG AAGTCTAACG
301 GTGCTCGACA CTCTAGCAAA CCTCGGCCTC CTTCTCTTCC TTTTCCTCGT CGGACTCGAG
361 ATCGATCTGA CATCTCTCCG ACGTACCGGC AAAAAAGCGA TTTCCATCGC CGCCGCCGGA
421 ATGCTTCTCC CCTTCGGTAT GGGCATTGTC ACCTCCTTCG CTTTCCCTGA AGCTTCTTCT
481 TCTGGCGACA ATAGCAAAGT ACTTCCGTTC ATCATTTTCA TGGGAGTCGC GCTCTCGATC
541 ACTGCTTTCG GAGTTTTGGC GAGAATACTC GCTGAATTAA AGCTACTCAC AACCGATCTC
601 GGCCGGATTT CAATGAACGC CGCCGCAATT AACGACGTCG CGGCTTGGGT TCTCCTAGCT
661 CTCGCTGTAT CTCTCTCCGG CGATAGGAAT TCTCCGCTTG TTCCTCTCTG GGTTCTGTTA
721 AGTGGAATCG CGTTTGTGAT CGCGTGTTTT TTAATCGTAC CGCGAATTTT CAAATTCATC
781 TCTCGGCGTT GCCCAGAAGG CGAACCCATA GGCGAAATGT ACGTTTGCGT CGCCCTTTGC
841 GCGGTTCTAC TCGCGGGTTT CGCGACAGAC GCCATTGGGA TTCACGCGAT TTTCGGTGCG
901 TTTGTGATGG GCGTTTTGTT TCCCAAAGGA CATTTCTCAG ACGCGATTGT GGAGAAGATT
961 GAAGATCTTG TAATGGGTCT TCTTCTACCA CTGTACTTCG TTATGAGCGG TTTGAAAACG
1021 GATATAACTA CGATTCAAGG TGTGAAATCG TGGGGACGAC TCGCGTTGGT GATTGTTACG
1081 GCTTGTTTTG GCAAAATCGT TGGGACTGTG AGTGTTGCCT TGTTATGCAA GGTAAGGCTC
1141 CGTGAATCGG TTGTTCTTGG GGTTTTAATG AACACAAAGG GTTTAGTGGA GCTAATTGTT
1201 CTCAACATTG GCAAAGACAG AAAGGTTTTG AGCGATCAGA CTTTTGCAAT TATGGTTCTC
1261 ATGGCGATAT TCACAACATT CATCACAACG CCAATAGTCT TGGCGTTATA CAAACCAAGC
1321 GAGACAACAC AAACACATAG TAGCGTCAGC TACAAGAACC GCAAACATAG ACGCAAGATT
1381 GAGAATGATG AAGAAGGCGA GAAGATGCAG CAGCTTAAGG TTTTGGTATG TCTTCAAAGC
1441 AGTAAAGATA TTGATCCCAT GATGAAAATA ATGGAAGCTA CTCGTGGAAG CAACGAAACC
1501 AAAGAAAGAT TTTGCGTTTA CGTTATGCAT TTAACTCAAC TCTCCGAGAG ACCTTCTTCT
1561 ATTCGAATGG TTCAAAAGGT GAGAAGCAAC GGTTTGCCCT TTTGGAACAA GAAAAGAGAG
1621 AATTCTAGTG CCGTTACGGT CGCGTTCGAG GCGTCTAGTA AGCTAAGTAG CGTTTCGGTG
1681 CGTTCTGTGA CCGCGATTTC ACCGTTGTCA ACAATTCATG AGGATATATG TAGCTCTGCT
1741 GATAGTAAAT GCACAGCGTT TGTGATTTTG CCGTTCCATA AGCAATGGAG ATCTCTGGAG
1801 AAAGAATTTG AAACGGTGAG ATCGGAGTAT CAGGGGATTA ACAAAAGAGT TCTTGAGAAT
1861 TCACCGTGTT CTGTTGGAAT TTTGGTTGAT CGTGGTCTCG GCGACAACAA TTCTCCGGTA
1921 GCTTCGAGCA ACTTTTCACT TTCCGTCAAT GTTCTGTTCT TTGGCGGTTG CGATGATCGT
1981 GAAGCTTTGG TTTACGGGTT ACGAATGGCT GAACATCCGG GCGTTAACTT GACCGTTGTG
2041 GTTATCTCTG GTCCGGAGAG CGCAAGGTTT GATAGGCTTG AAGCGCAAGA AACATCACTA
2101 TGTTCCTTAG ACGAGCAATT CCTTGCAGCA ATCAAGAAAA GGGCCAATGC AGCTAGATTT
2161 GAAGAGAGGA CGGTGAATTC AACGGAGGAA GTGGTTGAGA TTATCCGCCA ATTTTACGAG
2221 TGCGATATTT TATTGGTGGG AAAATCTTCC AAAGGACCTA TGGTTTCAAG ATTACCGGTT
2281 ATGAAGATAG AGTGTCCAGA ACTGGGACCG GTCGGAAACT TGATCGTGTC AAATGAGATT
2341 TCTACTTCAG TGTCTGTTTT GGTGGTTCAA CAATACACCG GGAAAGGTCC TTCTGTGGTG
2401 GGTTCCGTCT CTGTCCCGGT GGTGGAGACG CCATGA
|
|
Revised Protein Sequence (811 aa):
1 MGTLVNGTIP AMKCPKNVAM MKTTSNGVFD GESPLDFAFP LVILQICLVV AVTRSLAFLL
61 RPMRQPRVVA EIIGGILLGP SALGRITSYK NSIFPARSLT VLDTLANLGL LLFLFLVGLE
121 IDLTSLRRTG KKAISIAAAG MLLPFGMGIV TSFAFPEASS SGDNSKVLPF IIFMGVALSI
181 TAFGVLARIL AELKLLTTDL GRISMNAAAI NDVAAWVLLA LAVSLSGDRN SPLVPLWVLL
241 SGIAFVIACF LIVPRIFKFI SRRCPEGEPI GEMYVCVALC AVLLAGFATD AIGIHAIFGA
301 FVMGVLFPKG HFSDAIVEKI EDLVMGLLLP LYFVMSGLKT DITTIQGVKS WGRLALVIVT
361 ACFGKIVGTV SVALLCKVRL RESVVLGVLM NTKGLVELIV LNIGKDRKVL SDQTFAIMVL
421 MAIFTTFITT PIVLALYKPS ETTQTHSSVS YKNRKHRRKI ENDEEGEKMQ QLKVLVCLQS
481 SKDIDPMMKI MEATRGSNET KERFCVYVMH LTQLSERPSS IRMVQKVRSN GLPFWNKKRE
541 NSSAVTVAFE ASSKLSSVSV RSVTAISPLS TIHEDICSSA DSKCTAFVIL PFHKQWRSLE
601 KEFETVRSEY QGINKRVLEN SPCSVGILVD RGLGDNNSPV ASSNFSLSVN VLFFGGCDDR
661 EALVYGLRMA EHPGVNLTVV VISGPESARF DRLEAQETSL CSLDEQFLAA IKKRANAARF
721 EERTVNSTEE VVEIIRQFYE CDILLVGKSS KGPMVSRLPV MKIECPELGP VGNLIVSNEI
781 STSVSVLVVQ QYTGKGPSVV GSVSVPVVET P
›› Fasta: All CHX Protein Sequences
|
|
Back to the Top · Last Revision: October 16, 2006 |