Rice P450
sequences
Sept. 25,
2002 D. Nelson
Note there are 957 sequence entries here (571 are indica, 386 are japonica). Some are duplicates. #n are numbers for the ortholog pairs or unique sequences. 489 numbers were given out, 27 of these were combined and 4 were not from rice. Therefore, there are 458 unique rice sequences. Fragments get the same number as parents. Order is by CYP name.
Three sequences aaaa01039155.1, aaaa01093055.1, aaaa01067419.1 are probable fungal P450 contaminants. One seq aaaa01062516.1 is a probable insect P450 contaminant. These are not counted in the total.
CYP names have now been assigned to all 458 sequences.
#300
>aaaa01012243.1
$FI CYP51A5 Indica
rice genome CYP51 New April 24, 2002
ortholog
of AB025047 99%
5108
MTLADPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 4926
4925
IREEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQEVYKFNVPTFGPGVVF 4746
4745
DVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAE 4641
2774
EYFSKWGESGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSALFHDLDNGMQPVSV 2598
2597
IFPYLPIPAHRRRDRARQRLKEIFATIIKSRKASGQAEEDMLQCFIDSKYKSGRSTTEGE 2418
2417
ITGLLIAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVLYR 2223
2222
CIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFKNPDS 2043
2042
YDPDRYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEFELVSPF 1863
1862
PETNWKAMVVGIKDEVMVNFKRRKLVVDN* 1773
>AB025047
CYP51A5 rice (partial) 80%
to 51A2 missing N-term 64 aa
BE040549.1
OE08G10 OE Oryza sativa cDNA 5' Length = 255 I-helix CYP51
BE230288.1
99AS641 Rice Seedling cDNA clone 99AS641.Length = 586
BE230302.1
99AS655 Rice Seedling cDNA clone 99AS655.Length = 627
BE607441.1
OE202C10 OE cDNA clone ID707 C-term CYP51 Length = 428
REEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQE
VYKFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAEEYFSKWGE
SGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSSLFHDLDNGMQPVSVIFPYLPI
PAHRRRDRARQRLKEIFATIIKSRKASGRAEEDMLQCFIDSKYKSGRSTTEGEITGLL
IAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVL
YRCIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFK
NPDSYDPDPYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEF
ELVSPFPETNWKAMVVGIKDEVMVNFKRRKLVVDN
#300
>aaaa01066056.1 CYP51A5 (indica cultivar-group) = aaaa01012243.1 $FI Indica rice genome CYP51
591
DPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 761
762 IREEYARLGSVFTVPILRRKITFLI 836
#110
>aaaa01003099.1b
CYP51A6 (indica cultivar-group)
Nterm aa 4-160
ortholog
to AP005448.1b 100%
10626
VTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 10793
10794
RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 10973
10974
RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11117
>aaaa01003099.1c
CYP51A6 (indica cultivar-group)
Nterm aa 61-160
ortholog
to AP005448.1b 100% these two are duplicates only count once
11261
DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11437
11438
APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11617
>aaaa01003099.1d
CYP51A6 (indica cultivar-group)
Nterm aa 61-160
ortholog
to AP005448.1b 100% these two are duplicates only count once
11761
DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11937
11938
APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 12117
>aaaa01003099.1e
CYP51A6 (indica cultivar-group) nearly
gene, runs off end
ortholog
to AP005448.1b $F 99% plus one frameshifted region
21127
LQKRKISSPAAAAPPVVRGAGLVRLRARHGEGRAAGGDPRAAGEAGERVTAIAPF 20963
20962
GLFKVTFLIGPEVSSHFYLAAESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWD 20783
20782
VLKPRSIEARVGAMAEEVQ 20726 (0?)
18574
NYFSRWGEQGTVDLKKELERVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 18401
18400
TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTAGNGDDVLQRLIDGRYKD 18236
18235
ERALTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLAAVIAEQDRLMASRARTD 18056
18055
DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 17876
17875
LSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 17696
17695
KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRR 17570
>AP005448.1b
$F CYP51A6 (japonica
cultivar-group) chromosome 7 21 June 2002
100% to
AP005188.2c
32724
MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 32900
32901
RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 33080
33081
RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 33224
35381
NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 35554
35555
TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 35719
35720
ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 35899
35900
DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 36079
36080
MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 36259
36260
KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 36403
>AP005188.2c
$F CYP51A6 (japonica
cultivar-group) chr 7 orth to aaaa01003099.1e 99%
55155
MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 55331
55332
RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 55511
55512
RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ
57812
NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 57985
57986
TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 58150
58151
ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 58330
58331
DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 58510
58511
MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 58690
58691
KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 58834
note:
sequences aaaa01003099.1b to e are all probably from a single gene
#109
>aaaa01003099.1a
CYP51A7P (indica cultivar-group)
Nterm aa 4-94
ortholog
of AP005448.1a 100%
7681
TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 7857
7858
PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 7962
>AP005188.2b
$P CYP51A7P (japonica cultivar-group) chr 7 N-term fragment
orth to
aaaa01003099.1a 100% after frameshift
52199
MDHLTSS (frameshift)
TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 52375
52376
PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 52480
>AP005448.1a
CYP51A7P (japonica cultivar-group) chromosome 7 21 June 2002
29768
TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 29944
29945
PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 30049
#63
>aaaa01001626.1
$FI CYP51A8 (indica
cultivar-group) Cterm ONE FRAMESHIFT
ortholog
to AP005188.2a 98%
22316
MQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAPPPPVVQGVGLVRFV
RAMARDGPLEAIREQQAKLGSVFTASAPLGTFLIGSEVSSHFYVAPDSEISMGRLY
EFTVPIFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE (0) 22795
23040
NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 23153 (FS)
23156
VPGKLCELFGELDNGLHLISGLLPYLPIPAH
23249
RRRDRARQRLGEIITEVIRSRRNSSRGAAGTDENNDDMLQCLINSRYKDGCAMTDAE 23419
23420
TAGLVVALMFAGKHTSSGVSIWTGVHLLSNPNHLAAVVAEQDRLMASCPGRTDDYHRLD 23596
23597
YDTVQEMRSLHCCVKEALRLHPPVAAVSQAYKHFTVQTKEGKEYTIPGGHMVVSTILVNH 23776
23777
YLPHIYKDPHVFDPQRFAPGREEEKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLL 23956
23957
SNFEIKMVSPFLETEWSTVIPEPKGKVMVSYRRRTAPK* 24073
>AP005188.2a
$F CYP51A8 (japonica
cultivar-group) chr 7
12878
MDQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAAPPPVVQGVGLVRFVRAMAR 13063
13064
DGPLEAIREQQAKLGSVFTASAPLGLFKVTFLIGSEVSSHFYVASYSEISMGRLYEFTVP 13243
13244
IFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE 13372
13617
NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 13730
13727
ESVPGKLCELFGELDNGLHLISGLLPYLPIPAHRRRDRARQRLGEIITEVIRLRRNSSR 13903
13904
GAAGTDENNDDMLQCLINSRYKDGCAMTDAEIAGLVVALMFAGKHTSSGVSIWTGVHLLS 14083
14084
NPNHLAAVVAEQDRLMASCPGRTDDYHRLDYDTVQEMRSLHCCVKEALRLHPPVAAVRQA 14263
14264
YKHFTVQTKEDKEYTIPGGHMVVSTILVNHYLPHIYK
DPHVFDPQRFAPGREEDKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLLSN 14540
14541
FEIKMVSPFPET 14576 (frameshift)
QWSTVIPEPKGKVMVSYRRRTAPK* 14649
Note
this cluster continues on AP005188.2b
and 2c
#256
>aaaa01009323.1
CYP51A9 (indica cultivar-group) 55% to AP005448.1b $F
orth of
AP004890.1
6368
DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 6547
6548
GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 6727
6771
YKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAYQQIKVILSHLVSN 6950
6951
FELK 6962
>AP004890.1
$F CYP51A9 (japonica
cultivar-group) chr 2
78968
MDHIFSTNAWLTIALVFIITLAAKVVRSSVTLPAEKTSKPRPPPEAKGAPLV 79123
79124
GIIPAVLRRGLQAVIREQHRALGSVFTLRSLGLAVTFLVGPECSDHFFHAPELEIAID 79297
79298
GLYEVTVPIFGKEVGYDIDLDTRNEQHRFFAKMLRPAKLRGHVLPMVCEIEX 79450
79551
YFGKWGECGVVDLMQEVDHVLMLIASRCLLGKEVRENMFDEVASLFHELMGGMHLISMFF 79730
79731
PYLPTPGHRRRDKARAKLGEIFSQIVKTRKMSGRVEDDMLQDLIDSTYGDGRATT 79895
79896
DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 80075
80076
GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 80255
80256
ASPLVVNTLLPNIYKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAY 80435
80436
QQIKVILSHLVSNFELKLESPFPETEDMLSMRPKGKAIVSYKRRTLS 80576
#404
>aaaa01023253.1
CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chromosome 2
3179
YFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 3000
2999
FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 2820
2819
AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 2640
2639
MTTLTHCIKEALRLHP 2592
2584
LLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYIYKDPNVYDPSRFGPGR 2414
2412
EEDKVGGKFSYTPFSAGRHVCLGEDFAYMPN*GDMEPFAQGNFDLELISPFPEEEWEKFI 2233
2232
PGPKGKVMVTYKRRRL 2185
>AP004090.1
$F CYP51A10 chr 2
clone OJ1399_H05 49% to 51A2
AQ843111.1
nbxb0005D03r CUGI Rice BAC genomic cloneLength = 507 49% to 51A2
78158
MAVLFVATKMIQQRPRTLCLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVIHDLH
77972
SRLVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLYDVDLATRSRQISFCTDS 77805
77804
IKPINLRGHVDSMVHEVE 77751 (0)
76666
GYFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 76484
76483
FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 76304
76303
AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 76124
76123
MTTLTHCIKEALRLHPPANLLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYI 75944
75943
YKDPNVYDPSRFGPGREEDKVGGKFSYTPFSAGRHVCLGEDFAYMQIKVIWSHLLRNFDL 75764
75763
ELISPFPEEEWEKFIPGPKGKVMVTYKRRRLL* 75665
#404
>aaaa01024682.1
CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chr 2
Nterm join
with AAAA01023253.1 see this accession for ortholog
1522
LSMAVLFVATKMIQQRPRTLYLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVI 1701
1702
HDLHSRLGSVFTVSVFGLKKVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLY 1869
1870
DVDLATRSRQISFCTDSIKPINLRGHVDSMVHEVE 1974
#246
>aaaa01008685.1
CYP51A11 (indica cultivar-group) orth of AC108875.1a $F chr 5 100%
7539
DGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVRKHGII 7360
7359
NGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCVPAGHT 7180
7179
MASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGENYA 7009
7008
YMQIKAIWSHLLRNF 6964
>AC108875.1a
$F CYP51A11 chr 5 51%
to 51A2 same as AQ050946 AQ687182 AQ258479
58% to
AP004090 this might require subfamilies in CYP51
70310
MELTSSSAMWLAMAILAITAALTKIALGGGRRRCLSESSDLTCKTPPPPPVVNCIALLGL 70489
70490
LPALFRGDVPATMQQLYAKFGSVFTVSVAGLLKATFLVGPEVSAHFFQGLESEVSHGDLF 70669
70670
EFTVPMFGKEVGHGVDNATRIEQGRFFAEALKPVRLRIHVDPMVQEVE 70813 (0?)
71263
DYFAKWGQHGTVDLKHELEQLLLLISGRCLLGKEVMGTKFDEVCNLFRDIEGGVNLMSV 71439
71440
FFPYTPLIPSNRRRDMARERLHAIFSDIVRSRKQQQGDQEEVNDKDVLQSFIDSR 71604 (2?)
71910
YKADGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVR 72083
72084
KHGIINGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCV 72263
72264
PAGHTMASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGEN 72443
72444
YAYMQIKAIWSHLLRNFELKLLSPFPKTDWSKLVPEPQGKVMVSYKRRQL 72593
#276
>aaaa01010435.1 CYP51A12 (indica cultivar-group) orth of AC108875.1b $F 99% chr 5 similar to 51A2
1984
WGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSVFFPYTP 2163
2164
LIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRATTEA 2334
2335
*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGRITD 2514
2515
DRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIASP 2688
2689
IVISNQVPYIYMDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 2865
2866
AIWSHLLRNF 2895
>AC108875.1b
$F CYP51A12 chromosome
5 48% to 51A2
80009
MAIILAITAAVTKIARGGRRRSATDPTCKMPPPPPVVNSIALLRLLPTLFRSGLPA 80176
80177
ILHELYTKFGSVFTINLAGLLKMTFLVGPEVSAHFFQGLESEISHGNLLEFTVPMFGKEI 80356
80357
AHGVDSATRNEQARFFVDALKPARLRIHVDPMVQEVE 80467 (0?)
84741
DYFAKWGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSV 84917
84918
FFPYTPLIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRAT 85097
85098
TEA*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGR 85277
85278
ITDDRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIAS 85457
85458
PIVISNQVPYIYKDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 85637
85638
AIWSHLLRNFELRLLSPLPKSDFTKFVPEPHGELMVSYKRRQL 85766
#276
>aaaa01067145.1 CYP51A12 (indica cultivar-group) orth of AC108875.1b $F chromosome 5 1 diff see aaaa01010435.1 for ortholog
27
GRTGCVGEGYAYMQIKAIWSHLLRNFELR*LSPLPKSDFTKFVPEPHGELMVSYKRRQL 203
#140
>aaaa01004091.1 CYP51A13 (indica cultivar-group) orth of AC108875.1c $F chr 5 similar to 51A2
12032
GSVIFPYIPIPSHIRRDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLIDSKHRDGSS 12208
12209
TTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQKHGDHIDYN 12388
12389
VLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLLSP 12550
12551
MIFNNRLPYIYKDPHMYDLDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIKV 12727
12728
IWSHLLRNF 12754
>AC108875.1c
$F CYP51A13 chromosome
5 50% to 51A2
122577
MDLVSISIYMWAMAALCTIITAMVTTKLARVRRPITLNPKSKRPLPPVVNVIA 122735
122736
LLEHLPRLCTKGVIPVYKGSVFTVSLFGLKATFLVGPEVSAHFYQGMDSEISQGDLYEFT 122915
122916
VPLFGKGVGFDIDNATRTEHLRFFIDAIKTSKLRNHVNSMVQEVE 123050 (0?)
123296
DYFAKWGENGIVDIKHEFEKLLMLISGHCLLGKEVRDNMFDEVFSLF 123436
123437
HELDSGVGLGSVIFPYIPIPSHIRCDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLID 123616
123617
SKHRDGNSTTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQK 123796
123797
HGDHIDYNVLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLL 123976
123977
SPMIFNNRLPYIYKDPHMYDPDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIK 124156
124157
VIWSHLLRNFELKLESPFPKTNWSKILLEPWGKVMVSYKRRRL 124285
#181
>aaaa01005681.1a
CYP51A14 (indica cultivar-group) orth AP003866.1a $F chr 7 >99%
2692
EQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLISLC 2853
2854
FPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYR 3003
3004
DGRAMSDNEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIG 3168
3169
DDRVDYDALTTGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVRTREGKEYRMPAGHS 3342
3343
VVSYAAFNHRLGYVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLK 3522
3523
MKVIWSYLLRNFELELVSPFPEVEL 3597
>AP003866.1a
$F CYP51A14 chr 7
clone OJ1092_A07 53% to 51A2
AQ326645
and AQ291927 mid to K-helix region 52% identical to wheat CYP51
60%
identical to AQ327456 68% to EST T88278 705 family
AQ689048.1
nbxb0078H10r CUGI Rice BAC genomic clone Length = 737
AQ396185.2
nbxb0066K16r CUGI Rice BAC genomic cloneLength = 327
50920
MDMAAAAAVWFSAIAAVLLAASTIAVVVVAKMTGKRNGGAAAAAAAAAEAELPL
51082
PPVVSGVSLIIPVITRGPMAVADELYVKLGSVFT (0)
GGFYSRPE 51261
51262
SEVHQGGTYRMTVPMFGRGVMYDVDVATRSEQIAVCFEALRPTKLRSSTVTMVRETE 51432 (0)
52114
EYFAKWGEQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLI 52299
52300
SLCFPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYRDGRAMSD 52479
52480
NEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIGDDRVDYDALT 52653
52654
TGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVWTREGKEYRMPAGHSVVSYAAFNHRLG 52833
52834
YVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLKMKVIWSYLLRNF 53013
53014
ELELVSPFPEVELNNIMLGPRGEVMVRYKRRKLTST* 53124
no
japonica ortholog found 9/11/02
#346
>aaaa01014709.1
CYP51A15 (indica cultivar-group) 49% to 51A2
602
MDLTTGTIWLFLAQ
560
LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381
380
MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201
200
HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117
no
japonica ortholog found 9/11/02
#418
>aaaa01028263.1
CYP51A15 (indica cultivar-group) 73% to AP003866.1
8
GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187
188
LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364
365
FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526
VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*
no
japonica ortholog found 9/12/02
#418
>aaaa01028263.1
CYP51A15 (indica cultivar-group) 73% to AP003866.1
8
GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187
188
LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364
365
FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526
VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*
no
japonica ortholog found 9/12/02
all three
fragments CYP51A15 #418, #453, #346 joined reduce gene count by 2
602
MDLTTGTIWLFLAQ
560
LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381
380
MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201
200
HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117
(0)
GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV
STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH
DDMLQCLIDARYKDGRATTETEVAGMLVAALFA
8
GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187
188
LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364
365
FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526
VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*
#453
>aaaa01065204.1
CYP51A15 (indica cultivar-group) exon 3
ortholog
of AY022669.1 searched Genbank for extensions
(0)
GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV
STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH
DDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHT
>AY022669.1
CYP51A15 (partial)
microsatellite MRG4994 containing (CCG)X8, Length = 224
82% to
CYP51 pseudogene above
222 PRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAA 1
#182
>aaaa01005681.1b
$PI CYP51A16P (indica cultivar-group) ortholog of AP003866.1b
4595
VRFLHRKVTFLVGPEESSHFFTGLDAEISQDEVSRFIIPTFGS*VAFDA 4741
5197
GYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 5295
6670
VVTPIATRCLFGEVRSKMLGEVSTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARLGE 6849
6850
IFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG
6975
AEVAGMLVSALLAGQYTSSSTSTWTG 7052 frameshift
7055
ARLLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLML 7234
7235
LRHARRSFVVRARGSGDAEYEVPAGHTVAS 7324
PMVIHNALPHVY 7359
7360
EDAGSFDPGRFGPAREEYRAYAADHAYTVFGGGRHACVGE 7479 frameshift
7482
VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVTVGFSVQL 7655
>AP003866.1b
$P CYP51A16P chromosome 7 clone OJ1092_A07
No obvious
N-terminal, two in frame stops, three frameshifts = Pseudogene
82% to AY022669.1 seems to be a CYP51
pseudogene
54048
VRFLHRKVTFLVGPEESSHFFTGLDSEISQDEVSRFIIPTFGS*VAFD 54191 (intron no boundaries)
54642
AGYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 54743 (0) missing 20 aa
56119
VVTLIATRCLFGEVRSKMLGEVPTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARL 56292
56293
GEIFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG 56421 frameshift
56424
AEVAGMLVSALFAGQYTSSSTSTWTGAR 56507 frameshift
56510
LLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLMLLRHAR 56698
56699
RSFVVRARGSGDAEYEVPAGHTVASPMVIHNALPHVYEDAGSFDPGRFGPAREEYRAYAA 56878
56879
DHAYTVFGGGRHACVGE 56929 frameshift
56932
VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVIVGFSVQL* 57108
#10
>aaaa01000238.1f $FI CYP71C12 (indica cultivar-group)
AP003909.1a 99%
also
aaaa01079567.1 (98%)
44400
MAEMLDGLRHDEQASLHAPQKASTMPTMSCSDLLLAMMCPLILLLIIFRCYAYATRSGGM 44221
44220
LSRVPSPPGRLPVIGHMHLISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQA 44041
44040
ILRTHDRVFASRPYNTIADILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQT 43861
43860
RQQEVRLVMAKIVEEAATHMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEI 43681
43680
NSSLLGGFNLEDYFPSLARLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDN 43501
43500
NDEESDFIDVLLSIQQEYGLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAK 43321
43320
LQAEVRGVVPKGQEVVTEEQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTI 43141
43140
PSGTRVIVNAWAIARDPSYWENAEEFIPERFLGNTMAGYNGNNFNFLPFGTGRRICPGMN 42961
42960
FAIAAIEVMLASLVYRFDWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 42796
>AP003909.1a $F CYP71C12 chromosome 8 clone OJ1300_E01
55% to 71C4
orth
aaaa01000238.1f
50394
MAQMLDGLRHDEQASLHAPQEASTMPTMSCSD
50298
LLLAMMCPLILLLIIFRCYAYATRSGGMLSRVPSPPGRLPVIGHMH 50161
50160
LISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQAILRTHDRVFASRPYNTIA 49981
49980
DILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQTRQQEVRLVMAKIVEEAAT 49801
49800
HMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEINSSLLGGFNLEDYFPSLA 49621
49620
RLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEY 49441
49440
GLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTE 49261
49260
EQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPS 49081
49080
YWENAEEFMPERFLSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVY 48910
48909
RFNWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 48790
#10
part
>aaaa01079567.1
CYP71C12 (indica cultivar-group) orth AP003909.1a $F chr 8
99% 98% to
aaaa01000238.1f $FI see aaaa01000238.1f for ortholog
672
DQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEYGLTKDNIKANLVVM 511
510
FEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTEEQLGRMPYLKAVI 334
333
KETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPSYWENAEEFMPERF 154
153
LSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVYRFDWKL 4
#11
>aaaa01000238.1g
$PI CYP71C13P (indica cultivar-group) end of clone poor quality seq
allowing
frameshifts (fs) and deletions this seq 95% to AP003909.1b
(plus
strand)
46070
MAQMLGALLLFQDSQMSTMTRMSYSLLLPILCPLILLLLFRCYAYATRSGGL 46225
46226
LDKLPSPPGRLPLIGHMHLIGSFPHMSLRDLATKHGPDLMLLHLGTVPTLVVSSSRMAQV 46405
46406
ILRTHDRVFASRQQSAIT 46459 gap (frameshift) XILF (deletion and fs)
46485
YGDYWRQIKKIVTTNLLTI (fs) KKIRSYSQT (fs) RQQE (fs) VRL (fs) VM (fs)
AKI*EATTHMAV
46628 (deletion)
(minus
strand)
49427
LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNEX 49320
49320
ESDFIDVLLSIQQEYGLTKDNIKANLAIMFEAGTDTSFIELEYAMAELMQKPQMIAKLQA 49141
49140
EVRGVVSKGQEIVTEEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTTPSG 48961
48960
TRVIVNAWAIAR (fs) DPSY*ENAEEF (fs)
XQRFLSNTMADYNGNNFNFLPFWTGRRICPGINFA 48787
48786
ITTIEIMLASLVYRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 48658
>AP003909.1b
$P CYP71C13P chromosome 8 clone OJ1300_E01, 4 in frame stops pseudogene
orth
aaaa01000238.1g note this seq is out of order in this gene cluster
54948
MAQMLGALLLFQDSLMSTMTRMSY
54876
SLLLPILCPLILLLLFRCYAYATRSGGMLDKLPSPPGRLPLIGHM 54742
54741
HLIGFFPHMSLRDLATDLMLLHLGTVPTLVVSSSRMAQVILRTHDRVFASR*QSAI 54574
54573
TDILF*GATDVAFSPYGDYWRQIKKIVTTNLLTIKKIRSYSQTRQQEVRLVMAKIVEEAA 54394
54393
THMA IDLTELLSCYSNNMVCHAVSGMFFCEEGRNQLFKELI*INSSLLGGFNIEDYFPSL 54214
54213
ARLPVRRL LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNDVESDFIDVLLSIQQE 54037
54036
YGLTKDNIKANLAIMFEAGTDTSFIELEYSMAELM*KPQMIAKLQAEVRGVVSKGQDIVT 53857
53856
EEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTIPSGTRVIVNAWAIARDP 53677
53676
SYWENAEEFMPERFLSNTMADYNGNNFNFLPFRTGRRICPGINFAITTIEIMLASLV 53506
53505
YRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 53389
#9
>aaaa01000238.1e
$FI
CYP71C14 (indica cultivar-group) AP003909.1c 99%
MAVMLVPIPLLLLHQHHNHEHEH
40499
PSPVAPQPTMASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPVIGHL 40326
40325
HLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRTYSAV 40146
40145
TDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARINEAAV 39966
39965
ARTTVDLSELLNWFTNDIVCHAVSGKFFREEGRNQMFWELIQANSLLLSGFNLEDYFPNL 39786
39785
ARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLSIQHE 39606
39605
YGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQEIVT 39426
39425
EEQLGRMPYLKAVIKETLRLHLAGPLLVPHLSIAECDIEGYTIPSGTRVFVNAWALSRDP 39246
39245
SFWENAEEFIPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRF 39066
39065
DWEIPADQAAKGGIDMTEAFGLTVHRKEKLLLVPRLTQD* 38946
>AP003909.1c
$F CYP71C14 chromosome
8 clone OJ1300_E01
orth
aaaa01000238.1e
same as
AP004462.1 152574-152287 region
58316
MAVMLVPIPLLLLHQHHNHEHEHPSPVAPQPTM
58217
ASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPV 58086
58085
IGHLHLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRT 57906
57905
YSAVTDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARIN 57726
57725
EAAVARTTVDLSELLNWFTNDIVCHAVSGKFFREEGQNQMFWELIQANSLLLGGFNLEDY 57546
57545
FPNLARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLS 57366
57365
IQHEYGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQ 57186
57185
EIVTEEQLGRMPY 57153 frameshift
57147
LKAVIKEMLRLHLAGPLLVPYLSIAECDIEGYTIPSGTRVFVNAWALSRDPSFWENAEEF 56968
56967
IPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRFDWEIPA 56797
56796
DQAAKGGIDMTEAFGLTVHRRRSSSLFLGSHKIK* 56692
#8
>aaaa01000238.1c
$FI CYP71C15 (indica
cultivar-group) AP003909.1d 99%
25643
LLLPVALLLLLLRFARATTLAGDRNSELLLSKLPSPPLRLPVIGHMHLVGSLPHVSLRD 25467
25466
LAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRAMVPDIISYGATDSC 25287
25286
YGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEVRLVIAKLRGAAAMAGAPVDMTELL 25107
25106
HSFANDLICRAVSGKFFREEGRNKLFRELIDTNASLLGGFNLEDYFPSLARTKLLSKVIC 24927
24926
VRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQDSDFIDILLYHQEEYGFTRDNIKAI 24747
24746
LVX 24741
24592
MFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEIVNEDNIVDMVYLKAVI 24413
24412
KETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPERF 24233
24232
MDSNIDFKGHDFHYLPFGSG*RMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKEEDI 24059
24058
DMTEVFGLTVHRKEKLFLVP 23999
>AP003909.1d
$F CYP71C15 chromosome
8 clone OJ1300_E01
orth
aaaa01000238.1c
AQ868830.1
nbeb0032E11f CUGI Rice BAC genomicLength = 759 57% to 76C5
same as
AP004462.1 139663-140091 region
68223
MELNNTEPLTASRAQAAAVFLLLPVALLLLLLRFARATTMAGDRNSELLLSKLPS 68387
68388
PPLRLPVIGHMHLVGSLPHVSLRDLAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTH 68567
68568
DLAFASRPRAMVPDIITYGATDSCYGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEV 68747
68748
RLVIAKLRGAAAMAGAPVDMTELLHSFANDLICRAVSGKFFREEGRNKLFRELIDTNASL 68927
68928
LGGFNLEDYFPSLARTKLLSKVICVRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQD 69107
69108
SDFIDILLYHQEEYGFTRDNIKAILV 69185 (0)
69331
DMFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEVVNEDNIVDMVYLKAV 69510
69511
IKETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPER 69690
69691
FMDSNIDFKGHDFHYLPFGSGRRMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKE 69858
69859
EDIDMTEVFGLTVHRKEKLFLVPQAA* 69939
#7
>aaaa01000238.1b
$FI CYP71C16 (indica
cultivar-group) AP003909.1e 100%
14489
LLPLALLFYFARAAISSRDSKTRELILSKLPSPPFKLPVIGHMHLIGPLPYVSLRDLAA 14313
14312
KHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRSMVTDIIMYGALDSCFAP 14133
14132
YSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVMARLRGAAAAAAAVDLSQTLQFFA 13953
13952
NDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFNLEAYFPGLARMPLISKLICARAI 13773
13772
RIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVLLSLQDEYGFTRDHIKAISIX 13608
13134
MFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAVI 12955
12954
KETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPERF 12775
12774
MDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKKE 12598
12597
DIDMTDVFGLAIHRKEKLFLVPQI 12526
>AP003909.1e
$F CYP71C16 chromosome
8 clone OJ1300_E01
orth
aaaa01000238.1b
same as
AP004462.1 128584-129021 region
78935
MELILQLEAKTAAQAVVTVFFFFLLPLALLFYFARAAISSRDSKTRELILSKLPSPPFK 79111
79112
LPVIGHMHLIGPLPYVSLRDLAAKHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAF 79291
79292
ASRPRSMVTDIIMYGALDSCFAPYSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVM 79471
79472
ARLRGAAAAAAAVDLSQTLQFFANDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFN 79651
79652
LEAYFPGLARMPLISKLICARAIRIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVL 79828
79829
LSLQDEYGFTRDHIKAISI 79885 (0)
80359
DMFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAV 80538
80539
IKETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPER 80718
80719
FMDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKK 80895
80896
EDIDMTDVFGLAIHRKEKLFLVPQIANY* 80982
#6
>aaaa01000238.1a
$FI CYP71C17 (indica
cultivar-group) orth of AP003909.1f
2 diffs
N-terminal Met not identified
MVVQLMLFFHDKFMAPMAEEPLPF
3340
VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 3161
3160
RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 2981
2980
ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 2801
2800
LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 2621
2620
VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 2441
2440
QEYNLTRHNIHAILM (0) 2396
2206
DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 2036
2035
KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 1856
1855
MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMDK 1676
1675
DDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 1580
#6
>AP003909.1f
$F CYP71C17 chromosome
8 clone OJ1300_E01
orth
aaaa01000238.1a
AZ127316.1
OSJNBb0086E03f CUGI Rice BAC genomic Length = 498 54% to 71A14
AQ871024.1
nbeb0042C09f CUGI Rice BAC genomic Length = 495 56% to 71B23
same as
AP004462.1 147428-146820 region
63826
MVVQLMLFFHDKFMAPMAEEPLPFVLIMIII
63733
LLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHV 63581
63580
SLRDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGS 63401
63400
SNISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMR 63221
63220
ELLGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLR 63041
63040
RVVSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLD 62861
62860
RQQEYNLTRHNIHAILM 62810 (0)
62626
DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTQMVSEDDLNNMPYLKAV 62453
62452
VKETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDAKCWENSEEFMPER 62273
62272
FMDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMD 62093
62092
KDDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 61994
#6
duplicate
>aaaa01000238.1d
$FI CYP71C17 (indica
cultivar-group) AP003909.1f 99%
this seq
100% identical to aaaa01000238.1a, probably an error in assembly
only count
this gene once see aaaa01000238.1a for ortholog
N-terminal
Met not identified
MVVQLMLFFHDKFMAPMAEEPLPF
30181
VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 30360
30361
RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 30540
30541
ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 30720
30721
LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 30900
30901
VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 31080
31081
QEYNLTRHNIHAILM (0) 31137
31309
DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 31485
31486
KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 31665
31666
MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMDK 31845
31846
DDVDMTDQFALTMARKEKLYLIP RSHVIKIT* 31941
#105
>AP004232.1
$F CYP71C18 chromosome
1 clone OSJNBa0051H17 like CYP71C4
90% to
aaaa01002989.1 version 4 in Genbank does not allow for
frameshift
and skips beginning of heme signature
probably
not an ortholog no 99% match in indica 9/5/02
56483
MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP 56650
56651
PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLR 56815
56816
THDHVFASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREE 56995
56996
E 56998 (0)
57929
VHKVMTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANY 58105
58106
VLLAGFNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGND 58285
58286
DQDEMDFVDVLLLQERGITRDHLKAIL 58366 (0)
58462
DMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRTNIPK*GRELITECDQTNMTYLKA 58641
58642
VIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 58821
58822
RFVDGGSAANVDFIGTDFQFLPFGAX 58896 frameshift
58899
RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 59078
59079
VEYKGSVQDSAVIL* 59123
#447
>aaaa01059480.1
CYP71C19 (indica cultivar-group) orth of AP004233.1 100%
602
VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKANSV 426
425
LLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMSKQQCEHDEGNDQ 246
245
DEMNFVNVLLLQEQGITREHLKAIL (0)
81 DMYQAGTETSSVVLVFAMAELMQKPHL 1
>AP004233.1
$F CYP71C19 chromosome
1 clone OSJNBa0065J17 50% to CYP71C4
= AQ857130
duplicate of the AP004232 gene at 27203-29726
probably
not ortholog only 91%
21862
MEQAAGLVYQLFQHEMFPWTFSVLALFPFLLLVLHYLATNHRTPTTCKETKNHHPP
21694
PPSPPRLPIIGHLHLIGGLLHVSLRELAHRYGPDLMLLHLGQVPNLIVSSPRAAEAVLR 21518
21517
THDLVFASRPYSLIADILLYGPSDVGLSPYGE*WRRRIITTHLLTNKKVRSYRVAREE 21344
21343
E 21335 (0)
VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKAN
SVLLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMS
KQQCEHDEGNDQDEMNFVNVLLLQEQGITREHLKAIL (0)
20004
DMYQAGTETSSVVLVFAMAELMQKPHLMAKLQAELRTTIPKQGHELITERDLTDMTYLKA 19825
19824
VIKETLRLHPPTPLLLPHLAMADCNIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 19645
19644
RFVDDGSAANVDFIGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVSAEAA 19465
19464
IDKDGIDMAEAFGLSVQLKEKLLLVPVDYKDGMQDSAVILL* 19339
#135
>aaaa01003879.1
$FI CYP71C20 (indica
cultivar-group) ortholog to AP004757.1a 97%
4645
MAQMLAAFLLDDLISHEHGHESLGAPPQAGTMAWYSLVLMTSLLFPLLVLLVMRCYVTRS 4824
4825
GAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRNLATKHSPDMMLLHLGAVPTLVVSSSR 5004
5005
VAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNEYWWQIKKITTTHLLTVKKVRS 5184
5185
YVSARQREVRIVIARITEAASKHEVVDLTEMLSCYSNNIVCHVVCGKFS*KEGWNQLLRK 5364
5365
LVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKAHNINKRWDQLLEKLIDDHTTKHI 5544
5545
RSSSMLNHYDEEAGFIDVLLSIQHEYGLTKDNIKANLAAMLMAGTDTSFIELEYAMAELM 5724
5725
QKPHVMGKLQAEVRRVMPKGQDIVTEEQLGCMPYLKAVIKETLRLYPPAPLLMPHLSMSD 5904
5905
CNINGYTIPSGTRVIVNVWALARDSNYWENADEFIPERFIVNTLGDYNGNNFHFLSFGSG 6084
6085
RRIYPGINFAIATIEIMLANLVYRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPH 6264
6265
LHLR 6276
>AP004757.1a
$F CYP71C20 52% to
AF321858 Lolium rigidum 70% to AP003909
chromosome
6 clone P0652D10
MAQMLAAFLLDGLISHEHGHESLGAPPQAGTMAWYSLVLMTS
79980
LLFPLLVLLVMRCYVTRSGAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRDLATKH 79810
79809
SPDMMLLHLGAVPTLVVSSSRVAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNE 79630
79629
YWRQIKKITTTHLLTMKKVRSYVSARQREVRIVMARITEAASKHVVVDLTEMLSCYSNN 79453
79452
IVCHAVCGKFSLKEGWNQLLRELVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKA 79276
79275
HNINKRWDQLLEKLIDDHTTKHIRSSSMLNHYDEEAGFIDVLLSIQHEYGLTK 79117
79116
DNIKANLAAMLMAGMDTSFIELEYAMAELMQKPHVMGKLQAEVRRVMPKGQDIVTEEQLG 78937
78936
CMPYLKAVIKETLRLHPPAPLLMPHLSISDCNINGYTIPSGTRVIVNVWALARDSN 78769
78768
YWENADEFIPERFIVNTLGDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLV 78601
78600
YRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPHLHLR* 78472
#104
>aaaa01002989.1
$FI
CYP71C21 (indica
cultivar-group) 91% to AP004233.1
6020
MEQAAGLVYQLFQQEMFPWTFSVLALFPFLLL frameshift
SLHYLATNNRTPTTCKETKNHHPPPPSPPRLPIIGHLHLIGDLLHVSLRELA 5770
5769
HRYGPDLMLLHLGQVP (?)
5107
NLIVSSPRAAEAVLRTHDLVFVSRPYSLIADILLYGPSDIGLSPYGEQWRQSRRIVTTHL 4928
4927
LTNKKVRSYRVAREEE 4871 (0?)
4088
VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGDDGRNKLFRQLFKANS 3909
3908
VLLAGFNLEDYYPSLARLKAVSRVMCAKARKTRKLWDELLDKIIDDRMSKQQCEHDRGND 3729
3728
DQDEMDFVDVLLLQERGITREHLKAIL 3648 (0?)
3552
DMFQAGTETTSVVLVFAMAELMHKPHLMAKLQAELRTNISKQGQELLTECDLTNMTYLNA 3373
3372
VIKETLRLHPPTPLLLPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLSE 3193
3192
RFVDGGSAANVDLTGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVPAEAA 3013
3012
IDKAGIDMAEAFGLSVQLEEKLLLVPIEYKDSM 2914
#488
>aaaa01000238.1
CYP71C22P
Duplicated
end of exon 1 from CYP71C17 = AP003909.1g
ESDFVDILLDHQQEYNLTRHNIHAILM
32576
#488
>AP003909.1g
$P CYP2C22P chromosome 8 clone OJ1300_E01 lone pseudogene fragment
identical
to Duplicated end of exon 1 on aaaa01000238.1a
61469
EQESDFVDILLDHQQEYNLTRHNIHAILM 61383
#488
>aaaa01000238.1a
$FI CYP71C22P (indica
cultivar-group) orth of AP003909.1g
Duplicated
end of exon 1 same as AP003930.1g
1052
EQESDFVDILLDHQQEYNLTRHNIHAILM 966
#136
>aaaa01006247.1
CYP71C23P (indica cultivar-group) orth of AP004757.1b 2 diffs
2898
FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLTVHRKQKLLLVSWLPQD 3065
>AP004757.1b
$P CYP71C23P chr 6 Pseudogene fragment last
exon similar to AP003909
103765
FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLAVHRKEKLLLVSWLPQD*
103595
#28
>aaaa01000733.1
$FI CYP71E4 (indica
cultivar-group) 99% to AC092559.2
5765
MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLRLPPGPARLPVLGN 5586
5585
LLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLRTHDADCCSRPSSPG 5406
5405
PMRLSYGYKDVAFAPYDAYSRAARRLFVAELFSAPRVQAAWRARQDQ 5265
3896
VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDV 3720
3719
MDMLASFSAEDFFPNAAAARLFDHLTGLVARRERVFQQLDAFFEMVIEQHLDSDSSNAGG 3540
3539
GGGNLVGALIGLWKQGKQYGDRRFTRENVKAIIF 3438
3337
DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAYLK 3158
3157
MVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVFDP 2978
2977
DRFEAKRVEFNGGHFELLPFGSGRRICPGIAMAAANVEFTLANLLHCFDWALPVGMAPEE 2798
2797
LSMEESGGLVFHRKAPLVLVPTRYIQL 2717
>AC092559.2 $F CYP71E4 chromosome 3 clone
OSJNBb0096M04, 45% to 71B37
same as
AC096688.3 chromosome 3
96529
MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLR
96388
LPPGPARLPVLGNLLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLR 96212
96211
THDADCCSRPSSPGPMRLSYGYKDVAFAPYDAYGRAARRLFVAELFSAPRVQAAWRARQDQ 96017 (0)
94678
VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDVMDMLASFSAEDFFPNA
94454
94453
AAARLFDHLTGLVAHRERVFQQLDAFFEMVIEQHLDSDSSNAGGGGGNLVGALIGL 94286
94285
WKQGKQYGDRRFTRENVKAIIF 94220 (0)
94119
DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAY 93946
93945
LKMVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVF 93766
93765
DPDRFEAKRVEFNGGHFELLPFGSGRRICPGIAMGAANVEFTLANLLHCFDWALPVGMAP 93586
93585
EELSMEESGGLVLHRKAPLVLVPTRYIQL* 93496
#296
>aaaa01011852.1
$FI CYP71E5 (indica cultivar-group) ortholog of AL731888.1
58% to
AC092559.2 46% to 71B23
8074
MAISLITSLLFSLPQQWQP
8017
VVLTGLLPVIVSLVLLARKGRLKMPPGPEQVPLLGNLHQLAGPQPHRALRDLARVHGPV 7841
7840
MRLRLGKASAVVLTSAEAAWEALRGHDLDCCTRPVSAGTRRVTYGMKNVAFAPYGAYWR 7664
7663
EVRKLLMVELLSARRVKAAWYARHEQ (0) 7586
VEKLLSTLRRAEGKPVALDEHILSLSDGIIGRVAFGNIYGSDKFSQNK
NFQHALDDVMEMLSGEGSSAEDLQLPAAVGRLVDRLTGFAARRERIFRQLDSF
FEMVIEQHLDPNRAPPENGGDLVDVLIDHWKKNEPRGTFSFTKDNVKAIIF (0)
6324
STFVAGIDTNAATILWAMSELARKPRVLKKVQAEIRAAVGVNGRVQPDDITKLSYLRK 6151
6150
VVKETLRLHPPTPLLLPRETMRHIQISGYDVPAKTRIYVNAWAIGRDPASWPDEPEEFNP 5971
5970
ERFEANEIDFKGEHPELMPFGTGRRICPGMAMAMANVEFTLANLLFAFQWSLPEGTTPDN 5791
5790
VCMEEEGRLVCHRKTPLVLVPTVYRHGLE* 5701
>AL731888.1
CYP71E5 chr 12
31348
MAISLITSLLFSLPQQWQPVVLTGLLPVIVSLVLLARKGRLKMPPGPEQVPLLGNLHQLA 31169
31168
GPQPHRALRDLARVHGPVMRLRLGKASAVVLTSAEAAWEALRGHDLDCCTRPVSAGTRRV 30989
30988
TYGMKNVAFAPYGAYWREVRKLLMVELLSARRVKAAWYARHEQ 30860
30546
VEKLLSTLRRAEGKPVALDEHILSLSDGIIGTVAFGNIYGSDKFSQNKNFQHALDDV 30376
30375
MEMLSGEGSSAEDLQLPAAVGRLVDRLTGFAARRERIFRQLDSFFEMVIEQHLDPNRAPP 30196
30195
ENGGDLVDVLIGHWKKNEPRGTFSFTKDNVKAIIF 30091
29601
STFVAGIDTNAATILWAMSELARKPRVLKKVQAEIRAAVGVNGRVQPDDITKLSYLRKVV 29422
29421
KETLRLHPPTPLLLPRETMRHIQISGYDVPAKTRIYVNAWAIGRDPASWPDEPEEFNPER 29242
29241
FEANEIDFKGEHPELMPFGTGRRICPGMAMAMANVEFTLANLLFAFQWSLPEGTTPDNVC 29062
29061
LEEEGRLVCHRKTPLVLVPTVYRHGLE 28981
#352
>aaaa01015254.1
CYP71E6 (indica cultivar-group) 94% to AC084319.5a 7
diffs
53% to
AC092559.2
3134
LAVSVVLIFWSRHRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLGALAGWHGPVMAL 3313
3314
WLGTVPVVVLSSPKAEREALQVHDPECCNRSPT 3412
>aaaa01021677.1
CYP71E6 (indica cultivar-group) orth AC084319.5a 99%
838
DTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYLK 668
667
MVVKETFRLHPPATLLVPRETTRHFKVGGYDIPEKTKVIVNEWAIGRDPNIWKDPEEFIP 488
487
ERFEEMDIDFNGAHFELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKED 308
307
IDMEEAGKLTFHKKIPLLLVP 245
>AC084319.5a
CYP71E6 chr 3 Genbank translation is wrong at N-terminal
does
not identify frameshift and conserved motifs PPGPXXLPIIGNL
same as
AC084404.8 partial
2204
MAASLLLELLPQQWQLSITSLIL
2273
LAVSVVLIFWSRRRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLEALAGWHG 2437 (fs)
2437
LGTVSVVVLSSPKAAREALKVHDPECCSRSPS 2532
2533
AGPRMLSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAA 2658
8170
VMLPDYYCCM 8199
8599
VEKLIEKLTRNGRNAVAINEHIFSTVDGIIGTFALGETYAAEEFKDISETMDLLSSSSAE 8778
8779
DFFPGSVAGRLVDRLTGLAARREAIFRKLDRFFERIVDQHAAADDDGPAAARRKADDKGS 8958
8959
AGSDLVHELIDLWKMEGNTKQGFTKDHVKAMLL 9057
9159
DTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYLKMVV 9338
9339
KETFRLHPPATLLVPRETTRHFKVGGYDIPEKTKVIVNA*AIGRDPNIWKDPEEFIPERF 9518
9519
EEMDIDFNGAHFELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKEDIDM 9698
9699
EEAGKLTFHKKIPLLLVPTPNKAPN* 9776
>AC084404.8
CYP71E6 chr 3 incomplete = AC084319.5a
153211
MAASLLLELLPQQWQLSITSLIL 153279
153280
LAVSVVLIFWSRRRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLEALAGWHG 153444 (fs)
153444
LGTVSVVVLSSPKAAREALKVHDPECCSRSPS 153539
153540
AGPRMLSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAA 153665
#254
>aaaa01009177.1b
CYP71K1 (indica cultivar-group) orth AP002968 $F 98%
1277
LYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWALPVIGHLHHVAGALPHRAMRDL 1456
1457
ARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAFATRPITPTGKVLMADSVGVVFAP 1636
1637
YGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAVAALTTPGATAAVNL 1816
1817
SERISAYVADSAVRAVIGSRFKNRAAFLRMLERRMKLLPAQCLPDLFPSSRAAML 1981
1982
VSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAEEDLLDVLLRIQSQDKTNP 2143
2144
ALTNDNIKTVIX 2176
2244
DMFVASSETAATSLQWTMSELMRNPRVMRKAQDEVRRALAVAGQDGVTEESLPDLPYL 2423
2424
HLLIKESLRLHPPVTMLLPRECREPCRVMGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFA 2603
2604
PERFEGGGAADFKGTDFEYIPFGAGRRMCPGMAFGLANMELALAALLYHFDWELPGGMLP 2783
2784
GELDMTEALGLTTRRCSDLLLVPAL 2858
>AP002968
$F CYP71K1 40% to
71B24 complement(1875..2513,2584..3501)
AP003204
40% to 71B24 CDS complement(121487..122125,122196..123113)
AQ870215.1
nbeb0036N08f CUGI Rice BAC genomic Length = 754 58% to 99A1
MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWAL
PVIGHLHHVAGALPHRAMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAF
ATRPITPTGKVLMADSVGVVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGR
LLRAVAAAAAVAALTTPGATAAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLER
RMKLLPAQCLPDLFPSSRAAMLVSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAE
EDLLDVLLRIQSQDKTNPALTNDNIKTVIIDMFVASSETAATSLQWTMSELMRNPRVM
RKAQDEVRRALAIAGQDGVTEESLRDLPYLHLVIKESLRLHPPVTMLLPRECRETCRV
MGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFAPERFEGVGAADFKGTDFEYIPFGAGR
RMCPGMAFGLANMELALAALLYHFDWELPGGMLPGELDMTEALGLTTRRCSDLLLVPA
LRVPLRDHER
#253
>aaaa01009177.1a
$P CYP71K2P (indica cultivar-group) AP002968 $F 97%
only 363
bp away from start of second gene, cannot be complete gene
309
LYLLLLALLVAVPFLCLTRSSRRHGCGGGSRLPPSPWALPVIGHLHHVAGALPHRAMRDL 488
489
ARRHGPLMLLRLCELRVVVASTAEAAREVTKTHDLAFATRPITPTGKVLMADSVGVVFAP 668
669
YGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAAAAAALTTPGATAAV 848
849
NLSERISAYVADSAVRAVIGSR 914
no
ortholog found in japonica 9/13/02, may be indica unique pseudogene
does
not exist on AP002968 or AP003204 so it might be a sequence assembly
error.
#209
>aaaa01007181.1a
CYP71K3 (indica cultivar-group) orth AP003990.1h $F chr 2 99%
N-term
(orientation probably incorrect on either a or b)
1038
YLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHRAMRDMAR 859
858 RHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEGVIFAPYG
679
678
DGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAALSSSSPVNLTGMISAFV 499
498
ADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAMLLSRVPAKI 334
333
ERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 181
180 IKSILX 166
88 DMFGAGSETSATTLQWAMAELMRNPAV
2
>aaaa01007181.1b
CYP71K3 (indica cultivar-group) orth AP003990.1h $F chr 2 100% C-term
6892
VMRRAQDEVRRELAVAGNDRVTEDTLPSLHYLRLVIKETLRLHPPAPLLLPRECGGACKV 7071
7072
FGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEFSPERFERCERDFRGADFELIPFGAGRRI 7251
7252
CPGMAFGLAHVELALAALLFHFDWRLPGGMAAGEMDMTEAAGITVRRRSDL 7404
>AP003990.1h
$F CYP71K3 chromosome
2 clone OJ1073_F05
66403
MATELTEYLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHR 66582
66583
AMRDMARRHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEG 66762
66763
VIFAPYGDGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAASSSS 66924
66925
SPVNLTGMISAFVADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAML 67104
67105
LSRVPAKIERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 67281
67282
IKSILI 67299 (0)
67380
DMFGAGSETSATTLQWAMAELMRNPAVMRRAQDEVRRELAVAGNDRVTEDTLPSLHYL 67553
67554
RLVIKETLRLHPPAPLLLPRECGGACKVFGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEF 67733
67734
SPERFERCERDFRGADFELIPFGAGRRICPGMAFGLAHVELALAALLFHFDWRLPGGMA 67910
67911
AGEMDMTEAAGITVRRRSDLLVFAVPRVPVPAQ* 68012
#210
>aaaa01007181.1c
CYP71K4 (indica cultivar-group) orth AP003990.1i $F chr 2 99%
8111
LPPGPWALPVIGHLHHLAGDLPHRALSALARRHGALMLLRLGEVQAVVASSPDAAREIMR 8290
8291
THDAAFASRPLSPMQQLAYARDAEGVIFAPYGDGWRHLRKICTGELLSARRVQSFRPVRE 8470
8471
AELVRLLRSVAEATSSSSSGSLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRML 8638
8639
QDGLKIVPGMTLPDLFPSSRLALFLSRVPGR 8731
9019
DMFGAGSESSATVLQWTMAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRL 9192
9193
VIKETLRLHPPAPLLLPRKCGSTCKILGFDVPEGVMVIVNAWAIGRDPTYWDKPEEFVPE 9372
9373
RFEHNGRDFKGMDFEFIPFGAGRRICPGITFGMAHVELVL 9492 frameshift
9494
LYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNLLVRPI 9607
>AP003990.1i
$F CYP71K4 chromosome
2 clone OJ1073_F05
68586
MPLVVLLLATIPLLFFTIKRSAQRRGGGGGGEGRLPPGPWALPVIGHLHHLAGDLPHRA 68762
68763
LSALARRHGALMLLRLGEVQAVVASSPDAARDIMRTHDAAFASRPLSPMQQLAYGRDAEG 68942
68943
VIFAPYGDGWRHLRKICTAELLSARRVQSFRPVREAELGRLLRSVAEATSSSSSA 69107
69108
SLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRMLQDGLKIVPGMTLPDLFPSSRLALF 69287
69288
LSRVPGRIEHHRQGMQRFIDAIIVEHQEKRAAAAANDDDDEDEDFLDVLLKLQKEMGSQH 69467
69468
PLTTANIKTVML (0)
DMFGAGSESSATVLQWT 69647
69648
MAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRLVIKETLRLHPPAPLLLP 69821
69822
RKCGSTCKILGFDVPEGVMVIVNAWAIGRDLTYWDKPEEFVPERFEHNGRDFKGMDFEF 69998
69999
IPFGAGRRICPGITFGMAHVELVLSALLYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNL 70178
70179
LVRPIHRVSVPVE* 70220
#211
>aaaa01007181.1d
CYP71K5 (indica cultivar-group) orth AP003990.1j $F chr 2 99%
10296
LLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPNRAMRDLARWHGPLMLLRLGE 10475
10476
VEX 10481 frameshift
10486
VVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGLVFAPYGEAWRRLRRVCTQELL 10665
10666
SHRRVQSFRPVREDELGRLLRAVDAAAAAGTAVNLTAMMSTYVADSTVRAIIGSRRLKDR 10845
10846
DAFLRMLDELFTIMPGMSLPDLFPSSRLAMLVSRAPGRIMRYRRRMRRIMDSII 11007
11008
HEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGAQYPLTTENIKTVM 11157
11249
QDIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLNY 11428
11429
LKLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 11608
11609
EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVE 11737
>AP003990.1j
$F CYP71K5 chromosome
2 clone OJ1073_F05
70828
MAGELAFYLLLVGLVAVPLLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPHRA 71007
71008
MRDLARRHGPLMLLRLGEVEAVVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGL 71187
71188
VFAPYGEAWRRLRRVCTQELLSHRRVQSFRPVREDELGRLLRAVDAAAAAGT 71343
71344
AVNLTAMMSTYVADSTVRAIIGSRRLKDRDAFLRMLDELFTIMPGMSLPDLFPSSRLAML 71523
71524
VSRAPGRIMRYRRRMRRIMDSIIHEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGA 71703
71704 QYPLTTENIKTVMM 71745 (0)
71837
DIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLSYL 72016
72017
KLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 72193
72194
EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVELALAALLFHFDWSLPG 72370
72371
GMAADELDMAESSGLTTRRRLPLLVVARPHAALPTKYCN* 72490
#249
>aaaa01012971.1b
CYP71K6 (indica cultivar-group) orth AP003523.1c $F chr 6 99%
7052
LLRYLFSVPMLFFIVPLLFLVCSPGRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAM 7231
7232
RDIARRHGPLVLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGV 7411
7412
IFAPYGETWRQLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELM 7588
7589
SAYAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMP 7756
7757
RRMKRHRERMTAYLDAIIEEHQESRASREDDEDLLDVLL 7873
>AP003523.1c
$F CYP71K6 chromosome
6 clone P0416A11 six different genes
73% to
AP003523.1d 64% to AP003523.1f
128476
MAAELVHLLRYLFSVPM
128425
LFFIVPLLFLVCSPRRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAMRDIARRHGPL 128246
128245
VLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGVIFAPYGETWR 128066
128065
QLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELMSA 127913
127912
YAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMPRRMKRH 127733
127732
RERMTAYLDAIIEEHQESRASREDDEDLLDVLLRM 127628 frameshift
QREGDLEVSRESIRSTIG
bad exon boundary should be phase 0
126439
DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGY 126275
126274
MNLVIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEF 126095
126094
IPERFENAGINFKGTNFEYMPFGAGRRMCPGMAFGLATLELALASLLYHFDWKLPDGV 125921
125920
EIDMKEQSGVTTRRVHDLMLVPIIRVPLPV* 125828
#249
>aaaa01008944.1
CYP71K6 (indica cultivar-group) orth AP003523.1c $F chr 6 98%
see
aaaa01012971.1b for
ortholog
1338
DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGYMNL 1511
1512
VIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEFIPE 1691
1692
RFENAGINFKGTNFEYMPFGAGRRMCPGMAFSLVMLELALASLLYHFDWKLPDGVEIDM 1868
1869
KEQSGVTTRRVHDLMLVPII 1928
#42
>aaaa01001026.1a
$PI CYP71K7P (indica cultivar-group) 3 defects, probable pseudogene
probable
ortholog of AP003523.1d
MAEVVQLHHLILLLPLFILPSSSSVR (fs)
3368
RRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIARRHGPLVLLRLGELPVVVIASS 3189
3188
ADAARNVMKTHDLAFATRPITHMMRLVFPEGSEGIIFSPYGETWRQLRKICTVELLSARR 3009
3008
VNSFRSVREEEVNRLLRAVAAAAASATSPAKMVNLSELMSAYAADSSVRAMIGRRCKDRD 2829
2828
KFLEMLERGIKLFVTPSLPDLYPSSRLAMVVSRMPRRMRRHREEVFAFLDAIIAEHQENR 2649
2648
ASGEDEEDLLDVLLRIQREGCMEST (fs) 2580
2572
PLLSTESIRTTIG bad boundary 0 expected 2540
1662
DLFNGGSETTATTLQWIMAELMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYLVI 1483
1482
KEALRLHPPGPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSKEFIPERF 1303
1302
EHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKLSDKIKVGDLDM 1123
1122
TEERGATTRRLHDLLLVPVIRVPLPLDSRS* 1030
>AP003523.1d
$F CYP71K7P chromosome
6 clone P0416A11 six different genes
probable
ortholog of aaaa01001026.1a
138301
MAEVVQLHHLILLLPLFILPFLLLRSSRRRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIA 138152
138151
RRHGPLVLLRLGELPVVVASSADAARDVMKTHDLAFATRPITRMMRLVFPEGSEGIIFSP 137972
137971
YGETWRQLRKICTVELLSARRVNSFRSVREEEVNRLLRAVAAAAASATSPAKTVNL 137804
137803
SELMSAYAADSSVRAMIGRRCKDRDKFLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMP 137624
137623
RRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCME 137480 frameshift
SPLLSTESIRTTIG bad exon boundary should be phase 0
136562
DLFNGGSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYL 136389
136388
VIKEALRLHPPRPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILE 136209
136208
RFEHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKL 136056
136039
GDLDMTQERGATTRRLHDLLLVPVIRVPLPLDSRS* 135942
#42
>aaaa01012971.1a
CYP71K7P (indica cultivar-group) orth AP003523.1d $F chr 6 99%
see
aaaa01001026.1a for ortholog
4777
GSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYLVIKE 4607
4606
ALRLHPPGPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILERFEH 4427
4426
VDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKLSDKIKVGDLDMTE 4247
4246
ERGATTRRLHDLLLVPVI 4193
#43
>aaaa01001026.1b
$PI CYP71K8 (indica cultivar-group) Pseudogene of AP003523.1e
japonica
gene does not look like a pseudogene
MAGFPVYL
(deletion and fs) LAA (fs) LIILPMANLIRSARHRRLAGAR (fs)
16438
PPPGPWALPVIGHLHHLAGKLPHHHKLRDLAARHGPLMLLRFGELPVVVASSAGAAREITK 16620
16621
THDLAFATRPVTRTARLTLPEGAEGIIFAPYGDGWRQLRKICTLELLSARRVQSFRAVRE 16800
16801
EEVRRLLLAVASPSPEGTTATASVVNLSRMISSCVADSSV RAIIGSGRFKDRETFLRLME 16980
16981
RGIKLFSGPSLPDLFPSSRLAMLVSRVPGRMRRQRKEMMEFMDTIIEEHQAAREASM 17151
17152
ELEKEDLVDVLLRVQRDGSLQFSLTTDNIKAAIA (0) 17253
this
segment is homologous to 108 aa region before the sequence gap at 17972
gene
may not be assembled correctly
RAMIGSRFKDRN*FLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMPRRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCMESTVSTESIRTTIG
(0)
missing Ihelix
exon
19112
YLHLVIKETLRLHPPAPLLLARECREPCQILGFDVPKGAMVLINAWSIGRDPSNWHAPKK 19291
19292
FMPERFEQNNIDFKRTSFKYIPFGAGRRICPGMTFGLANIELLLASLLYHFDWELPHGMQ 19471
19472
AGDLDMTETLAVTARRKADLLVVPVVRVPIVG* 19570
>AP003523.1e
$F CYP71K8 chromosome
6 clone P0416A11 six different genes AQ331067 AQ364007.2
AQ331067
55% identical to AQ328148 47% identical to C74921 57% to 71B4 58% to 76C1 64%
to AP003523.1f
AQ364007.2
nbxb0060E04f CUGI Rice BAC Length = 393 65% to 99A1
End of
this gene matches AP003571 at 155149
151229
MAGFPVYLLFLAALIILPMANLIRSARHRRLAGARRPPPGPWALPVIGHLHHLLAGKLPH 151408
151409
HHKLRDLAARHGPLMLLRFGELPVVVASSADAAREIAKAHDLAFATRPVTRTARLTLPEG 151588
151589
GEGVIFAPYGDGWRQLRKICTLELLSARRVLSFRAVREQEVRCLLLAVASPSPEGTTAT 151765
151766
ASVVNLSRMISSCVADSSVRAIIGSGRFKDRETFLRLMERGIKLFSCPSLPDLFPSSR 151939
151940
LAMLVSRVPGRMRRQRKEMMEFMETIIEEHQAARQASMELEKEDLVDVLLRVQRDGSLQF 152119
152120
SLTTDNIKAAIA 152155 (0)
166133
DLFIGGSETAATTLQWAMSELLNNPKVMQKAQDEIRQVLYGQERITEETISSLHYLHL 166306
166307
VIKETLRLHPPTPLLLPRECREPCQILGFDVSKGAMVLINAWSIGRDPSNWHAPEKFMPE 166486
166487
RFEQNNIDFKETSFEYIPFGAGRRICPGMTFRLANIELLLASLLYHFDWELPYGMQAGD 166663
166664
LDMTETLAVTARRKADLLVVPVVRVPIVG* 166753
#44
>aaaa01001026.1c
$FI CYP71K9
(indica
cultivar-group) Cterminal differs from AAAA01001026.1b and AP003523.1f may be a
frameshift (check)
may be
ortholog of AP003523.1f 95%
20983 MAAAASSVLAYLLVVALLAIVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGAL
21162
21163
PHVAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPE 21342
21343
GGEGIIFAPYGDRWRELRKICTVELLSARRVQSFRPVREEEAGRLLRAVAAASSPSPAQ 21519
21520
AAVNLSALLSAYAADSAV RAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMW 21699
21700
LSRMPRRMMQHRREAYAFTDAIIREHQENRAAGAGDDKEDLLDVLLRIQREGDLQF 21867
21868
PLSTERIKTTVG (0) 21903
22325
DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 22498
22499
VIKEVLRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 22678
22679
RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFDWQLPDGMDTADL 22858
22859
DMTEEMVVSARRLXXXXXXXVVHVPLPVASS 22951
>AP003523.1f
$F CYP71K9 chromosome
6 clone P0416A11 six different genes
AU096456
AU096455 71% to AP002968 65% to 71B24 also = AU032983
may be
ortholog of aaaa01001026.1c 95%
168538
MAAAASSVLAYLLVVALLAIVPLVYFGWVARRRGEGGRLPPSPWGLPVIGHLHHLAGALPHHAMRDLA 168741
168742
RRHGPLMLLRLGELPVVVASSAEAAREVMRTRDIEFATRPMSRMTRLVFPAGTEGIIFAP 168921
168922
YGDEWRELRKVCTVELLSARRVQSFRAVREEEVGRLLRAVAATSSSPSPAQAAVNL 169089
169090
SALLSAYAADSAVHAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMWLSRMP 169269
169270
RRMMQHRREAYAFTDAIIREHQENRAAGAGDGDGDDKEDLLDVLLRIQREGDLQFPLSTE 169449
169450
RIKTTVG 169470 (0)
169893
DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 170066
170067
VIKEALRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 170246
170247
RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFNWQLPDGMDTAD 170423
170424
LDMTEEMVVSARRLHDLLLVPVVHVPLPVASS* 170522
#45
>aaaa01001026.1d
$FI CYP71K10 (indica
cultivar-group) orth of AP003571.1h $F 99%
28790
MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPHVA 28966
28967
MRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEGGEG 29146
29147
IIFAPYGDRWRELRKICTVELLSARRVQSFRPVREEEAGRLLRAVAAASPGQAVN 29311
29312
LSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAMLLSRM 29491
29492
PRRMKQHHRDMVAFLDAIIQEHQENRSAAGDDDDNDLLDVLLRIQREGDLQFPLSS 29659
29660
ESIKATIG (0) 29683
29867
DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEIRRELIGHRKVTEDTLCRLNYMHMVI 30046
30047
KEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPERF 30226
30227
EHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENLDM 30406
30407
TEEMRFTTRRLHDLVLIPVVHVPLPTI* 30490
>AP003571.1h $F CYP71K10 chromosome 6 clone P0458E02
continuation of contig AP003523.1
40% to
71B23
AQ687385
nbxb0074N19f 50% to 71B20
AQ258331
nbxb0020M04r 71-like sequence 36% to 71B33
145371
MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPH 145201
145200
VAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEG 145024
145023
GEGIIFAPYGDRWRELRKICTVELLSGRRVQSFRPVREEEAGRLLRAVAAASPG 144862
144861
QAVNLSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAM 144685
144684
LLSRMPRRMKQHHRDMVAFLDAIIQEHQENRSAAADDDNDLLDVLLRIQREGDLQFPLS 144508
144507 SESIKATIG 144481 (0)
144297
DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEVRRELIGHRKVTEDTLCRLNYMHM 144124
144123
VIKEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPE 143944
143943
RFEHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENL 143764
143763
DMTEEMRFTTRRLHDLVLIPVVHVPLPTI* 143674
#248
>aaaa01008885.1
$FI CYP71K11 (indica
cultivar-group) almost same as AAAA01011410.1
ortholog
to AC118346.1a gene 1
6884
MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 6705
6704
LPPHHAMRDIALRHGPLVRLRLGGLQVILASSVDAAREVMRTHDLAFATRPSTRVMQLVF 6525
6524
PEGSQ (0)
GIVFTPYGDSWRNLR 6345
6344
KICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNLSELISAYSADSTMRALI 6165
6164
GSRFKDRDRFLMLLERGVKLFATPSLPDLYPSSRLAELISRRPRQMRRHRDEVYAFLDII 5985
5984
IKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG 5853
5758
DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 5585
5584
VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 5405
5404
RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 5225
5224
DMKEEMGAIARRLHDLSLVPVIRHPLPVDM 5135
>AC118346.1a
$F CYP71K11 Gene 1,
94448-96200, 3 exons 97% identical to gene 2 (12 diffs)
36% to 41%
with 71As and 71Bs
94448
MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 94627
94628
LPPHHAMRDIALRHGPLVRLRLGGLQVI 94711
94712
LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0)
GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLL 95071
95072
RAVAAASPARRAVNLSELISAYSADSTMRALIGSRFKDRDRFLMLLERGVKLFATPSLPD 95251
95252
LYPSSRLAELISRRPRQMRRHRDEVYAFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQR 95431
95432
KGDFPLSTDNIKTTIG (0) 95479
95574
DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 95747
95748
VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 95927
95928
RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 96107
96108
DMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 96200
#286
>aaaa01011410.1
$FI CYP71K12 (indica cultivar-group) Ortholog to AC118346.1b gene 2
4935
MADQLVHLPQQLLVL
LLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGALPPQHAMRNIALRH 5156
5157
GPLVRLRLGGLQVILASSVDAAREVMRRHDLAFATRPSTRVMQLVFPEGSQ (0) 5309
5428
GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNLSE 5607
5608
LISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRPR 5784
5785
QMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 5964
6058
DLFNGGSETTATTLKWIMAELIRNPRVMQKAQDEVRQVLGKHHKVTEEALR 6210
6211
NLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFHVPQGTMILVNMWAISRDPMYWD 6387
6388
QAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFNWELP 6567
6568
DETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 6684
>AC118346.1b
$F CYP71K12 (japonica
cultivar-group) chromosome 11 clone Ba0039F06,
Gene 2 =
AU096586.1, D48250 97% identical to gene 1 (12 diffs)
113634
MADQLVHLPQQLLVLLLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGA 113813
113814
LPPQHAMRNIALRHGPLVRLRLGGLQVI 113897
113898
LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0) 114008
114127
GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNL 114300
114301
SELISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRP 114480
114481
RQMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 114663
DLFNGGSETTATTLKWIMAELIRNPRVM 114840
114841
QKAQDEVRQVLGKHHKVTEEALRNLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFH 115020
115021
VPQGTMILVNMWAISRDPMYWDQAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIA 115200
115201
FGLVNLELVLASLLYHFNWELPDETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 115383
#287
>AC118346.1c
Gene 3 $P CYP71K13P pseudogene 56% to AC118346.1 genes 1, 2 no ortholog
117756
DAAREVMRTHDLAFATRPSTRVMQLVFLEGSQ 117661
117553
GDRFTPYGDIWRNLRRSAPLAVSAKRVQFFRPIHQEEVCRLLQAVAVASPA 117395
117394
RGPPETLTSSFRPTWATLQCAP**GARLRDRDKSLMLLYRGVKPIRHARACQIFTQSIAL 117215
117214
ADLIIKSLSPMRRASYPMSNLLDIIFK 117134
117108
SDNHMDLTLVAFLLRFHKKGACPLSFCYIRKQFG*AF 116998
#172
>aaaa01005413.1
$FI CYP71P1 (indica
cultivar-group) ortholog to AL713951.1
5862
MSLALLVLSAAYVLVALRRSRSSSLKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELARTMRAPLFRMRL
GSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAGPYHRMARR
VVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLANDVLCRVAFG
RRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCLADLREACD
VIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0) 4972
4875
DMFVAGTDTTFATLEWVMTELVRHPRILKKAQEE
4773
VRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPARTR 4594
4593
VFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGYTFA 4414
4413
LATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFKGEE 4234
4233
LSEV* 4219
>AL713951.1
$F CYP71P1 chromosome
12 clone Monsanto- 39% to 83B1
AF088221
BI305808.1 49% to 76C6 mRNA
44616
MSLALLVLSAAYVLVALRRSRSSSSKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELART 44437
44436
MRAPLFRMRLGSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAG 44257
44256
PYHRMARRVVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLAND 44077
44076
VLCRVAFGRRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCL 43897
43896
ADLREACDVIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0)
DMFVAGTDTTFATLEWVMTELVRHPRILKKA 43537
43536
QEEVRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPA 43357
43356
RTRVFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGY 43177
43176
TFALATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFK 42997
42996
GEELSEV* 42973
#264
>aaaa01009895.1
$PI CYP71P2P (indica cultivar-group) orth of AP003544.1
6273
GSMPAMVISKPNLARPALTTNDAVLASRQHLLNG*FLSFG frameshift
CSDVTFAPAGPYHRM
frameshift
QMAR 6094
6093
GVEVSELLSAHHVAMYGVVRVKELQRLLAHLTNNTSSAKPIDLSECFLNLANDVLCRVAF 5914
5913
GRRFPRDEGDKLSAVLANAQDL 5848 frameshift
5848
LAGFTISDFFLELEPVASTVTGLCHRLKKCLADLYEACDVIVDVHISGNRQRIPSDREED 5669
5668
FVDVLLRVQ 5642
>AP003544.1
$P CYP71P2P chr 6 clone P0599C12 same as AP003686.1 8668-8037 pseudogene of
AL713951.1
this gene
matches a barley EST BF255745.2 75% and a sorghum EST BE354971.1 79%
so there
must be a functional copy of this gene in rice
107880
GSMPAVVISKPNLARPALTTNDAVLASRQHLLNG*FLSF 107764 frameshift
107762
GCSDVTFAPAGPYHRM 107715 frameshift
107713
QMARGVEVSELLSAHHVAMYGVVRVKELQRLLAHLTKNTSSAKPIDLSECFLNLANDVLCRVAF 107521
107520
GRRFPRDEGDKLSAVLANAQDLL 107452 frameshift
107452
AGFTISDFFLELEPVASTVTGLCHRLKKCLADLCEACDVIVDVHISGNRQRIPSDREEDFVDVLLRVQ 107249
#127
>aaaa01003512.1
i CYP71Q1 (indica cultivar-group) ortholog to AP004346.1a 95%
6414
SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVHSFAYARTAEVARLVDTLAASPPGVPF 6593
6594
DISCTLYQLLDGIIGTVAFGKVYGAAQWSTERAVFQDVLSELLLVLGSFSFEDFFPSSAL 6773
6774
ARWGDALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQEDMVDALVRMWREQQDR 6953
6954
PSGVLTREHIKAILM 6998
8541
NTFAGGIDTTAITAIWIMSELMRNPRVMQKAQAEVRNTVKNKPLVDEEDIQNLKYLE 8711
8712
MIIKENFRLHPPGTLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRDPMIWDNPEEFYP 8891
8892
ERFEDRNIDFRGSHFELVPFGSGRRICPGIAMAVASLELVVANLLYCFDWKLPKGM*EED 9071
9072
IDMEEIGQLSFHRKVELFIVPVKHEQCEP*DQLMGH 9179
>AP004346.1a
$F CYP71Q1 two genes
and a pseudogene
71B like
47% to AC092559.2 75% to AP004346.1b
22020
MADDFLSSQPQPW 22058
22059
PPLLQLSAAVLFFLLPLLYLLFLRGSNGEVRGRQGNSASAPSLPGPCRQLPVLGNLLQIG 22238
22239
SRPHRYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRPPSPG 22405 (2)
26427
SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVRSFAYARAAEVARLVDTL 26577
26578
AASPPGVPVDLSCALYQLLDGIIGTVAFGKGYGAAQWSTERAVFQDVLSELLLVLG 26745
26746
SFSFEDFFPSSALARWADALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQED 26919
26920
MVDALVKMWREQQDRPSGVLTREHIKAILM 27009 (0)
28586
NTFAGGIDTTAITAIWIMSEIMRNPRVMQKARAEVRNTVKNKPLVDEEDSQNLKYLEMIIKEN 28774
28775
FRLHPPGNLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRGPMIWDNPEEFYPERFE 28948
28949
DRNMDFRGSNFELVPFGSGRRICPGVAMAVTSLELVVANLLYCFDWKLPKGMKEEDIDM 29125
29126
EEIGQISFISFRRKVELFIVPVKHEQYQLMGHIN* 29221
#127
>aaaa01043764.1
CYP71Q1 (indica cultivar-group) orth AP004346.1a $F 97%
see
aaaa01003512.1 for ortholog
905
LSAAVLFFFLLPFLYLLFLRGSNGEVRGRQGNSASAPSPPGPCRQLPVLGNLLQIGSRPH 726
725
RYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRP 585
#121
>aaaa01017559.1
CYP71Q2 (indica cultivar-group) orth AP004346.1b $F 98%
see
aaaa01003239.1b above
3049
DCCLHPVCTRFFSPYSAYWREMRKLLVIELTSIRRVQSFAYARAAEVAR 3195
3196
LVDTLAASLAGVPVDLSSALYTFSDGVIGTVAFGKVYGSAAWSSSEWGGSFQEAMDETM 3372
3373
QVLGSFSFEDFFPSSALARWADALTGAAGQRRRVFHRIDGFFDAVIDKHLEPERLSAGV 3549
3550
QEDMVDATVKVWREQKDEAFGLTCDHIKAIL 3642
#121
>aaaa01025743.1
CYP71Q2 (indica cultivar-group) orth AP004346.1b $F 99%
see
aaaa01003239.1b for ortholog
1225
FLLLPLVYLLFFKGDGNGGVMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRY 1404
1405
GPVVQVQLGSIRTVVVHSPEAAKDVLRTNDLQCCSRPSS 1521
#121
>aaaa01003239.1b
i CYP71Q2 (indica cultivar-group) exon 2 ortholog to AP004346.1b 95%
17640
LQDAFVGGIDTTAVTTTWIMSELMRNPRVVQKA*AEVHNIVKNKSKVCKEDIQNMKYLKM 17461
17460
IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNICDNPEQFYPE 17281
17280
RFEDKGIDFRGSHFELLPFGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDI 17101
17100
DMDEIGQLAFRK 17065
>AP004346.1b
$F CYP71Q2 two genes
and a pseudogene 48% to AC092559.2 75% to AP004346.1a
69192
MATELLASQLLPWQPLVQLLAAGLFLLPLVYLLFFKGDGNGG 69317
69318
VMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRYVPVVQVQLGSIRTVVVHS 69494
69495
PEAAKDVLRTNDLQCCSRPSSPG 69565 (2)
72047
NYNYLDVAVSPYS 72083 (frameshift)
72085
YWREMRKLLVIELTSIRRVQSFAYARAAEVARLVDTLAASPAGVPVDLSSALYTF 72249
72250
SDGVIGTVAFGKVYGSAAWSSWEWGASFQEAMDETMQVLGSFSFEDFFPSSALARWADALTGA 72438
72421
AGRRRRVFHRIDGFFDAVIDKHLEPERLSAGVQEDMVDAMVMVWREQKDEAFGLTRDHIKAILL 72630 (0)
84351
DAFVGGIDTTAVTVTWIMSELMRNPRVMQKAQAEVHNIVKNKSKVCEEDIQNMKYLKM 84524
84525
IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNIWDNPEQFY 84698
84699
PERFEDKGIDFRGSHFELLPFGSGRRICPGIAMGVANVELVVANLLYCFNWQLPKGMKEE 84878
84879
DIDMDEIGQLAFRKNFLF* 84935
note
there is another gene on AP004346.1
#120
>aaaa01003239.1a $PI CYP71Q3 (indica cultivar-group) exon 2 ortholog to AP004346.1c 96%
2517
DAFAGGIDTTVVTTTWIMSELMRNPTVMQ 2431 frameshift
2432
KAQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCKLHPPGTLLIPRHTMKTCTIGGYN 2253
2252
VPSKTRIYVNVWAMWRDPNIWDNPEQFYLERFEDKGIDFRGSHFELLT 2109 (?)
1820
FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGTKEEDIDMDEIG*LAFRK 1659
>AP004346.1c
$P CYP71Q3 two genes and a pseudogene
probable
pseudogene 89% to AP004346.1b
92865
DAFAGGIDTTVVTTTWIMSELMRNPRVMQK 92954 (frameshift)
92956
AQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCRLHPPGTLLIPRHTMKTCTIGGYSV 93132
93133
PSKRRIYVNVWAMWRDPNIWDNLEQFYLERFEDKGIDFRGSHFELLT 93273 (insertion)
93561
FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDTDMDEIG*LAFRKKLPLFI 93740
93741
VPMKH* 93758
#84
>aaaa01002200.1 $PI CYP71Q4P (indica cultivar-group) ortholog of AC087599.11 $P 94% PERF region resembles CYP71Q sequences
2930
PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSPEEFWPERFLASREAMD 3109
3110
FQGNNYQLILFITDRRICPDINFAVPVLETALVGLLHPTNELLGGGGGLMWLQRSCSRAR 3289
3290
RLRSTGHRRHRSGTHPAAAVAAAAT 3364
>AC087599.11
$P CYP71Q4P chromosome 10 clone OSJNBa0057L21, pseudogene fragment like 71A1
44% to AAAA01006105.1b
16812
GGGGRWTETLEWIMAELTANTRVMAKLQDEISRAADGK 16925 24 aa deletion and frameshift
16931
PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSSEEFWPEQFLASREAVD 17110
17111
FQGNNYQLILFITDRRIFPDINFAVPVLETALVGLLHPTNELLGGG 17248
17249
GGLMWLQRSCSRARRPRSTAHRRHRSGTHPAAIAAAAAT* 17368
#145
>aaaa01004200.1
CYP71R1 (indica cultivar-group)
16322
MAAVQLDFGLLVGFLFLATCLAVAIRSYLRSGGAAIPSPPALPVIGNLHQLGR 16480
16481
GRHHRALRELARRHGPLFQLRLGSVRALVVSSAPMAEAVLRHQDHVFCGRPQQRTARGTL 16660
16661
YGCRDVAFSPYGERWRRLRRVAVVRLLSARRVDSFRALREEEVASFVNRIRAASGGGV 16834
16835
VNLTELIVGLTHAVVSRAAFGKKLGGVDPAKVRETIGELADLLETIAVSDMFPRLRWVDW 17014
17015
ATGLDARTKRTAAKLDEVLEMALRDHEQSRGDDDDGGGGDGEPRDLMDDLLSMANDGGGD 17194
17195
HGHKLDRIDVKGLILV (1)
NMFIA
(frameshift)
GTDTIYKSIEWT
17374
17375
MAELIKNPAEMAKVQAEVRHVAAAAHGDEDEDTVAVVREQQLGKMTLLRAA 17527
17528
MKEAMRLHPPVPLLIPREAIEDTVLHGHRVAAGTRVMINAWAIGRDEAAWEGAAEFRPGR 17707
17708
FAGGGDAAGVEYYGGGDFRFVPFGAGRRGCPGVAFGTRLAELAVANMACWFEWELPDGQ 17884
17885
DVESFEVVESS 17917
aaaa01004200.1
no ortholog in japonica 9/6/02
#214
>aaaa01007242.1
$PI CYP71R2P (indica cultivar-group) ortholog of AP003575.1 99%
11280
MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAAITSPPALPVIGNLHQLGR 11459
11460
GRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRHQDHVFCGRPQQHTARGTL 11639
11640
YGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEEVASFVNRIRAASGGGGGV 11819
11820
VNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGELADLLGTIAVSDMFPRLRWVDW 11999
12000
ATGLDARTKRTAAKLDEVLEMVLRDHEQSRGDDDDDDGDGEARDLMDDLLSMANGGDDHG 12179
12180
YKLDRIDVKGLLILV (0)
DMFAAGTDTVYKSIE
frameshift
MAEL 12359
12360
IKNPAEMAKVQAEVRHVVAAAHGGEGDEDAVVIVKEEQASS frameshift
12482
LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 12661
12662
EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC 12841
12842
WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYAVETT 12976
>AP003575.1
$P CYP71R2P chromosome 6 clone P0528B02, similar to 71A24 one in frame stop
codon 395 to 71A14
54816
MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAA
54690
ITSPPALPVIGNLHQLGRGRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRH 54511
54510
QDHVFCGRPQQHTARGTLYGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEE 54331
54330
VASFVNRIRAASGGGGGVVNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGEL 54160
54159
ADLLGTIAVSDMFPRLRWVDWATGLDARTKRTAAKLDEVLEMVLRDHEPSRGDDDDDDGD 53980
53979
GEARDLMDDLLSMANGGDDHGYKLDRIDVKGLLIL 53875 (0)
DMFAAGTDTVYKSIE*TMAELIKNPAEMAKVQAEVRHVVAAAHGGEGDEDA (0) may be incorrect
joint
53613
LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 53434
53433
EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC 53254
53253
WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYVVQTTRM* 53110
#448
>aaaa01059584.1
CYP71R3 (indica cultivar-group) 59% to CYP71R1
596
VAAKTRVIINTWAIGRDSIIRENAEEFLPERFIDNGIDYNSKDFSFIPFGAGRRGCPGIAFATRLA 793
794
ELALANLMYHFDWELQEGQDLESFQLVSPSVIQTWGSS 907
no
japonica ortholog found 9/12/02
#362
>aaaa01016223.1
CYP71S1 (indica cultivar-group) orth AL606614.1b $F chr 4 96%
1585
PRPRGLPLIGNLHQVGALPHRSLAALAARHATPLMLLHLGSVPTLVVSTADAARALFRDN 1764
1765
DRALSGRPALYAATRLSYGQKNISFAPDGAYWRAARRACMSALLGAPRVRELRDAREREA 1944
1945
AALIAAVAAAGASPVNLSDMVAATSSRIVRRVALGDGDGDESMDVKAVLDETQA 2106
2107
LLGGLWVADYVPWLRWVDTLSGMRRRLELRFHQLDALYERVIDDHLNNRKHASDEE 2274
2275
DDLVDVLLRLHGDPAHRSTFGSRSHIKGIL 2364
2699
DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHY 2872
2873
LRLVIKETLRLHPAAPLLVPREMTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAE 3052
3053
RFVPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWR 3223
3224
APPGREVDVEEENGLVVHKKNPLVLI 3301
>AL606614.1b
$F CYP71S1 chromosome
4 clone OSJNBb0011N17 40% to 71A25
23233 MSMASLQAPEFLASCLLLATILFFKQLLAPSSKQRAASPSLPRPRGLPLIGNLHQVGALPHRSLAALAAR 23096
23095
HAAPLMLLRLGSVPTLVVSTADAARALFRDNDRALSGRPALYAATRLSYGQKSISFAPD 22919
22918
GAYWRAARRACMSELLGPPRVRGLRDAREREAAALVAAVAAAGASPVNLSDMVAATSSR 22742
22741
IVRRVAFGDGDGDESMDVKAVLNETQALLGGLWVADYVPWLRWVDTLSGKRWRLERRFRQ 22562
22561
LDALYERVIDDHLNKRKHASDEEDDLVDVLLRLHGDPAHRSTFGSRSHIKGILT 22400 (0)
22059
DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHYLR 21889
21888
LVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAERF 21709
21708
VPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWRAP 21538
21537
PGREVDVEEENGLAVHKKNPLVLIATKSKRNTGGH* 21427
#362
>aaaa01040889.1
CYP71S1 (indica cultivar-group) orth AL606614.1b $F chr 4 100%
see
aaaa01016223.1 for ortholog
822
DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHY 986
987
LRLVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAE 1166
1167RFVPERHRD
1193
#298
>aaaa01011971.1
CYP71S2 (indica cultivar-group) orth AL606614.1a $F chr 4 96%
987 DMFIAGSDTSAVTVQWAMTELVRNPDVLAR 1085
AQHEVRRVVAAAGGGDKDGAMVREADLPELHYLRLVIKETLRLHPASPLVQR 1240
1241
ETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWGPDAERFVPERHRAHDADGGQQHDGF 1420
1421
ALVPFGIGRRSC 1456
1450
ELLLANLLFCFDWSAPPGREVDVEEENGLAVRKKNPLVLI 1569
>AL606614.1a
$F CYP71S2 chromosome
4 clone OSJNBb0011N17 90% to AL606614.1b
19270
MASLQAPEFLASCLLLLATILLFKQLLAPSSKKRAASPSLPRPKGLPLIGNLHQVGALPHRSLAAL 19073
19072
AARHAAPLMLLRLGSVPTLVVSTADAARALFRNNDRALSGRPALYAATRLSYGQKNISF 18896
18895
APDGAYWRAARRACMSALLGAPRVCELRDAREREAAALIAAVAAAGASPVNLSDMVAAT 18719
18718
SSRIVRRVAFGDGDGDESMDVKAVLDETQSLLGGLWVADYVPWLRWVDTLSGMRRRLERR 18539
18538
FRQLDAFYERVIDDHINKRKHASDEEDDLVDVLLRLHGDPAHRSMFGSRTHIKGILT (0)18368
17337
DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVIAGGGGGDKDGAMVREADL 17149
17148
PELHYLRLVIKETLRLHPASPLVQRETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWG 16969
16968
PNAERFLPERHRAHDADGEQQHEHDGFALVPFGIGRRSCPGVHFAAAAAELLLANLLFCF 16789
16788
DWRALPGREVDVEEENGLAVRKKNPLVLIATKSKSNRDAH* 16666
#24
>aaaa01000559.1
$FI CYP71T1 (indica
cultivar-group) 98% to AP003434.1a
22050
MELSSSLAAVLHSPLFLLAALLLLPVFTLLSFSSAKKPGDGGGRRLPLPPSPRGVPFLGH 21871
21870
LPLLGSLPHRKLRSMAEAHGPVMLLWFGRVPTVVASSAAAAQEAMRARDAAFASRARVSM 21691
21690
AERLIYGRDMVFAPYGEFWRQARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGV 21511
21510
RGGGETVNLSDMLMSYANGVISRAAFGDGAYGLDGDEGGGKLRELFANFEALLGTATVGE 21331
21330
FVPWLAWVDKLMGLDAKAARISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDH 21151
21150
RDFVDVLLDVSEVEEGAGAGEVLLFDAVAIKAIIL 21046
20444
DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELRL 20265
20264
LRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRPXAAWGDRAEE 20085
20084
FVPERWLDGGGGGEAVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLL 19917
YHFDWELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV 19769
>AP003434.1a
$F CYP71T1 chromosome
1, PAC clone:P0452F10, complete 41% to 71A24 = C98812
C98812 52%
identical to D48413 43% to 71A13, 44% to 71B10
34853
MELSSSLAAVLHSPLFLLAAL
34916
LLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGHLPLLGSLPHRKLRSMAEAHGP 35095
35096
VMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRMAERLIYGRDMVFAPYGEFWR 35272
35273
QARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGVRGGGETVNLSDLLMSYANGV 35452
35453
ISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGEFVPWLAWVDKLMGLDAKAA 35629
35630
RISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDHRDFVDVLLDVSEVEEGAG
AGEVLLFDTVAIKAIIL (0)
36458
DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELR 36634
36635
LLRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRDAAAWGDRAE 36814
36815
EFVPERWLDGGGEEVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDW 36994
36995
ELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV* 37129
#76
>aaaa01002066.1
$PI CYP71T2 (indica cultivar-group) ortholog of AP003434.1b $F 99%
1728
MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLPLPPSPPGVPLLGH 1549
1548
LPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRTRDLAFASRPRVRM 1369
1368
SERLFYGRDM 1339 frameshift and deltion
DFVDVMLDVSEAEEGAGAGAGGVLLDTVAIKAVIL 1233
>AP003434.1b
$F CYP71T2 chromosome
1, PAC clone:P0452F10, complete = AA754300
AA754300 42% IDENTICAL
TO 71A14 1/98 I-HELIX 43% to
703A2
39698
MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP
39839
LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018
40019
RDLAFASRPRVRMSERLFYGRDM AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195
40196
VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375
40376
DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552
40553
VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)
42074
DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169
42170
QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349
42350
DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529
42530
RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709
42710
VRLKADLNLVAKPWSPGAS* 42769
note
there are 4 sequences on AP003434.1
#206
>aaaa01006724.1
CYP71T3 (indica cultivar-group) orth AP003434.1c $F chr 1 99% also AU163704.1
7723
RRRLPPSPPWGLPLLGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEE 7544
7543
VMRTRDLEFASRPRVAMAERLLYGGRDVAFAPYGEYWRQTRRICVVHLLSARRVLSFRRV 7364
7363
REEEAAALVARVRAAGGAVDLVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRVLR 7193
7192
KLFDDFVELLGQEPMGELLPWLGWVDALNGMEVKVQRTFEALDGILEKVIDDHRRRRR 7019
7018
EVGRQMDDGGGGDHRDFVDVLLDVNETDMDAGVQLGTIEIKAII 6887
5268
DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 5095
5094
AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPARTRIVINAWTIGRDQATWGEHAEEFI 4915
4914
PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 4735
4734
EFGTSSLDMSEMNGLSVHLKYGLPLIAI 4651
>AP003434.1c
$F CYP71T3 AU163704.1
chromosome 1, PAC clone:P0452F10, complete 44% 71A14
48011
MAVSLVVVVVV
48044
VIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHLLGALPHRALRS 48223
48224
LAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAERLLYGGRDVAFA 48403
48404
PYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVDLVEHLTAY 48574
48575
SNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLGWVDALN 48748
48749
GMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVNETDMD 48925
48926 AGVQLGTIEIKAIIL 48970 (0)
51142
DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 51312
51313
AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAWTIGRDQATWGEHAEEFI 51492
51493
PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 51672
51673
EFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP* 51771
#204
>aaaa01006398.1c $FI CYP71T4 (indica cultivar-group)
AP003434.1d 95%
probably
orthologs since 6398 and 3434 have only 3 nuc diffs and two
1 nuc
indels in the intron.
9740
MAVSLLVVLLVVLAIVVPLLYLVLLPAGNTTRNGAARWEDDGGDGRRRRRLPPSPRGLPL 9919
9920 LGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDVEFASRP
10096
10097
RMAMAELLLYGGRDVAFAPYGEYWRQAPRICVVHLLSARRILSFRRVREEEAAALVGRV 10273
10274
RAAAADVVDLSDLLIAYSNTVLTRIAF GDESARGGGGGDRGRELRKVFDDFARL 10435
10436
LGTEPMGELLPWFWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRLMDDDGGG 10615
10616
DHRDFVDVLLDVNETDKDAGIQLGTVEIKAIIM (0) 10711
11174
DMFVGGSDTTTTMIAWTMAELINHPRAMHKAQNEIRAVVGNTSHVTKDHVDKLPYLKAVF 11353
11354
KETLRLHPPLPLLIPREPLADAQILGYTIPAHTRVVINAWAIGRDPAAWGQQPDEFSPEK 11533
11534
FLNGAIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWE 11680
11681
AAATDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 11812
>AP003434.1d
$F CYP71T4 chromosome
1, PAC clone:P0452F10, complete like 71A
58119
MAVSLLPAVL
58149
VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328
58329
LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508
58509
GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685
58686
LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859
58860
FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036
59037
VNETDKDAGIQLGTVEIKAIIM 59102 (0)
59562
DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717
59718
LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897
59898
PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077
60078
TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200
#205
= #203 = #445 reduce gene count by 2
>aaaa01006398.1d
$PI CYP71T5 (indica cultivar-group) seq gap before 12414
first exon
has two frameshifts and part is missing (1205) no ortholog
GDESARG (fs)
RALRKLFENFARLLGTEPMGELLPWLGWVDAV (fs)
WLDGKVQRTFEALDSIIEKVIDDHRRRRRRREVGRQMDSDDDGGGGG
DHRDFVDVLLDVNETDKDAGIRLGTIEIKAIIL (0)
12924
DMFAAGTDTTTTAMEWAMAELITHRDAMHKVQDEIRAVVGVTGCVTEDHIDRLPYLKAVL 13103
13104
KETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIGRDPATWGEHAEKFIPER 13283
13284
FLNNNVDYKGQDFGLVPFGAGRRGCPGMGFAVPTIEMALASLLYNFSWETRPVDRRCKSG 13463
13464
TSSLDMSEMNGISVRLKYGLPLIAKSHFP* 13553
#203
= #205 = #445 reduce gene count by 2
>aaaa01006398.1b
$FI CYP71T5 (indica
cultivar-group) no ortholog
4560
MAVSLLPAVLVLLAIVAPLLYLVLLPAVKYTTSNGAARWEDDDGGDGRRRRRLPPSPRGLPLLGHLHLLGAL 4775
4776
PHRALRSLAAAHGPVLLLRLGRVPAVVVSSAAAAEEVMRARDVAFASRPRVAMAERLLYG 4955
4956
GRDVAFAPYGEYWRHARRICVVHLLSARRVLSFRRVREEEAAALVARVRAAARAPGAR 5129
5130
GAVDLVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRALRKLFDDFVELLGQEPMGELL 5309
5310
PWLGWVDAVRGLDGKVQRTFEALDSIIEKVIDDHRRRRRRHEVGRQMDSDDDGG 5471
5472
GGGDHRDFVDVLLDVNETDKDAGIRLGTIEIKAIIL (0)
DMFAAGTDTTTTAMEWAMAELITHRDAMHKVQDEIRAVVGVTG 5831
5832
CVTEDHIDRLPYLKAVLKETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIG 6011
6012
RDPVTWGEHAEKFIPERFLNNNVDYKGQDFGLVPFGAGRRGCPGMGFAVPTIEMALASLL 6191
6192
YNFSWETRPVDRRCKSGTSSLDMSEVNGISVHLKYGLPLMAKFYSS* 6332
aaaa01006398.1b
no ortholog found in japonica 9/7/02
#445
= #203 = #205 reduce gene count by 2
>aaaa01054542.1
CYP71T5 (indica cultivar-group) 76% to AP003434.1c $F
96% to
aaaa01006398.1d $PI >99% over 970 bp eve outside the coding region
581
DMFAAGTDTTTTAMEWAMAELITHRNAMHKVQDEIRAVVGVTGCVTEDH 408
407
IDRLPYLKAVLKETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIGRDPATW 228
227
GEHAEKFIPERFLNNYVDYKGQDYGLVPFGAGRRGCPGMGFAVPTIEMALASLLYISAWE 48
47 TRPVDR 30
no
japonica ortholog found 9/12/02
#202
>aaaa01006398.1a CYP71T6 (indica cultivar-group) (partialI)
1854
MVVVVVVVAIAIVVPLLYLVLLPPARRGGGDSARRRLPPSPRGLPLLGHLHLLGALP 2024
2025
HRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRARDVAFASRPRVAMAERLLY 2198
2199
GGRDVAFAPYGEYWR 2243 sequence gap here
aaaa01006398.1a
no ortholog found in japonica 9/7/02
#89
>aaaa01002288.1
$FI CYP71T7 (indica
cultivar-group) 13560 MDISLASLVLVLLAFVLPLLYLLLQLPGKKSGGGGGDGPRLPPSPAGCLPLLGHLHL
13390
13389
LGPLPHVALRSMAAAHGPVLRLRLGRVPTVVVSSAAAAEEVLRARDAAFSSRPRSAMAER 13210
13209
ILYGRDIAFAPYGEYWRQARRVCVVHLLSAQRVSSFRRVREEEAAALADAVRAAGRGGG 13033
13032
RAFDLSGLIVAYASAVVSRAAFGDESARGMYGGADGGRAVRKAFSDFSHLFGTKPVSDYL 12853
12852
PWLGWVDTLRGRERKARRTFEALDGVLDKVIDDHRRRRDSGRRQTGDADAGHRDFVDVL 12676
12675
LDVNEMDNEAGIHLDAIEIKAIIM 12604
12529
DMFVAGSDATSKPMEWAMAELVSHPRHMRRLQDEIRAVVGGGRVTEDHVDKLPYLRAAL 12353
12352
KEALRLHAPLPLLVARETVADTEIMGYHVAARTRVVINGWAIGRDTAVWGETAEEFMPER 12173
12172
FLAGGNGGGAAAADYKVQGFEMLPFGGGRRGCPGVTFGMATVE 12044
12041
SAVASLLYHFDWEAAAADGKGGREGTPLLDMSETSGISMGLKHGLPLVAKPRFP 11880
aaaa01002288.1
$FI has no ortholog in nr or HTGS on 9/5/02
#33
>aaaa01000893.1
CYP71T8 (indica cultivar-group) Nterm 49% to AP003434.1
33765
MSSYVVVAAALLVFVVVVVAAIKNLGKGKLPPSPPSLPFVGHLHLVGELPH 33917
33918
RSLDALHRRYGSDGGLMFLRLGRAGALVVSTAAAAADLYRGHDLAFASRPPSHSAERLFY 34097
34098
GGRNMSFAPLGDAWRRTKKLAVAHLLSPRRARPRRRGR 34211
aaaa01000893.1
may not have an ortholog
#437
>aaaa01042159.1
CYP71T9 (indica cultivar-group) 60% to AP003434.1b
58% to
wheat AL821861 This seq may belong in another family or
subfamily
like CYP703
DIMGAATDTSFVTLEWIMTELIRNTQVMSKLQNEIIQVTGS
3
no
japonica ortholog found 9/12/02
#275 =
#377 reduce gene count by 1
>aaaa01010398.1
CYP71T10 (indica cultivar-group) not an exact match 71 like pseudogene fragment
75% to 71T5 in a small region
2295
LMYLVLLPDVNRSNRPERWEDSDGWQRLPP*PRRLPLLRYLHLLSVPLHQAFHPLPR 2465
2466
HMAWCCYSSSNACRWWLSSFAATSRPKSAMAEQLLYGCDVAFAPYGEY* 2612
2613
S*ECRICVLFFRCIREEEVAVLVKHVRHPCR 2705
no
japonica ortholog found 9/10/02
#377
= #275 reduce gene count by 1
>aaaa01017833.1
CYP71T10 (indica cultivar-group) N-term pseudogene fragment
3692
LMYLVLLPDVNRSNRPERWEDGDGWQRLPP*PRRLPLLRYLHLLGAPLHQAFHPLPR 3862
3863
HMAWCYYSSSNACRWWLSSFAATSRPKSAMAEQLLYGCDVAFAPYGEY* 4009
4010
S*ECRICVLFFRCIREEEVAVLVKHVRHPCR 4102
No
japonica ortholog found 9/11/02
#179
>aaaa01005635.1
$PI CYP71U1P (indica cultivar-group) one stop codon, one fs
62% to
AAAA01000843.1
8796
MDELSAGSLYLVVLGTLALALAFKRVLRGKETGVKLPPGPWNLPIIGSLHHLVGAHLPHRALLRVSR 8596
8595
RQGPLMLLRLGEVPAVVVSSPEAAMEFLRTRDPVFASRPRGALRSTSSASAVK 8437
8436
GSSWRHTASTGGRCARSAWWSCSAQGRCSGWSLSGRRRGGVPPRRGHRHDIA 8281
8280
CYSQHRHDPDASGAQ*RHHREGGVRRQVPTAGLRYLRVLKVVATLAGSFN 8131
8130
MVDLFPSSRLVRWLSCVERRLREHHAQTVRIVDSIIQDRKENEASASPGASAEDDDNDDL 7951
7950
LDVLLRLQREDNLTFPITAEIIGALIS (0) 7852
7202
DIFGAATDTTGSTLEWAMAELMRNPRTMEKAKQEVQNALGQGRAMVTGADIGDLHYLQMV 7023
7022
IKETLR (fs) 7005
7000
LHPSIPLIVRASEESTLVMGYDIPQGTNIFINAFAVARDPRYWKDADEFMPERFEKN 6830
6829
GDDIKATTVHMGFIPFGAGR (deletion of 18 aa heme signature region in seq gap) 6770
6669
NLLYHFDWTLINGESPESLDMGEVWGISIHRRSDLRLHAALSVSSGFLRHSDRDS* 6496
aaaa01005635.1
no ortholog in japonica on 9/7/02
#31
>aaaa01000843.1
$FI CYP71U2 (indica
cultivar-group) 63% to AAAA01005635.1 37% to 71B2
92% to AP004872.1 and AP005536.1 (00843 is best indica match for these seqs)
21357
MDELSIENHSPISMDELSFG
21297
SLCLVAMATLALALALMVVMGAHRRGGEKGATTGAKNLPPGPWNLPVTGSLHHLLGASP 21121
21120
PPHRALLRLSRRHGPLMLVRLGEVPTVIVSGSDAAMEGWVLKAHDPAFADRARSTTVDAV 20941
20940
SFGGKGIIFAPYGEHWRQARRVCLAELLSARQVRRLESIRQEEVSRLVGSIAGSSNA 20770
20769
AAVDMTRALAALTNDVIARAVFGGKCARQEEYLRELGVLTALVAGFSMADLFPSSRV 20599
20598
VRWLSRRTERRLRRSHAQMARIVGSIIEERKEKKASDDGVGAKDEDDDLLGVLLRLQEED 20419
20418
SLTSPLTAEVIGALVI (0) 20371
17791
DIFGAATDTTASTLEWVMVELMRNPRAMEKAQQEVRNTLGHEKGKLIGTDISELHYLRMV 17612
17611
IKETLRLHPSSALILRQS (fs) 17558
17558
QGNCRVMGYDIPQATPVLINTFAVARDAKYWDNAEEFKPERFENSGADIRTSTAHLGFVP 17379
17378
FGAGCRQCPGALFATTTLELILANLLYHFDWALPDGVSPESLDMSEVMGITLHRSSSLHL 17199
17198
HATLSRLGFVSHSGQ* 17151
aaaa01000843.1
$FI may not have an ortholog
#41
>AP004872.1
$F CYP71U3 (japonica
cultivar-group) chr 2 = AP005536.1
92% to aaaa01000843.1 (best match in indica) low percent for an ortholog
98325
MDELSIENHSPISMDELSFGSLCMVAMATLALALALMVMGAHRRGGEKGATTGAKNLPP 98149
98148
GPWNLPVIGSLHHLLGASPPHRALLRLSRRHGPLMLVRLGEVPTVIVSGSDAAMEVL 97978
97977
KARDPAFADRARSTTVDAVSFGGKGVIFAPYGEHWRHARRVCLAELLSARQVRRLESIRQ 97798
97797
EEVSRLVDSIIAGSSNAAAVDMTRALAALTNDVIARAVFGGKCARQEEYRRELGVLTTLV 97618
97617
AGYSMVDLFPSSRVVRWLSRRTERRLRRSHAEMARIVGSIIEERKEKKGSDAGVGAKDED 97438
97437
DDLLGVLLRLQEEDGLTSPLTAEVIAALV 97351
94360
XDIFGAATDTTASTLEWIMVELMRNPRAMDKAQQEVRNTLGHEKGKLIGIDISELHYLCMV 94181
94180
IKETLRLHPASALILRQSRENCRVMGYDIPQATPVLINTFAVARDPKYWDNAEEFKPE 94007
94006
RFENSGADIRTSIAHLGFIPFGAGCRQCPGALLATTTLELTLANLLYHFDWALPDGVSPK 93827
93826
SLDMSEVMGITLHRRSSLHLHTTLTRSGFFSHSGR 93722
#486
incorrectly labeled as #200
>aaaa01025826.1
CYP71V1 (indica cultivar-group) orth AC096855.1 $F chr 3 98%
604
YFFFLQSLLLCIAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHRALRD 783
784
LAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGWADILFS 963
964
PSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPV 1116
1672
DMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFHGKAVVMEADLQASNLRYL 1851
1852
KLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVNVWAIGRHPKYWDDAEEFK 2031
2032
PERFDDGAIDFMGGSYKFIPFGSGRRMCPGFNYGLASMELVLVAMLYHFDWSLPVGVKEV 2211
2212
DMEEAPGLGVRRRSPLLL 2265
>AC096855.1
$F CYP71V1 chromosome
3 clone OJ1365_D05 54% to AC087550 frameshift before PERF?
= AQ326032
AQ329780 73% to AF321860 Lolium rigidum similar to CYP71D sequences
87309
MDDYFFLQSLLLCVAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHR 87136
87135
AMRDLAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGW 86962
86961
ADILFSPSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPVNLS 86782
86781
VLFHSTTNDIVARAAFGRKRKSAPEFMAAIKAGVGLSSGFKIPDLFPTWTTALAAVTGMK 86602
86601
RSLRGIHKTVDAILQEIIDERRCVRGDKINNGGAADDQNADENLVDVLIALQEKGGF 86431 (1)
86339
GKSVTTPWVIVTHMICTLDVQDMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFH 86157
86156
RKAVVTEADLQASNLRYLKLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVN 85977
85976
VWAIGRDPK Y*Y*E DAEEFKPEQFDDDAIDFMGGSYEFIPFGSGRRMCPGFNYGLASMEL 85797
85796
VLVAMLYHFDWSLLVGVKEVDMEEAPGLGVRRRSPLLLCATPFVPAAVSADY* 85638
#200
>aaaa01006345.1
CYP71V2 (indica cultivar-group) 77% to AC096855.1 $F
no orth
9/15/02
10409
VLQLLKLLLVRHRRPRTPPGPWRLPVIGSMHHLVNVLPHRKLRELAAVHGPLMMLQLGET 10230
10229
PLVVATSKETARAVLKTHDTNFATRPRLLAGEIVGYEWADILFSPSGDYWRKLRQLCAAE 10050
10049
ILSPKRVLSFRHIREDE 9999
9730
VNLSVMFHSVTNSIVSRAAFGKKRKNAAEFLAAIKSGVGLASGFNIPDLFPT 9575
9574
WTGILATVTGMKRSLRAIYTTVDGILEEIIAERKGIRDEKISGGAENVDENLVDVLIGL 9398
9397
QGKGGFGFHLDNSKIKAIILQDMFA 9218
9217
GGTGTSASAMEWGMSELMRNPSVMKKLQAEIREVLRGKTTVTEADMQAGNLRYLKMVI 9044
9043
REALRLHPPAPLLVPRESIDVCELDGYTIPAKSRVIINAWAIGRDPKYWDNPEEFRPERF 8864
8863
EDGTLDFTGSNYEFIPFGSGRRMCPGFNYGLASMELMFTGLLYHFDWSLPEGVNEVDMAE 8684
8683
APGLGVRRRSPLMLCATPFVPVV 8615
#139
>aaaa01004037.1 CYP71V3 (indica cultivar-group)
ortholog to AL732378.3 99%
12867
LPPGPRNLPVVGSAHRLVNTLAHRVLRDLADVHGPLMHLRVGQVPVVVVTSKELARD 12697
12696
ILKTHDANFATRPKLVAGGIVAYDWTDILFSPSGDYWRKLRRLCIQEILSAKRILSFEHI 12517
12516
REDEVRMLADEIRAVGPSVAVDLSARLHRITNTIVSRAAFGNKRSNAADFLVAIKQSVIM 12337
12336
ASGFYVPDLFPRFSVLLCWLTGMRRTLHGIRDTIDSILEEIISEKEEAKQQQDNNL 12169
12168
VDVLLSLKDKGDFGFPITRDTIKAIVL 12088
11880
DIFAGGSGTSANAMEWAMSELMMNPRVMNKVQAEIRDAFHGKQSIGEADLRARDLKYLKL 11701
11700
VMKETLRLHPPAPLLVPRESIDACEINGYMIPAKARVIVNAWAISRDPRYWEDAEEFKPE 11521
11520
RFAEGGIDFYGSNYEYTPFGSGRRMCPGYNYGLASMELTLAQLLHSFDWSMPDGATEV 11347
11346
DMTEAPGLGVRRKTPLLLCAAPYVASPI 11263
>AL732378.3
$F CYP71V3
MAWLDDVLSLCNNNTRMCNALVLSVVVVSFLQLLKHVLLTPSRLP
64951
LPPGPRNLPVVGSAHRLVNTLAHRVLRDLADVHGPLMHLRVGQVPVVVVTSKELARDILK 64772
64771
THDANFATRPKLVAGGIVAYDWTDILFSPSGDYWRKLRRLCIQEILSAKRILSFEHIRED 64592
64591
EVRMLADEIRAVGPSVAVDLSARLHRITNTIVSRAAFGNKRSNAADFLVAIKQSVIMASG 64412
64411
FYVPDLFPRFSVLLCWLTGMRRTLHGIRDTIDSILEEIISEKEEAKQQQDNNLVDVLLSL 64232
64231
KDKGDFGFPITRDTIKAIVL 64172
63964
DIFAGGSGTSANAMEWAMSELMMNPRVMNKVQAEIRDAFHGKQSIGEADLRARDLKYLKL 63785
63784
VMKETLRLHPPAPLLVPRESIDACEINGYMIPAKARVIVNSWAISRDPRYWEDAEEFKPE 63605
63604
RFAEGGIDFYGSNYEYTQFGSGRRMCPGYNYGLASMELTLAQLLHSFDWSMPDGATEVDM 63425
63424
TEAPGLGVRRKTPLLLCAAPYVASHIYA* 63338
#193
>aaaa01006105.1a
$FI CYP71V4 (indica
cultivar-group) similar to Lolium rigidum AF321859
MDELLYRALLLSVLAVALLQIIEAFLIIIRAKPAAPPLPPGPWRLPVIGSMHHLAGKLPHRALRD
3196
LAAAHGPLMMLRLGETPLVVASSREMAREVLRTHDANFATRPRLLAGEVVLYGGADILF
3019
SPSGEYWRRLRQLCAAEVLGPKRVLSFRHIREQE
(0) 2914
MESQVEEIRAAGPSTP
VDLTAMFSFLVISNVSRASFGSKHRNAKKFLSAVKTGVTLASGFKIPDLFPTWRKVLAAV
1750
TGMRRALEDIHRVVDSTLEEVIEERRSAREDKARCGMVGTEENLVDVLIGLHEQGGC
1579
LSRNSIKSVIFDMFTAGTGTLSSTLGWGMSELMRSPMVMSKLQGEIREVFYGKATVGEED
1399
IQASRLTYLGLFIKETLRLHPPVPLLVPRESIDTCEIKGYMIPARSRIIVNAWAIGRDPR
1219
YWDDAEEFKPERFEKNIVDFTGSCYEYLPFGAGRRMCPGVAYGIPILEMALVQLLYHFDW
1039
SLPKGVVDVDMEESSGLGARRKTPLLLCATPFVVPVL*
925
aaaa01006105.1a no japonica ortholog found 9/7/02
#439
>aaaa01045745.1
CYP71V4 (indica cultivar-group) 64% to AC096855.1 $F
98% to
aaaa01006105.1a $FI 2 diffs
2
ESIDTCEIKGYMIPARSRIIVNAWAIGRDPRYWDDAEEFKPKRFEKNMVDFTGSCYEYLP 181
182
FGAGRRMCPGVAYGIPILEMALVQLLYHFDWSLPKGVVDVDMEESSGLGARRKTPLLL 355
no
japonica ortholog found 9/12/02
#194
>aaaa01006105.1b
$FI CYP71V5 (indica
cultivar-group)
6903
MDGLLYQALLLSALAVAVLQIVKLAVVNRGKKQAAAAAPTPPGPWRLPVIGSMHHLAGKLAHRALRD 6703
6702
LAAVHGPLMMLQLGETPLVVVSSREVAREVLRTHDANFATRPRLLAGEVVLYGGADILF 6526
6525
SPSGEYWRKLRQLCAAEVLGPKRVLSFRHIREQE (0) 6421
MASRVERIRAVGPSVP
5914
VDVSALFYDMAISIVSCASFGKKQRNADEYLSAIKTGISLASGFKIPDLFPTWRTVLAAV 5735
5734
TGMRRALENVHRIVDSTLEEVIEERRGAARECKGRLDMEDNEENLVDVLIKLHEQGG 5564
5563
HLSRNSIKSVIFDMFTAGTGTLASSLNWGMSELMRNPRVMTKLQGEIREAFHGKATV 5393
5392
GEGDIQVSNLSYLRLFIKETLRLHPPVPLLVPRESIDMCEVNGYTIPARSRIVVNAWAIG 5213
5212
RDPKYWDDPEEFKPERFEGNKVDFAGTSYEYLPFGAGRRICPGITYALPVLEIALVQLIY 5033
5032
HFNWSLPKGVTEVDMEEEPGLGARRMTPLLLCATPFVVPVL* 4907
aaaa01006105.1b
no japonica ortholog found 9/7/02
#195
>aaaa01006105.1c
$PI CYP71V6P (indica cultivar-group) 79% to AAAA01006105.1b
but no
Nterm exon present in 2000bp to end of clone
12642
(0) IASRVDLICAVGPLTL
12594
VDVSALFYDITISIASCASFGKKHRNVDEYLSSIKTRVSLASRFKIPDLFPSWRTMLAMV 12415
12414
TGMRRALEEVHGIVDSTLEDVIEERQGEKEDKTRPDMVDTKENLVDVLIGLHENGA 12247
12246
HLSRDSIKAVIFDMFTAGTGTLASALNWGMSKLMRNPRVMTKLQGEIRKAFHGKVTVG 12073
12072
EDDIQAANLPYIRLFIEETLLLHPVVPLLVPRESIDVCEVNGYTILARSRIVVNAWAIGR 11893
11892
DPKYWDNPEEFKPEWFEGNIVDFPGSSYEYLPFGAG*RMCPGIAYGLPVLEMALVQLLYH 11713
11712
FD*SLPNGVMKVDMEEEPGLGARRKTPLLLNLFVIPVLQGQQ* 11578
aaaa01006105.1c
no japonica ortholog found 9/7/02
#406
>aaaa01023722.1
$FI CYP71W1 (indica cultivar-group
71
MELTTLLLLALISFFFLVKLIARYASPSGRESALRLPPGPSQLPLIGSLHHLLLSRYGDL 250
251
PHRAMRELSLTYGPLMLLRLGAVPTLVVSSAEAAAEVMRAHDAAFAGRHLSATIDILSC 427
428
GGKDIIFGPYTERWRELRKVCALELFNHRRVLSFRPVREDEVGRLLRSVSAASAEGGA 601
602
ACFNLSERICRMTNDSVVRAAFGARCDHRDEFLHELDKAVRLTGGINLADLYPSSRLVRR 781
782
LSAATRDMARCQRNIYRIAESIIRDRDGAPPPERDEEDLLSVLLRLQRSGGLKF 943
944
ALTTEIISTVIF (0) 979
1125
DIFSAGSETSSTTLDWTMSELMKNPRILRKAQSEVRETFKGQDKLTEDDVAKLSYLQLVI 1304
1305
KETLRLHPPAPLLIPRECRETCQVMGYDVPKGTKVFVNVWKIGREGEYWGDGEIFRPERF 1484
1485
ENSTLDFRGADFEFIPFGAGRRMCPGIALGLANMELALASLLYHFDWELPDGIKSEELDM 1664
1665
TEVFGITVRRKSKLWLHAIPRVPYYSTY* 1751
no
japonica ortholog found 9/12/02
#330
>aaaa01013880.1b
CYP71W2 (indica cultivar-group) orth of AC120537.1a
stops are
even in the same location
4325
ARRAQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCAR 4507
gap
4868
ETFFNMDNLRTHDTYRKKNHSGNSQHCTASFIVFSFSELQLKMTIWQSHHYKLPINLRK 5044
5045
YFQQGARQLNDTLVGNI*ASEKYPQVMQKAQTEVREKFR 5161
G*DKLIKDDMNRLSYLHLVIQE
5226
5227
TLRLH 5241
>AC120537.1a
CYP71W2 chromosome 3 clone pseudogene fragment
2543
ARRVQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCA 2364
2363
RRDEFLHVQARGLRQARGRVQLGRPVPIVVASELAQRRAAVGRPSVAAGAFARCGRPAET 2184
2183
FFNMDNLRTHDTYRKKNHSGNSQHCTAFSALSFSELQLKMTIWQSHHYKLPINLREIFS
2006
SAGSETLNDTLVGNI*ANEKYPQVMQKAQTEVREKFRG*DKLIKDDMNRLSYLHL 1846
1845
VIQETLRLH
#329
>aaaa01013880.1a
$FI CYP71W3 (indica
cultivar-group) ortholog of AC120537.1b AQ869247.1
801 MEVSLPLLIGVVLAFLLLFVLVNVKNSCRSWWPPPEKEKKKLRLPPGPWRLPLVGSLHHVLLS
(fs) 989
991
RHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYLTPTLA 1170
1171
VLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHVREDEAARLVRSVAAECAG 1347
1348
RGGAAVVSVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLYPSSW 1527
1528
LARRLSCAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPL 1689
1690
TTDLITNVVL (0)
2513
DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEVMDKL 2671
2672
SYLRLVIRETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPE 2851
2852
VFKPERFENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRD 3031
3032
RNDEIDLSETFGITAKRKSKLMVYATQRIPCLG* 3133
>AC120537.1b
$F CYP71W3 chromosome
3 clone
AQ869247.1
nbeb0034D08r CUGI Rice BAC genomic Length = 447 53% to 99A1
AZ130570.1
OSJNBb0104D19r CUGI Rice BAC genomic Length = 327
80150
MEVSLPLLIGVVLAFLLLFVLVNIKNSCRSWWPPPEKEKKKLRLPPGPWQLPLVGSLHHV 80329
80330
LLSRHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYL 80503
80504
TPTLAVLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHGREDEAARLVRSVAA 80683
80684
ECAARGGAAVVNVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLY 80863
80864
PSSWLARRLSGAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPLTT 81043
81044
DLITNVVL (0) 81067
81865
DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEMMDKLSYLRLVI 82044
82045
RETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPEVFKPERF 82224
82225
ENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRDRNDEIDL 82404
82405
SETFGITAKRKSKLMVYATQRIPCLG 82482
#392
>aaaa01021177.1
$FI CYP71W4 (indica
cultivar-group) ortholog to AC120537.1c
AAAA01039974.1
(indica cultivar-group) Nterm 132 aa
396
MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERRLRLPPGPWRLPLVGSLHHVLLSR 217
216
HGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAEAAREVLKTHDACFASRHMTPTLAV 37
36 FTRGGRDILF 7
3930
SPYGDLWRQLRRICVLELFSARRVQSLRHVREDEAARLVRAVAEECAIGGGGGAVVPIGD 3751
3750
MMSRMVNDSVVRSAIGGRCARRDEFLRELEVSVRLTGGFNLADLYPSSSLARWLSGALRE 3571
3570
TEQCNRRVRAIMDDIIRERAAGKDDGDGEDDLLGVLLRLQKNGGVQCPLTTDM 3412
3411
IATVIM (0) 3394
2892
EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLSYLHLVI 2713
2712
RETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEIFKPERF 2533
2532
NANLVDFKGNYFEYIPFGSGRRVCPGITLGLTSMELVLASLLYYFDWELPGGKRCEEIDM 2353
2352
SEAFGITVRRKSKLVLHATPRVPCLH* 2272
>AC120537.1c
$F CYP71W4 chromosome
3 clone OSJNBb0042N11
AQ573952
nbxb0083G09r 60% to AQ259669 52% to 99A1 51% to 71B23
BM039053
clone V013G04.Length = 527
103769
MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERR
103880
LRLPPGPWRLPLVGSLHHVLLSRHGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAE 104056
104057
AAREVLKTHDACFASRHMTPTLAVFTRGGRDILF SPYGDLWRQLRRICVLELFSARRVQS 104236
104237
LRHVREDEAARLVRAVAEECAIGGGGGAVVPIGDMMSRMVNDSVVRSAIGGRCARRDEFL 104416
104417
RELEVSVRLTGGFNLADLYPSSSLARWLSGALRETEQCNRRVRAIMDDIIRERAAGKDDG 104596
104597
DGEDDLLGVLLRLQKNGGVQCPLTTDMIATVIM (0) 104695
105190
EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLS 105351
105352
YLHLVIRETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEI 105531
105532
FKPERFNANLVDFKGNDFEYIPFGSGRRVCPGITLGLTSMELVLASLLYHFDWELPGGKR 105711
105712
CEEIDMSEAFGITVRRKSKLVLHATPRVPCLH* 105810
#393
>AC078894.1
$P CYP71W5P chromosome 10 clone OSJNBa0096G08 12 unordered pieces starts 123
probable
pseudogene fragment 47% to 71B1 2 diffs with AP004175.1 $P
80% to
71W4
51119
LRRICVLELFSAHRV*SLHHVREEEAAPLVRVVADIRSPLGP 50994
#394
>AP004175.1
$P CYP71W6P chromosome 2 clone OJ1006_B12 pseudogene fragment
94% to
AC078894 51119-50994
64520
LRRICMLELFSAHRV*SLHHVREEEAARLVRVVA 64419
#311
>aaaa01012657.1
CYP71X1 (indica cultivar-group) orth AP003990.1g $P chr 2 99%
7170
FTPLFLLAVLPLKLTNGGDGV*LPPGPWRLPVIGSMHHLMGESLVHRAMADLARRLDAPL 7349
7350
MYLKLGEVPVVLASSPCAAREIMRVHDVAFASRP 7451
7490
RQLRKICVVELLSARRVRTFRRVREEE 7570
6081
DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL 5908
5907
VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 5782
5782
AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCPGLAFAEAIMDLLFST 5603
5602
LLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPIL 5483
>AP003990.1g
$P CYP71X1 chromosome 2 clone OJ1073_F05 pseudogene
42530
MDHVLACVGILVAFTPLFLLAVLPLKLTNGGDGVKLPPGPWRLPVIGSMHHLMGESLVHRAMAD 42721
42722
LARRLDAPLMYLKLGEVPVVLASSPCAAREIMRAHDVAFASRPLSPTVRRMR 42877
42878
PPPPRRRQLRKICVVELLSARRVRTFRRVREEEVARLVGALVCLAHVA 43021 gap
AMIGARFERRDEFLE
missing mid
region
43862
DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL 44035
44036
VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 44161 frameshift
44161
AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCSGLAFAEAIIDLLFS 44337
44338
TLLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPILRVPQTQTSSALLF* 44502
#222
>aaaa01007431.1b
CYP71X2 (indica cultivar-group) orth AP003990.1f
10785
MYDAVACVVAVVVVVIFAMLRVKLARSGDGGGGGGGGVR
10668
LPPGPWRLPVIGSLHHVVGDRLLHRSMARIARRLGDAPLVYLQLGEVPVVVASSPGAARE 10489
10488
VTRTHDLAFADRALNPTARRLRPGGAGVALAPYGALWRQLRKICVVELLSARRVRSFRRV 10309
10308
REEEAGRLVGALAAAAASPGEEAAVNFTERIAEAVSDAALRAMIGDRFERRDEFLQ 10141
10140
ELTEQMKLLGGFSLDDLFPSSWLASAIGGRARRAEANSRKLYELMDCAIRQHQQQRAE 9967
9966
AAVVDGGAGVEDDKNQDLIDVLLNIQKQGELETPLTMEQIKAVIL 9790
9595
DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 9422
9421
IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNVWAIGRDPKYWDDAEEFRPE 9242
9241
RFEHSTVDFKGVDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMVASEL 9062
9061
DMTEEMGITVRRKNDLHLRPXXXXXXXXXXXXXXXRERERHFV 8936
>AP003990.1f
$F CYP71X2 chromosome
2 clone OJ1073_F05
38238
MYDAVACVVAVVVVVVFAMLWVKLARSGDGGGGGSGGVRLPPGPWRLPVIGSLHHVVGDRLLHRSMA 38438
38439
RIARRLGDAPLVYLQLGEVPVVVASSPGAAREVTRTHDLAFADRALNPTARRLRPGGAGV 38618
38619
ALAPYGALWRQLRKICVVELLSARRVRSFRRVREEEAGRLVGALAAAAASPGEEA 38783
38784
AVNFTERIAEAVSDAALRAMIGDRFERRDEFLQELTEQMKLLGGFSLDDLFPSSWLASAI 38963
38964
GGRARRAEANSRKLYELMDCAIRQHQQQRAEAAVVDGGAGVEDDKNQDLIDVLLNIQKQG 39143
39144
ELETPLTMEQIKAVIL 39191 (0)
39428
DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 39601
39602
IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNAWAIGRDPKYWDDAEEFRPE 39781
39782
RFEHSTVDFKGIDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMAASE 39958
39959
LDMTEEMGITVRRKNDLHLRPHPPCVVRSNFRSFVERERERHFV* 40093
#221
>aaaa01007431.1a
CYP71X3 (indica cultivar-group) orth AP003990.1e $F chr 2 99%
lower case
does not match japonica seq, but matches seq b
4252
NLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADAAR 4431
4432
EIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSFHG 4611
4612
VREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERREDFLEV 4773
4774
LPEIVKLASGFSLDDLFPSSwlagaiggsrrRGEAVNRASYELVDSAFRQRQQQKEAM 4947
4948
AAPPPDIAKEEEDDLMDELIRIHKEGSLEVPLTAGNLKAVI
5070
5313
ELFCAGSETSSNAIQWAMSELVRNPRVMEKAQNEVRSILKGKPTVTEADMVDLTY 5486
5487
VKMIVKETHRLHPVLPLLTPRVC*QTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 5666
5667
KPERFEDSEIDLKGTNYEFIPYGAGRRICPGLALAQVSIEFILTTLLYHFNWELPKGAAP 5846
5847
KELDMTEDMGLTIRRKNDLYLLPTL 5921
>AP003990.1e
$F CYP71X3 chromosome
2 clone OJ1073_F05
33613
MEQVSCFAAAAAAVLVVLSLARMLLAPRREWD
33709
GLNLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADA 33888
33889
AREIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSF 34068
34069
HGVREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERRE 34221
34222
DFLEVLPEIVKLASGFSLDDLFPSS 34296 check joint
GSPAPSAARGEAVNRASYELVDSAFRQRQQQKEAMAAPPPDIAKEE
EDDLMDELIRIHKEGSLEVPLTAGNLKAVIL 34528 (0)
34777
ELFCAGSETSSNAIQWAMSELVRNPKVMEKAQNEVRSILKGKPTVTEADMVDLTY 34941
34942
VKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFIKSWAIMRDPKHWDDAETF 35121
35122
KPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILTMLLYHFNWELPNGAA 35298
35299
PEELDMTEDMGLTIRRKNDLYLLPTLRVPLTA* 35397
#221
>aaaa01070587.1
CYP71X3 (indica cultivar-group) orth AP003990.1e $F chr 2 96%
see
aaaa01007431.1a for ortholog
84
YQVKVSHMLHFGIV*ELFCAGSETSSNAIQWAMSELVRNPRVMEKAQNEVRSILKGKP 257
258
TVTEANMVDLTYVKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFINSWTIM 437
438
RDPKHWDDAETFKPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILATLLY 617
618
HFNWELPNGAAPKELDMTEDMGLTIRRKNDLYLLPTL 728
#376
>AP003990.1d
$F CYP71X4 chromosome
2 clone OJ1073_F05 no ortholog
27391
MEQVSCFAAAAAVVVVVLLLARMLLAPRGEWDGLNLPPSPPRLPFIGSFHLLRRSPLVHRALADVARQL 27597
27598
GSPPLMYMRIGELPAIVVSSADAAREVMKTHDIKFASRPWPPTIRKLRAQGKGIFFEPYG 27777
27778
ALWRQLRKICIVKLLSVRRVSSFHGVREEEAGRLVAAVAATPPGQAVNLTE 27930
27931
RIEVVIADTTMRPMIGERFERREDFLELLPEIVKIASGFSLDDLFPSSWLACAIGGSQRR 28110
28111
GEASHRTSYELVDSAFRQRQQQREAMAASPPDIAKEEEDDLMDELIRIHKEGSLEVPLTA 28290
28291
GNLKAVIL 28314 (0)
28577
DLFGAGSETSSDALQWAMSELMRNPRVMEKAQNEVQSILKGKPSVTEADVANLKY 28741
28742
LKMIVKETHRLHPVLPLLIPRECQQTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 28921
28922
KPERFEDGEIDLKGTNYEFTPFGAGRRICPGLALAQASIEFMLATLLYHFDWELPNRAA 29098
29099
PEELDMTEEMGITIRRKKDLYLLPTLRVPLTA* 29197
#375
>aaaa01017763.1
CYP71X5 (indica cultivar-group) orth AP003990.1c $F chr 2 97%
1306
FLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD 1485
1486
APLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSPTTRRLRCDGEGVVFATYGAL 1665
1666
WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERITAVITDAT 1842
1843
MRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAEANH 2007
2008
RRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGNI 2181
2182
KAIIL
2562
DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKY 2735
2736
LKLVIKETLRLHPVLPLLLPRECQEACNVIGYDVPKYTTVFINVWAINRDPKYWDMAEMF 2915
2916
KPERFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYHFDWELPSGMSP 3095
3096
EELDMTEDMGLSVRRKNDLYLHPTV 3170
>AP003990.1c
$F CYP71X5 chromosome
2 clone OJ1073_F05 one in frame stop at W
AQ259669
61% identical to AQ328148 53% to 76C2 also has stop at W
AQ690680.1
nbxb0082B18f CUGI Rice BAC genomic clone Length = 768
AQ579195
nbxb0084A11f AQ509836 nbxb0094K16f 72% identical to AQ259671
16507
MEKVAWCACFLLLALMVVRLTAKRRGDNGAERLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD 16710
16711
APLMSLRLGEVPVVVASSADAAREIMRTHDVAFATRPWNPTTRRLRCDGEGVVFATYGAL 16890
16891
WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERI 17043
17044
TAVITDATMRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAE 17223
17224
ANHRRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGN 17403
17404
IKAIIL 17421 (0)
17795
DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKYLKL 17968
17969
VIKETLRLHPVLPLLLPRECREACNVIGYDVPKYTTVFINV*AINRDPKYWDMAEMFKPE 18148
18149
RFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYYFDWELPSGMSPEE 18325
18326
LDMTEDMGLSVRRKNDLYLHPTVCVPL* 18409
#75
>aaaa01002047.1c
$PI CYP71X6 (indica cultivar-group) ortholog of AP003990.1b 99%
20381
MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRT 20560
20561
MADLARRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSEGVG 20740
20741
LVFAPYGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNVSERI 20920
20921
AALVSDAAVRTIIG 20962
20979
VAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMA 21125 aa 452 out of sequence
21126
DLFPSSRLASFIGGTTRRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDI 21299
21300
VDVLLRIQKEGSLQVPLTMGNIKAVVL 21380
22004
DLFSAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKLII 22183
22184
KETLRLHPVVPLLLPRECQETCKVMDYDIPIGTIVLVNVWVIGRDPKYWD 22333
22334
DAKTFRLERFEDGHVDFKGMNFEYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFP 22513
22514
DGILPAKMDMMEVMGSTV*KKNDLYLVPNAHVPVAP 22621
>AP003990.1b
$P CYP71X6 chromosome 2 clone OJ1073_F05
11020
MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLA 11214
11215
RRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 11394
11395
YGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNV 11547
11548
SERIAALVSDAAVRTIIGDRFERRDEFLEGLAEGIKITSGFSLGDLFPSSRLASFIGGTT 11727
11728
RRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDIVDVLLRIQKEGSLQVPLT 11907
11908
MGNIKAVVL 11934 (0)
12556
DLFGAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKL 12729
12730
IIKETLRLHPVVPLLLPRE 12786 frameshift
CQETCKVMDYDVPIGTIVLVNMWVIGRDPKYWEDAKTFRPERFEDGHIDFKGMNF 12955
12956
EYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFPDGISPAKMDMMEVMGSTVRKKN 13135
13136
DLYLVPNAHVPVAP* 13180
note
cluster continues on AP003990.1 to sequence j
#74
>aaaa01002047.1b
$FI CYP71X7 (indica
cultivar-group) ortholog of AP003990.1a 99%
16718
MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 16897
16898
LVHRTMAGLARGLGDAPLLSLRLGEVPVVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 17077
17078
MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAATRRPG 17257
17258
EAAVNVGERLTVLITDIAMRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPSSRLAS 17437
17438
FVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRIQKEGG 17617
17618
LEVPLTMGVIKGVIR 17662
17911
DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKYLK 18081
18082
LVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETFIP 18261
18262
ERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVAPSN 18441
18442
LDMEEEMGITIRRKNDLYLVPKVHVPL 18522
>AP003990.1a
$F CYP71X7 chromosome
2 clone OJ1073_F05 42% to 71B24
AQ259671
323-379 region I-helix 55% to 71B4
AQ691116.1
nbxb0088K01f CUGI Rice BAC genomic clone Length = 544
7359
MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 7538
7539
LVHRTMAGLARGLGDAPLLSLRLGEVPIVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 7718
7719
MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAA 7883
7884
TRRPGEAAVNVGERLTVLITDIAVRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPS 8063
8064
SRLASFVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRI 8243
8244
QKEGGLEVPLTMGVIKGVIR 8303 (0)
8551
DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKY 8715
8716
LKLVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETF 8895
8896
IPERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVA 9072
9073
PSNLDMEEEMGITIRRKNDLYLVPKVRVPL* 9165
#72
>aaaa01002047.1a
$FI CYP71X8 (indica cultivar-group)
ortholog
of AP004000.1a 99%
2682
MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMG 2861
2862
GPLVHRTMADLARRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWSSTIRV 3041
3042
LMSDGVGLVFAPYGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQP 3221
3222
VNVSERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVG 3401
3402
GTTRRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGN 3581
3582
IKAVVL 3599
4057
ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKLII 4236
4237
KETLRLHPVVPLLLPRECRETCEVMGYDIPIGTIVLVNVWAIGRDPKYWEDAETFIPERF 4416
4417
EDGHIDFKGTNFEFIPFGAGRRMCPGMVFAEVIMELALASLLYHFDWELPDGISPTKVDM 4596
4597
MEELGATIRRKNDLYLIPAVRVPLSTVL 4680
>AP004000.1a
$F CYP71X8 chromosome
2 clone OJ1115_B01
95316
MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLA 95101
95100
RRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWTSTIRVLMSDGVGLVFAP 94921
94920
YGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQPVNV 94768
94767
SERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVGGTT 94588
94587
RRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGNIKA 94408
94407
VVL 94399 (0)
93945
ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKL 93772
93771
IIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTVLVNVWAIGRDPKYWEDAETFIPE 93592
93591
RFEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTK 93415
93414
VDMMEELGATIRRKNDLYLIPTVRVPLSTVL* 93319
#72
>aaaa01027906.1
CYP71X8 (indica cultivar-group) orth AP004000.1a $F chr 2 99%
see
aaaa01002047.1a for ortholog
1930
LFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKLI 1757
1756
IKETLRLHPVVPLLLPRECRETCEVMGYDIPIGITVLVNVWAIGRDPKYWEDAETFIPER 1577
1576
FEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTKMD 1397
1396
MMEELGATIRRKNDLYLIPAV 1334
#72
>aaaa01012191.1
CYP71X8 (indica cultivar-group) orth AP004000.1a $F chr 2 98%
see
aaaa01002047.1a for ortholog
762
KPRLPPGPWRLPVIGNLHQIMVGGPLVHRTMADLARRLDAPLMSLRLGELRVVVLYYRFI 583
582
*IPALSPFYLATRPWSSTIRVLMSDGVGLVFAPYGALWRQLRKIAVVELLSARRVQSF 409
408
RRIREDEVGRLVAAVAAAAAASAAQPVNVSERIAALISDSAVRTIIGDRFERRDEF 241
240
LEGLAEGIKITSGFSLGDLFPS 175
411
TARRKADLHLRPCL 370
#73
>aaaa01030108.1
CYP71X9P orth of AP004000.1b
1684
RVIASSTGAACREFTETHDVKFATRPWSSTVRVLMADGLG 1565
1556
GLVFAPYGALWRQLRKIAMVELLSARRVQSHRRYRRRGDAAR 1431
>AP004000.1b
$P CYP71X9P chromosome 2 clone OJ1115_B01 pseudogene fragment
3 aa diffs
with AAAA01030108.1
101503
RVVASSTDAACREFTKTHDVKFATRPWSSTVRVLMADGLG 101393
#414
>aaaa01025401.1
CYP71X10 (indica cultivar-group) orth AP004000.1c $F chr 2 98%
2324
KPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLARRLDAPLMSLRLGELRVVVASSADA 2145
2144
AREITKTHDVAFATRPWSPTIRVLMSDGVGLVFAPYGALWRQLRKIAMVELLSARRVQSF 1965
1964
RRIREDEVGRLVADVAAAQPGEAVNVSERITALISDSAVRTIMGDRFEK 1818
917
LFGAGSETSASTLHWAMTELIMNPKVI 837
832
DELSNVIKGKQTISEDDLVELRYLKLVIKETLRLHPVVPLLLPRECRETCEVMGY 659
658
DIPIGTTMLVNVWAIGRDPKYWEDAETFRPERFEDGHIDFKGTDFEFIPFGAGRRKCPGM 479
478
AFAEAIMELVLASLLYHFDWELPDGISPTKVDMMEELGATIRKKNDLYLVPTV 320
>AP004000.1c
$F CYP71X10 chromosome
2 clone OJ1115_B01
109813
MAMVQYVTGYLCLLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRL
PVIGNLHQVAMGGPLVHRTMADLA 109595
109594
RRHDAPLMSLRLGELRVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 109415
109414
YGALWRQLRKIAVVELLSARRVQSFRRIREDEVCRLVAAVAAAQPGEAVNV 109262
109261
SERITALISDSAVRTIMGDRFEKRDEFLEGLAEGDRIASGFSLGDLFPSSRLASFVGGTT 109082
109081
RRAEANHRKNFGLIECALRQHEERRAAGAVDDDEDLVDVLLRVQKEGSLQVPLTMGNIKAVIL 108893 (0)
107479
ELFGAGSETSASTLHWAMTELIMNPKVMLKAQDELSNVIKGKQTISEDDLVELRYLKL 107306
107305
VIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTMLVNVWAIGRDPKYWEDAETFRPE 107126
107125
RFEDGHIDFKGTDFEFIPFGAGRRMCPGMAFAEAIMELVLASLLYHFDWELPDGISPTK 106949
106948
VDMMEELGATIRKKNDLYLVPTVRVPMSTAL* 106853
#106
>aaaa01002996.1a
$FI CYP71X11 (indica
cultivar-group) ortholog of AP004000.1d >99%
13279
MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPL 13458
13459
VHRALADLARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTA 13638
13639
DGEGLVFAPYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV 13818
13819
NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 13998
13999
AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 14178
14179
TMGIIKAVIL 14208
14345
DLFSAGSETSATTIQWAMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLADLNYLKLII 14524
14525
KETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVFVNAWAIGRDPKYWDDPEEFKPERF 14704
14705
EDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSELDM 14884
14885
TEEMGITVRRKNDLYLHAVVRVPLHATTP 14971
>AP004000.1d
$F CYP71X11 chromosome
2 clone OJ1115_B01
123850
MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPLVHRALAD 124050
124051
LARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTADGEGLVF 124230
124231
APYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV 124389
124390
NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 124569
124570
AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 124749
124750
TMGIIKAVIL 124779 (0)
124917
DLFSAGSETSATTIQW AMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLTDLNYLKL 125090
125091
IIKETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVLVNAWAIGRDPKYWDDPEEFKPE 125270
125271
RFEDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSE 125447
125448
LDMTEEMGITVRRKNDLYLHAVVRVPLHATTP* 125546
#107
>aaaa01002996.1b
$FI CYP71X12 (indica
cultivar-group) ortholog of AP004000.1e 99%
16497
MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPQ 16676
16677
VHRAMADLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMA 16856
16857
DGKGLTFARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAVV 17036
17037
NVSERAAVLVTDTTVRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFP 17186
17187
SSRLASLVSGTARRAAASHRKMFELMDCAIRHHQERKAAMDADEDILDVLLRMQKEGGHD 17366
17367
APLTMGDVKDTIL 17405
17534
DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKLVI 17713
17714
KETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPDRC 17893
17894
ENNKYDFRGTDFEYIPFGSRRKICPCPAFTHAILELALAALLYHFDWELPCGVAQ 18058 frameshift
18055
SGEVDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT 18174
>AP004000.1e
$F CYP71X12 chromosome
2 clone OJ1115_B01
AP004066.1
chromosome 2 clone OJ1572_F02, 55% to 71B17 aa 342-511 runs off beginning
contig of
AA751324 and AQ327456 54% IDENTICAL TO 71B24 1/98 K-HELIX
58%
identical to AQ328148
127153
MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPHVHRAMA 127350
127351
DLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMADGEGLA 127530
127531
FARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAV 127689
127690
VNVSERAAVLVTDTX 127731 frameshift
127734
VRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFPSSRLASLVSGTARRAAASHRKMFE 127913
127914
LMDCAIRHHQERKAAMDADEDILDVLLRIQKEGGHDAPLTMGDVKDTIL 128060 (0)
128189
DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKL 128362
128363
VIKETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPD 128542
128543
RCENNKYNFRGTDFEYIPFGSRRKICPGPAFTHAILELALAALLYHFDWELPCGVAPGE 128719
128720
VDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT* 128833
#96
>aaaa01002645.1a
$PI CYP71X13P (indica cultivar-group) sequence gap at 950 sequence
similarity
stops at 236 (80% identical to AAAA01002645.1b)
ortholog
to AP005385.1b $P 99%
949
PLMRLQLGGVPVVVASSPDAAREVTCTHDAAFASRPWPPTVRRLRPHREGVVFATYGAM 773
772
WRQLRKVCIVEMLSARRVRSFRRVREEEAASLAAAVAASLSSPPARRDAVNVSALVALAV 593
592
ADATMRVVIGDRLERREEFLESMTEAVRSFTGFSLDDLFPSSRIAAAVGGMTRRAEASHR 413
412
KGNELIESAIRQHEQVRDAMAAQGGGGAMEEDLLDTLLRIQKEGALDMPLTMDNIKAVI 236
>AP005385.1b
$P CYP71X13P (japonica cultivar-group) chr 2 = aaaa01012992.1
146254
MDQVACWSICAFLALLLLVRIGGKRGRGGDGARLRQPPPGPWRLPVIGNLHQLMLRGP 146427
146428
LVHRTMADLARGLDDAPLMRLQLGGVPVVVASSPDAAREVTCTHDAAFASRPWPPTVRRL 146607
146608
RPHREGVVFAPYGAMWRQLRKVCIVEMLSARRVRSFRRVREEEAANLAAAVAASLSSPPA 146787
146788
RRDAVNVSALVAAAVADATMRVVIGDRLERREEFLESMTEAVRSFTGFSLDDLFP 146952
146953
SSRIAAAVGGMTRRAEASHRKGNELIESAIRQHEQVRDAMAAQGGGGAMEEDLLDTLL 147126
147127
RIQKEGALDMPLTMDNIKAVI 147189
147557
DIFGAGSDTSSNIIQW
FS and
Small deletion 19aa
147612
RNTLQGKHPVKEDDLVNIKYLKLIIKETLRLHPVVPLLLPRECLHACKVMGYDVPKGTTV 147791
147792
FVNIWAINRDPKHWDDPEVFKPERFDDGKIDFKGANFEYIPFGAGRRSCPGVTFGHATVE 147971
147972
LMLATLLYHFKWELLEGVAPNELDMTEEIGINVGRKNPLWLCPIVRVPLQ* 148124
#96
>aaaa01012992.1
CYP71X13P (indica cultivar-group) 80% to AAAA01002645.1b
runs off
end of clone (partialI) orth of AP005385.1b
see
aaaa01002645.1a for ortholog
175
DIFGAGSDTSSNIIQWAMSELMRNPKVMQKAQVELRNTLQGKHPVKEDDLVNIKYLKL 348
349
IIKETLRLHPMVPLLLPRECLHACKVMGYDVPKGTTVFVNIWAINRDPKHWDDPEVFKPE 528
529
RFDDGKIDFKGANFEYIPFGAGRRSCPGVTFGHATVELMLATLLYHFKWELLEGVAPNEL 708
709
DMT 717 (fs)
717
EEIGINVGRKNPLWLCPIVRVPLQ* 791
#97
>aaaa01002645.1b
$FI CYP71X14 (indica
cultivar-group) no introns 40% to 71B23
ortholog
to AP005385.1a $F 99%
2048
MEQVAYWFICACAFLALVLLVRLGAARRDVVRLPPGPWRLPVVGNLHQLMLRGPLVHRTMADLA 2239
2240
RGLDDAPLMRLQLGGVPVVVASSADAAREVTRTHDLDFASRPWPPTVRRLRPHREGVVF 2416
2417
APYGAMWRQLRKVCVVEMLSARRVRSFRRVREEEAARLVASIASSSSSSPTGHDGGAAP 2593
2594
AVNVSAPIAAAVADATMRAVIGDRFERREEFLESITEAVRSFTGFSLDDLFPSSRLAAAV 2773
2774
GGMTRRAEASIRKGHQLMDSAFRQHQQLRDAMAAQPHLDDCAMEEDLLDTLLRIQKEDNL 2953
2954
DVPLTTGNIKAVLLDIF 3133
3134
GARSDTSSHMVQWVLSELMRNPEAMHKAQTELRSTLQGKQMVSEDDFASLTYLKLVIKET 3313
3314
LRLHPMVPLLLPRECRQTCKVMGYDVPQGTTVFVNVWAINRDPRHWDEPEVFKPERFHSG 3493
3494
KIDFKGANFEYIPFGAGRRICPGMTFGHATVELMLAMLLYHFDWELPKGVAPNELDMTEE 3673
3674
MGITVGRKNALYLHPIVRVSLEQASMS* 3757
>AP005385.1a
CYP71X14 (japonica cultivar-group) chr 2
142591
MEQVAYWFICACAFLALVLLVRLGAARRDVVRLPPGPWRLPVVGNLHQLMLRGPLVHRTM 142770
142771
ADLARGLDDAPLMRLQLGGVPVVVASSADAAREVTRTHDLDFASRPWPPTVRRLRPHREG 142950
142951
VVFAPYGAMWRQLRKVCVVEMLSARRVRSFRRVREEEAARLVASIASSSSSSPTGHDGGA 143130
143131
APAVNVSAPIAAAVADATMRAVIGDRFERREEFLESITEAVRSFTGFSLDDLFPSSRLAA 143310
143311
AVGGMTRRAEASIRKGHQLMDSAFRQHQQLRDAMAAQPHLDDCAMEEDLLDTLLRIQKED 143490
143491
NLDVPLTTGNIKAVLL (0)
143668
DIFGAGSDTSSHMVQWVLSELMRNPEAMHKAQIELRSTLQGKQMVSEDDLASLTYLKLVIK 143850
143851
ETLRLHPVVPLLLPRECRQTCKVMGYDVPQGTTVFVNVWAINRDPRHWDEPEVFKPERFH 144030
144031
SGKIDFKGANFEYIPFGAGRRICPGMTFGHATVELMLAMLLYHFDWELPKGVAPNELDMT 144210
144211
EEMGITVGRKNALYLHPIVRVPLEQATMS 144297
#237
>aaaa01008333.1a
$FI CYP71X15 (indica cultivar-group) very similar to AP003990.1
6914
MAMVQDATGYLSLFLALLSITLVLHKVARKASGDGAGKPRLPPGPWRLPVIGNLHQIAMGG 6732
6731
PLVHRTMADLARRHDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTVRVL 6552
6551
MSDGVGLVFAPYGALWRQLRKIAMVELLSARRVQSFRGIREDEVGRLVAAVAAASAAQ 6378
6377
PGEAVNVSERIAVLIA DSVVRALMGDRFDRRDEFLDQLAERVKITSGFSLGDLFPSSRL 6201
6200
ASFIGGTTRRAEANHRKNFELIECALRQHEERRAARAGAAAAGAVDDDEDLVDVLLRIQK 6021
6020
EGKLEVPLTMGNINAVIY (0) 5967
5378
DLFGAGSETSANTLQWVMSELILNPRVMLKLQAELRGILQGKQRVTEDDLVELKYLKLVI 5199
5198
KETLRLHPVVPLLLARECQDTCKIMGYDIPVGTIVFVNVWVICRESKYWKDAETFRPERF 5019
5018
ENVCVDFKGTHFEYIPFGAGRRMCP 4944
4945
PGVAFAEASMELVLASLLYHFDWKLPNDILPTKLDMTEEMGLSIRRKNDLYLIPTICVPPLAA* 4754
no
japonica ortholog on 9/7/02
#238
>aaaa01008333.1b
CYP71X16 (indica cultivar-group) runs off end of clone (partialI)
like
AP004000 exon 2
11117
EMFGAGSETSANTLQWLMSELILNPRVMSKAQVELSDTLRGKQTVTEDDLAGLKYLKLII 10938
10937
KENLRLHPVVPLLLPRECQKTCKVMMYDVPVGTTVLVNVWSINRDPKYWEDPETFKPERF 10758
10757
EDGHIDFKGTDFEFIPFGAGRRMCPGITFAEAIMELALASLLYHFDWKLLGNGISSTKLD 10578
10577
MTEELGATVRRKNDLYLVPTIRVPLPADS* 10488
no
japonica ortholog on 9/7/02
#68
>aaaa01001712.1
$PI CYP71X17P (indica cultivar-group) missing C-terminal exon
not found
in 20000bp of seq.
6693
MAMAQDVTGYLCLFVALLVLLKVVRKASGNGAAGRLRLPPGPWRLPVIGNLHQVAMGG 6866
6867
PLVHRTMADMARRLDAPLMSLRLGEIPVVVASSADAAREITKTHDVAFATRPLSSTIRVM 7046
7047
VSDGEGLVFTPYGALWRRLRKIAMLELLSARRVQSFRRVREEEVGRLVAAVAAAAAAR 7220
7221
PGEAVNLSQLIAELISDTAARTIIGDRFEKRQELLEGLTEGIRISSGFSLGDLFPSSRL 7397
7398
ANLIGGTTRRAEANHRKNLALIECALRQHEERRAAGDEEDDEDLVDVLLRVQKEGG 7565
7566
GEVPLTMGNVKVVIR (0) 7610
aaaa01001712.1
$PI no ortholog yet, no match in nr or HTGS 9/5/02
#413
>aaaa01025223.1
CYP71Y1 (indica cultivar-group) orth? AP003571.1g $F chr 6 95%
1835
QRLPPGPWMLPAIGSLHHLAGKLPHRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREV 1656
1655
MKTHDTAFATRPLSATLRVLTNGGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIR 1476
1475
EEEVAAVLRAVAVAAGTVEMRAALSALVSDITARTVFGNRCKDRGEFLFL 1326
1325
LDRTIEFAGGFNPADLWPSSRLAGRLSGVVRRAEECRNSVYKILDGIIQEHQER 1164
1163
TGAGGEDLVDVLLRIQKEGELQFPLAMDDIKSII 1062
991
QDIFSAGSETSATTLAWAMAELIRNPTAMHKATPEVRRAFAAAGAVSEDALGELPYLH 818
817
LVIRETLRLHPPLPLLLPRECREPCRVLGYDVPRGTQVLVNAWAIGRDERCWPGGSPEEF 638
637
RPERF 623
588
RGADFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFDWEVPGLADPAKLDMTEAFGI 412
411
TARRKADLHLRPCL 370
>AP003571.1g
$F CYP71Y1 chromosome
6 clone P0458E02
139036
MEDATHGYVYVGLALVSLFVVLLARRRRSPPPAAHGDGGLRLPPGPWTLPIIGSLHHLVGQIP 139224
139225
HRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREVTKTHDTAFAMRPLSATLRVLTN 139398
139399
GGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIREEEVAALLRAVAVA 139554
139555
AGTVEMRAALSALVSDITARTVFDNRCKDRGEF
LVLLERTIEFAGGFNPADLWPS 139719 (?) bad exon boundary
140222
SRLAGRLSSVVRRAEECRNSVYKILDGIIQEHQERTSAGGEDLVDVLLRIQKEGG 140386
140387
LQFPLAMDDIKSIIF 140428 (0)
DIFSAGSETSATTLAWAMAELIRNPTAMHKVMAEVRRAFAAAGAVSEDALGE 140655
140656
LRYLQLVIRETLRLHPPLPLLLPRECREPCRVLGYDVTRGTQVLVNAWAIGLDERYWPGG 140835
140836
SPEEFRPERFEDGEATAAVDFRGTDFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFD 141015
141016
WEVPGLADPAKLDMTEAFGITARRKADLHLRPCLLVSVPGV* 141141
#474
>aaaa01101459.1
CYP71Y2P (indica cultivar-group) orth AP003571.1f $P chr 6 97%
463
LDFRGADFELLPFGXARRMCPGMAFGLANVELPLSSLLFHFDWEVPGMADPTKLDMTEAF 284
283
GITSRRKENLHLRPLL 236
>AP003571.1f
$P CYP71Y2P chromosome 6 clone P0458E02 pseudogene fragment
136961
VSEDALGELRYLQLVIRETLRLHPPLPLLLPRECTIGR 137074
137075
DERYWPGGSPEEFRPERFDDGEATAAVDFRGADFELLPFGGGRRMCPGMAFGLANVELPL 137254
137255
SSLLFHFDWEVPGMADPTKLDMTEAFGITSRRKENLHLRPLLRVSVPG 137398
#215
>aaaa01007286.1
CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6 100%
4839
YFYLGLALASLLVVLFARRRRSAAHGDGGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDL 4660
4659
ARRHGPVMMLRLGEVPTLVVSSRDAAREVMRAHDAAFASRPLSATVRVLTSGGRGIIFAP 4480
4479
YGGSWRQLRKIAVTELLTARRVASFRAIREEEVAAMLRAVAAAAAAGRAVELRAALSALV 4300
4299
AETTVRAVIGDRCKDRDVFLRKLQRTIELSAGFNPADLWPSSRLAGRLGG 4150
4149
AVREAEECHDTVYGILDGIIQEHMERTSSGSCGAGDGDGDGEDLLDVLL 4003
>AP003571.1e
$F CYP71Y3 chromosome
6 clone P0458E02
128926
MADDYFYLGLALASLLVVLFARRRRSAAHGDGGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLA 129120
129121
RRHGPVMMLRLGEVPTLVVSSRDAAREVMRAHDAAFASRPLSATVRVLTSGGRGIIFAP 129297
129298
YGGSWRQLRKIAVTELLTARRVASFRAIREEEVAAMLRAVAAAAAAGRAVELRAA 129462
129463
LSALVAETTVRAVIGDRCKDRDVFLRKLQRTIELSAGFNPADLWPSSRLAGRLGGA 129630
129631
VREAEECHDTVYGILDGIIQEHMERTSSGSCGAGDGDGDGEDLLDVLLRIQKEGGLEFPV 129810
129811
DMLAIKQVIF 129837 (0)
132964
DIFGAGSETSATTLEWVMAELIRNPKAMRKATAEVRRAFAADGVVLESALGKLHYMHLVI 133143
133144
RETFRLHTPLPLLLPRECREPCRVLGYDVPRGTQVLVNVWAIGRDERYWPGGSPEEFRPE 133323
133324
RFEDGEAAAAVDFRGADFELLPFGAGRRMCPGLAFGLANVELALASLLFHFDWEAPDVAD 133503
133504
PAEFDMTEGFGITARRKADLPLRPTLRVPVLVSVG* 133611
#215
>aaaa01048884.1
CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6
97% 3
diffs see aaaa01007286.1 for ortholog
1024
VRLHTPLPLLLPRECREPCRVLGYDVPRGSQVLVNVWAIGRDERYWPGGSPEEFRPERFE 845
844
DGEAAAAVDLRGADFELLPFGAGRRMCPGLAFGLANVELALASLLFHFDWEAPDVADPAE 665
664 FDMTEGFGITARRKANLPLRPTL 596
#215
>aaaa01040160.1
CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6 96%
see
aaaa01007286.1 for ortholog
667
MQDIFGAGSETSATTLEWVMAELIRNPKAMRKATAEVRRAFAANGVVSESALGKLHYL 840
841
HLVIRETFRLHTPLPLLLPRECREPCRVLGYDVPRGSQVLVNVWAIGRDERYWPGGSPEE 1020
1021FRPERFED
1044
1064
LDFRGADFELLPFGAGRWMCPGFGVRARQRG 1156
#301
>aaaa01012291.1
CYP71Y4 (indica cultivar-group) orth AP003571.1d $F chr 6 99%
6215
DAGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLARRHGPVMMLRLGEVPTLVVSSRDAA 6394
6395
REVMRTHDAAFASRPLSASVRAATKGGRDIAFAPYGDYWRQLRKIAVTELLSARRVLSFR 6574
6575
PIREEEVGRSPATLQPGQHAASGRTVELRAALCALVADSTVRAVVGERCAGLDVF 6739
6740
LRQLDRAIELAAGLNVADLWPSSRLAGRPSQRRRAPGREVRDTMFGVLDGII 6895
6896
QAHLEKTGGAGEDILDVLLRIHKEGGLEFPLDMDAVKCV
DVISGGCETSATTLGWAFAELIRNPAAMK 7249
7250
KATAEVRRDFEAAGAVSESALSVGELPYLRLVVRETLRLHPPLPLLLPRECREPCRVLGY 7429
7430
DVPRGAQVLVNAWAIGRDERYWPGGSPEEFRPERFGDGEAAAAVDFKGADFELLPFGGGR 7609
7610
RMCPGMAFGLANVELPLASLLFHFDWEASGVADPTEFDMTEAFGITARRKANLLLRPIL 7786
>AP003571.1d
$F CYP71Y4 chromosome 6 clone P0458E02
118418
MADGYFYLGLALVSLLVVLFARRRRSAAAAHGDAGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLA 118618
118619
RRHGPVMMLRLGEVPTLVVSSRDAAREVMRTHDAAFASRPLSASVRAATKGGRDIAFAP 118795
118796
YGDYWRQLRKIAVTELLSARRVLSFRPIREEEVAATLRAVAAAAADGRTVELRAA 118960
118961
LCALVADSTVRAVVGERCAGLDVFLRQLDRAIELAAGLNVADLWPSSRLAGRLSGAVRQ 119137
119138
AERCRDTMFGVLDGIIQAHLEKTGGAGEDILDVLLRIHKEGGLEFPLDMDA 119290
119291 VKCVVV 119308 (0)
119451
DVISGGCETSATTLGWAFAELIRNPAAMKKATAEVRRDFEAAGAVSESALAVGELPYLRL 119630
119631
VVRETLRLHPPLPLLLPRECREPCRVLGYDVPRGAQVLVNAWAIGRDERYWPGGSPEEFR 119810
119811
PERFGDGEAAAAVDFKGADFELLPFGGGRRMCPGMAFGFANVELPLASLLFHIDWEASGV 119990
119991
ADPTEFDMTEAFGITARRKANLLLRPILRVPVPGV* 120098
#390
>aaaa01020516.1
CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 95%
3
IVFAPYGDYWRQLRKITVTELLSARRVASFRAIREEEVAAMLRAVAASAAAGRAVEMRPL 182
183
LSALVSDSTVRAVMGDQFPHRDVFLRELDRSIELVAGFNPADLWPSSRLAGCLT 344
345
GTMRQAKKCWDTMSSVLESTIQEHLQKNGSSGGGAGATDEDLIDVLLRIQKEGGLQFP 518
519
FDMDVIKSVI 548
>AP003571.1c
$F CYP71Y5 chromosome
6 clone P0458E02
AQ328148
49% identical to C72289 58% to AQ327456 61% to AQ259669
56% to
71B3
106935
MADLHTYLYLGLALVSLLAVQLARRRRSSAAHGSGALRLPPGPWQLPVIGSLHHLVGKL 107111
107112
PHQAMRDLARRHGPVMMLRLGEVPTLVVSSPEAAREVTKTHDVSFATRPLSSTTRVFS 107285
107286
NGGRDIVFAPYGDYWRQLRKITVTELLSARRVASFRAIREEEVAAMLRAVGGYAA 107450
107451
AGCAVEIRPLLAALVSDSTVRAVMGDRFPHRDVFLRELDRSIELTAGFNPADLWPSSRL 107627
107628
AGCLTGTIRQAKKCWDTMSSVLESTIQEHLQKNGSSGGGAGATDEDLIDVLLRIQKEGGL 107807
107808
QFPFDMDVIKSVIH 107846 (0)
110756
NVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYLHLVI 110935
110936
KETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEEFRPE 111115
111116
RFGDGEPAAALDFKGTDYELLPFGAGRRMCPGLAFGLANVELPLASLLFHFDWEVPGMAD 111295
111296
PTKLDMTEAFGIGVRRKADLIIRPILRVPVPGV* 111397
#390
>aaaa01021346.1
CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 98%
1 diff see
aaaa01020516.1
158
LPPGPWQLPIIGSLHHLVGKLPHQAMRDLARRHGPVMMLRLGEVPTLVVSS 6
#390
>aaaa01083019.1
CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 99%
see
aaaa01020516.1 for ortholog
501
MQNVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYL 328
327
HLVIKETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEE 148
147
FRPERFGDGEPAAA*DFKGTDYELLTFGAGRRMCPGLAFGLANVELPL 4
#390
>aaaa01032282.1
CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6
100% see
aaaa01020516.1 for ortholog
297
NVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYL 470
471
HLVIKETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEE 650
651
FRPERFGDGEPAAALDFKGTDYELLPFGAGRRMCPGLAFGLANVELPLASLLFHFDWEVP 830
831
GMADPTKLDMTEAFGIGVRRKADLIIRPIL 920
#390
>aaaa01032612.1
CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 100%
see
aaaa01020516.1 for ortholog
1464
LPPGPWQLPVIGSLHHLVGKLPHQAMRDLARRHGPVMMLRLGEVPTLVVSSPEAAREVTK 1643
1644
THDVSFATR 1670
#444
>aaaa01053818.1
CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6 100%
600
LLTKRSRKATAQRLPPGPWQLPVIGSLHHLAGKLPHHAMRDLARRHGPVMMLRLGEVPTL 779
780
VVSSPEAAQEVMRTHDAVFATRALSATVRAATMGGRDIAFAPYGDRWRQLRKIAATQLLS 959
960
ARRVASF 980
>AP003571.1b
$F CYP71Y6 chromosome
6 clone P0458E02
80945
MEDASHGYVYLAMAVVALLGVLLTKRSRKATAQRLPPGPWQLPVIGSLHHLAGKLPHHAMRDLARRHG 81148
81149
PVMMLRLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAATMGGRDIAFAPYGDR 81325
81326
WRQLRKIAATQLLSARRVASFRAIREEEVATMLRAVAAAAADGRAVEMRAALCVV 81490
81491
VADSTARAMVGESCQERDAFLREIDRSMELVSGFNPEDLWPSSRLAGRLSGAVRKIEAS 81667
81668
LHTVLGILDRIIQKRLQEKIGGAGAAAASEDILDVLLRIHKDGGAGGLQVPLDMDDITLV 81847
81848
IT 81853 (0)
83437
SMQLQNAHACALFLTIVSSTYSYSLFNDPPSLHMQDLFSGGGETVATLLVWAMAELIRN 83613
83614
PMAMQKATAEVRRAFALPGVVSEGEGALGELRYLHLVIRETFRLHPPGPLLLPRECSEPC 83793
83794
QVLGYDVPRGTQVLVNVWAIGRDERCWPAAAGGGSPEEFWPERFEDGAEAVDLRGNNFEL 83973
83974
LPFGAGRRMCPGVAFALANIELTLASLLFHFDWEVPGMADPAKLDMAEALGITARRKGDL 84153
84154
LLRPVLRMPVPGV* 84195
#444
>aaaa01076398.1
CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6
99% see
aaaa01053818.1 for ortholog
725
QLLSARRVASFRAIREEEVATMLRAVAAAAADGRAVEMRAALCVVVADSTARAMVGESCQ 546
545
ERDAFLREIDRSMELVSGFNPEDLWPSSRLAGRLSGAVRKIEASLHTVLGILDRIIQKRL 366
365
QEKIGGAGAAAASEDILDVLLRIHKDGGAGGLQVPLDMDDITLVITVSDQL 213
#444
>aaaa01098934.1
CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6 99%
see
aaaa01053818.1 for ortholog
131
MQDLFSGGGETVATLLVWAMAELIRNPMAMQKATAEVRRAFALPGVVSEGEGALGELRYL 310
311
HFVIRETFRLHPPGPLLLPRECSEPCQVLGYDVPRGTQVLVNVWAIGRDERCWPAAAGGG 490
491
SPEEFWPERFED 526
#310
>aaaa01012578.1
CYP71Y7 (indica cultivar-group) orth AP003571.1a $F chr 6 99%
5308
LVALLGVLLTKRSRTATAQRRLPPGPWQLPVIGSLHHLIGKLPHHAMRDLTRRHGPVMML 5487
5488
RLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAGTMGGRDIAFAPYGDYWRQLRK 5667
5668
IAATELLSAPRVASFRAIREEEVAATLRTVAAAAADGRAVELRAALCALVTDSTSRAVVG 5847
5848
DRCKESDALIRAFDRSMELASGFNPADLWPSSRLAGLLSGGVREIEANLHTVFGIL 6015
6016
DRLIEKRLQQKKTAPSSAAGEDILDALLRIHKEGGGLQFPLDMDSIKLII 6165
>AP003571.1a
$F CYP71Y7 chromosome
6 clone P0458E02
67847
MADVLSQGYVYLAMALVALLGVLLTKCSRTATAQRRLPPGPWQLPVIGSLHHLIGKLPHHAMRDLTRRHG 68056
68057
PVMMLRLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAGTMGGRDIAFAPYGDY 68233
68234
WRQLRKIAATELLSAPRVASFRAIREEEVAATLRTVAAAAADGRAVELRAALCAL 68398
68399
VTDSTSRAVVGDRCKESDALIRAFDRSMELASGFNPAADLWPSSRLAGLLSGGVREIEA 68575
68576
NLHTVFGILDRLIEKRLQQKKTAPSSAAGEDILDALLRIHKEGGGLQFPLDMDSIKLIIA 68755 (0)
73915
DLFSGGGETVATLLVWAMAELIRNPMAMQKATTEVRRAFALAGAVSEGKGALGELRYLHL 74094
74095
VIKEASRLHPPAPLLLPRECSEPCQVLGYDVPRGTQVLVNAWAIGRDERCWTGGSGDGSS 74274
74275
PEEFRPERFEDGAEAVDLRGNNFELLPFGAGRRMCPGMAFALANIELTLASLLFHFDWEV 74454
74455
PDMADPAKLDMTETLGITARRKGDLLLRPVLRMPVPGVY* 74574
#289
>aaaa01011521.1b
$FI CYP71Y8 (indica cultivar-group)
6071
MADTSHGYVYIGLALVSLFVVLLDRRRRSPPPPAAH
6179
GDGGLRLPPGPWTLPIIGSLHHLVGKLPHHAMRDLARRHGPVMLLRIGQVPTLVVSSRDA 6358
6359
AREMMKTHDMAFATRPLSATLHVITCDGRDLVFAPYGDYWRQLRKIAVTELLTARRVNS 6535
6536
YRAIREEEVAAMLRAVAAAAEGSGAAAGTVEMRAALTALSTDITARAVFGNRCKDREEYL 6715
6716
AQVDHTIELTAGFNPADLWPSSRLAGRLSGIVRRAEECRDTAFKILDRIIQERLE 6880
6881
MARSDGAAGEYLIDVLLRIQKEGGLQFPLAMDDIKANIF (0) 6994
7066
DIFGAGSETSGTALAWAMAELIRNPTVMRKATAEVRRAFAAAGAVSEDGLGELPYLHLVI 7245
7246
RETFRLHPPLPLLLPRECREPCRLLGYDVPRGTQVLVNAWALGRDERYWPGGSPEEFRPE 7425
7426
RFEDGEATAAVNFRGADFEFLPFGGGRRMCPGIAFALATVELPLASLLFHFDWEVPGMAD 7605
7606
PTKLDMTEAFGITARRKADLHLRPLLRVSVPGV* 7707
no
japonica ortholog found 9/10/02
#288
>aaaa01011521.1a $PI CYP71Y9P (indica cultivar-group)
1377
MADASDGYVYVG
1413
LAVVSLFVVLLAWRSRSPAAHGVGDGGLRLPPGPWTLPVIGSLHHLAGQLPHRAMRDLAR 1592
1593
RHGPLMLLRIGEVPTLVVSSRDAAREVMKTHDMAFATRPLSATLRVITCDGRDLVFAPY 1769
1770
GDYWRQVRKIAVTELLTVRRVSSFRSIREEEVAAVLRAVAAAAAVEEATPAMATVEMRAA 1949
1950
LSALVTDITARTAFGNRCKDREEYLVLLERIVEIAGGFNPADLWPSSRLAGRLKRCRAPR 2129
2130
RGVPQLGVILDGIIQEERTGAGSEDLVDVLLRIQKEGELQFPLAMDD 2270
2271
IKSIDIFNAGIETSGTTLQWAMAELIRNPTVM 2450
2451
HKATAEVRHAFAAAGDVSEDALGELRYLQL 2540 (deletion of about 104 aa)
2539
FDWEVPGMADLTKLDMTEAFGITARRKENLHLRPLLRVSVPAASS 2673
2674
RLRWTTTAFSICCHDTHLV*
no
japonica ortholog found 9/10/02
#285
>aaaa01011369.1
CYP71Z1 (indica cultivar-group) orth AL606625.1 $F chr 4 99%
8999
LWFGEVGTVFASSPEAAREVLRSHDLAFADRHLTAAAAAFSFGGRDVVLSPYGERWRQLR 8820
8819
KLLTQELLTASRVRSFRRVREEEVARLMRDLSAAATAGAAVNLSEMVTRMVNDTVLRCSV 8640
8639
GSRCEHSGEYLAALHAVVRLTSGLSVADLFPS 8544
5576
KSLFQDMFAGGTDTSSTTLIWAMAELIRSPRVMAKVQSEMRQIFDGKNTITEDDLVQL 5403
5402
SYLKMVIKETLRLHCPLPLLAPRKCRETCKIMGYDVPKGTSAFVNVWAICRDSKYWEDAE 5223
5222
EFKPERFENNDIEFKGSNFEFLPFGSGRRVCPGINLGLANMEFALANLLYHFDWKLPNRM 5043
5042
LHKDLDMREAPGLLVYKHTSLNVCPVTH 4959
>AL606625.1
$F CYP71Z1 chromosome
4 clone OSJNBa0032I19 similar to 71B28 = AQ858445.1
AQ858445.1
nbeb0013M22r CUGI Rice BAC genomic Length = 824 54% to 71B23
82576
MGASILLVVVVSKLMISFAAKPRLNLPPGPWTLPLIGSIHHVVSSRESVHSAMRRLARRHGAPLM 82770
82771
QLWFGEVGTVVASSPEAAREVLRSHDLAFADRHLTAAAAAFSFGGRDVVLSPYGERWRQL 82950
82951
RKLLTQELLTASRVRSFRRVREEEVARLMRDLSAAATAGAAVNLSEMVTRMVNDTVLRCS 83130
83131
VGSRCEHSGEYLAALHAVVRLTSGLSVADLFP 83226
83227
SSRLAAMVSAAPRAAIANRDKMVRIIEQIIRERKAQIEADDRAADSKSC 83373
83374
ACSLDDLLRLQKEGGSPIPITNEVIVVLLM 83463 (0)
84970
DMFAGGTDTSSTTLIWAMAELIRSPRVMAKVQSEMRQIFDGKNTITEDDLVQLSY 85134
85135
LKMVIKETLRLHCPLPLLAPRKCRETCKIMGYDVPKGTSAFVNVWAICRDSKYWEDAEEF 85314
85315
KPERFENNDIEFKGSNFEFLPFGSGRRVCPGINLGLANMEFALANLLYHFDWKLPNGMLH 85494
85495
KDLDMREAPGLLVYKHTSLNVCPVTHIASSCA* 85593
#88
>aaaa01002274.1a
CYP71Z2 (indica cultivar-group) ortholog of AP003805.1 $F 100%
a
duplicate of 1b
2088
MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVI 1909
1908
GKLAREHGPVMQLWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVV 1729
1728
MAQYGERWRHLRKLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVN 1549
1548
RLVNDTVLRCSVGSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLA 1369
1368
NRNKVERIIEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPI 1189
1188
TNQVITVLLW 1159
>aaaa01002274.1b
CYP71Z2 (indica cultivar-group) ortholog of AP003805.1 $F 100%
duplicate
of 1a count only once
22179
MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVI 22358
22359
GKLAREHGPVMQLWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVV 22538
22539
MAQYGERWRHLRKLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVN 22718
22719
RLVNDTVLRCSVGSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLA 22898
22899
NRNKVERIIEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPI 23078
23079
TNQVITVLLW 23108
>AP003805.1
$F CYP71Z2 chromosome
7 clone OJ1080_F08, similar to AC087550.2
39% to
71B23
10416
MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVIGKLAREHGPVMQ 10201
10200
LWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVVMAQYGERWRHLR 10021
10020
KLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVNRLVNDTVLRCSV 9841
9840
GSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLANRNKVERI 9673
9672
IEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPITNQVITVLLW (0)9487
3785
DMFGAGTDTSSTTLIWTMAELMRSPRVMAKVQAEMRQAFQGKNTITEDDLAQLSYLKMVL 3606
3605 KESFRLHCPVPLLSPRKCRETCKIMGYDVPKGTSVFVNVWAICRDSMYWKNAEEFKPERF
3426
3425
EDNDIELKGSNFKFLPFGSGRRICPGINLGWANMEFALANLLYHFDWNLPDGMLHKDLDM 3246
3245 QESPGLVAAKCSDLNVCPVTHISSSCA* 3162
#13
>aaaa01000275.1
CYP71Z3 (indica cultivar-group) orth AC087550.2 $F chr 10, 100%
same as
AAAA01002847.1 $FI see that accession below for ortholog
42371
MEDKRPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHRSMRALAE 42171
42170
KHGRHHLMQISLGEVFAVVVSSPEAAEEILR 42078
#13
>aaaa01002847.1a
$FI CYP71Z3 (indica
cultivar-group) ortholog to AC087550.2a 99%
also aaaa01000275.1
part
14507
MDDKLLQLLLLALAVSVVSSIVTISKLVYRATNKPRLNLPPGPWTLPVIGSLHHLVMRSP 14328
14327
SIHRSMRALAEKHGPLMQVWLGEVPAVVVSSTEAAEEVLKNQDARFADRFITTTLGAI 14154
14153
TFGGGDLAFAPYGERWRHLKMLCTQQLLTAARVRSFRRIREEEVARLVRDLAASAGGGSE 13974
13973
VAVNLSERVARLVNDIMVRCCVGGRSKHRDEFLGALCTALSQTSWLTVADLFPSSRLARM 13794
13793
LGTAPRRALASRKKMELILEQIIQEREEMTTDRSGDGEAGPTNECFLDVLLRLQK 13629
13628
EGDTPIPITMELIVMLLF 13575
12405
DIVSGGTETSTIVLNWTMAELIRTPRVMAKAHAEVRQTFQAKSTITEDDDISGL 12244
12243
TYLKMVIKESLRMHCPVPLLGPRRCRETCKVMGYDILKDTTVFVNVWAMCRSSIYWNDAE 12064
12063
EFKPERFENKCIDYKGSNFEFVPFGSGRRMCAGMNLGMADVEFPLASLLYHFDWKLPDGM 11884
11883
SPEDIDMQEAPGLFGGRRTSLILYPITRVAPSDLQVI 11773
>AC087550.2a
$F CYP71Z3 chromosome
10 clone nbeb0016G17 74% to AC087554 seq 14167
132002
MDDKLLQLLLLALAVSVVSIVTISKLVYRATNKPRLNLPPGPWTLPVIGSLHHLVMRSPS 131823
131822
IHRSMRALAEKHGPLMQVWLGEVPAVVVSSTEAAEEVLKNQDARFADRFITTTLGAITF 131646
131645
GGGDLAFAPYGERWRHLKMLCTQQLLTAARVRSFRRIREEEVARLVRDLAASAAGGGEVA 131466
131465
VNLSERVARLVNDIMVRCCVGGRSKHRDEFLGALCTALSQTSWLTVADLFPSSRLARML 131289
131288
GTAPRRALASRKKMELILEQIIQEREEMTTDRSGDGEAGPTNECFLDVLLRLQK 131127
131126
EGDTPIPITMELIVMLLF 131073 (0)
DIVSGGTETSTIVLNWTMAELIRTPRVMTKAHAEVRQTFQAKSTITEDDDISGL 129741
129740
TYLKMVIKESLRMHCPVPLLGPRRCRETCKVMGYDILKDTTVFVNAWAMCRSSIYWNDAE 129561
129560
EFKPERFENKCIDYKGSNFEFIPFGSGRRMCAGMNLGMADVEFPLASLLYHFDWKLPDGM 129381
129380
SPEDIDMQEAPGLFGGRRTSLILCPITRVAPSDLQVIV* 129264
#101
>aaaa01002847.1b
$FI CYP71Z4 (indica cultivar-group) =
aaaa01000275.1
ortholog
to AC087550.2b >99% 1 diff
21763
MEDKRPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHR 21584
21583
SMRALAEKHGRHHLMQISLGEVFAVVVSSPEAAEEILRNQDVTFADRFLSTTIGVITFGG 21404
21403
NDMAFAPYGERWRQLRKLCTLELLSAARVRSFRRIREEEVARLVRDLAASAAAGEAVNLS 21224
21223
GRIAKLINDVVVRCCVGGRSEHRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAPR 21044
21043
KALASRKKIEHILEQIIQERKRIMDRSSHGGDGDGEAMNTSECFLDVLLRLQKDGNTPIP 20864
20863
ITNEVIVVLLF 20831
18797
DMFSGGSETSSSTLIWTMAELIRKPKVMAKAHVEVRQAFQGKNTITEDDGVNELTYLKMV 18618
18617
IKESLRMHCPVPLLGPRKCRETCKVMGYDIPKDTTVFVNAWAICRDPKYWDDAEEFQPER 18438
18437
FENKSIDFKGSNFEFLPFGSGRRMCAAMNLGIANVELPLASLLYHFDWKLPDGMMPEDVD 18258
18257
MQDAPGILVGKRSSLIMCPVTRVAPSNPQVIAS 18159
>AC087550.2b
$F CYP71Z4 chromosome
10 clone nbeb0016G17 same as seq on AC087544 from 1-3082
AQ330340
nbxb0046P18r 60% to D48250 65% to 76C4 almost identical to AC087550.2
139422
MEDKLPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHR 139243
139242
SMRALAEKHGRHHLMQISLGEVFAVVVSSPEAAEEILRNQDVTFADRFLSTTIGVITFGG 139063
139062
NDMAFAPYGERWRQLRKLCTLELLSAARVRSFRRIREEEVARLVRDLAASAAAGEAVNLS 138883
138882
GRIAKLINDVVVRCCVGGRSEHRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAP 138706
138705
RKALASRKKIEHILEQIIQERKRIMDRSSHGGDGDGEAMNTSECFLDVLLRLQKDGNT 138532
138531
PIPITNEVIVVLLF (0)
136455
DMFSGGSETSSSTLIWTMAELIRKPKVMAKAHVEVRQAFQGKNTITEDDGVNELTYLKMV 136276
136275
IKESLRMHCPVPLLGPRKCRETCKVMGYDIPKDTTVFVNAWAICRDPKYWDDAEEFQPER 136096
136095
FENKSIDFKGSNFEFLPFGSGRRMCAAMNLGIANVELPLASLLYHFDWKLPDGMMPEDVD 135916
135915
MQDAPGILVGKRSSLIMCPVTRVAPSNPQVIAS* 135814
#184
>aaaa01005737.1
$FI CYP71Z5 (indica
cultivar-group) orth of AP004790.1 >99%
14185
MEDKTILLSLALSMLLAILLSKLVSISKKPRLNLPPGPWTLPVIGSIHHLASNPNTHRALRALSQK 13988
13987
HGPLMQLWLGEVPAVVASTPEAAREILRNQDLRFADRHVTSTVATVSFDASDIFFSPY 13814
13813
GERWRQLRKLCTQELLTATRVRSFSRVREDEVARLVRELAGGGGAAVDLTERLG 13652
13651
RLVNDVVMRCSVGGRCRYRDEFLGALHEAKNQLTWLTVADLFPSSRLARMLGAAPRRGLA 13472
13471
SRKRIERIIADIVREHEGYMGSGGGGGDEAAAAAAGKDCFLSVLLGLQKEGGTPIPITNEIIVVLLF (0) 13271
10674
DMFSGGSETSATVMIWIMAELIRWPRVMTKVQAEVRQALQGKVTVTEDDIV 10522
10521
RLNYLKMVIKETLRLHCPGPLLVPHRCRETCKVMGYDVLKGTCVFVNVWALGRDPKYWED 10342
10341
PEEFKPERFENSDMDYKGNTFEYLPFGSGRRICPGINLGIANIELPLASLLYHFDWKLPD 10162
10161
EMASKDLDMQEAPGMVAAKLTSLCVCPITRVAPLISA* 10048
>AP004790.1
$F CYP71Z5 (japonica
cultivar-group) chr 2
51668
MEDKTILLSLALSMLLAILLSKLVSISKKPRLNLPPGPWTLPVIGSIHHLASNPNTHRAL 51847
51848
RALSQKHGPLMQLWLGEVPAVVASTPEAAREILRNQDLRFADRHVTSTVATVSFDASDIF 52027
52028
FSPYGERWRQLRKLCTQELLTATRVRSFSRVREDEVARLVRELAGGGGAAVDLTERLGRL 52207
52208
VNDVVMRCSVGGRCRYRDEFLGALHEAKNQLTWLTVADLFPSSRLARMLGAAPRRGLASR 52387
52388
KRIERIIADIVREHEGYMGSGGDGGDEAAAAAAGKDCFLSVLLGLQKEGGTPIPITNEII 52567
52568
VVLLF 52582
55179
DMFSGGSETSATVMIWIMAELIRWPRVMTKVQAEVRQALQGKVTVTEDDIVRLNYLK 55349
55350
MVIKETLRLHCPGPLLVPHRCRETCKVMGYDVLKGTCVFVNVWALGRDPKYWEDPEEFMP 55529
55530
ERFENSDMDYKGNTFEYLPFGSGRRICPGINLGIANIELPLASLLYHFDWKLPDEMASKD 55709
55710
LDMQEAPGMVAAKLTSLCVCPITRVAPLISA 55802
#16
>aaaa01000393.1
$FI CYP71Z6 (indica
cultivar-group) 89% to AP005114.1b
9756
MEDKLILALCLSALFVVVLSKLVSSAVKPRLNLPPGPWTLPLIGSLHHLAMTKSPQTHRSLRALS 9562
9561
EKHGPIMQLWMGEVPAVVVSSPAVAEEVLKNQDLRFADRHLTATTEEIFFGGRDVIFGP 9385
9384
YGERWRHLRKICMQELLTAARVRSFRGVREGEVARLVRELAASAAGAGAGAVGAAAGVNL 9205
9204
NERISKLANDIVMVSSVGGRCSHRDEFMEALEVAKKQITWLSVADLFPSSKLARMVAVAP 9025
9024
RKGLASRKRMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVP 8848
8847
VTDEIIVVLLF (0) 88315
4681
DMISGASETSPTVLIWTLAELMRNPRIMAKAQAEVRQAVAGKTTITEDDIVG 4526
4525
LSYLKMVIKETLRLHPPAPLLNPRKCRETSQVMGYDIPKGTSVFVNMWAICRDSRYWEDP 4346
4345
EEYKPERFENNSVDYKGNNFEFLPFGSGRRICPGINLGVANLELPLASLLYHFDWKLPNG 4166
4165
MAPKDLDMHETSGMVAAKLITLNICPITHIAPSSA* 4058
aaaa01000393.1
has no ortholog in nr or HTGS 9/2/02
#328
>aaaa01013736.1
$FI CYP71Z7 (indica
cultivar-group) ortholog of BI811079.1 AP005114.1b
6446
MEDNKLILALGLSVLFVLLSKLVSSAMKPRLNLPPGPWTLPLIGSLHHLVMKSPQIHRSLRALSE 6252
6251
KHGPIMQLWMGEVPAVVVSSPAVAEEVLKHQDLRFADRHLTATIEEVSFGGRDVTFAPYS 6072
6071
ERWRHLRKICMQELLTAARVRSFQGVREREVARLVRELAADAGAGGDAGVNLNERISKLA 5892
5891
NDIVMVSSVGGRCSHRDEFLDALEVAKKQITWLSVADLFPSSKLARMVAVAPRKGLASRK 5712
5711
RMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVPVTDEIIVVL 5532
5531
LF (0) 5526
4289
DMFTGASETSPTVLIWILAELMRCPRVMAKAQAEVRQAAVGKTRITENDIVGLSYLKMVI 4110
4109
KEALRLHSPAPLLNPRKCRETTQVIGYDIPKGTSVFVNMWAICRDPNYWEDPEEFKPERF 3930
3929
ENNCVDFKGNNFEFLPFGSGRRICPGINLGLANLELALASLLYHFDWKLPNEMLPKDLDM 3750
3749
QETPGIVAAKLTTLNMCPVTQIAPSSAEDAS* 3654
>AP005114.1b
$F CYP71Z7 (japonica
cultivar-group) chromosome 2
BI811079.1
clone K015D02.Length = 347 57% to AC087550.2 C-helix
41% to
71B11
120645
MEDNKLILALGLSVLFVLLSKLVSSAMKPRLNLPPGPWTLPLIGSLHHLVMKSPQIHRSL 120824
120825
RALSEKHGPIMQLWMGEVPAVIVSSPAVAEEVLKHQDLRFADRHLTATIEEVSFGGRDVT 121004
121005
FAPYSERWRHLRKICMQELLTAARVRSFQGVREREVARLVRELAADAGAGGDAGVNLNER 121184
121185
ISKLANDIVMVSSVGGRCSHRDEFLDALEVAKKQITWLSVADLFPSSKLARMVAVAPRKG 121364
121365
LASRKRMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVPVTDE 121544
121545
IIVVLLF (0) 121565
122803
DMFTGASETSPTVLIWILAELMRCPRVMAKAQAEVRQAAVGKTRITENDIVGLSYLKMVI 122982
122983
KEALRLHSPAPLLNPRKCRETTQVMGYDIPKGTSVFVNMWAICRDPNYWEDPEEFKPERF 123162
123163
ENNCVDFKGNNFEFLPFGSGRRICPGINLGLANLELALASLLYHFDWKLPNGMLPKDLDM 123342
123343
QETPGIVAAKLTTLNMCPVTQIAPSSAEDAS* 123438
#29
>aaaa01000805.1a
CYP71Z8 partial (indica cultivar-group) 100% to AC087544.2
4553
MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLNLPPGPWTLPVIGSIHHLVGSHPIHRS 4374
4373
MRALAEKHGRDLMQVWLGELPAVVVSSPEAARDVLRSQDLAFADRYVSTTIAAIYLGGRD 4194
4193
LAFAPYGERWRQLRKLCTQRLLTAARVRSFRCVREEEVARLVRDLAASAAAGEAVDLTAR 4014
4013
VAELVNDVVVRCCIGGRRSRYRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAPRK 3834
3833
ALASRKKMERILEQIIQERKQIKERSTGAGAGADDEAAAAGNECFLDVLLRLQKEGDTPI 3654
3653
PITNETMMLLLH 3618 sequence gap
#29
>aaaa01000805.1b
CYP71Z8 partial (indica cultivar-group) 100% to AC087544.2
duplicate
of first 46 aa probably an assembly error. Count only once.
18349
MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLNLPPGPWTLPVIG 18212
>AC087544.2
$F CYP71Z8 chromosome
10 clone nbxb0046P18,
47% to
CYP71D7
AZ131846.1
OSJNBb0111D08r CUGI Rice BAC Length = 377 59% to 71B9
AZ132319.1
OSJNBb0062F12r CUGI Rice BAC genomicLength = 683
14167
MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLN
14065
LPPGPWTLPVIGSIHHLVGSHPIHRSMRALAEKHGRDLMQVWLGELPAVVVSSPEAARDV 13886
13885
LRSQDLAFADRYVSTTIAAIYLGGRDLAFAPYGERWRQLRKLCTQRLLTAARVRSFRCVR 13706
13705
EEEVARLVRDLAASAAAGEAVDLTARVAELVNDVVVRCCIGGRRSRYRDEFLDALRTALD 13526
13525
QTTWLTVADVFPSSKLARMLGTAPRKALASRKKMERILEQIIQERKQIKERSTGAGAGAD 13346
13345
DEAAAAGNECFLDVLLRLQKEGDTPIPITNETMMLLLH 13232 (0)
10760
NMFSAGSETSSTTLNWTMAELIKSPRVMAKVHDEVRQAFQGKNTITDDDVAKLSYLKMVT 10581
10580
KESLRMHCPVPLLGPRRCRETCKVMGYDVPKGTIVFVNAWAICRDSKYWKSAEEFKPERF 10401
10400
ENISIDYNGNNFEFLPFGSGRRICPGITLGMANVEFPLASLLYHFDWKLPNQMEPEEIDM 10221
10220
REAPGLVGPKRTSLYLHPVTRVAPSSV* 10119
#29
>aaaa01011405.1
CYP71Z8 (indica cultivar-group) orth AC087544.2 $F chr 10 99%
see
aaaa01000805.1a = aaaa01000805.1b for ortholog
2160
SPRVMAKVHDEVRQAFQGKNTITDDDVAKLSYLKMVTKESLRMHCPVPLLGPRRCRET 2333
2334
CKVMGYDVPKGTIVFVNAWAICRDSKYWKSAEEFKPERFENISIDYNGNNFEFLPFGSGR 2513
2514
KICPGITLG 2540
#468
>aaaa01092069.1
CYP71Z9 (indica cultivar-group) 61% to AC087544.2 frag = 623bp
623
AIAMAFRQTSVLTLADLFPSSRLMQALGTAPRKVLACRDKIQRILEQVIQEKAQEMGRGDEATAGNEGFV 414
413
GVLLRLQKEGSTPVQLTNDTI 351
207
DMFSAGSETSSTTLNWCMTELVRSPVVMAKAQAELRDAFKGKNTITENDLEGLSYLKLVI 28
27 KEALRMHAP 1
no
japonica ortholog found 9/12/02
#466
>aaaa01088222.1
CYP71Z10 i not an exact match 64%
to AP005114.1b $F
651bp
frag. N-term runs off the end
247
MEENKALLAAVSLSILLVILSKLKSFLATKPKLNLSPGPWTLPVIG
SLHHLVRSPNIYRAMRALAQKHGQLMTLRLGEVQCM
2
no
japonica ortholog found 9/12/02
#431
>aaaa01035499.1 CYP71Z11 (indica cultivar-group) 53% to AP005114.1b (partialI)
no
ortholog in known set might be a new subfamily
3
TMPTTIQGYHIPAKTIAFINVWAIGRDPAAWDTPDEFRPERFMGSAVDFRGNDYKFIPFG 182
183
AGRRLCPGIILALPGLEMVIASLLYHFDWELPDGMDVQDLDMAEAPGLTTPPMNPVWLIP 362
363
RCRTI* 380
no
japonica ortholog found 9/12/02
#475
>BI808626.1 CYP71Z12 (partial) clone D005B07.Length = 538 EST with numerous frameshifts similar to AAAA01035499.1 and 71Bs no ortholog found in indica
no
extensions in htgs nr gss or est sections of Genbank 8/3/02
66% to
AY104083.1 Zea mays
11
LFVHAWAIGRDPXAWXXPEEFRPDRFLXXSVDFRGNDYQLVPFGAAPRICPGISFXX
PVLEMALFALLHHFDWELPAGMXXXXXDMSEAPGLTTPLRVPLRLVPKRKARLPRHIYKRNVIGE*
#93
>aaaa01002599.1a
CYP71AA1P (indica cultivar-group) orth of AP004326.2d 100%
even the
frameshifts are the same
8731
FFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 8552
8552
QETLRLHPPVPLLLPRLWSEPCKIMGYDIP
KNTAIFVNTWALGR
KIKNTGLMQVSSG 8382
8381
LKYSRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSN 8202
8201
KLDMTEANGITTHRRIDIWLEATPFVPR 8118
>AP004326.2d
$P CYP71AA1P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 4 pseudogene
81031
DFFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 81213 frameshift
81213
QETLRLHPPVPLLLPRLWSEPCKIMGYDIP 81302 frameshift
81304 KNTAIFVNTWALGR 81345 frameshift
81344
KIKNTGLMQVSSGLKY
81393
SRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSNK 81566
81567
LDMTEANGITTHRRIDIWLEATPFVPR 81647
#94
>aaaa01002599.1b
$FI CYP71AA2 (indica
cultivar-group) ortholog of AP004326.2c $F 99%
12298
MAGIMDSTTASYYTTLLCGALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHC 12119
12118
LLGSLPHHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTAS 11939
11938
ILTYGARDIVFAPFGKHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASAS 11759
11758
SAVNVSELVKIMTNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARV 11579
11578
LGGRSLRTTKRVHEKLHQITEAIIQGHGIKDTVGDEHHECEDIL 11447
11446
DVLLRFQRDGGLGITLTKEIVSAVLF 11369
11213
DLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHYLQLV 11034
11033
IKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKNVNEFRPE 10854
10853
RFKDDIVDFSGTDFRFIPGGSGRRMCPGLTFGVSNIEIALVTLLYHFDWKLPSETDTHEL 10674
10673
DMRETYGLTTRRRSELLLKATPSY 10602
>AP004326.2c
$F CYP71AA2 genomic
DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence Gene 3 39% to 71B11
77487
MAGIMDSTTASYYTTLLCG
77544
ALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRRY 77717
77718
GPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTASIDIVFAPFG 77873
77874
KHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASASSAVNVSELVKIM 78044
78045
TNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARVLGGRSLRTTKRV 78224
78225
HEKLHQITEAIIQGHGIKDTVGDEHHECEDILDVLLRFQRDGGLGITLTKEIVSA 78389
78390
VLF 78398 (0)
78554
DLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHYLQLV 78733
78734
IKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKDVNEFRPE 78913
78914
RFKDDIVDFSGTDFRFIPGGSGRRMRPGLTFGVSNIEIALVTLLYHFDWKLPSETDTHEL 79093
79094
DMRETYGLTTRRRSDLLLKATPSYARLGWSTNMQIYSVKCLVYE* 79228
#95
>aaaa01002599.1c
$FI CYP71AA3 (indica cultivar-group)
ortholog of AP004326.2b $F >99%
22237
MAGIVDTAAFCTLLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLP 22058
22057
HHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGA 21878
21877
RDIVFAPFSKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSEL 21698
21697
VKIMANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRA 21518
21517
TKRVHQKLHQITDTIIQGHEIIEDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLLRFHR 21338
21337
DGGLGITLTKEIVSAVLF 21284
20770
DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIHQVLQGKTVVSEADIEGRLHYLQLV 20591
20590
IRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEFRPE 20411
20410
RFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASSCKL 20231
20230
DMRETHGVTARRRTELLLKATPLYT 20156
>AP004326.2b
$F CYP71AA3 genomic
DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Gene 2 no
good matches in NR 79% to AP004326.2c
71860
MAGIVDTAAFCT
71896
LLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRR 72069
72070
YGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGARDIVFAPF 72243
72244
SKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSELVKI 72408
72409
MANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRATKR 72588
72589
VHQKLHQITDTIIQGHEIIKDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLL 72747
72748
RFHRDGGLGITLTKEIVSAVLF 72813 (0)
73327
DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIRQVLQGKTVVSEADIEGRLHYL 73497
73498
QLVIRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEF 73677
73678
RPERFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASS 73857
73858
CKLDMRETHGVTARRRTELLLKATPLYT* 73944
cluster
continues on AP004326.2 seq a
#334
>aaaa01014066.1
CYP71AA4P (indica cultivar-group) orth AP004326.2a $P
chr 1 100%
5245
YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 5424
5425
F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 5604
5605
RMELDMTESAGLT 5643
>AP004326.2a
$P CYP71AA4P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence
Length =
102983
4 genes
71B like
Gene 1
pseudogene 71 family
67989
LPPVPWPLPVIGSMH*LLGSLPHH 68060 frameshift with deletion
68060
RPACAVELLSPRRARSFRRVREAEPARLVRAVAASPAWPLVNVVGGEHVAAMMTAV 68227
68228
GARP 68239 frameshift with small deletion
68238
RCPRQEEYLEELGKVAKLAAGFNLVDLFPESRLVRAAQAAHGKIHSIMDAMVQ 68396
68397
DHLKAMEERREEVADGVVDDGDGDGADRDEELLSILLRFQRDGGLGITLTNGNHQRDS 68570 (0)
68886
GILAGGSDTTTTTVMWAMSELLRCPRAMQ 68972 frameshift with deletion
69023
YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 69202
69203
F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 69382
69383
RMELDMTESAGLTASRLTDLFG* 69451
#442
>aaaa01051575.1
CYP71AA5 (indica cultivar-group) 69% to AP004326.2b
855
DVFAAGSETTATATIWAMSELVRTPRLMERAQAEIRQLLQGKTRVAEEDIQGRLPYLQMV 676
675
IKETLRLHPPAPLILPRLCAESTKILGFDVPEGTTVFVNAWALGRDDKSWVDANEFKPER 496
495
FEDDDRVDFSGADFRFIPGGSGRRMCPGLTFGLANIETTLANLLYHFDWKLPGGANPYEL 316
315
DMAESYGITARRTTDLLLEATPYVPHGSVS* 223
no
japonica ortholog found 9/12/02
#273
>aaaa01010273.1
$FI CYP71AB1 (indica
cultivar-group) ortholog of AC113337.1
6315
MANLIYYSLLIILPFLFLIKFYKAMFSSRKQARRLPPCPWQLPIMGSIHHLIGDLPHRAL 6494
6495
RDLSRRYGPVMLLKFGQVPFIIVSSPEAAKDIMKTHDSIFATRPQSEIMKIITKRGQGLV 6674
6675
FAPYDDQWRQLRKICIRELLCAKRVQSFCAIREEEAARLVKSISSDQAHLVNLSKKLADY 6854
6855
ATDAAIRIITGTRFENQE VRDKFQYYQDEGVHLAASFCPANLCPSLQLGNTLSRTAHKA 7031
7032
EIYREGMFAFIGGIIDEHQERRAQDMSHKEDLIDVLLRIQQEGSLESPVSMETIKFLIF (0) 7208
7297
DILAGGSETVTTVLQWAMAELMRNPTVMSKVQDEVREVFKWKEMVSNDDINKLTYLQFVI 7476
7477
KETLRLHTPGPLFMRECQEQCQVMGYDMPKGTKFLLNLWSISRDPKYWDDPETFKPERF 7653
7654
EDDARDFKGNDF EFISFGAGRRMCPGMLFGLANIELALANLLFYFDWSLPDGVLPSELDM 7833
7834
TENFGVTVRKKEDLLLHASLYAQLSC* 7914
>AC113337.1
$F CYP71AB1 (japonica
cultivar-group) cultivar Nipponbare clone OSJNBa0061H20,
from chromosome
10
AC074355.2
Oryza sativa clone OSJNBa0071I20, gene 1 43% to 71A13
AQ288798
65-164 region C-helix 54% to 71A12 same as AC074355.2
AQ840770.1
nbxb0071I20f CUGI Rice BAC genomic cloneLength = 754
AQ840078.1
nbxb0051B18f CUGI Rice genomic cloneLength = 694
similar to
lotus 71D
AQ865944.1
nbeb0026D10f BAC genomic Length = 473 59% to 99A1 69% to AP004000.1
23542
MANLIYYSLLIILPFLLLINFYKAMFSSRKQAGRLPPCPWQLPIMGSIHHLIGDLPHRSL 23721
23722
HDLSRRYGPVMLLKFGQVPFIIVSSPEAAKDIMKTHDSIFAMRPQSEIMKIITKRGQGLV 23901
23902
FAPYDDQWRQLRKICIRELLCAKRVQSFCAIREEEAARLVKSISSDQAHLVNLSKKLADY 24081
24082
ATDAAIRIITGTRFENQEVRDKFQYYQDEGVHLAASFCTANLCPSLQLGNTLSRTARKAE 24261
24262
IYREGMFAFIGGIIDEHQERRAQDMYHKEDLIDVLLRIQQEGSLESPVSMETIKFLIF (0)
DILAGGSETVTTVLQWAMTELMRNPTVMSKAQ 24621
24622
DEVREVFKWKKMVSNDDINKLTYLQFVIKETVRLHTPGPLFMRECQEQCQVMGYDVPKGT 24801
24802
KFLLNLWSISRDPKYWDDPETFKPERFENDARDFKGNDFEFIPFGAGRRMCPGMLFGLAN 24981
24982
IELALANLLFYFDWSLPDGVLPSELDMTENFGVTVRKKEDLLLHASLYAQLSC* 25143
#263
>aaaa01009869.1
CYP71AB2 (indica cultivar-group) orth
AP004684.1b $F chr6 98%
2834
LPLVHYLITLFLHGSRDSDLRLPPGPWRLPLIGSLHHLFFGALPHRALRDLARRHGPLML 3013
3014
LAFGDAPVVVVASTAAAAREILRTHDDNFSSRPLSAVVKACTRRGAGITFAPYGEHWRQV 3193
3194
RKICRLELLSPRRILAFRAIREEEAARLVRAIGVASPPLVTNLSQLLGNYVTDTTVHIV 3370
3371
MGERFRERDALLRYVDEAVRLAGSLTMADLFPSSRLAHAMSSTTLRRAEAFVES 3532
3533
LMEFMDRVIREHLEKKRSCQGGEREEDLIDVLLRLQAEGSLHFELTMGIIRAVIF
DLFSGGSETATTT 3886
3887
LQWAMAELMRNPGVMSRAQAEVREAYKDKMEVTEEGLTNLTYLQCIIKETLRLHTPGP 4060
4061
LALPRECQEQCRILGYDIPKGATVLVNVWAICTDTEFWDESEKFMPERFEGSTIEHKGNN 4240
4241
FEFIPFGAGRRICPGMQFGIANIELALANLLFHFDWTLPEGTIHSDLDMTETMGITARRK 4420
4421
EDL 4429
>AP004684.1b
$F CYP71AB2 chromosome 6 clone P0012H03, Length = 163117
New seq
similar to AP004000 57% to AP003523.1 78% to AP004688.1
36% to 41%
with 71A and 71B sequences possibly new subfamily in 71
117908
MDAAVFCCLLALLPLLHYLITLFLHGSRDSDLRLPPGPWRLPLIGSLHHLF 118060
118061
FGALPHRALRDLARRHGPLMLLAFGDAPVVVVASTAGAAREILRTHDDNFSSRPLSAV 118234
118235
VKVCTRRGAGITFAPYGEHWRQVRKICRLELLSPRRILAFRAIREEEAARLVRAIGVASP 118414
118415
PLVTNLSELLGNYVTDTTVHIVMGERFRERD ALLRYVDEAVRLAGSLTMADLFPSSRLAR 118594
118595
AMSSTTLRRAEAFVESLMEFMDRVIREHLEKKRSCQGGEREEDLIDVLLRLQAEGSLHFE 118774
118775
LTMGIIRAVIF 118807 (0)
118932
DLFSGGSETATTTLQWAMAELMRNPGVMSRAQAEVREAYKDKMEVTEEGLTNLTYLQCII 119111
119112
KETLRLHTPGPLALPRECQEQCQILGYDIPKGATVLVNVWAICTDNEFWDESEKFMPERF 119291
119292
EGSTIEHKGNNFEFIPFGAGRRICPGMQFGIANIELALANLLFHFDWTLPEGTLHSDLDM 119471
119472
TETMGITARRKEDLYVHAIPFVQLP* 119549
#71
>aaaa01002000.1 CYP71AB3 (indica cultivar-group) ortholog of AP004688.1 $F 98%
6046
METADLCCLLALLPLVYCLLTLFHGSRESDLRLPPGPWRLPLIGSLHHLFGRTLPHHALR 5867
5866
DLARLHGPLMLLSFGQASPVVIASTAIAAREIMRTHDDNFSTRPLSTVLKVCTRYGAGMT 5687
5686
FVPYGEHWRQVRKICSLELLSPRRILKFRSIREEEVARLVLAIASSSTPTPTPPAPVNLS 5507
5506
KLLSNYMTDATVHIIMGQCFRDRDTLVRYVDEAVRLASSLTMADLFPSWRLPRVMCATTL 5327
5326
HRAEVFVESVMEFMDRVISEHLEKRSCQGGDREEDLIDVLLRLQAEGNLEFELTTSIIKA 5147
5146
IIF 5138
ELLAGGSEAPITTLQWAMAELMRNPDVMSRAQAEVREAYKEKMKV 4911
4910
TEEGLTNLPYLHCIIKETLRLHTPGPFVLPRECQEQCQILGYDVPKRATVVVNIWAICRD 4731
4730
AEIWDEPEKFMPDRFEGSAIEHKGNHFEFIPFGAGRRICPGMNFALANMELALASLLFYF 4551
4550
DWSLPEDVLPGDLDMTETMGLTARRKEDLYVCAIPFVQLP 4431
>AP004688.1
$F CYP71AB3 chromosome
6 clone P0036C11, Length = 137929
New seq
similar to AP004000 37% to 71B23 52% to AP003523.1 78% to AP004684.1b
57304
METAELCCLLALLPLVYCLLTLFHGSRESDLRLPPGPWRLPLIGSLHHLFGRTLPHRA 57477
57478
LRDLARLHGPLMLLSFGQAAPVVIASTAIAAREIMRTHDDNFSTRPLSTVLKVCTRYGA 57654
57655
GMTFVPYGEHWLQVRKICSLELLSPRRILKFRSIREEEVARLVLAIASSSTPTPTPPAPV 57834
57835
NLSKLLSNYMTDATVHIIMGQCFRDRDTLVRYVDEAVRLASSLTMADLFPSWRLPRVMCA 58014
58015
TTLHRAEVFVESVMEFMDRVISEHLEKRSCQGGDREEDLIDVLLRLQAEGNLEFELSTSI 58194
58195
IKAIIF 58209 (0)
58290
ELLAGGSEAPITTLQWAMAELMRNPDVMSRAQAEVREAYKEKMKVTEEGLTNLPY 58469
58470
LHCIIKETLRLHTPGPFVLPRKCQEQCQILSYDVPKRATVVVNIWAICRDAEIWDEPEKF 58649
58650
MPDRFEGSAIEHKGNHFEFIPFGAGRRICPGMNFALANMELALASLLFYFDWSLPEDVLP 58829
58830
GDLDMTETMGLTARRKEDLYVCAIPFVQLP* 58919
#267
>aaaa01010030.1b CYP71AC1 (indica cultivar-group) orth AP003523.1b $F chr 6 99%
6579
QDMFAGGSESTSTTLEWALSELVRNPHVMQKAQAEIRHALQGRTRVTEDDLINLKYPK 6406
6405
NVIKETLRLHPVAPLLVPKECQESCKILGYDVPKGTIMFVNAWAIGRDPRYWNDAEVFMP 6226
6225
ERFEKVAVDFRGTNFEFIPFGAGRRMCPGITFANATIEMALTALLYHFDWHLPPGVTPDG 6046
6045
LDMEEEFGMSVSRKRDLYLRPTLH 5974
>AP003523.1b
$F CYP71AC1 chromosome
6 clone P0416A11 six different genes
54-58%
with genes in AP003090 and AP004000 group
38% to 41%
with 71A and 71B sequences possibly new subfamily in 71
118132
MDLMKSNPLQGSPWSL
118084
LNLLVLIIVAAMICGELCRRRRRRRGDENGGATRLPPGPWRLPFVGSLHHLAVMRPRGVV 117905
117904
VHRALAELARRHDAPVMYLRLGELPVVVASSPEAAREVLKTHDAAFATRAMSVTVRESIG 117725
117724
DKVGILFSPYGKKWRQLRGICTLELLSVKRVRSFRPIREEQVARLVDAIAAAAASS 117557
117556
TAEAAAVNISRQITGPMTDLALRAIMGECFRWREEFLETLAEALKKTTGLGVADMFPSSR 117377
117376
LLRAVGSTVRDVKLLNAKLFELVECAIEQHREQIRAAHDNGGDDDDAHGHGDKECFLNTL 117197
117196
MRIQKEGDDLDD 117161 (frameshift) LTMATVKAVIL (0)
DMFAGGSESTSTTLEWALSELVRNPHVMQKAQAEIRHALQGRTRV 115967
115966
TEDDLINLKYPKNIIKETLRLHPVAPLLVPKECQESCKILGYDVPKGTIMFVNAWAIGRD 115787
115786
PRYWNDAEVFMPERFEKVAVDFRGTNFEFKPFGAGRRMCPGITFANATIEMALTALLYH 115610
115609
FDWHLPPGVTPDGLDMEEEFGMSVSRKRDLYLRPTLHMGLETI* 115478
#267
>aaaa01031277.1
CYP71AC1 (indica cultivar-group) orth AP003523.1b $F chr 6 99%
see
AP003523.1b above for ortholog
1393
LPPGPWRLPFVGSLHHLAVMRPRGVVVHRALAELARRHDAPVMYLRLGELPVVVASSPEA 1572
1573
AREVLKTHDAAFATRAMSVTVRESIGDKVGILFSPYGKKWRQLRGICTLELLSVKRVRSF 1752
1753
RPIREEQVARLVDAIAAGA 1809
#25
>aaaa01000575.1
$FI CYP71AC2 (indica
cultivar-group) 74% to AP003523.1
same seq
as aaaa01002303.1 $FI orth to AP005610.1 AP005192.1
31724
MDMEMGKLLHRPWKWSLNSPLL
31558
LLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVIGSLHHLAMNPKAVHRALADLAR 31379
31378
RCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAFATRAMSVTVRDSIGDTVGILF 31202
31201
SPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLVGAIAAAAAAPGGDQPPPVNVS 31022
31021
WQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKASRFGVADLFPSSRLLRAVGSTA 30842
30841
VRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGGDDDARDDNECLLNTLMRIQKE 30662
30661
GGGTLSMSTVKAVIL (0) 30617
28748
DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSY 28584
28583
PKNIIKETLRLHPVAPLLGXKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVF 28404
28403
LPERFEEITVDFGGTNYEFIPFGGGRRICPGITFAHATLEWALTALLYHFDWHLPPSVTP 28224
28223
DGLDMEEEFGMNVRRKRDLHLHPVIHVGVEKGIMS* 28116
#25
>aaaa01002303.1
$FI CYP71AC2 (indica
cultivar-group) same seq as AAAA01000575.1
except 2
aa diffs and one short frameshifted region
see
AAAA01000575.1 for ortholog
3994
MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAVGTRLPPGPWRLPVI 4173
4174
VQSAPPRHEPEGGARALADLARRCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 4353
4354
ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 4533
4534
GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 4713
4714
RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 4893
4894
DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 5001
6869
DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 7048
7049
KETLRLHPVAPLLGXKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPERF 7228
7229
EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLEWALTALLYHFDWHLPPSVTPDGLDM 7408
7409
EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 7498
>AP005610.1
$F CYP71AC2 (japonica
cultivar-group) chr 6 = AP005192.1
115277
MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVI 115456
115457
GSLHHLAMNPKAVHRALADLARRCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 115636
115637
ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 115816
115817
GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 115996
115997
RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 116176
116177
DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 116284
118153
DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 118332
118333
KETLRLHPVAPLLMPKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPKRF 118512
118513
EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLELALTALLYHFDWHLPPSVTPDGLDM 118692
118693
EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 118782
>AP005192.1
$F CYP71AC2 (japonica
cultivar-group) chr 6
83856
MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVI 83677
83676
GSLHHLAMNPKAVHRALADLARRCGGXGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 83497
83496
ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 83317
83316
GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 83137
83136
RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 82957
82956
DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 82849
80980
DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 80801
80800
KETLRLHPVAPLLMPKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPKRF 80621
80620
EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLELALTALLYHFDWHLPPSVTPDGLDM 80441
80440
EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 80351
#208
>aaaa01007044.1
$PI CYP71AC3P (indica cultivar-group) seq gap at 5222 no Nterm exon
might be
in this gap but also one frameshift so probably a pseudogene of an
AP003523
like gene.
6146
DMFAGGSETTSTTLEWA 6196 (frameshift)
6196
PEVMQKAQAEIRHALQGKSRVTEDDLINLKYPKNIIKETMRLHPLASLLVPRKCQESCKI 6375
6376
LGYDIPKGTILIMNVWTIGRDHRYWDDAEVFIPERFEDTTIDFKGTHFEFIPFGAGRRMC 6555
6556
LGMTFAHATIELALTALLYHFDWHLPHGVTHDGMDMEEQFSVTVSRKRDLYLHPIQHVGVEEI* 6747
aaaa01007044.1
no japonica ortholog found 9/7/02
#427
>aaaa01034252.1
CYP71AC4 (indica cultivar-group) 77% to AP003523.1b
may be a
pseudogene
1319 R*VVHRALADLVRRCDDLAPLMYLCLSELRVVVASTPDAAREVLKTHDAAMSTVVSAN
1146
1122 FAPYGKRWRHLRGICTLELLSAKRVRSFRPIREEQDARLVGAVVAAAAPSGESVNVRRLI
943
942 GGPMTDLALRAIMGE
898
no
japonica ortholog found 9/12/02
#37
>aaaa01021566.1
CYP71AC5P (indica cultivar-group) orth of AL606658.1 2 diffs
lone
pseudogene fragment
1048
SALNVSRQITGTLTDLTLRAIMGECGFRWHEEFLETLGEAQKKATRFGVADLFPSSRLLP 1227
1228
AVGSRSGD 1251
>AL606658.1
$P CYP71AC5P chromosome 4 clone OSJNBb0016D16 lone pseudogene fragment
72% to
AP003523 118132-115478
120987 SALNVSWQITGTLTDLTLHAIMGECGFRWHEEFLETLGEAQKKATRFGVADLFPSSRLLPAVGSRSGD 120784
#38
>aaaa01013200.1
$PI CYP71AC6P (indica cultivar-group) 3 diffs with AL606658.1 95%
94% to
AP004571.1 and AP004327.1 lone pseudogene fragment
7465
ALNVSRQITGTLTDLTLHAIMGECGFRWREEFLETLGEAQKKATRFGVADLFPSSRLLPA 7644
7645
VESRSGD 7665
>AP004571.1
$P CYP71AC6P (japonica cultivar-group) chr 6 94% to AAAA01013200.1
identical
to AP004327.1 lone pseudogene fragment
60465
ALNVSRQITGTLTDLTLRAIMWECGFRWREEFLETLGEAQKKATRFGVADLFLSSRLLPA 60286
60285
VGSRSGD 60265
>AP004327.1
$P CYP71AC6P (japonica cultivar-group) chr 6 94% to AAAA01013200.1 4 diffs
identical
to AP004571.1 lone pseudogene fragment
105764
ALNVSRQITGTLTDLTLRAIMWECGFRWREEFLETLGEAQKKATRFGVADLFLSSRLLPA 105943
105944
VGSRSGD 105964
#39
>aaaa01017762.1
$PI CYP71AC7P (indica cultivar-group) 89% to AL606658.1
92% to
AAAA01013200.1 lone pseudogene fragment
4196
ALNVSRQITGTLTDLTLRAIMGECGFRWREEFLETLGEAQKKATRFGVADLFPLSRLLPV 4017
4016
IRSRSGD 3996
#290
>aaaa01011555.1
CYP71AD1 (indica cultivar-group) orth AC109595.1 $F chr 5 >99%
8324
NARRRLAPAPRGLPVIGNLHQVGALPHRALRALAAATGAPHLLRLRLGHVTALVASSPAA 8145
8144
AAAVMREHDHVFATRPYFRTAEILTYGFKDLVFAPYGEHWRHARRLCSEHVLSAARSH 7971
7970
RY 7965
7950
QEVALLVNAIRTEAAAAAVDVSKALYAFTNAVICRAVSGRLSREDEGRSELFRELIEE 7777
7776
NATLLGGFCVGDYFPALAWADAFLSGFAARACRNLRRWDELLEEVIAEHEARLRGGDDG 7600
7599
GGEEHREEDFVDVLLALQEESQRHDGSFKLTRDIIKSLLQDMFAAGTDTSFITLEWAMSE 7420
7419
LVKNPAAMRKLQDEVRRGGGATTAATPYLKAVVKETLRLHPPVPLLVPREC 7267
7266
ARDTDDDATVLGYHVAGGTRVFVNAWAIHRDAGAWSSPEEFRPERFLPGGGEAEAVDLRG 7087
7086
GHFQLVPFGAGRRVCPGMQFALATVELALASLVRLFDWEIPPPGELDMSDDPGFTVRRR 6910
6909
IPLRLV 6892
>AC109595.1
$F CYP71AD1 chr 5 39%
to 71As 40% to 71Bs
44% to
71Cs clone OJ1212B02, Length = 126962
72095
MEIELSPVLLLLPFLLLGFLYLTGGVLRSGGNARRRLAPAPRGLPVIGNLHQVGALP 71925
71924
HRALRALAAATGAPHLLRLRLGHVTALVASSPAAAAAVMREHDHVFATRPYFRTAEILTY 71745
71744
GFKDLVFAPYGEHWRHARRLCSEHVLSAARSHRYGPMREQEVALLVNAIRTEAAAAAV 71571
71570
DVSKALYAFTNAVICRAVSGRLSREDEGRSELFRELIEENATLLGGFCVGDYFPALAWA 71394
71393
DAFLSGFAARACRNLRRWDELLEEVIAEHEARLRGGDDGGGEEHREEDFVDVLLALQE 71220
71219
ESQRHDGSFKLTRDIIKSLLQDMFAAGTDTSFITLEWAMSELVKNPAAMRKLQDEVRRGG 71040
71039
GATTAATPYLKAVVKETLRLHPPVPLLVPRECARDTDDDATVLGYHVAGGTRV 70881
70880
FVNAWAIHRDAGAWSSPEEFRPERFLPGGGEAEAMDLRGGHFQLVPFGAGRRVCPGMQFA 70701
70700
LATVELALASLVRLFDWEIPPPGELDMSDDPGFTVRRRIPLRLVAKPVGSEDDK* 70536
#383
>aaaa01019060.1
$FI CYP71AE1 (indica
cultivar-group) one stop in exon 2
4042
MASLATVPNLPLLLLLHYALATFTASRARKNNKDRLPPSPLALLVIGHLLHLMGSLPRTSPSAASPHG 3839
3838
TGPTCSSGLAPCRCSLRRRRVPAAEAILRTHDHVFASRPRTVLLANIVFYRSRDVRFAPY 3659
3658
GDHWRQARKLVTTHLLSAKKVRSLRLAREEE (0) 3584
2413
VSLVMTKISKAATASAVVDIGQILRSFTNDMICRTVSGKCPRDDR*KRIFQELANETSLL 2234
2233
LGGFDIEEYFPVLARVGLVGKMMCLKAERLKKRWDELLEELINDHENDDHSCNLISDQND 2054
2053
EDFVDILLSVRQEYGFTREHVKAIL (0) 1979
1622
DVFFGGIDTSALVLEFTIAELMQRPRMLKKLQDEVRACIPKGQKIVSEVDINNMAYLRAV 1443
1442
IKEGIRLHPVAPVLAPHISMDDCNIDGYMIPSGTRVLVNVWAIGRDPRFWEDAEEFVPER 1263
1262
FIDSMSSAAANVNFTENDYQYLPFGYGRRMXPGMKFGIAVVEIMLANLMWKFDWTLPPG 1086
1085
TEIDMSEVFGLSVHRKEKLLLVPNNMSSC* 996
No
japonica ortholog found 9/11/02
#382
= #425 = #40 reduce gene count by 2
>AQ573853
nbxb0085A03r CYP71AE2 (partial)
50% to 71A24 AQ691042 nbxb0086M20r
AQ795917.1
nbxb0058F03f CUGI Rice BAC genomic clone Length = 684
No indica
ortholog found
MAARTWLWLLLSPLILLLLHYALALLTARRARKNPLPPSPPALPFIGHLHLIGALPHVSLCCLAT
KHAPDLMFLRLGTSLPVLVASSPCAAEAILRTHDDVFASRPRTVLADIIFYGSRDIGFAPYGEDWRQAR
#40
= #382 =
#425 reduce gene count by 2
>AQ573853 nbxb0085A03r CYP71AE2
(partial) 50% to 71A24
AQ691042 nbxb0086M20r