Rice P450
sequences
Dec. 29,
2003 D. Nelson
#n are numbers for the ortholog pairs or unique sequences. 489 numbers were given out, 31 of these were combined and 4 were not from rice. Therefore, there are 454 unique rice sequences. Fragments get the same number as parents. Order is by CYP name.
Three sequences aaaa01039155.1, aaaa01093055.1, aaaa01067419.1 are probable fungal P450 contaminants. One seq aaaa01062516.1 is a probable insect P450 contaminant. These are not counted in the total.
CYP names have now been assigned to all 454 sequences.
27
sequences are partial and they may join to make a smaller number of genes.
This will
probably reduce the gene count by 4 to 450 genes and pseudogenes
#300
>aaaa01012243.1
$FI CYP51G1 = old CYP51A5
Indica rice genome CYP51 New April 24, 2002
ortholog
of AB025047 99%
5108
MTLADPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 4926
4925
IREEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQEVYKFNVPTFGPGVVF 4746
4745
DVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAE 4641
2774
EYFSKWGESGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSALFHDLDNGMQPVSV 2598
2597
IFPYLPIPAHRRRDRARQRLKEIFATIIKSRKASGQAEEDMLQCFIDSKYKSGRSTTEGE 2418
2417
ITGLLIAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVLYR 2223
2222
CIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFKNPDS 2043
2042
YDPDRYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEFELVSPF 1863
1862
PETNWKAMVVGIKDEVMVNFKRRKLVVDN* 1773
>AB025047
CYP51G1 = old CYP51A5 rice (partial) 80% to 51A2 missing N-term 64 aa
BE040549.1
OE08G10 OE Oryza sativa cDNA 5' Length = 255 I-helix CYP51
BE230288.1
99AS641 Rice Seedling cDNA clone 99AS641.Length = 586
BE230302.1
99AS655 Rice Seedling cDNA clone 99AS655.Length = 627
BE607441.1
OE202C10 OE cDNA clone ID707 C-term CYP51 Length = 428
REEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQE
VYKFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAEEYFSKWGE
SGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSSLFHDLDNGMQPVSVIFPYLPI
PAHRRRDRARQRLKEIFATIIKSRKASGRAEEDMLQCFIDSKYKSGRSTTEGEITGLL
IAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVL
YRCIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFK
NPDSYDPDPYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEF
ELVSPFPETNWKAMVVGIKDEVMVNFKRRKLVVDN
#300
>aaaa01066056.1 CYP51G1 = old CYP51A5 (indica cultivar-group) = aaaa01012243.1 $FI Indica rice genome CYP51
591
DPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 761
762 IREEYARLGSVFTVPILRRKITFLI 836
#346
>aaaa01014709.1
CYP51G3 = old CYP51A15 (indica cultivar-group) 49% to 51A2
602
MDLTTGTIWLFLAQ
560
LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381
380
MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201
200
HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117
no
japonica ortholog found 9/11/02
#418
>aaaa01028263.1
CYP51G3 = old CYP51A15 (indica cultivar-group) 73% to AP003866.1
8 GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET
187
188
LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364
365
FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526
VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*
no
japonica ortholog found 9/12/02
#418
>aaaa01028263.1
CYP51G3 = old CYP51A15 (indica cultivar-group) 73% to AP003866.1
8
GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187
188
LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364
365
FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526
VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*
no
japonica ortholog found 9/12/02
all three
fragments CYP51G3 = old CYP51A15 #418, #453, #346 joined reduce gene count by 2
602
MDLTTGTIWLFLAQ
560
LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381
380
MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201
200
HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117
(0)
GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV
STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH
DDMLQCLIDARYKDGRATTETEVAGMLVAALFA
8
GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187
188
LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364
365
FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526
VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*
#453
>aaaa01065204.1
CYP51G3 = old CYP51A15 (indica cultivar-group) exon 3
ortholog
of AY022669.1 searched Genbank for extensions
(0)
GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV
STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH
DDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHT
>AY022669.1
CYP51G3 = old CYP51A15 (partial)
microsatellite MRG4994 containing (CCG)X8, Length = 224
82% to
CYP51 pseudogene above
222
PRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAA 1
>AK107185.1
CYP51A15 (japonica cultivar-group) cDNA clone:002-124-H08
AC135914.2
genomic seq
MDLTTGAIWLFLAQLFVAATMLSKIATRERTRTTGTKFSRPPPPPLARGAPLVGVLPSLLANGPVEFIRH
182
183
HYEKMGSVFTVSLLQQKVTFLVGSEASSHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVD 362
363
YATRHEQFRFFGDIMKPAKLRTYVDLMVAEVE (0)
GYFARWGQSGTVNMKQEFEQLVTLIASR 542
543
CLLGEEVRDKMFDEVSTLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVR 722
723
SRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHTSSSTST 902
903
WAGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKETLRLHPPALML 1082
1083LRHARRSFVVRGGSGEREYEVPEGHTVASPLLLHNALPRVYRDPGEFDPGRFGAGRE
1253
1254EGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQLVSPFPETDWTVVMPGPK
1433
1434GKVMVTYNRRKLT*
1475
#182
>aaaa01005681.1b
$PI CYP51G4P = old CYP51A16P (indica cultivar-group) ortholog of AP003866.1b
4595
VRFLHRKVTFLVGPEESSHFFTGLDAEISQDEVSRFIIPTFGS*VAFDA 4741
5197
GYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 5295
6670
VVTPIATRCLFGEVRSKMLGEVSTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARLGE 6849
6850
IFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG
6975
AEVAGMLVSALLAGQYTSSSTSTWTG 7052 frameshift
7055
ARLLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLML 7234
7235
LRHARRSFVVRARGSGDAEYEVPAGHTVAS 7324
PMVIHNALPHVY 7359
7360
EDAGSFDPGRFGPAREEYRAYAADHAYTVFGGGRHACVGE 7479 frameshift
7482
VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVTVGFSVQL 7655
>AP003866.1b
$P CYP51G4P = old CYP51A16P chromosome 7 clone OJ1092_A07
No obvious
N-terminal, two in frame stops, three frameshifts = Pseudogene
82% to AY022669.1 seems to be a CYP51
pseudogene
54048
VRFLHRKVTFLVGPEESSHFFTGLDSEISQDEVSRFIIPTFGS*VAFD 54191 (intron no boundaries)
54642
AGYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 54743 (0) missing 20 aa
56119
VVTLIATRCLFGEVRSKMLGEVPTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARL 56292
56293
GEIFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG 56421 frameshift
56424
AEVAGMLVSALFAGQYTSSSTSTWTGAR 56507 frameshift
56510
LLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLMLLRHAR 56698
56699
RSFVVRARGSGDAEYEVPAGHTVASPMVIHNALPHVYEDAGSFDPGRFGPAREEYRAYAA 56878
56879
DHAYTVFGGGRHACVGE 56929 frameshift
56932
VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVIVGFSVQL* 57108
#110
>aaaa01003099.1b
CYP51H1 = old CYP51A6 (indica cultivar-group) Nterm aa 4-160
ortholog
to AP005448.1b 100%
10626
VTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 10793
10794
RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 10973
10974
RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11117
>aaaa01003099.1c
CYP51H1 = old CYP51A6 (indica cultivar-group) Nterm aa 61-160
ortholog
to AP005448.1b 100% these two are duplicates only count once
11261
DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11437
11438
APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11617
>aaaa01003099.1d
CYP51H1 = old CYP51A6 (indica cultivar-group) Nterm aa 61-160
ortholog
to AP005448.1b 100% these two are duplicates only count once
11761
DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11937
11938
APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 12117
>aaaa01003099.1e
CYP51H1 = old CYP51A6 (indica cultivar-group) nearly gene, runs off end
ortholog
to AP005448.1b $F 99% plus one frameshifted region
21127
LQKRKISSPAAAAPPVVRGAGLVRLRARHGEGRAAGGDPRAAGEAGERVTAIAPF 20963
20962
GLFKVTFLIGPEVSSHFYLAAESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWD 20783
20782
VLKPRSIEARVGAMAEEVQ 20726 (0?)
18574
NYFSRWGEQGTVDLKKELERVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 18401
18400
TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTAGNGDDVLQRLIDGRYKD 18236
18235
ERALTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLAAVIAEQDRLMASRARTD 18056
18055
DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 17876
17875
LSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 17696
17695
KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRR 17570
>AP005448.1b
$F CYP51H1 = old CYP51A6
(japonica cultivar-group) chromosome 7 21 June 2002
100% to
AP005188.2c
32724
MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 32900
32901
RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 33080
33081
RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 33224
35381
NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 35554
35555
TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 35719
35720
ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 35899
35900
DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 36079
36080
MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 36259
36260
KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 36403
>AP005188.2c
$F CYP51H1 = old CYP51A6
(japonica cultivar-group) chr 7 orth to aaaa01003099.1e 99%
55155
MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 55331
55332
RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 55511
55512
RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ
57812
NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 57985
57986
TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 58150
58151
ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 58330
58331
DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 58510
58511
MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 58690
58691
KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 58834
note:
sequences aaaa01003099.1b to e are all probably from a single gene
#109
>aaaa01003099.1a
CYP51H2P = old CYP51A7P (indica cultivar-group) Nterm aa 4-94
ortholog
of AP005448.1a 100%
7681
TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 7857
7858
PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 7962
>AP005188.2b
$P CYP51H2P = old CYP51A7P (japonica cultivar-group) chr 7 N-term fragment
orth to
aaaa01003099.1a 100% after frameshift
52199
MDHLTSS (frameshift)
TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 52375
52376
PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 52480
>AP005448.1a
CYP51H2P = old CYP51A7P (japonica cultivar-group) chromosome 7 21 June 2002
29768
TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 29944
29945
PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 30049
#63
>aaaa01001626.1
$FI CYP51H3 = old CYP51A8
(indica cultivar-group) Cterm ONE FRAMESHIFT
ortholog
to AP005188.2a 98%
22316
MQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAPPPPVVQGVGLVRFV
RAMARDGPLEAIREQQAKLGSVFTASAPLGTFLIGSEVSSHFYVAPDSEISMGRLY
EFTVPIFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE (0) 22795
23040
NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 23153 (FS)
23156
VPGKLCELFGELDNGLHLISGLLPYLPIPAH
23249
RRRDRARQRLGEIITEVIRSRRNSSRGAAGTDENNDDMLQCLINSRYKDGCAMTDAE 23419
23420
TAGLVVALMFAGKHTSSGVSIWTGVHLLSNPNHLAAVVAEQDRLMASCPGRTDDYHRLD 23596
23597
YDTVQEMRSLHCCVKEALRLHPPVAAVSQAYKHFTVQTKEGKEYTIPGGHMVVSTILVNH 23776
23777
YLPHIYKDPHVFDPQRFAPGREEEKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLL 23956
23957
SNFEIKMVSPFLETEWSTVIPEPKGKVMVSYRRRTAPK* 24073
>AP005188.2a
$F CYP51H3 = old CYP51A8
(japonica cultivar-group) chr 7
12878
MDQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAAPPPVVQGVGLVRFVRAMAR 13063
13064
DGPLEAIREQQAKLGSVFTASAPLGLFKVTFLIGSEVSSHFYVASYSEISMGRLYEFTVP 13243
13244
IFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE 13372
13617
NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 13730
13727
ESVPGKLCELFGELDNGLHLISGLLPYLPIPAHRRRDRARQRLGEIITEVIRLRRNSSR 13903
13904
GAAGTDENNDDMLQCLINSRYKDGCAMTDAEIAGLVVALMFAGKHTSSGVSIWTGVHLLS 14083
14084
NPNHLAAVVAEQDRLMASCPGRTDDYHRLDYDTVQEMRSLHCCVKEALRLHPPVAAVRQA 14263
14264
YKHFTVQTKEDKEYTIPGGHMVVSTILVNHYLPHIYK
DPHVFDPQRFAPGREEDKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLLSN 14540
14541
FEIKMVSPFPET 14576 (frameshift)
QWSTVIPEPKGKVMVSYRRRTAPK* 14649
Note
this cluster continues on AP005188.2b
and 2c
#256
>aaaa01009323.1
CYP51H4 = old CYP51A9 (indica cultivar-group) 55% to AP005448.1b $F
orth of
AP004890.1
6368
DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 6547
6548
GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 6727
6771
YKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAYQQIKVILSHLVSN 6950
6951
FELK 6962
>AP004890.1
$F CYP51H4 = old CYP51A9
(japonica cultivar-group) chr 2
78968
MDHIFSTNAWLTIALVFIITLAAKVVRSSVTLPAEKTSKPRPPPEAKGAPLV 79123
79124
GIIPAVLRRGLQAVIREQHRALGSVFTLRSLGLAVTFLVGPECSDHFFHAPELEIAID 79297
79298
GLYEVTVPIFGKEVGYDIDLDTRNEQHRFFAKMLRPAKLRGHVLPMVCEIEX 79450
79551
YFGKWGECGVVDLMQEVDHVLMLIASRCLLGKEVRENMFDEVASLFHELMGGMHLISMFF 79730
79731
PYLPTPGHRRRDKARAKLGEIFSQIVKTRKMSGRVEDDMLQDLIDSTYGDGRATT 79895
79896
DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 80075
80076
GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 80255
80256
ASPLVVNTLLPNIYKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAY 80435
80436
QQIKVILSHLVSNFELKLESPFPETEDMLSMRPKGKAIVSYKRRTLS 80576
#404
>aaaa01023253.1
CYP51H5 = old CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chromosome
2
3179
YFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 3000
2999
FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 2820
2819
AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 2640
2639
MTTLTHCIKEALRLHP 2592
2584
LLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYIYKDPNVYDPSRFGPGR 2414
2412
EEDKVGGKFSYTPFSAGRHVCLGEDFAYMPN*GDMEPFAQGNFDLELISPFPEEEWEKFI 2233
2232
PGPKGKVMVTYKRRRL 2185
>AP004090.1
$F CYP51H5 = old CYP51A10
chr 2 clone OJ1399_H05 49% to 51A2
AQ843111.1
nbxb0005D03r CUGI Rice BAC genomic cloneLength = 507 49% to 51A2
78158
MAVLFVATKMIQQRPRTLCLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVIHDLH
77972
SRLVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLYDVDLATRSRQISFCTDS 77805
77804
IKPINLRGHVDSMVHEVE 77751 (0)
76666
GYFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 76484
76483
FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 76304
76303
AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 76124
76123
MTTLTHCIKEALRLHPPANLLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYI 75944
75943
YKDPNVYDPSRFGPGREEDKVGGKFSYTPFSAGRHVCLGEDFAYMQIKVIWSHLLRNFDL 75764
75763
ELISPFPEEEWEKFIPGPKGKVMVTYKRRRLL* 75665
#404
>aaaa01024682.1
CYP51H5 = old CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chr 2
Nterm join
with AAAA01023253.1 see this accession for ortholog
1522
LSMAVLFVATKMIQQRPRTLYLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVI 1701
1702
HDLHSRLGSVFTVSVFGLKKVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLY 1869
1870
DVDLATRSRQISFCTDSIKPINLRGHVDSMVHEVE 1974
#246
>aaaa01008685.1
CYP51H6 = old CYP51A11 (indica cultivar-group) orth of AC108875.1a $F chr 5
100%
7539
DGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVRKHGII 7360
7359
NGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCVPAGHT 7180
7179
MASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGENYA 7009
7008
YMQIKAIWSHLLRNF 6964
>AC108875.1a
$F CYP51H6 = old CYP51A11
chr 5 51% to 51A2 same as AQ050946 AQ687182 AQ258479
58% to
AP004090 this might require subfamilies in CYP51
70310
MELTSSSAMWLAMAILAITAALTKIALGGGRRRCLSESSDLTCKTPPPPPVVNCIALLGL 70489
70490
LPALFRGDVPATMQQLYAKFGSVFTVSVAGLLKATFLVGPEVSAHFFQGLESEVSHGDLF 70669
70670
EFTVPMFGKEVGHGVDNATRIEQGRFFAEALKPVRLRIHVDPMVQEVE 70813 (0?)
71263
DYFAKWGQHGTVDLKHELEQLLLLISGRCLLGKEVMGTKFDEVCNLFRDIEGGVNLMSV 71439
71440
FFPYTPLIPSNRRRDMARERLHAIFSDIVRSRKQQQGDQEEVNDKDVLQSFIDSR 71604 (2?)
71910
YKADGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVR 72083
72084
KHGIINGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCV 72263
72264
PAGHTMASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGEN 72443
72444
YAYMQIKAIWSHLLRNFELKLLSPFPKTDWSKLVPEPQGKVMVSYKRRQL 72593
#276
>aaaa01010435.1 CYP51H7 = old CYP51A12 (indica cultivar-group) orth of AC108875.1b $F 99% chr 5 similar to 51A2
1984
WGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSVFFPYTP 2163
2164
LIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRATTEA 2334
2335
*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGRITD 2514
2515
DRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIASP 2688
2689
IVISNQVPYIYMDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 2865
2866
AIWSHLLRNF 2895
>AC108875.1b
$F CYP51H7 = old CYP51A12
chromosome 5 48% to 51A2
80009
MAIILAITAAVTKIARGGRRRSATDPTCKMPPPPPVVNSIALLRLLPTLFRSGLPA 80176
80177
ILHELYTKFGSVFTINLAGLLKMTFLVGPEVSAHFFQGLESEISHGNLLEFTVPMFGKEI 80356
80357
AHGVDSATRNEQARFFVDALKPARLRIHVDPMVQEVE 80467 (0?)
84741
DYFAKWGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSV 84917
84918
FFPYTPLIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRAT 85097
85098
TEA*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGR 85277
85278
ITDDRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIAS 85457
85458
PIVISNQVPYIYKDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 85637
85638
AIWSHLLRNFELRLLSPLPKSDFTKFVPEPHGELMVSYKRRQL 85766
#276
>aaaa01067145.1 CYP51H7 = old CYP51A12 (indica cultivar-group) orth of AC108875.1b $F chromosome 5 1 diff see aaaa01010435.1 for ortholog
27
GRTGCVGEGYAYMQIKAIWSHLLRNFELR*LSPLPKSDFTKFVPEPHGELMVSYKRRQL 203
#140
>aaaa01004091.1 CYP51H8 = old CYP51A13 (indica cultivar-group) orth of AC108875.1c $F chr 5 similar to 51A2
12032
GSVIFPYIPIPSHIRRDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLIDSKHRDGSS 12208
12209
TTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQKHGDHIDYN 12388
12389
VLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLLSP 12550
12551
MIFNNRLPYIYKDPHMYDLDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIKV 12727
12728
IWSHLLRNF 12754
>AC108875.1c
$F CYP51H8 = old CYP51A13
chromosome 5 50% to 51A2
122577
MDLVSISIYMWAMAALCTIITAMVTTKLARVRRPITLNPKSKRPLPPVVNVIA 122735
122736
LLEHLPRLCTKGVIPVYKGSVFTVSLFGLKATFLVGPEVSAHFYQGMDSEISQGDLYEFT 122915
122916
VPLFGKGVGFDIDNATRTEHLRFFIDAIKTSKLRNHVNSMVQEVE 123050 (0?)
123296
DYFAKWGENGIVDIKHEFEKLLMLISGHCLLGKEVRDNMFDEVFSLF 123436
123437
HELDSGVGLGSVIFPYIPIPSHIRCDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLID 123616
123617
SKHRDGNSTTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQK 123796
123797
HGDHIDYNVLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLL 123976
123977
SPMIFNNRLPYIYKDPHMYDPDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIK 124156
124157
VIWSHLLRNFELKLESPFPKTNWSKILLEPWGKVMVSYKRRRL 124285
#181
>aaaa01005681.1a
CYP51H9 = old CYP51A14 (indica cultivar-group) orth AP003866.1a $F chr 7
>99%
2692
EQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLISLC 2853
2854
FPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYR 3003
3004
DGRAMSDNEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIG 3168
3169
DDRVDYDALTTGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVRTREGKEYRMPAGHS 3342
3343
VVSYAAFNHRLGYVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLK 3522
3523
MKVIWSYLLRNFELELVSPFPEVEL 3597
>AP003866.1a
$F CYP51H9 = old CYP51A14
chr 7 clone OJ1092_A07 53% to 51A2
AQ326645
and AQ291927 mid to K-helix region 52% identical to wheat CYP51
60%
identical to AQ327456 68% to EST T88278 705 family
AQ689048.1
nbxb0078H10r CUGI Rice BAC genomic clone Length = 737
AQ396185.2
nbxb0066K16r CUGI Rice BAC genomic cloneLength = 327
50920
MDMAAAAAVWFSAIAAVLLAASTIAVVVVAKMTGKRNGGAAAAAAAAAEAELPL
51082
PPVVSGVSLIIPVITRGPMAVADELYVKLGSVFT (0)
GGFYSRPE 51261
51262
SEVHQGGTYRMTVPMFGRGVMYDVDVATRSEQIAVCFEALRPTKLRSSTVTMVRETE 51432 (0)
52114
EYFAKWGEQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLI 52299
52300
SLCFPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYRDGRAMSD 52479
52480
NEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIGDDRVDYDALT 52653
52654
TGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVWTREGKEYRMPAGHSVVSYAAFNHRLG 52833
52834
YVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLKMKVIWSYLLRNF 53013
53014
ELELVSPFPEVELNNIMLGPRGEVMVRYKRRKLTST* 53124
no
japonica ortholog found 9/11/02
#10
>aaaa01000238.1f $FI CYP71C12 (indica cultivar-group)
AP003909.1a 99%
also
aaaa01079567.1 (98%)
44400
MAEMLDGLRHDEQASLHAPQKASTMPTMSCSDLLLAMMCPLILLLIIFRCYAYATRSGGM 44221
44220
LSRVPSPPGRLPVIGHMHLISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQA 44041
44040
ILRTHDRVFASRPYNTIADILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQT 43861
43860
RQQEVRLVMAKIVEEAATHMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEI 43681
43680
NSSLLGGFNLEDYFPSLARLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDN 43501
43500
NDEESDFIDVLLSIQQEYGLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAK 43321
43320
LQAEVRGVVPKGQEVVTEEQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTI 43141
43140
PSGTRVIVNAWAIARDPSYWENAEEFIPERFLGNTMAGYNGNNFNFLPFGTGRRICPGMN 42961
42960
FAIAAIEVMLASLVYRFDWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 42796
>AP003909.1a $F CYP71C12 chromosome 8 clone OJ1300_E01
55% to 71C4
orth
aaaa01000238.1f
50394
MAQMLDGLRHDEQASLHAPQEASTMPTMSCSD
50298
LLLAMMCPLILLLIIFRCYAYATRSGGMLSRVPSPPGRLPVIGHMH 50161
50160
LISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQAILRTHDRVFASRPYNTIA 49981
49980
DILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQTRQQEVRLVMAKIVEEAAT 49801
49800
HMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEINSSLLGGFNLEDYFPSLA 49621
49620
RLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEY 49441
49440
GLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTE 49261
49260
EQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPS 49081
49080
YWENAEEFMPERFLSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVY 48910
48909
RFNWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 48790
#10
part
>aaaa01079567.1
CYP71C12 (indica cultivar-group) orth AP003909.1a $F chr 8
99% 98% to
aaaa01000238.1f $FI see aaaa01000238.1f for ortholog
672
DQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEYGLTKDNIKANLVVM 511
510
FEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTEEQLGRMPYLKAVI 334
333
KETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPSYWENAEEFMPERF 154
153
LSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVYRFDWKL 4
#11
>aaaa01000238.1g
$PI CYP71C13P (indica cultivar-group) end of clone poor quality seq
allowing
frameshifts (fs) and deletions this seq 95% to AP003909.1b
(plus
strand)
46070
MAQMLGALLLFQDSQMSTMTRMSYSLLLPILCPLILLLLFRCYAYATRSGGL 46225
46226
LDKLPSPPGRLPLIGHMHLIGSFPHMSLRDLATKHGPDLMLLHLGTVPTLVVSSSRMAQV 46405
46406
ILRTHDRVFASRQQSAIT 46459 gap (frameshift) XILF (deletion and fs)
46485
YGDYWRQIKKIVTTNLLTI (fs) KKIRSYSQT (fs) RQQE (fs) VRL (fs) VM (fs)
AKI*EATTHMAV
46628 (deletion)
(minus
strand)
49427
LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNEX 49320
49320
ESDFIDVLLSIQQEYGLTKDNIKANLAIMFEAGTDTSFIELEYAMAELMQKPQMIAKLQA 49141
49140
EVRGVVSKGQEIVTEEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTTPSG 48961
48960
TRVIVNAWAIAR (fs) DPSY*ENAEEF (fs)
XQRFLSNTMADYNGNNFNFLPFWTGRRICPGINFA 48787
48786
ITTIEIMLASLVYRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 48658
>AP003909.1b
$P CYP71C13P chromosome 8 clone OJ1300_E01, 4 in frame stops pseudogene
orth
aaaa01000238.1g note this seq is out of order in this gene cluster
54948
MAQMLGALLLFQDSLMSTMTRMSY
54876
SLLLPILCPLILLLLFRCYAYATRSGGMLDKLPSPPGRLPLIGHM 54742
54741
HLIGFFPHMSLRDLATDLMLLHLGTVPTLVVSSSRMAQVILRTHDRVFASR*QSAI 54574
54573
TDILF*GATDVAFSPYGDYWRQIKKIVTTNLLTIKKIRSYSQTRQQEVRLVMAKIVEEAA 54394
54393
THMA IDLTELLSCYSNNMVCHAVSGMFFCEEGRNQLFKELI*INSSLLGGFNIEDYFPSL 54214
54213
ARLPVRRL LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNDVESDFIDVLLSIQQE 54037
54036
YGLTKDNIKANLAIMFEAGTDTSFIELEYSMAELM*KPQMIAKLQAEVRGVVSKGQDIVT 53857
53856
EEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTIPSGTRVIVNAWAIARDP 53677
53676
SYWENAEEFMPERFLSNTMADYNGNNFNFLPFRTGRRICPGINFAITTIEIMLASLV 53506
53505
YRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 53389
#9
>aaaa01000238.1e
$FI
CYP71C14 (indica cultivar-group) AP003909.1c 99%
MAVMLVPIPLLLLHQHHNHEHEH
40499
PSPVAPQPTMASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPVIGHL 40326
40325
HLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRTYSAV 40146
40145
TDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARINEAAV 39966
39965
ARTTVDLSELLNWFTNDIVCHAVSGKFFREEGRNQMFWELIQANSLLLSGFNLEDYFPNL 39786
39785
ARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLSIQHE 39606
39605
YGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQEIVT 39426
39425
EEQLGRMPYLKAVIKETLRLHLAGPLLVPHLSIAECDIEGYTIPSGTRVFVNAWALSRDP 39246
39245
SFWENAEEFIPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRF 39066
39065
DWEIPADQAAKGGIDMTEAFGLTVHRKEKLLLVPRLTQD* 38946
>AP003909.1c
$F CYP71C14 chromosome
8 clone OJ1300_E01
orth
aaaa01000238.1e
same as
AP004462.1 152574-152287 region
58316
MAVMLVPIPLLLLHQHHNHEHEHPSPVAPQPTM
58217
ASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPV 58086
58085
IGHLHLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRT 57906
57905
YSAVTDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARIN 57726
57725
EAAVARTTVDLSELLNWFTNDIVCHAVSGKFFREEGQNQMFWELIQANSLLLGGFNLEDY 57546
57545
FPNLARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLS 57366
57365
IQHEYGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQ 57186
57185
EIVTEEQLGRMPY 57153 frameshift
57147
LKAVIKEMLRLHLAGPLLVPYLSIAECDIEGYTIPSGTRVFVNAWALSRDPSFWENAEEF 56968
56967
IPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRFDWEIPA 56797
56796
DQAAKGGIDMTEAFGLTVHRRRSSSLFLGSHKIK* 56692
#8
>aaaa01000238.1c
$FI CYP71C15 (indica
cultivar-group) AP003909.1d 99%
25643
LLLPVALLLLLLRFARATTLAGDRNSELLLSKLPSPPLRLPVIGHMHLVGSLPHVSLRD 25467
25466
LAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRAMVPDIISYGATDSC 25287
25286
YGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEVRLVIAKLRGAAAMAGAPVDMTELL 25107
25106
HSFANDLICRAVSGKFFREEGRNKLFRELIDTNASLLGGFNLEDYFPSLARTKLLSKVIC 24927
24926
VRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQDSDFIDILLYHQEEYGFTRDNIKAI 24747
24746
LVX 24741
24592
MFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEIVNEDNIVDMVYLKAVI 24413
24412
KETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPERF 24233
24232
MDSNIDFKGHDFHYLPFGSG*RMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKEEDI 24059
24058
DMTEVFGLTVHRKEKLFLVP 23999
>AP003909.1d
$F CYP71C15 chromosome
8 clone OJ1300_E01
orth
aaaa01000238.1c
AQ868830.1
nbeb0032E11f CUGI Rice BAC genomicLength = 759 57% to 76C5
same as
AP004462.1 139663-140091 region
68223
MELNNTEPLTASRAQAAAVFLLLPVALLLLLLRFARATTMAGDRNSELLLSKLPS 68387
68388
PPLRLPVIGHMHLVGSLPHVSLRDLAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTH 68567
68568
DLAFASRPRAMVPDIITYGATDSCYGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEV 68747
68748
RLVIAKLRGAAAMAGAPVDMTELLHSFANDLICRAVSGKFFREEGRNKLFRELIDTNASL 68927
68928
LGGFNLEDYFPSLARTKLLSKVICVRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQD 69107
69108
SDFIDILLYHQEEYGFTRDNIKAILV 69185 (0)
69331
DMFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEVVNEDNIVDMVYLKAV 69510
69511
IKETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPER 69690
69691
FMDSNIDFKGHDFHYLPFGSGRRMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKE 69858
69859
EDIDMTEVFGLTVHRKEKLFLVPQAA* 69939
#7
>aaaa01000238.1b
$FI CYP71C16 (indica
cultivar-group) AP003909.1e 100%
14489
LLPLALLFYFARAAISSRDSKTRELILSKLPSPPFKLPVIGHMHLIGPLPYVSLRDLAA 14313
14312
KHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRSMVTDIIMYGALDSCFAP 14133
14132
YSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVMARLRGAAAAAAAVDLSQTLQFFA 13953
13952
NDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFNLEAYFPGLARMPLISKLICARAI 13773
13772
RIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVLLSLQDEYGFTRDHIKAISIX 13608
13134
MFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAVI 12955
12954
KETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPERF 12775
12774
MDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKKE 12598
12597
DIDMTDVFGLAIHRKEKLFLVPQI 12526
>AP003909.1e
$F CYP71C16 chromosome
8 clone OJ1300_E01
orth
aaaa01000238.1b
same as
AP004462.1 128584-129021 region
78935
MELILQLEAKTAAQAVVTVFFFFLLPLALLFYFARAAISSRDSKTRELILSKLPSPPFK 79111
79112
LPVIGHMHLIGPLPYVSLRDLAAKHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAF 79291
79292
ASRPRSMVTDIIMYGALDSCFAPYSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVM 79471
79472
ARLRGAAAAAAAVDLSQTLQFFANDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFN 79651
79652
LEAYFPGLARMPLISKLICARAIRIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVL 79828
79829
LSLQDEYGFTRDHIKAISI 79885 (0)
80359
DMFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAV 80538
80539
IKETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPER 80718
80719
FMDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKK 80895
80896
EDIDMTDVFGLAIHRKEKLFLVPQIANY* 80982
#6
>aaaa01000238.1a
$FI CYP71C17 (indica
cultivar-group) orth of AP003909.1f
2 diffs
N-terminal Met not identified
MVVQLMLFFHDKFMAPMAEEPLPF
3340
VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 3161
3160
RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 2981
2980
ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 2801
2800
LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 2621
2620
VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 2441
2440
QEYNLTRHNIHAILM (0) 2396
2206
DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 2036
2035
KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 1856
1855
MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMDK 1676
1675
DDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 1580
#6
>AP003909.1f
$F CYP71C17 chromosome
8 clone OJ1300_E01
AK067200 2151 bp mRNA linear PLN 24-JUL-2003
Oryza
sativa (japonica cultivar-group) cDNA clone:J013097P19, full
insert
sequence.
orth
aaaa01000238.1a
AZ127316.1
OSJNBb0086E03f CUGI Rice BAC genomic Length = 498 54% to 71A14
AQ871024.1
nbeb0042C09f CUGI Rice BAC genomic Length = 495 56% to 71B23
same as
AP004462.1 147428-146820 region
63826
MVVQLMLFFHDKFMAPMAEEPLPFVLIMIII
63733
LLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHV 63581
63580
SLRDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGS 63401
63400
SNISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMR 63221
63220
ELLGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLR 63041
63040
RVVSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLD 62861
62860
RQQEYNLTRHNIHAILM 62810 (0)
62626
DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTQMVSEDDLNNMPYLKAV 62453
62452
VKETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDAKCWENSEEFMPER 62273
62272
FMDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMD 62093
62092
KDDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 61994
#6
duplicate
>aaaa01000238.1d
$FI CYP71C17 (indica
cultivar-group) AP003909.1f 99%
this seq
100% identical to aaaa01000238.1a, probably an error in assembly
only count
this gene once see aaaa01000238.1a for ortholog
N-terminal
Met not identified
MVVQLMLFFHDKFMAPMAEEPLPF
30181
VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 30360
30361
RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 30540
30541
ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 30720
30721
LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 30900
30901
VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 31080
31081
QEYNLTRHNIHAILM (0) 31137
31309
DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 31485
31486
KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 31665
31666 MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGM