Rice P450 sequences

Dec. 29, 2003 D. Nelson

 

#n are numbers for the ortholog pairs or unique sequences.  489 numbers were given out, 31 of these were combined and 4 were not from rice.  Therefore, there are 454 unique rice sequences.  Fragments get the same number as parents.  Order is by CYP name.

Three sequences aaaa01039155.1, aaaa01093055.1, aaaa01067419.1 are probable fungal P450 contaminants.  One seq aaaa01062516.1 is a probable insect P450 contaminant.  These are not counted in the total. 

CYP names have now been assigned to all 454 sequences.

27 sequences are partial and they may join to make a smaller number of genes.

This will probably reduce the gene count by 4 to 450 genes and pseudogenes

 

#300

>aaaa01012243.1 $FI CYP51G1 = old CYP51A5 Indica rice genome CYP51 New April 24, 2002

ortholog of AB025047 99%

5108 MTLADPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 4926

4925 IREEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQEVYKFNVPTFGPGVVF 4746

4745 DVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAE 4641

2774 EYFSKWGESGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSALFHDLDNGMQPVSV 2598

2597 IFPYLPIPAHRRRDRARQRLKEIFATIIKSRKASGQAEEDMLQCFIDSKYKSGRSTTEGE 2418

2417 ITGLLIAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVLYR 2223

2222 CIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFKNPDS 2043

2042 YDPDRYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEFELVSPF 1863

1862 PETNWKAMVVGIKDEVMVNFKRRKLVVDN* 1773

 

>AB025047 CYP51G1 = old CYP51A5 rice (partial)   80% to 51A2 missing N-term 64 aa

BE040549.1 OE08G10 OE Oryza sativa cDNA 5' Length = 255 I-helix CYP51

BE230288.1 99AS641 Rice Seedling cDNA clone 99AS641.Length = 586

BE230302.1 99AS655 Rice Seedling cDNA clone 99AS655.Length = 627

BE607441.1 OE202C10 OE cDNA clone ID707 C-term CYP51 Length = 428

REEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQE

VYKFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAEEYFSKWGE

SGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSSLFHDLDNGMQPVSVIFPYLPI

PAHRRRDRARQRLKEIFATIIKSRKASGRAEEDMLQCFIDSKYKSGRSTTEGEITGLL

IAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVL

YRCIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFK

NPDSYDPDPYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEF

ELVSPFPETNWKAMVVGIKDEVMVNFKRRKLVVDN

 

#300

>aaaa01066056.1 CYP51G1 = old CYP51A5 (indica cultivar-group) = aaaa01012243.1 $FI Indica rice genome CYP51

 591 DPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 761

 762 IREEYARLGSVFTVPILRRKITFLI 836

 

#346

>aaaa01014709.1 CYP51G3 = old CYP51A15 (indica cultivar-group) 49% to 51A2

602 MDLTTGTIWLFLAQ

560 LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381

380 MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201

200 HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117

 

no japonica ortholog found 9/11/02

 

#418

>aaaa01028263.1 CYP51G3 = old CYP51A15 (indica cultivar-group) 73% to AP003866.1

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

no japonica ortholog found 9/12/02

 

#418

>aaaa01028263.1 CYP51G3 = old CYP51A15 (indica cultivar-group) 73% to AP003866.1

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

no japonica ortholog found 9/12/02

 

all three fragments CYP51G3 = old CYP51A15 #418, #453, #346 joined reduce gene count by 2

602 MDLTTGTIWLFLAQ

560 LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381

380 MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201

200 HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117

(0) GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV

    STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH

    DDMLQCLIDARYKDGRATTETEVAGMLVAALFA

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

#453

>aaaa01065204.1 CYP51G3 = old CYP51A15 (indica cultivar-group) exon 3

ortholog of AY022669.1 searched Genbank for extensions

(0) GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV

STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH

DDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHT

 

>AY022669.1 CYP51G3 = old CYP51A15 (partial)   microsatellite MRG4994 containing (CCG)X8, Length = 224

82% to CYP51 pseudogene above

222 PRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAA 1

 

>AK107185.1 CYP51A15 (japonica cultivar-group) cDNA clone:002-124-H08

AC135914.2 genomic seq

    MDLTTGAIWLFLAQLFVAATMLSKIATRERTRTTGTKFSRPPPPPLARGAPLVGVLPSLLANGPVEFIRH 182

183 HYEKMGSVFTVSLLQQKVTFLVGSEASSHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVD 362

363 YATRHEQFRFFGDIMKPAKLRTYVDLMVAEVE (0)

    GYFARWGQSGTVNMKQEFEQLVTLIASR 542

543 CLLGEEVRDKMFDEVSTLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVR 722

723 SRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHTSSSTST 902

903 WAGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKETLRLHPPALML 1082

1083LRHARRSFVVRGGSGEREYEVPEGHTVASPLLLHNALPRVYRDPGEFDPGRFGAGRE 1253

1254EGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQLVSPFPETDWTVVMPGPK 1433

1434GKVMVTYNRRKLT* 1475

 

#182

>aaaa01005681.1b $PI CYP51G4P = old CYP51A16P (indica cultivar-group) ortholog of AP003866.1b

4595 VRFLHRKVTFLVGPEESSHFFTGLDAEISQDEVSRFIIPTFGS*VAFDA 4741

5197 GYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 5295

6670 VVTPIATRCLFGEVRSKMLGEVSTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARLGE 6849

6850 IFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG

6975 AEVAGMLVSALLAGQYTSSSTSTWTG 7052 frameshift

7055 ARLLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLML 7234

7235 LRHARRSFVVRARGSGDAEYEVPAGHTVAS 7324

     PMVIHNALPHVY 7359

7360 EDAGSFDPGRFGPAREEYRAYAADHAYTVFGGGRHACVGE 7479 frameshift

7482 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVTVGFSVQL 7655

 

>AP003866.1b $P CYP51G4P = old CYP51A16P chromosome 7 clone OJ1092_A07

No obvious N-terminal, two in frame stops, three frameshifts = Pseudogene

82%  to AY022669.1 seems to be a CYP51 pseudogene

54048 VRFLHRKVTFLVGPEESSHFFTGLDSEISQDEVSRFIIPTFGS*VAFD 54191 (intron no boundaries)

54642 AGYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 54743 (0) missing 20 aa

56119 VVTLIATRCLFGEVRSKMLGEVPTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARL 56292

56293 GEIFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG 56421 frameshift

56424 AEVAGMLVSALFAGQYTSSSTSTWTGAR 56507 frameshift

56510 LLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLMLLRHAR 56698

56699 RSFVVRARGSGDAEYEVPAGHTVASPMVIHNALPHVYEDAGSFDPGRFGPAREEYRAYAA 56878

56879 DHAYTVFGGGRHACVGE 56929 frameshift

56932 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVIVGFSVQL* 57108

 

#110

>aaaa01003099.1b CYP51H1 = old CYP51A6 (indica cultivar-group)  Nterm aa 4-160

ortholog to AP005448.1b 100%

10626 VTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 10793

10794 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 10973

10974 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11117

 

>aaaa01003099.1c CYP51H1 = old CYP51A6 (indica cultivar-group)  Nterm aa 61-160

ortholog to AP005448.1b 100% these two are duplicates only count once

11261 DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11437

11438 APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11617

 

>aaaa01003099.1d CYP51H1 = old CYP51A6 (indica cultivar-group)  Nterm aa 61-160

ortholog to AP005448.1b 100% these two are duplicates only count once

11761 DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11937

11938 APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 12117

 

>aaaa01003099.1e CYP51H1 = old CYP51A6 (indica cultivar-group) nearly  gene, runs off end

ortholog to AP005448.1b $F 99% plus one frameshifted region

21127 LQKRKISSPAAAAPPVVRGAGLVRLRARHGEGRAAGGDPRAAGEAGERVTAIAPF 20963

20962 GLFKVTFLIGPEVSSHFYLAAESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWD 20783

20782 VLKPRSIEARVGAMAEEVQ 20726 (0?)

18574 NYFSRWGEQGTVDLKKELERVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 18401

18400 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTAGNGDDVLQRLIDGRYKD 18236

18235 ERALTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLAAVIAEQDRLMASRARTD 18056

18055 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 17876

17875 LSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 17696

17695 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRR 17570

 

>AP005448.1b $F CYP51H1 = old CYP51A6 (japonica cultivar-group) chromosome 7 21 June 2002

100% to AP005188.2c

32724 MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 32900

32901 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 33080

33081 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 33224

35381 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 35554

35555 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 35719

35720 ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 35899

35900 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 36079

36080 MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 36259

36260 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 36403

 

>AP005188.2c $F CYP51H1 = old CYP51A6 (japonica cultivar-group) chr 7 orth to aaaa01003099.1e 99%

55155 MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 55331

55332 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 55511

55512 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ

57812 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 57985

57986 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 58150

58151 ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 58330

58331 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 58510

58511 MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 58690

58691 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 58834

 

note: sequences aaaa01003099.1b to e are all probably from a single gene

 

#109

>aaaa01003099.1a CYP51H2P = old CYP51A7P (indica cultivar-group)  Nterm aa 4-94

ortholog of AP005448.1a 100%

7681 TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 7857

7858 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 7962

 

>AP005188.2b $P CYP51H2P = old CYP51A7P (japonica cultivar-group) chr 7 N-term fragment

orth to aaaa01003099.1a 100% after frameshift

52199 MDHLTSS (frameshift)

      TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 52375

52376 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 52480

 

>AP005448.1a CYP51H2P = old CYP51A7P (japonica cultivar-group) chromosome 7 21 June 2002

29768 TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 29944

29945 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 30049

 

#63

>aaaa01001626.1 $FI CYP51H3 = old CYP51A8 (indica cultivar-group) Cterm ONE FRAMESHIFT

ortholog to AP005188.2a 98%

22316 MQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAPPPPVVQGVGLVRFV

      RAMARDGPLEAIREQQAKLGSVFTASAPLGTFLIGSEVSSHFYVAPDSEISMGRLY

      EFTVPIFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE (0) 22795

23040 NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 23153 (FS)

23156 VPGKLCELFGELDNGLHLISGLLPYLPIPAH

23249 RRRDRARQRLGEIITEVIRSRRNSSRGAAGTDENNDDMLQCLINSRYKDGCAMTDAE 23419

23420 TAGLVVALMFAGKHTSSGVSIWTGVHLLSNPNHLAAVVAEQDRLMASCPGRTDDYHRLD 23596

23597 YDTVQEMRSLHCCVKEALRLHPPVAAVSQAYKHFTVQTKEGKEYTIPGGHMVVSTILVNH 23776

23777 YLPHIYKDPHVFDPQRFAPGREEEKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLL 23956

23957 SNFEIKMVSPFLETEWSTVIPEPKGKVMVSYRRRTAPK* 24073

 

>AP005188.2a $F CYP51H3 = old CYP51A8 (japonica cultivar-group) chr 7

12878 MDQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAAPPPVVQGVGLVRFVRAMAR 13063

13064 DGPLEAIREQQAKLGSVFTASAPLGLFKVTFLIGSEVSSHFYVASYSEISMGRLYEFTVP 13243

13244 IFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE 13372

13617 NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 13730

13727 ESVPGKLCELFGELDNGLHLISGLLPYLPIPAHRRRDRARQRLGEIITEVIRLRRNSSR 13903

13904 GAAGTDENNDDMLQCLINSRYKDGCAMTDAEIAGLVVALMFAGKHTSSGVSIWTGVHLLS 14083

14084 NPNHLAAVVAEQDRLMASCPGRTDDYHRLDYDTVQEMRSLHCCVKEALRLHPPVAAVRQA 14263

14264 YKHFTVQTKEDKEYTIPGGHMVVSTILVNHYLPHIYK

      DPHVFDPQRFAPGREEDKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLLSN 14540

14541 FEIKMVSPFPET 14576 (frameshift)

      QWSTVIPEPKGKVMVSYRRRTAPK* 14649

 

Note this cluster continues on AP005188.2b and 2c

 

#256

>aaaa01009323.1 CYP51H4 = old CYP51A9 (indica cultivar-group) 55% to AP005448.1b $F

orth of AP004890.1

6368 DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 6547

6548 GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 6727

6771 YKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAYQQIKVILSHLVSN 6950

6951 FELK 6962

 

>AP004890.1 $F CYP51H4 = old CYP51A9 (japonica cultivar-group) chr 2

78968 MDHIFSTNAWLTIALVFIITLAAKVVRSSVTLPAEKTSKPRPPPEAKGAPLV 79123

79124 GIIPAVLRRGLQAVIREQHRALGSVFTLRSLGLAVTFLVGPECSDHFFHAPELEIAID 79297

79298 GLYEVTVPIFGKEVGYDIDLDTRNEQHRFFAKMLRPAKLRGHVLPMVCEIEX 79450

79551 YFGKWGECGVVDLMQEVDHVLMLIASRCLLGKEVRENMFDEVASLFHELMGGMHLISMFF 79730

79731 PYLPTPGHRRRDKARAKLGEIFSQIVKTRKMSGRVEDDMLQDLIDSTYGDGRATT 79895

79896 DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 80075

80076 GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 80255

80256 ASPLVVNTLLPNIYKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAY 80435

80436 QQIKVILSHLVSNFELKLESPFPETEDMLSMRPKGKAIVSYKRRTLS 80576

 

#404

>aaaa01023253.1 CYP51H5 = old CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chromosome 2

3179 YFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 3000

2999 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 2820

2819 AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 2640

2639 MTTLTHCIKEALRLHP 2592

2584 LLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYIYKDPNVYDPSRFGPGR 2414

2412 EEDKVGGKFSYTPFSAGRHVCLGEDFAYMPN*GDMEPFAQGNFDLELISPFPEEEWEKFI 2233

2232 PGPKGKVMVTYKRRRL 2185

 

>AP004090.1 $F CYP51H5 = old CYP51A10 chr 2 clone OJ1399_H05 49% to 51A2

AQ843111.1 nbxb0005D03r CUGI Rice BAC genomic cloneLength = 507 49% to 51A2

78158 MAVLFVATKMIQQRPRTLCLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVIHDLH

77972 SRLVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLYDVDLATRSRQISFCTDS 77805

77804 IKPINLRGHVDSMVHEVE 77751 (0)

76666 GYFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 76484

76483 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 76304

76303 AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 76124

76123 MTTLTHCIKEALRLHPPANLLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYI 75944

75943 YKDPNVYDPSRFGPGREEDKVGGKFSYTPFSAGRHVCLGEDFAYMQIKVIWSHLLRNFDL 75764

75763 ELISPFPEEEWEKFIPGPKGKVMVTYKRRRLL* 75665

 

#404

>aaaa01024682.1 CYP51H5 = old CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chr 2

Nterm join with AAAA01023253.1 see this accession for ortholog

1522 LSMAVLFVATKMIQQRPRTLYLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVI 1701

1702 HDLHSRLGSVFTVSVFGLKKVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLY 1869

1870 DVDLATRSRQISFCTDSIKPINLRGHVDSMVHEVE 1974

 

#246

>aaaa01008685.1 CYP51H6 = old CYP51A11 (indica cultivar-group) orth of AC108875.1a $F chr 5 100%

7539 DGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVRKHGII 7360

7359 NGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCVPAGHT 7180

7179 MASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGENYA 7009

7008 YMQIKAIWSHLLRNF 6964

 

>AC108875.1a $F CYP51H6 = old CYP51A11 chr 5 51% to 51A2 same as AQ050946 AQ687182 AQ258479

58% to AP004090 this might require subfamilies in CYP51

70310 MELTSSSAMWLAMAILAITAALTKIALGGGRRRCLSESSDLTCKTPPPPPVVNCIALLGL 70489

70490 LPALFRGDVPATMQQLYAKFGSVFTVSVAGLLKATFLVGPEVSAHFFQGLESEVSHGDLF 70669

70670 EFTVPMFGKEVGHGVDNATRIEQGRFFAEALKPVRLRIHVDPMVQEVE 70813 (0?)

71263 DYFAKWGQHGTVDLKHELEQLLLLISGRCLLGKEVMGTKFDEVCNLFRDIEGGVNLMSV 71439

71440 FFPYTPLIPSNRRRDMARERLHAIFSDIVRSRKQQQGDQEEVNDKDVLQSFIDSR 71604 (2?)

71910 YKADGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVR 72083

72084 KHGIINGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCV 72263

72264 PAGHTMASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGEN 72443

72444 YAYMQIKAIWSHLLRNFELKLLSPFPKTDWSKLVPEPQGKVMVSYKRRQL 72593

 

#276

>aaaa01010435.1 CYP51H7 = old CYP51A12 (indica cultivar-group) orth of AC108875.1b $F 99% chr 5 similar to 51A2

1984 WGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSVFFPYTP 2163

2164 LIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRATTEA 2334

2335 *VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGRITD 2514

2515 DRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIASP 2688

2689 IVISNQVPYIYMDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 2865

2866 AIWSHLLRNF 2895

 

>AC108875.1b $F CYP51H7 = old CYP51A12 chromosome 5 48% to 51A2

80009 MAIILAITAAVTKIARGGRRRSATDPTCKMPPPPPVVNSIALLRLLPTLFRSGLPA 80176

80177 ILHELYTKFGSVFTINLAGLLKMTFLVGPEVSAHFFQGLESEISHGNLLEFTVPMFGKEI 80356

80357 AHGVDSATRNEQARFFVDALKPARLRIHVDPMVQEVE 80467 (0?)

84741 DYFAKWGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSV 84917

84918 FFPYTPLIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRAT 85097

85098 TEA*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGR 85277

85278 ITDDRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIAS 85457

85458 PIVISNQVPYIYKDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 85637

85638 AIWSHLLRNFELRLLSPLPKSDFTKFVPEPHGELMVSYKRRQL 85766

 

#276

>aaaa01067145.1 CYP51H7 = old CYP51A12 (indica cultivar-group) orth of AC108875.1b $F chromosome 5 1 diff see aaaa01010435.1 for ortholog

27  GRTGCVGEGYAYMQIKAIWSHLLRNFELR*LSPLPKSDFTKFVPEPHGELMVSYKRRQL 203

 

#140

>aaaa01004091.1 CYP51H8 = old CYP51A13 (indica cultivar-group) orth of AC108875.1c $F chr 5 similar to 51A2

12032 GSVIFPYIPIPSHIRRDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLIDSKHRDGSS 12208

12209 TTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQKHGDHIDYN 12388

12389 VLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLLSP 12550

12551 MIFNNRLPYIYKDPHMYDLDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIKV 12727

12728 IWSHLLRNF 12754

 

>AC108875.1c $F CYP51H8 = old CYP51A13 chromosome 5 50% to 51A2

122577 MDLVSISIYMWAMAALCTIITAMVTTKLARVRRPITLNPKSKRPLPPVVNVIA 122735

122736 LLEHLPRLCTKGVIPVYKGSVFTVSLFGLKATFLVGPEVSAHFYQGMDSEISQGDLYEFT 122915

122916 VPLFGKGVGFDIDNATRTEHLRFFIDAIKTSKLRNHVNSMVQEVE 123050 (0?)

123296 DYFAKWGENGIVDIKHEFEKLLMLISGHCLLGKEVRDNMFDEVFSLF 123436

123437 HELDSGVGLGSVIFPYIPIPSHIRCDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLID 123616

123617 SKHRDGNSTTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQK 123796

123797 HGDHIDYNVLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLL 123976

123977 SPMIFNNRLPYIYKDPHMYDPDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIK 124156

124157 VIWSHLLRNFELKLESPFPKTNWSKILLEPWGKVMVSYKRRRL 124285

 

#181

>aaaa01005681.1a CYP51H9 = old CYP51A14 (indica cultivar-group) orth AP003866.1a $F chr 7 >99%

2692 EQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLISLC 2853

2854 FPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYR 3003

3004 DGRAMSDNEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIG 3168

3169 DDRVDYDALTTGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVRTREGKEYRMPAGHS 3342

3343 VVSYAAFNHRLGYVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLK 3522

3523 MKVIWSYLLRNFELELVSPFPEVEL 3597

 

>AP003866.1a $F CYP51H9 = old CYP51A14 chr 7 clone OJ1092_A07 53% to 51A2

AQ326645 and AQ291927 mid to K-helix region 52% identical to wheat CYP51

60% identical to AQ327456 68% to EST T88278 705 family

AQ689048.1 nbxb0078H10r CUGI Rice BAC genomic clone Length = 737

AQ396185.2 nbxb0066K16r CUGI Rice BAC genomic cloneLength = 327

50920 MDMAAAAAVWFSAIAAVLLAASTIAVVVVAKMTGKRNGGAAAAAAAAAEAELPL

51082 PPVVSGVSLIIPVITRGPMAVADELYVKLGSVFT (0)

      GGFYSRPE 51261

51262 SEVHQGGTYRMTVPMFGRGVMYDVDVATRSEQIAVCFEALRPTKLRSSTVTMVRETE 51432 (0)

52114 EYFAKWGEQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLI 52299

52300 SLCFPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYRDGRAMSD 52479

52480 NEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIGDDRVDYDALT 52653

52654 TGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVWTREGKEYRMPAGHSVVSYAAFNHRLG 52833

52834 YVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLKMKVIWSYLLRNF 53013

53014 ELELVSPFPEVELNNIMLGPRGEVMVRYKRRKLTST* 53124

 

no japonica ortholog found 9/11/02

 

#10

>aaaa01000238.1f $FI CYP71C12 (indica cultivar-group) AP003909.1a 99%

also aaaa01079567.1 (98%)

44400 MAEMLDGLRHDEQASLHAPQKASTMPTMSCSDLLLAMMCPLILLLIIFRCYAYATRSGGM 44221

44220 LSRVPSPPGRLPVIGHMHLISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQA 44041

44040 ILRTHDRVFASRPYNTIADILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQT 43861

43860 RQQEVRLVMAKIVEEAATHMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEI 43681

43680 NSSLLGGFNLEDYFPSLARLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDN 43501

43500 NDEESDFIDVLLSIQQEYGLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAK 43321

43320 LQAEVRGVVPKGQEVVTEEQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTI 43141

43140 PSGTRVIVNAWAIARDPSYWENAEEFIPERFLGNTMAGYNGNNFNFLPFGTGRRICPGMN 42961

42960 FAIAAIEVMLASLVYRFDWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 42796

 

>AP003909.1a $F CYP71C12 chromosome 8 clone OJ1300_E01 55% to 71C4

orth aaaa01000238.1f

50394 MAQMLDGLRHDEQASLHAPQEASTMPTMSCSD

50298 LLLAMMCPLILLLIIFRCYAYATRSGGMLSRVPSPPGRLPVIGHMH 50161

50160 LISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQAILRTHDRVFASRPYNTIA 49981

49980 DILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQTRQQEVRLVMAKIVEEAAT 49801

49800 HMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEINSSLLGGFNLEDYFPSLA 49621

49620 RLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEY 49441

49440 GLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTE 49261

49260 EQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPS 49081

49080 YWENAEEFMPERFLSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVY 48910

48909 RFNWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 48790

 

#10 part

>aaaa01079567.1 CYP71C12 (indica cultivar-group) orth AP003909.1a $F chr 8

99% 98% to aaaa01000238.1f $FI see aaaa01000238.1f for ortholog

672 DQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEYGLTKDNIKANLVVM 511

510 FEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTEEQLGRMPYLKAVI 334

333 KETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPSYWENAEEFMPERF 154

153 LSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVYRFDWKL 4

 

#11

>aaaa01000238.1g $PI CYP71C13P (indica cultivar-group) end of clone poor quality seq

allowing frameshifts (fs) and deletions this seq 95% to AP003909.1b

(plus strand)

46070 MAQMLGALLLFQDSQMSTMTRMSYSLLLPILCPLILLLLFRCYAYATRSGGL 46225

46226 LDKLPSPPGRLPLIGHMHLIGSFPHMSLRDLATKHGPDLMLLHLGTVPTLVVSSSRMAQV 46405

46406 ILRTHDRVFASRQQSAIT 46459 gap (frameshift) XILF (deletion and fs)

46485 YGDYWRQIKKIVTTNLLTI (fs) KKIRSYSQT (fs) RQQE (fs) VRL (fs) VM (fs)

      AKI*EATTHMAV 46628 (deletion)

(minus strand)

49427 LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNEX 49320

49320 ESDFIDVLLSIQQEYGLTKDNIKANLAIMFEAGTDTSFIELEYAMAELMQKPQMIAKLQA 49141

49140 EVRGVVSKGQEIVTEEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTTPSG 48961

48960 TRVIVNAWAIAR (fs) DPSY*ENAEEF (fs)

      XQRFLSNTMADYNGNNFNFLPFWTGRRICPGINFA 48787

48786 ITTIEIMLASLVYRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 48658

 

>AP003909.1b $P CYP71C13P chromosome 8 clone OJ1300_E01, 4 in frame stops pseudogene

orth aaaa01000238.1g note this seq is out of order in this gene cluster

54948 MAQMLGALLLFQDSLMSTMTRMSY

54876 SLLLPILCPLILLLLFRCYAYATRSGGMLDKLPSPPGRLPLIGHM 54742

54741 HLIGFFPHMSLRDLATDLMLLHLGTVPTLVVSSSRMAQVILRTHDRVFASR*QSAI 54574

54573 TDILF*GATDVAFSPYGDYWRQIKKIVTTNLLTIKKIRSYSQTRQQEVRLVMAKIVEEAA 54394

54393 THMA IDLTELLSCYSNNMVCHAVSGMFFCEEGRNQLFKELI*INSSLLGGFNIEDYFPSL 54214

54213 ARLPVRRL LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNDVESDFIDVLLSIQQE 54037

54036 YGLTKDNIKANLAIMFEAGTDTSFIELEYSMAELM*KPQMIAKLQAEVRGVVSKGQDIVT 53857

53856 EEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTIPSGTRVIVNAWAIARDP 53677

53676 SYWENAEEFMPERFLSNTMADYNGNNFNFLPFRTGRRICPGINFAITTIEIMLASLV 53506

53505 YRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 53389

 

#9

>aaaa01000238.1e $FI CYP71C14 (indica cultivar-group) AP003909.1c 99%

      MAVMLVPIPLLLLHQHHNHEHEH

40499 PSPVAPQPTMASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPVIGHL 40326

40325 HLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRTYSAV 40146

40145 TDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARINEAAV 39966

39965 ARTTVDLSELLNWFTNDIVCHAVSGKFFREEGRNQMFWELIQANSLLLSGFNLEDYFPNL 39786

39785 ARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLSIQHE 39606

39605 YGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQEIVT 39426

39425 EEQLGRMPYLKAVIKETLRLHLAGPLLVPHLSIAECDIEGYTIPSGTRVFVNAWALSRDP 39246

39245 SFWENAEEFIPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRF 39066

39065 DWEIPADQAAKGGIDMTEAFGLTVHRKEKLLLVPRLTQD* 38946

 

>AP003909.1c $F CYP71C14 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1e

same as AP004462.1 152574-152287 region

58316 MAVMLVPIPLLLLHQHHNHEHEHPSPVAPQPTM

58217 ASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPV 58086

58085 IGHLHLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRT 57906

57905 YSAVTDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARIN 57726

57725 EAAVARTTVDLSELLNWFTNDIVCHAVSGKFFREEGQNQMFWELIQANSLLLGGFNLEDY 57546

57545 FPNLARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLS 57366

57365 IQHEYGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQ 57186

57185 EIVTEEQLGRMPY 57153 frameshift

57147 LKAVIKEMLRLHLAGPLLVPYLSIAECDIEGYTIPSGTRVFVNAWALSRDPSFWENAEEF 56968

56967 IPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRFDWEIPA 56797

56796 DQAAKGGIDMTEAFGLTVHRRRSSSLFLGSHKIK* 56692

 

#8

>aaaa01000238.1c $FI CYP71C15 (indica cultivar-group) AP003909.1d 99%

25643 LLLPVALLLLLLRFARATTLAGDRNSELLLSKLPSPPLRLPVIGHMHLVGSLPHVSLRD 25467

25466 LAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRAMVPDIISYGATDSC 25287

25286 YGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEVRLVIAKLRGAAAMAGAPVDMTELL 25107

25106 HSFANDLICRAVSGKFFREEGRNKLFRELIDTNASLLGGFNLEDYFPSLARTKLLSKVIC 24927

24926 VRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQDSDFIDILLYHQEEYGFTRDNIKAI 24747

24746 LVX 24741

24592 MFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEIVNEDNIVDMVYLKAVI 24413

24412 KETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPERF 24233

24232 MDSNIDFKGHDFHYLPFGSG*RMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKEEDI 24059

24058 DMTEVFGLTVHRKEKLFLVP 23999

 

>AP003909.1d $F CYP71C15 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1c

AQ868830.1 nbeb0032E11f CUGI Rice BAC genomicLength = 759 57% to 76C5

same as AP004462.1 139663-140091 region

68223 MELNNTEPLTASRAQAAAVFLLLPVALLLLLLRFARATTMAGDRNSELLLSKLPS 68387

68388 PPLRLPVIGHMHLVGSLPHVSLRDLAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTH 68567

68568 DLAFASRPRAMVPDIITYGATDSCYGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEV 68747

68748 RLVIAKLRGAAAMAGAPVDMTELLHSFANDLICRAVSGKFFREEGRNKLFRELIDTNASL 68927

68928 LGGFNLEDYFPSLARTKLLSKVICVRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQD 69107

69108 SDFIDILLYHQEEYGFTRDNIKAILV 69185 (0)

69331 DMFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEVVNEDNIVDMVYLKAV 69510

69511 IKETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPER 69690

69691 FMDSNIDFKGHDFHYLPFGSGRRMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKE 69858

69859 EDIDMTEVFGLTVHRKEKLFLVPQAA* 69939

 

#7

>aaaa01000238.1b $FI CYP71C16 (indica cultivar-group) AP003909.1e 100%

14489 LLPLALLFYFARAAISSRDSKTRELILSKLPSPPFKLPVIGHMHLIGPLPYVSLRDLAA 14313

14312 KHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRSMVTDIIMYGALDSCFAP 14133

14132 YSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVMARLRGAAAAAAAVDLSQTLQFFA 13953

13952 NDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFNLEAYFPGLARMPLISKLICARAI 13773

13772 RIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVLLSLQDEYGFTRDHIKAISIX 13608

13134 MFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAVI 12955

12954 KETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPERF 12775

12774 MDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKKE 12598

12597 DIDMTDVFGLAIHRKEKLFLVPQI 12526

 

>AP003909.1e $F CYP71C16 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1b

same as AP004462.1 128584-129021 region

78935 MELILQLEAKTAAQAVVTVFFFFLLPLALLFYFARAAISSRDSKTRELILSKLPSPPFK 79111

79112 LPVIGHMHLIGPLPYVSLRDLAAKHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAF 79291

79292 ASRPRSMVTDIIMYGALDSCFAPYSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVM 79471

79472 ARLRGAAAAAAAVDLSQTLQFFANDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFN 79651

79652 LEAYFPGLARMPLISKLICARAIRIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVL 79828

79829 LSLQDEYGFTRDHIKAISI 79885 (0)

80359 DMFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAV 80538

80539 IKETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPER 80718

80719 FMDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKK 80895

80896 EDIDMTDVFGLAIHRKEKLFLVPQIANY* 80982

 

#6

>aaaa01000238.1a $FI CYP71C17 (indica cultivar-group) orth of AP003909.1f

2 diffs N-terminal Met not identified

     MVVQLMLFFHDKFMAPMAEEPLPF

3340 VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 3161

3160 RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 2981

2980 ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 2801

2800 LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 2621

2620 VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 2441

2440 QEYNLTRHNIHAILM (0) 2396

2206 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 2036

2035 KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 1856

1855 MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMDK 1676

1675 DDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 1580

 

#6

>AP003909.1f $F CYP71C17 chromosome 8 clone OJ1300_E01

AK067200                2151 bp    mRNA    linear   PLN 24-JUL-2003

Oryza sativa (japonica cultivar-group) cDNA clone:J013097P19, full

insert sequence.

orth aaaa01000238.1a

AZ127316.1 OSJNBb0086E03f CUGI Rice BAC genomic Length = 498 54% to 71A14

AQ871024.1 nbeb0042C09f CUGI Rice BAC genomic Length = 495 56% to 71B23

same as AP004462.1 147428-146820 region

63826 MVVQLMLFFHDKFMAPMAEEPLPFVLIMIII

63733 LLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHV 63581

63580 SLRDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGS 63401

63400 SNISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMR 63221

63220 ELLGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLR 63041

63040 RVVSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLD 62861

62860 RQQEYNLTRHNIHAILM 62810 (0)

62626 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTQMVSEDDLNNMPYLKAV 62453

62452 VKETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDAKCWENSEEFMPER 62273

62272 FMDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMD 62093

62092 KDDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 61994

 

#6 duplicate

>aaaa01000238.1d $FI CYP71C17 (indica cultivar-group) AP003909.1f 99%

this seq 100% identical to aaaa01000238.1a, probably an error in assembly

only count this gene once see aaaa01000238.1a for ortholog

N-terminal Met not identified

      MVVQLMLFFHDKFMAPMAEEPLPF

30181 VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 30360

30361 RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 30540

30541 ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 30720

30721 LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 30900

30901 VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 31080

31081 QEYNLTRHNIHAILM (0) 31137

31309 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 31485

31486 KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 31665

31666 MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGM