Rice P450 sequences

Sept. 25, 2002 D. Nelson

 

Note there are 957 sequence entries here (571 are indica, 386 are japonica). Some are duplicates.  #n are numbers for the ortholog pairs or unique sequences.  489 numbers were given out, 27 of these were combined and 4 were not from rice.  Therefore, there are 458 unique rice sequences.  Fragments get the same number as parents.  Order is by CYP name.

Three sequences aaaa01039155.1, aaaa01093055.1, aaaa01067419.1 are probable fungal P450 contaminants.  One seq aaaa01062516.1 is a probable insect P450 contaminant.  These are not counted in the total. 

CYP names have now been assigned to all 458 sequences.

 

#300

>aaaa01012243.1 $FI CYP51A5 Indica rice genome CYP51 New April 24, 2002

ortholog of AB025047 99%

5108 MTLADPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 4926

4925 IREEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQEVYKFNVPTFGPGVVF 4746

4745 DVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAE 4641

2774 EYFSKWGESGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSALFHDLDNGMQPVSV 2598

2597 IFPYLPIPAHRRRDRARQRLKEIFATIIKSRKASGQAEEDMLQCFIDSKYKSGRSTTEGE 2418

2417 ITGLLIAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVLYR 2223

2222 CIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFKNPDS 2043

2042 YDPDRYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEFELVSPF 1863

1862 PETNWKAMVVGIKDEVMVNFKRRKLVVDN* 1773

 

>AB025047 CYP51A5 rice (partial)   80% to 51A2 missing N-term 64 aa

BE040549.1 OE08G10 OE Oryza sativa cDNA 5' Length = 255 I-helix CYP51

BE230288.1 99AS641 Rice Seedling cDNA clone 99AS641.Length = 586

BE230302.1 99AS655 Rice Seedling cDNA clone 99AS655.Length = 627

BE607441.1 OE202C10 OE cDNA clone ID707 C-term CYP51 Length = 428

REEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAEMSQQE

VYKFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAEEYFSKWGE

SGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSSLFHDLDNGMQPVSVIFPYLPI

PAHRRRDRARQRLKEIFATIIKSRKASGRAEEDMLQCFIDSKYKSGRSTTEGEITGLL

IAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILAEMDVL

YRCIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRLPHIFK

NPDSYDPDPYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLLRNFEF

ELVSPFPETNWKAMVVGIKDEVMVNFKRRKLVVDN

 

#300

>aaaa01066056.1 CYP51A5 (indica cultivar-group) = aaaa01012243.1 $FI Indica rice genome CYP51

 591 DPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIPAAPLVGGLLRFMRGPIPM 761

 762 IREEYARLGSVFTVPILRRKITFLI 836

 

#110

>aaaa01003099.1b CYP51A6 (indica cultivar-group)  Nterm aa 4-160

ortholog to AP005448.1b 100%

10626 VTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 10793

10794 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 10973

10974 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11117

 

>aaaa01003099.1c CYP51A6 (indica cultivar-group)  Nterm aa 61-160

ortholog to AP005448.1b 100% these two are duplicates only count once

11261 DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11437

11438 APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 11617

 

>aaaa01003099.1d CYP51A6 (indica cultivar-group)  Nterm aa 61-160

ortholog to AP005448.1b 100% these two are duplicates only count once

11761 DGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYL 11937

11938 APESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 12117

 

>aaaa01003099.1e CYP51A6 (indica cultivar-group) nearly  gene, runs off end

ortholog to AP005448.1b $F 99% plus one frameshifted region

21127 LQKRKISSPAAAAPPVVRGAGLVRLRARHGEGRAAGGDPRAAGEAGERVTAIAPF 20963

20962 GLFKVTFLIGPEVSSHFYLAAESEMGQGSIYRFTVPLFGPEVGYAVDPDTRAEQMRLFWD 20783

20782 VLKPRSIEARVGAMAEEVQ 20726 (0?)

18574 NYFSRWGEQGTVDLKKELERVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 18401

18400 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTAGNGDDVLQRLIDGRYKD 18236

18235 ERALTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLAAVIAEQDRLMASRARTD 18056

18055 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 17876

17875 LSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 17696

17695 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRR 17570

 

>AP005448.1b $F CYP51A6 (japonica cultivar-group) chromosome 7 21 June 2002

100% to AP005188.2c

32724 MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 32900

32901 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 33080

33081 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ 33224

35381 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 35554

35555 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 35719

35720 ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 35899

35900 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 36079

36080 MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 36259

36260 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 36403

 

>AP005188.2c $F CYP51A6 (japonica cultivar-group) chr 7 orth to aaaa01003099.1e 99%

55155 MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 55331

55332 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 55511

55512 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ

57812 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 57985

57986 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 58150

58151 ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 58330

58331 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 58510

58511 MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 58690

58691 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 58834

 

note: sequences aaaa01003099.1b to e are all probably from a single gene

 

#109

>aaaa01003099.1a CYP51A7P (indica cultivar-group)  Nterm aa 4-94

ortholog of AP005448.1a 100%

7681 TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 7857

7858 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 7962

 

>AP005188.2b $P CYP51A7P (japonica cultivar-group) chr 7 N-term fragment

orth to aaaa01003099.1a 100% after frameshift

52199 MDHLTSS (frameshift)

      TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 52375

52376 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 52480

 

>AP005448.1a CYP51A7P (japonica cultivar-group) chromosome 7 21 June 2002

29768 TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 29944

29945 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 30049

 

#63

>aaaa01001626.1 $FI CYP51A8 (indica cultivar-group) Cterm ONE FRAMESHIFT

ortholog to AP005188.2a 98%

22316 MQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAPPPPVVQGVGLVRFV

      RAMARDGPLEAIREQQAKLGSVFTASAPLGTFLIGSEVSSHFYVAPDSEISMGRLY

      EFTVPIFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE (0) 22795

23040 NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 23153 (FS)

23156 VPGKLCELFGELDNGLHLISGLLPYLPIPAH

23249 RRRDRARQRLGEIITEVIRSRRNSSRGAAGTDENNDDMLQCLINSRYKDGCAMTDAE 23419

23420 TAGLVVALMFAGKHTSSGVSIWTGVHLLSNPNHLAAVVAEQDRLMASCPGRTDDYHRLD 23596

23597 YDTVQEMRSLHCCVKEALRLHPPVAAVSQAYKHFTVQTKEGKEYTIPGGHMVVSTILVNH 23776

23777 YLPHIYKDPHVFDPQRFAPGREEEKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLL 23956

23957 SNFEIKMVSPFLETEWSTVIPEPKGKVMVSYRRRTAPK* 24073

 

>AP005188.2a $F CYP51A8 (japonica cultivar-group) chr 7

12878 MDQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAAPPPVVQGVGLVRFVRAMAR 13063

13064 DGPLEAIREQQAKLGSVFTASAPLGLFKVTFLIGSEVSSHFYVASYSEISMGRLYEFTVP 13243

13244 IFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE 13372

13617 NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 13730

13727 ESVPGKLCELFGELDNGLHLISGLLPYLPIPAHRRRDRARQRLGEIITEVIRLRRNSSR 13903

13904 GAAGTDENNDDMLQCLINSRYKDGCAMTDAEIAGLVVALMFAGKHTSSGVSIWTGVHLLS 14083

14084 NPNHLAAVVAEQDRLMASCPGRTDDYHRLDYDTVQEMRSLHCCVKEALRLHPPVAAVRQA 14263

14264 YKHFTVQTKEDKEYTIPGGHMVVSTILVNHYLPHIYK

      DPHVFDPQRFAPGREEDKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLLSN 14540

14541 FEIKMVSPFPET 14576 (frameshift)

      QWSTVIPEPKGKVMVSYRRRTAPK* 14649

 

Note this cluster continues on AP005188.2b and 2c

 

#256

>aaaa01009323.1 CYP51A9 (indica cultivar-group) 55% to AP005448.1b $F

orth of AP004890.1

6368 DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 6547

6548 GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 6727

6771 YKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAYQQIKVILSHLVSN 6950

6951 FELK 6962

 

>AP004890.1 $F CYP51A9 (japonica cultivar-group) chr 2

78968 MDHIFSTNAWLTIALVFIITLAAKVVRSSVTLPAEKTSKPRPPPEAKGAPLV 79123

79124 GIIPAVLRRGLQAVIREQHRALGSVFTLRSLGLAVTFLVGPECSDHFFHAPELEIAID 79297

79298 GLYEVTVPIFGKEVGYDIDLDTRNEQHRFFAKMLRPAKLRGHVLPMVCEIEX 79450

79551 YFGKWGECGVVDLMQEVDHVLMLIASRCLLGKEVRENMFDEVASLFHELMGGMHLISMFF 79730

79731 PYLPTPGHRRRDKARAKLGEIFSQIVKTRKMSGRVEDDMLQDLIDSTYGDGRATT 79895

79896 DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 80075

80076 GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 80255

80256 ASPLVVNTLLPNIYKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAY 80435

80436 QQIKVILSHLVSNFELKLESPFPETEDMLSMRPKGKAIVSYKRRTLS 80576

 

#404

>aaaa01023253.1 CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chromosome 2

3179 YFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 3000

2999 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 2820

2819 AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 2640

2639 MTTLTHCIKEALRLHP 2592

2584 LLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYIYKDPNVYDPSRFGPGR 2414

2412 EEDKVGGKFSYTPFSAGRHVCLGEDFAYMPN*GDMEPFAQGNFDLELISPFPEEEWEKFI 2233

2232 PGPKGKVMVTYKRRRL 2185

 

>AP004090.1 $F CYP51A10 chr 2 clone OJ1399_H05 49% to 51A2

AQ843111.1 nbxb0005D03r CUGI Rice BAC genomic cloneLength = 507 49% to 51A2

78158 MAVLFVATKMIQQRPRTLCLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVIHDLH

77972 SRLVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLYDVDLATRSRQISFCTDS 77805

77804 IKPINLRGHVDSMVHEVE 77751 (0)

76666 GYFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 76484

76483 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 76304

76303 AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 76124

76123 MTTLTHCIKEALRLHPPANLLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYI 75944

75943 YKDPNVYDPSRFGPGREEDKVGGKFSYTPFSAGRHVCLGEDFAYMQIKVIWSHLLRNFDL 75764

75763 ELISPFPEEEWEKFIPGPKGKVMVTYKRRRLL* 75665

 

#404

>aaaa01024682.1 CYP51A10 (indica cultivar-group) orth of AP004090.1 $F chr 2

Nterm join with AAAA01023253.1 see this accession for ortholog

1522 LSMAVLFVATKMIQQRPRTLYLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVI 1701

1702 HDLHSRLGSVFTVSVFGLKKVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLY 1869

1870 DVDLATRSRQISFCTDSIKPINLRGHVDSMVHEVE 1974

 

#246

>aaaa01008685.1 CYP51A11 (indica cultivar-group) orth of AC108875.1a $F chr 5 100%

7539 DGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVRKHGII 7360

7359 NGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCVPAGHT 7180

7179 MASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGENYA 7009

7008 YMQIKAIWSHLLRNF 6964

 

>AC108875.1a $F CYP51A11 chr 5 51% to 51A2 same as AQ050946 AQ687182 AQ258479

58% to AP004090 this might require subfamilies in CYP51

70310 MELTSSSAMWLAMAILAITAALTKIALGGGRRRCLSESSDLTCKTPPPPPVVNCIALLGL 70489

70490 LPALFRGDVPATMQQLYAKFGSVFTVSVAGLLKATFLVGPEVSAHFFQGLESEVSHGDLF 70669

70670 EFTVPMFGKEVGHGVDNATRIEQGRFFAEALKPVRLRIHVDPMVQEVE 70813 (0?)

71263 DYFAKWGQHGTVDLKHELEQLLLLISGRCLLGKEVMGTKFDEVCNLFRDIEGGVNLMSV 71439

71440 FFPYTPLIPSNRRRDMARERLHAIFSDIVRSRKQQQGDQEEVNDKDVLQSFIDSR 71604 (2?)

71910 YKADGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVR 72083

72084 KHGIINGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCV 72263

72264 PAGHTMASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGEN 72443

72444 YAYMQIKAIWSHLLRNFELKLLSPFPKTDWSKLVPEPQGKVMVSYKRRQL 72593

 

#276

>aaaa01010435.1 CYP51A12 (indica cultivar-group) orth of AC108875.1b $F 99% chr 5 similar to 51A2

1984 WGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSVFFPYTP 2163

2164 LIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRATTEA 2334

2335 *VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGRITD 2514

2515 DRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIASP 2688

2689 IVISNQVPYIYMDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 2865

2866 AIWSHLLRNF 2895

 

>AC108875.1b $F CYP51A12 chromosome 5 48% to 51A2

80009 MAIILAITAAVTKIARGGRRRSATDPTCKMPPPPPVVNSIALLRLLPTLFRSGLPA 80176

80177 ILHELYTKFGSVFTINLAGLLKMTFLVGPEVSAHFFQGLESEISHGNLLEFTVPMFGKEI 80356

80357 AHGVDSATRNEQARFFVDALKPARLRIHVDPMVQEVE 80467 (0?)

84741 DYFAKWGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSV 84917

84918 FFPYTPLIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRAT 85097

85098 TEA*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGR 85277

85278 ITDDRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIAS 85457

85458 PIVISNQVPYIYKDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 85637

85638 AIWSHLLRNFELRLLSPLPKSDFTKFVPEPHGELMVSYKRRQL 85766

 

#276

>aaaa01067145.1 CYP51A12 (indica cultivar-group) orth of AC108875.1b $F chromosome 5 1 diff see aaaa01010435.1 for ortholog

27  GRTGCVGEGYAYMQIKAIWSHLLRNFELR*LSPLPKSDFTKFVPEPHGELMVSYKRRQL 203

 

#140

>aaaa01004091.1 CYP51A13 (indica cultivar-group) orth of AC108875.1c $F chr 5 similar to 51A2

12032 GSVIFPYIPIPSHIRRDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLIDSKHRDGSS 12208

12209 TTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQKHGDHIDYN 12388

12389 VLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLLSP 12550

12551 MIFNNRLPYIYKDPHMYDLDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIKV 12727

12728 IWSHLLRNF 12754

 

>AC108875.1c $F CYP51A13 chromosome 5 50% to 51A2

122577 MDLVSISIYMWAMAALCTIITAMVTTKLARVRRPITLNPKSKRPLPPVVNVIA 122735

122736 LLEHLPRLCTKGVIPVYKGSVFTVSLFGLKATFLVGPEVSAHFYQGMDSEISQGDLYEFT 122915

122916 VPLFGKGVGFDIDNATRTEHLRFFIDAIKTSKLRNHVNSMVQEVE 123050 (0?)

123296 DYFAKWGENGIVDIKHEFEKLLMLISGHCLLGKEVRDNMFDEVFSLF 123436

123437 HELDSGVGLGSVIFPYIPIPSHIRCDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLID 123616

123617 SKHRDGNSTTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQK 123796

123797 HGDHIDYNVLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLL 123976

123977 SPMIFNNRLPYIYKDPHMYDPDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIK 124156

124157 VIWSHLLRNFELKLESPFPKTNWSKILLEPWGKVMVSYKRRRL 124285

 

#181

>aaaa01005681.1a CYP51A14 (indica cultivar-group) orth AP003866.1a $F chr 7 >99%

2692 EQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLISLC 2853

2854 FPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYR 3003

3004 DGRAMSDNEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIG 3168

3169 DDRVDYDALTTGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVRTREGKEYRMPAGHS 3342

3343 VVSYAAFNHRLGYVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLK 3522

3523 MKVIWSYLLRNFELELVSPFPEVEL 3597

 

>AP003866.1a $F CYP51A14 chr 7 clone OJ1092_A07 53% to 51A2

AQ326645 and AQ291927 mid to K-helix region 52% identical to wheat CYP51

60% identical to AQ327456 68% to EST T88278 705 family

AQ689048.1 nbxb0078H10r CUGI Rice BAC genomic clone Length = 737

AQ396185.2 nbxb0066K16r CUGI Rice BAC genomic cloneLength = 327

50920 MDMAAAAAVWFSAIAAVLLAASTIAVVVVAKMTGKRNGGAAAAAAAAAEAELPL

51082 PPVVSGVSLIIPVITRGPMAVADELYVKLGSVFT (0)

      GGFYSRPE 51261

51262 SEVHQGGTYRMTVPMFGRGVMYDVDVATRSEQIAVCFEALRPTKLRSSTVTMVRETE 51432 (0)

52114 EYFAKWGEQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLI 52299

52300 SLCFPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYRDGRAMSD 52479

52480 NEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIGDDRVDYDALT 52653

52654 TGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVWTREGKEYRMPAGHSVVSYAAFNHRLG 52833

52834 YVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLKMKVIWSYLLRNF 53013

53014 ELELVSPFPEVELNNIMLGPRGEVMVRYKRRKLTST* 53124

 

no japonica ortholog found 9/11/02

 

#346

>aaaa01014709.1 CYP51A15 (indica cultivar-group) 49% to 51A2

602 MDLTTGTIWLFLAQ

560 LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381

380 MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201

200 HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117

 

no japonica ortholog found 9/11/02

 

#418

>aaaa01028263.1 CYP51A15 (indica cultivar-group) 73% to AP003866.1

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

no japonica ortholog found 9/12/02

 

#418

>aaaa01028263.1 CYP51A15 (indica cultivar-group) 73% to AP003866.1

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

no japonica ortholog found 9/12/02

 

all three fragments CYP51A15 #418, #453, #346 joined reduce gene count by 2

602 MDLTTGTIWLFLAQ

560 LFIIATILSKIATRERTRTTSTKISRPPPPPMARGAPLVGVLPSLLAKGPVAFIRHHYEK 381

380 MGSVFTVSLLQQKVTFLVGSEAASHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVDYATR 201

200 HEQFRFFGDIMKPAKLRTYVDLMVAEVE 117

(0) GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV

    STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH

    DDMLQCLIDARYKDGRATTETEVAGMLVAALFA

8   GQHTSSSTSTWTGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKET 187

188 LRLHPPALMLLRHARRSFVVHSEDSGGGEREYEVPEGHTMASPLLLHNALPRVYRDPGE 364

365 FDPGRFGAGREEGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQL 526

    VSPFPETDWTVVMPGPKGKVMVTYKRRKLT*

 

#453

>aaaa01065204.1 CYP51A15 (indica cultivar-group) exon 3

ortholog of AY022669.1 searched Genbank for extensions

(0) GYFARWGQSGTVNMKQEFEQLVTLIASRCLLGEEVRDKMFDEV

STLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSVGGGGGAPH

DDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHT

 

>AY022669.1 CYP51A15 (partial)   microsatellite MRG4994 containing (CCG)X8, Length = 224

82% to CYP51 pseudogene above

222 PRLPIPAHRRRDRARARLGEIFSDIVRSRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAA 1

 

#182

>aaaa01005681.1b $PI CYP51A16P (indica cultivar-group) ortholog of AP003866.1b

4595 VRFLHRKVTFLVGPEESSHFFTGLDAEISQDEVSRFIIPTFGS*VAFDA 4741

5197 GYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 5295

6670 VVTPIATRCLFGEVRSKMLGEVSTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARLGE 6849

6850 IFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG

6975 AEVAGMLVSALLAGQYTSSSTSTWTG 7052 frameshift

7055 ARLLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLML 7234

7235 LRHARRSFVVRARGSGDAEYEVPAGHTVAS 7324

     PMVIHNALPHVY 7359

7360 EDAGSFDPGRFGPAREEYRAYAADHAYTVFGGGRHACVGE 7479 frameshift

7482 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVTVGFSVQL 7655

 

>AP003866.1b $P CYP51A16P chromosome 7 clone OJ1092_A07

No obvious N-terminal, two in frame stops, three frameshifts = Pseudogene

82%  to AY022669.1 seems to be a CYP51 pseudogene

54048 VRFLHRKVTFLVGPEESSHFFTGLDSEISQDEVSRFIIPTFGS*VAFD 54191 (intron no boundaries)

54642 AGYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 54743 (0) missing 20 aa

56119 VVTLIATRCLFGEVRSKMLGEVPTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARL 56292

56293 GEIFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG 56421 frameshift

56424 AEVAGMLVSALFAGQYTSSSTSTWTGAR 56507 frameshift

56510 LLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLMLLRHAR 56698

56699 RSFVVRARGSGDAEYEVPAGHTVASPMVIHNALPHVYEDAGSFDPGRFGPAREEYRAYAA 56878

56879 DHAYTVFGGGRHACVGE 56929 frameshift

56932 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVIVGFSVQL* 57108

 

#10

>aaaa01000238.1f $FI CYP71C12 (indica cultivar-group) AP003909.1a 99%

also aaaa01079567.1 (98%)

44400 MAEMLDGLRHDEQASLHAPQKASTMPTMSCSDLLLAMMCPLILLLIIFRCYAYATRSGGM 44221

44220 LSRVPSPPGRLPVIGHMHLISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQA 44041

44040 ILRTHDRVFASRPYNTIADILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQT 43861

43860 RQQEVRLVMAKIVEEAATHMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEI 43681

43680 NSSLLGGFNLEDYFPSLARLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDN 43501

43500 NDEESDFIDVLLSIQQEYGLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAK 43321

43320 LQAEVRGVVPKGQEVVTEEQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTI 43141

43140 PSGTRVIVNAWAIARDPSYWENAEEFIPERFLGNTMAGYNGNNFNFLPFGTGRRICPGMN 42961

42960 FAIAAIEVMLASLVYRFDWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 42796

 

>AP003909.1a $F CYP71C12 chromosome 8 clone OJ1300_E01 55% to 71C4

orth aaaa01000238.1f

50394 MAQMLDGLRHDEQASLHAPQEASTMPTMSCSD

50298 LLLAMMCPLILLLIIFRCYAYATRSGGMLSRVPSPPGRLPVIGHMH 50161

50160 LISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQAILRTHDRVFASRPYNTIA 49981

49980 DILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQTRQQEVRLVMAKIVEEAAT 49801

49800 HMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEINSSLLGGFNLEDYFPSLA 49621

49620 RLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEY 49441

49440 GLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTE 49261

49260 EQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPS 49081

49080 YWENAEEFMPERFLSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVY 48910

48909 RFNWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 48790

 

#10 part

>aaaa01079567.1 CYP71C12 (indica cultivar-group) orth AP003909.1a $F chr 8

99% 98% to aaaa01000238.1f $FI see aaaa01000238.1f for ortholog

672 DQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEYGLTKDNIKANLVVM 511

510 FEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTEEQLGRMPYLKAVI 334

333 KETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPSYWENAEEFMPERF 154

153 LSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVYRFDWKL 4

 

#11

>aaaa01000238.1g $PI CYP71C13P (indica cultivar-group) end of clone poor quality seq

allowing frameshifts (fs) and deletions this seq 95% to AP003909.1b

(plus strand)

46070 MAQMLGALLLFQDSQMSTMTRMSYSLLLPILCPLILLLLFRCYAYATRSGGL 46225

46226 LDKLPSPPGRLPLIGHMHLIGSFPHMSLRDLATKHGPDLMLLHLGTVPTLVVSSSRMAQV 46405

46406 ILRTHDRVFASRQQSAIT 46459 gap (frameshift) XILF (deletion and fs)

46485 YGDYWRQIKKIVTTNLLTI (fs) KKIRSYSQT (fs) RQQE (fs) VRL (fs) VM (fs)

      AKI*EATTHMAV 46628 (deletion)

(minus strand)

49427 LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNEX 49320

49320 ESDFIDVLLSIQQEYGLTKDNIKANLAIMFEAGTDTSFIELEYAMAELMQKPQMIAKLQA 49141

49140 EVRGVVSKGQEIVTEEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTTPSG 48961

48960 TRVIVNAWAIAR (fs) DPSY*ENAEEF (fs)

      XQRFLSNTMADYNGNNFNFLPFWTGRRICPGINFA 48787

48786 ITTIEIMLASLVYRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 48658

 

>AP003909.1b $P CYP71C13P chromosome 8 clone OJ1300_E01, 4 in frame stops pseudogene

orth aaaa01000238.1g note this seq is out of order in this gene cluster

54948 MAQMLGALLLFQDSLMSTMTRMSY

54876 SLLLPILCPLILLLLFRCYAYATRSGGMLDKLPSPPGRLPLIGHM 54742

54741 HLIGFFPHMSLRDLATDLMLLHLGTVPTLVVSSSRMAQVILRTHDRVFASR*QSAI 54574

54573 TDILF*GATDVAFSPYGDYWRQIKKIVTTNLLTIKKIRSYSQTRQQEVRLVMAKIVEEAA 54394

54393 THMA IDLTELLSCYSNNMVCHAVSGMFFCEEGRNQLFKELI*INSSLLGGFNIEDYFPSL 54214

54213 ARLPVRRL LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNDVESDFIDVLLSIQQE 54037

54036 YGLTKDNIKANLAIMFEAGTDTSFIELEYSMAELM*KPQMIAKLQAEVRGVVSKGQDIVT 53857

53856 EEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTIPSGTRVIVNAWAIARDP 53677

53676 SYWENAEEFMPERFLSNTMADYNGNNFNFLPFRTGRRICPGINFAITTIEIMLASLV 53506

53505 YRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 53389

 

#9

>aaaa01000238.1e $FI CYP71C14 (indica cultivar-group) AP003909.1c 99%

      MAVMLVPIPLLLLHQHHNHEHEH

40499 PSPVAPQPTMASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPVIGHL 40326

40325 HLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRTYSAV 40146

40145 TDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARINEAAV 39966

39965 ARTTVDLSELLNWFTNDIVCHAVSGKFFREEGRNQMFWELIQANSLLLSGFNLEDYFPNL 39786

39785 ARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLSIQHE 39606

39605 YGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQEIVT 39426

39425 EEQLGRMPYLKAVIKETLRLHLAGPLLVPHLSIAECDIEGYTIPSGTRVFVNAWALSRDP 39246

39245 SFWENAEEFIPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRF 39066

39065 DWEIPADQAAKGGIDMTEAFGLTVHRKEKLLLVPRLTQD* 38946

 

>AP003909.1c $F CYP71C14 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1e

same as AP004462.1 152574-152287 region

58316 MAVMLVPIPLLLLHQHHNHEHEHPSPVAPQPTM

58217 ASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPV 58086

58085 IGHLHLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRT 57906

57905 YSAVTDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARIN 57726

57725 EAAVARTTVDLSELLNWFTNDIVCHAVSGKFFREEGQNQMFWELIQANSLLLGGFNLEDY 57546

57545 FPNLARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLS 57366

57365 IQHEYGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQ 57186

57185 EIVTEEQLGRMPY 57153 frameshift

57147 LKAVIKEMLRLHLAGPLLVPYLSIAECDIEGYTIPSGTRVFVNAWALSRDPSFWENAEEF 56968

56967 IPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRFDWEIPA 56797

56796 DQAAKGGIDMTEAFGLTVHRRRSSSLFLGSHKIK* 56692

 

#8

>aaaa01000238.1c $FI CYP71C15 (indica cultivar-group) AP003909.1d 99%

25643 LLLPVALLLLLLRFARATTLAGDRNSELLLSKLPSPPLRLPVIGHMHLVGSLPHVSLRD 25467

25466 LAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRAMVPDIISYGATDSC 25287

25286 YGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEVRLVIAKLRGAAAMAGAPVDMTELL 25107

25106 HSFANDLICRAVSGKFFREEGRNKLFRELIDTNASLLGGFNLEDYFPSLARTKLLSKVIC 24927

24926 VRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQDSDFIDILLYHQEEYGFTRDNIKAI 24747

24746 LVX 24741

24592 MFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEIVNEDNIVDMVYLKAVI 24413

24412 KETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPERF 24233

24232 MDSNIDFKGHDFHYLPFGSG*RMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKEEDI 24059

24058 DMTEVFGLTVHRKEKLFLVP 23999

 

>AP003909.1d $F CYP71C15 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1c

AQ868830.1 nbeb0032E11f CUGI Rice BAC genomicLength = 759 57% to 76C5

same as AP004462.1 139663-140091 region

68223 MELNNTEPLTASRAQAAAVFLLLPVALLLLLLRFARATTMAGDRNSELLLSKLPS 68387

68388 PPLRLPVIGHMHLVGSLPHVSLRDLAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTH 68567

68568 DLAFASRPRAMVPDIITYGATDSCYGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEV 68747

68748 RLVIAKLRGAAAMAGAPVDMTELLHSFANDLICRAVSGKFFREEGRNKLFRELIDTNASL 68927

68928 LGGFNLEDYFPSLARTKLLSKVICVRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQD 69107

69108 SDFIDILLYHQEEYGFTRDNIKAILV 69185 (0)

69331 DMFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEVVNEDNIVDMVYLKAV 69510

69511 IKETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPER 69690

69691 FMDSNIDFKGHDFHYLPFGSGRRMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKE 69858

69859 EDIDMTEVFGLTVHRKEKLFLVPQAA* 69939

 

#7

>aaaa01000238.1b $FI CYP71C16 (indica cultivar-group) AP003909.1e 100%

14489 LLPLALLFYFARAAISSRDSKTRELILSKLPSPPFKLPVIGHMHLIGPLPYVSLRDLAA 14313

14312 KHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAFASRPRSMVTDIIMYGALDSCFAP 14133

14132 YSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVMARLRGAAAAAAAVDLSQTLQFFA 13953

13952 NDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFNLEAYFPGLARMPLISKLICARAI 13773

13772 RIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVLLSLQDEYGFTRDHIKAISIX 13608

13134 MFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAVI 12955

12954 KETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPERF 12775

12774 MDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKKE 12598

12597 DIDMTDVFGLAIHRKEKLFLVPQI 12526

 

>AP003909.1e $F CYP71C16 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1b

same as AP004462.1 128584-129021 region

78935 MELILQLEAKTAAQAVVTVFFFFLLPLALLFYFARAAISSRDSKTRELILSKLPSPPFK 79111

79112 LPVIGHMHLIGPLPYVSLRDLAAKHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAF 79291

79292 ASRPRSMVTDIIMYGALDSCFAPYSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVM 79471

79472 ARLRGAAAAAAAVDLSQTLQFFANDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFN 79651

79652 LEAYFPGLARMPLISKLICARAIRIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVL 79828

79829 LSLQDEYGFTRDHIKAISI 79885 (0)

80359 DMFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAV 80538

80539 IKETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPER 80718

80719 FMDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKK 80895

80896 EDIDMTDVFGLAIHRKEKLFLVPQIANY* 80982

 

#6

>aaaa01000238.1a $FI CYP71C17 (indica cultivar-group) orth of AP003909.1f

2 diffs N-terminal Met not identified

     MVVQLMLFFHDKFMAPMAEEPLPF

3340 VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 3161

3160 RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 2981

2980 ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 2801

2800 LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 2621

2620 VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 2441

2440 QEYNLTRHNIHAILM (0) 2396

2206 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 2036

2035 KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 1856

1855 MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMDK 1676

1675 DDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 1580

 

#6

>AP003909.1f $F CYP71C17 chromosome 8 clone OJ1300_E01

orth aaaa01000238.1a

AZ127316.1 OSJNBb0086E03f CUGI Rice BAC genomic Length = 498 54% to 71A14

AQ871024.1 nbeb0042C09f CUGI Rice BAC genomic Length = 495 56% to 71B23

same as AP004462.1 147428-146820 region

63826 MVVQLMLFFHDKFMAPMAEEPLPFVLIMIII

63733 LLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHV 63581

63580 SLRDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGS 63401

63400 SNISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMR 63221

63220 ELLGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLR 63041

63040 RVVSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLD 62861

62860 RQQEYNLTRHNIHAILM 62810 (0)

62626 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTQMVSEDDLNNMPYLKAV 62453

62452 VKETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDAKCWENSEEFMPER 62273

62272 FMDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMD 62093

62092 KDDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 61994

 

#6 duplicate

>aaaa01000238.1d $FI CYP71C17 (indica cultivar-group) AP003909.1f 99%

this seq 100% identical to aaaa01000238.1a, probably an error in assembly

only count this gene once see aaaa01000238.1a for ortholog

N-terminal Met not identified

      MVVQLMLFFHDKFMAPMAEEPLPF

30181 VLIMIIILLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHVSL 30360

30361 RDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGSSN 30540

30541 ISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMREL 30720

30721 LGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLRRV 30900

30901 VSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLDRQ 31080

31081 QEYNLTRHNIHAILM (0) 31137

31309 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTKMVSEDDLNNMPYLKAVV 31485

31486 KETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDARCWENSEEFMPERF 31665

31666 MDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMDK 31845

31846 DDVDMTDQFALTMARKEKLYLIP RSHVIKIT* 31941

 

#105

>AP004232.1 $F CYP71C18 chromosome 1 clone OSJNBa0051H17 like CYP71C4

90% to aaaa01002989.1 version 4 in Genbank does not allow for

frameshift and skips beginning of heme signature

probably not an ortholog no 99% match in indica 9/5/02

56483 MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP 56650

56651 PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLR 56815

56816 THDHVFASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREE 56995

56996 E 56998 (0)

57929 VHKVMTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANY 58105

58106 VLLAGFNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGND 58285

58286 DQDEMDFVDVLLLQERGITRDHLKAIL 58366 (0)

58462 DMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRTNIPK*GRELITECDQTNMTYLKA 58641

58642 VIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 58821

58822 RFVDGGSAANVDFIGTDFQFLPFGAX 58896 frameshift

58899 RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 59078

59079 VEYKGSVQDSAVIL* 59123

 

#447

>aaaa01059480.1 CYP71C19 (indica cultivar-group) orth of AP004233.1 100%

602 VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKANSV 426

425 LLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMSKQQCEHDEGNDQ 246

245 DEMNFVNVLLLQEQGITREHLKAIL (0)

81  DMYQAGTETSSVVLVFAMAELMQKPHL 1

 

>AP004233.1 $F CYP71C19 chromosome 1 clone OSJNBa0065J17 50% to CYP71C4

= AQ857130 duplicate of the AP004232 gene at 27203-29726

probably not ortholog only 91%

21862 MEQAAGLVYQLFQHEMFPWTFSVLALFPFLLLVLHYLATNHRTPTTCKETKNHHPP

21694 PPSPPRLPIIGHLHLIGGLLHVSLRELAHRYGPDLMLLHLGQVPNLIVSSPRAAEAVLR 21518

21517 THDLVFASRPYSLIADILLYGPSDVGLSPYGE*WRRRIITTHLLTNKKVRSYRVAREE 21344

21343 E 21335 (0)

      VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKAN

      SVLLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMS

      KQQCEHDEGNDQDEMNFVNVLLLQEQGITREHLKAIL (0)

20004 DMYQAGTETSSVVLVFAMAELMQKPHLMAKLQAELRTTIPKQGHELITERDLTDMTYLKA 19825

19824 VIKETLRLHPPTPLLLPHLAMADCNIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 19645

19644 RFVDDGSAANVDFIGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVSAEAA 19465

19464 IDKDGIDMAEAFGLSVQLKEKLLLVPVDYKDGMQDSAVILL* 19339

 

#135

>aaaa01003879.1 $FI CYP71C20 (indica cultivar-group) ortholog to AP004757.1a 97%

4645 MAQMLAAFLLDDLISHEHGHESLGAPPQAGTMAWYSLVLMTSLLFPLLVLLVMRCYVTRS 4824

4825 GAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRNLATKHSPDMMLLHLGAVPTLVVSSSR 5004

5005 VAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNEYWWQIKKITTTHLLTVKKVRS 5184

5185 YVSARQREVRIVIARITEAASKHEVVDLTEMLSCYSNNIVCHVVCGKFS*KEGWNQLLRK 5364

5365 LVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKAHNINKRWDQLLEKLIDDHTTKHI 5544

5545 RSSSMLNHYDEEAGFIDVLLSIQHEYGLTKDNIKANLAAMLMAGTDTSFIELEYAMAELM 5724

5725 QKPHVMGKLQAEVRRVMPKGQDIVTEEQLGCMPYLKAVIKETLRLYPPAPLLMPHLSMSD 5904

5905 CNINGYTIPSGTRVIVNVWALARDSNYWENADEFIPERFIVNTLGDYNGNNFHFLSFGSG 6084

6085 RRIYPGINFAIATIEIMLANLVYRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPH 6264

6265 LHLR 6276

 

>AP004757.1a $F CYP71C20 52% to AF321858 Lolium rigidum 70% to AP003909

chromosome 6 clone P0652D10

      MAQMLAAFLLDGLISHEHGHESLGAPPQAGTMAWYSLVLMTS

79980 LLFPLLVLLVMRCYVTRSGAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRDLATKH 79810

79809 SPDMMLLHLGAVPTLVVSSSRVAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNE 79630

79629 YWRQIKKITTTHLLTMKKVRSYVSARQREVRIVMARITEAASKHVVVDLTEMLSCYSNN 79453

79452 IVCHAVCGKFSLKEGWNQLLRELVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKA 79276

79275 HNINKRWDQLLEKLIDDHTTKHIRSSSMLNHYDEEAGFIDVLLSIQHEYGLTK 79117

79116 DNIKANLAAMLMAGMDTSFIELEYAMAELMQKPHVMGKLQAEVRRVMPKGQDIVTEEQLG 78937

78936 CMPYLKAVIKETLRLHPPAPLLMPHLSISDCNINGYTIPSGTRVIVNVWALARDSN 78769

78768 YWENADEFIPERFIVNTLGDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLV 78601

78600 YRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPHLHLR* 78472

 

#104

>aaaa01002989.1 $FI CYP71C21 (indica cultivar-group) 91% to AP004233.1

6020 MEQAAGLVYQLFQQEMFPWTFSVLALFPFLLL frameshift

     SLHYLATNNRTPTTCKETKNHHPPPPSPPRLPIIGHLHLIGDLLHVSLRELA 5770

5769 HRYGPDLMLLHLGQVP (?)

5107 NLIVSSPRAAEAVLRTHDLVFVSRPYSLIADILLYGPSDIGLSPYGEQWRQSRRIVTTHL 4928

4927 LTNKKVRSYRVAREEE 4871 (0?)

4088 VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGDDGRNKLFRQLFKANS 3909

3908 VLLAGFNLEDYYPSLARLKAVSRVMCAKARKTRKLWDELLDKIIDDRMSKQQCEHDRGND 3729

3728 DQDEMDFVDVLLLQERGITREHLKAIL 3648 (0?)

3552 DMFQAGTETTSVVLVFAMAELMHKPHLMAKLQAELRTNISKQGQELLTECDLTNMTYLNA 3373

3372 VIKETLRLHPPTPLLLPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLSE 3193

3192 RFVDGGSAANVDLTGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVPAEAA 3013

3012 IDKAGIDMAEAFGLSVQLEEKLLLVPIEYKDSM 2914

 

#488

>aaaa01000238.1 CYP71C22P

Duplicated end of exon 1 from CYP71C17 = AP003909.1g

ESDFVDILLDHQQEYNLTRHNIHAILM 32576

 

#488

>AP003909.1g $P CYP2C22P chromosome 8 clone OJ1300_E01 lone pseudogene fragment

identical to Duplicated end of exon 1 on aaaa01000238.1a

61469 EQESDFVDILLDHQQEYNLTRHNIHAILM 61383

 

#488

>aaaa01000238.1a $FI CYP71C22P (indica cultivar-group) orth of AP003909.1g

Duplicated end of exon 1 same as AP003930.1g

1052 EQESDFVDILLDHQQEYNLTRHNIHAILM 966

 

#136

>aaaa01006247.1 CYP71C23P (indica cultivar-group) orth of AP004757.1b 2 diffs

2898 FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLTVHRKQKLLLVSWLPQD 3065

 

>AP004757.1b $P CYP71C23P chr 6 Pseudogene fragment last exon similar to AP003909

103765 FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLAVHRKEKLLLVSWLPQD* 103595

 

#28

>aaaa01000733.1 $FI CYP71E4 (indica cultivar-group) 99% to AC092559.2

5765 MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLRLPPGPARLPVLGN 5586

5585 LLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLRTHDADCCSRPSSPG 5406

5405 PMRLSYGYKDVAFAPYDAYSRAARRLFVAELFSAPRVQAAWRARQDQ 5265

3896 VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDV 3720

3719 MDMLASFSAEDFFPNAAAARLFDHLTGLVARRERVFQQLDAFFEMVIEQHLDSDSSNAGG 3540

3539 GGGNLVGALIGLWKQGKQYGDRRFTRENVKAIIF 3438

3337 DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAYLK 3158

3157 MVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVFDP 2978

2977 DRFEAKRVEFNGGHFELLPFGSGRRICPGIAMAAANVEFTLANLLHCFDWALPVGMAPEE 2798

2797 LSMEESGGLVFHRKAPLVLVPTRYIQL 2717

 

>AC092559.2 $F CYP71E4 chromosome 3 clone OSJNBb0096M04, 45% to 71B37

same as AC096688.3 chromosome 3

96529 MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLR

96388 LPPGPARLPVLGNLLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLR 96212

96211 THDADCCSRPSSPGPMRLSYGYKDVAFAPYDAYGRAARRLFVAELFSAPRVQAAWRARQDQ 96017 (0)

94678 VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDVMDMLASFSAEDFFPNA

94454

94453 AAARLFDHLTGLVAHRERVFQQLDAFFEMVIEQHLDSDSSNAGGGGGNLVGALIGL 94286

94285 WKQGKQYGDRRFTRENVKAIIF 94220 (0)

94119 DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAY 93946

93945 LKMVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVF 93766

93765 DPDRFEAKRVEFNGGHFELLPFGSGRRICPGIAMGAANVEFTLANLLHCFDWALPVGMAP 93586

93585 EELSMEESGGLVLHRKAPLVLVPTRYIQL* 93496

 

#296

>aaaa01011852.1 $FI CYP71E5 (indica cultivar-group) ortholog of AL731888.1

58% to AC092559.2 46% to 71B23

8074 MAISLITSLLFSLPQQWQP

8017 VVLTGLLPVIVSLVLLARKGRLKMPPGPEQVPLLGNLHQLAGPQPHRALRDLARVHGPV 7841

7840 MRLRLGKASAVVLTSAEAAWEALRGHDLDCCTRPVSAGTRRVTYGMKNVAFAPYGAYWR 7664

7663 EVRKLLMVELLSARRVKAAWYARHEQ (0) 7586

     VEKLLSTLRRAEGKPVALDEHILSLSDGIIGRVAFGNIYGSDKFSQNK

     NFQHALDDVMEMLSGEGSSAEDLQLPAAVGRLVDRLTGFAARRERIFRQLDSF

     FEMVIEQHLDPNRAPPENGGDLVDVLIDHWKKNEPRGTFSFTKDNVKAIIF (0)

6324 STFVAGIDTNAATILWAMSELARKPRVLKKVQAEIRAAVGVNGRVQPDDITKLSYLRK 6151

6150 VVKETLRLHPPTPLLLPRETMRHIQISGYDVPAKTRIYVNAWAIGRDPASWPDEPEEFNP 5971

5970 ERFEANEIDFKGEHPELMPFGTGRRICPGMAMAMANVEFTLANLLFAFQWSLPEGTTPDN 5791

5790 VCMEEEGRLVCHRKTPLVLVPTVYRHGLE* 5701

 

>AL731888.1 CYP71E5 chr 12

31348 MAISLITSLLFSLPQQWQPVVLTGLLPVIVSLVLLARKGRLKMPPGPEQVPLLGNLHQLA 31169

31168 GPQPHRALRDLARVHGPVMRLRLGKASAVVLTSAEAAWEALRGHDLDCCTRPVSAGTRRV 30989

30988 TYGMKNVAFAPYGAYWREVRKLLMVELLSARRVKAAWYARHEQ 30860

30546 VEKLLSTLRRAEGKPVALDEHILSLSDGIIGTVAFGNIYGSDKFSQNKNFQHALDDV 30376

30375 MEMLSGEGSSAEDLQLPAAVGRLVDRLTGFAARRERIFRQLDSFFEMVIEQHLDPNRAPP 30196

30195 ENGGDLVDVLIGHWKKNEPRGTFSFTKDNVKAIIF 30091

29601 STFVAGIDTNAATILWAMSELARKPRVLKKVQAEIRAAVGVNGRVQPDDITKLSYLRKVV 29422

29421 KETLRLHPPTPLLLPRETMRHIQISGYDVPAKTRIYVNAWAIGRDPASWPDEPEEFNPER 29242

29241 FEANEIDFKGEHPELMPFGTGRRICPGMAMAMANVEFTLANLLFAFQWSLPEGTTPDNVC 29062

29061 LEEEGRLVCHRKTPLVLVPTVYRHGLE 28981

 

#352

>aaaa01015254.1 CYP71E6 (indica cultivar-group) 94% to AC084319.5a 7 diffs

53% to AC092559.2

3134 LAVSVVLIFWSRHRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLGALAGWHGPVMAL 3313

3314 WLGTVPVVVLSSPKAEREALQVHDPECCNRSPT 3412

 

>aaaa01021677.1 CYP71E6 (indica cultivar-group) orth AC084319.5a  99%

838 DTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYLK 668

667 MVVKETFRLHPPATLLVPRETTRHFKVGGYDIPEKTKVIVNEWAIGRDPNIWKDPEEFIP 488

487 ERFEEMDIDFNGAHFELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKED 308

307 IDMEEAGKLTFHKKIPLLLVP 245

 

>AC084319.5a CYP71E6 chr 3 Genbank translation is wrong at N-terminal

does not identify frameshift and conserved motifs PPGPXXLPIIGNL

same as AC084404.8 partial

2204 MAASLLLELLPQQWQLSITSLIL

2273 LAVSVVLIFWSRRRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLEALAGWHG 2437 (fs)

2437 LGTVSVVVLSSPKAAREALKVHDPECCSRSPS 2532

2533 AGPRMLSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAA 2658

8170 VMLPDYYCCM 8199

8599 VEKLIEKLTRNGRNAVAINEHIFSTVDGIIGTFALGETYAAEEFKDISETMDLLSSSSAE 8778

8779 DFFPGSVAGRLVDRLTGLAARREAIFRKLDRFFERIVDQHAAADDDGPAAARRKADDKGS 8958

8959 AGSDLVHELIDLWKMEGNTKQGFTKDHVKAMLL 9057

9159 DTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYLKMVV 9338

9339 KETFRLHPPATLLVPRETTRHFKVGGYDIPEKTKVIVNA*AIGRDPNIWKDPEEFIPERF 9518

9519 EEMDIDFNGAHFELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKEDIDM 9698

9699 EEAGKLTFHKKIPLLLVPTPNKAPN* 9776

 

>AC084404.8 CYP71E6 chr 3 incomplete = AC084319.5a

153211 MAASLLLELLPQQWQLSITSLIL 153279

153280 LAVSVVLIFWSRRRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLEALAGWHG 153444 (fs)

153444 LGTVSVVVLSSPKAAREALKVHDPECCSRSPS 153539

153540 AGPRMLSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAA 153665

 

#254

>aaaa01009177.1b CYP71K1 (indica cultivar-group) orth AP002968 $F 98%

1277 LYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWALPVIGHLHHVAGALPHRAMRDL 1456

1457 ARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAFATRPITPTGKVLMADSVGVVFAP 1636

1637 YGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAVAALTTPGATAAVNL 1816

1817 SERISAYVADSAVRAVIGSRFKNRAAFLRMLERRMKLLPAQCLPDLFPSSRAAML 1981

1982 VSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAEEDLLDVLLRIQSQDKTNP 2143

2144 ALTNDNIKTVIX 2176

2244 DMFVASSETAATSLQWTMSELMRNPRVMRKAQDEVRRALAVAGQDGVTEESLPDLPYL 2423

2424 HLLIKESLRLHPPVTMLLPRECREPCRVMGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFA 2603

2604 PERFEGGGAADFKGTDFEYIPFGAGRRMCPGMAFGLANMELALAALLYHFDWELPGGMLP 2783

2784 GELDMTEALGLTTRRCSDLLLVPAL 2858

 

>AP002968 $F CYP71K1 40% to 71B24 complement(1875..2513,2584..3501)

AP003204 40% to 71B24 CDS complement(121487..122125,122196..123113)

AQ870215.1 nbeb0036N08f CUGI Rice BAC genomic Length = 754 58% to 99A1

MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWAL

PVIGHLHHVAGALPHRAMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAF

ATRPITPTGKVLMADSVGVVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGR

LLRAVAAAAAVAALTTPGATAAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLER

RMKLLPAQCLPDLFPSSRAAMLVSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAE

EDLLDVLLRIQSQDKTNPALTNDNIKTVIIDMFVASSETAATSLQWTMSELMRNPRVM

RKAQDEVRRALAIAGQDGVTEESLRDLPYLHLVIKESLRLHPPVTMLLPRECRETCRV

MGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFAPERFEGVGAADFKGTDFEYIPFGAGR

RMCPGMAFGLANMELALAALLYHFDWELPGGMLPGELDMTEALGLTTRRCSDLLLVPA

LRVPLRDHER

 

#253

>aaaa01009177.1a $P CYP71K2P (indica cultivar-group) AP002968 $F 97%

only 363 bp away from start of second gene, cannot be complete gene

309 LYLLLLALLVAVPFLCLTRSSRRHGCGGGSRLPPSPWALPVIGHLHHVAGALPHRAMRDL 488

489 ARRHGPLMLLRLCELRVVVASTAEAAREVTKTHDLAFATRPITPTGKVLMADSVGVVFAP 668

669 YGDGWRTLRRICTLELLSARRVRSFRAVREEEVGRLLRAVAAAAAAAAAALTTPGATAAV 848

849 NLSERISAYVADSAVRAVIGSR 914

 

no ortholog found in japonica 9/13/02, may be indica unique pseudogene

does not exist on AP002968 or AP003204 so it might be a sequence assembly

error.

 

#209

>aaaa01007181.1a CYP71K3 (indica cultivar-group) orth AP003990.1h $F chr 2 99%

N-term (orientation probably incorrect on either a or b)

1038 YLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHRAMRDMAR 859

858  RHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEGVIFAPYG 679

678  DGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAALSSSSPVNLTGMISAFV 499

498  ADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAMLLSRVPAKI 334

333  ERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 181

180  IKSILX 166

88    DMFGAGSETSATTLQWAMAELMRNPAV 2

 

>aaaa01007181.1b CYP71K3 (indica cultivar-group) orth AP003990.1h $F chr 2 100% C-term

6892 VMRRAQDEVRRELAVAGNDRVTEDTLPSLHYLRLVIKETLRLHPPAPLLLPRECGGACKV 7071

7072 FGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEFSPERFERCERDFRGADFELIPFGAGRRI 7251

7252 CPGMAFGLAHVELALAALLFHFDWRLPGGMAAGEMDMTEAAGITVRRRSDL 7404

 

>AP003990.1h $F CYP71K3 chromosome 2 clone OJ1073_F05

66403 MATELTEYLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHR 66582

66583 AMRDMARRHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEG 66762

66763 VIFAPYGDGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAASSSS 66924

66925 SPVNLTGMISAFVADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAML 67104

67105 LSRVPAKIERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 67281

67282 IKSILI 67299 (0)

67380 DMFGAGSETSATTLQWAMAELMRNPAVMRRAQDEVRRELAVAGNDRVTEDTLPSLHYL 67553

67554 RLVIKETLRLHPPAPLLLPRECGGACKVFGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEF 67733

67734 SPERFERCERDFRGADFELIPFGAGRRICPGMAFGLAHVELALAALLFHFDWRLPGGMA 67910

67911 AGEMDMTEAAGITVRRRSDLLVFAVPRVPVPAQ* 68012

 

#210

>aaaa01007181.1c CYP71K4 (indica cultivar-group) orth AP003990.1i $F chr 2 99%

8111 LPPGPWALPVIGHLHHLAGDLPHRALSALARRHGALMLLRLGEVQAVVASSPDAAREIMR 8290

8291 THDAAFASRPLSPMQQLAYARDAEGVIFAPYGDGWRHLRKICTGELLSARRVQSFRPVRE 8470

8471 AELVRLLRSVAEATSSSSSGSLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRML 8638

8639 QDGLKIVPGMTLPDLFPSSRLALFLSRVPGR 8731

9019 DMFGAGSESSATVLQWTMAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRL 9192

9193 VIKETLRLHPPAPLLLPRKCGSTCKILGFDVPEGVMVIVNAWAIGRDPTYWDKPEEFVPE 9372

9373 RFEHNGRDFKGMDFEFIPFGAGRRICPGITFGMAHVELVL 9492 frameshift

9494 LYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNLLVRPI 9607

 

>AP003990.1i $F CYP71K4 chromosome 2 clone OJ1073_F05

68586 MPLVVLLLATIPLLFFTIKRSAQRRGGGGGGEGRLPPGPWALPVIGHLHHLAGDLPHRA 68762

68763 LSALARRHGALMLLRLGEVQAVVASSPDAARDIMRTHDAAFASRPLSPMQQLAYGRDAEG 68942

68943 VIFAPYGDGWRHLRKICTAELLSARRVQSFRPVREAELGRLLRSVAEATSSSSSA 69107

69108 SLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRMLQDGLKIVPGMTLPDLFPSSRLALF 69287

69288 LSRVPGRIEHHRQGMQRFIDAIIVEHQEKRAAAAANDDDDEDEDFLDVLLKLQKEMGSQH 69467

69468 PLTTANIKTVML (0)

      DMFGAGSESSATVLQWT 69647

69648 MAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRLVIKETLRLHPPAPLLLP 69821

69822 RKCGSTCKILGFDVPEGVMVIVNAWAIGRDLTYWDKPEEFVPERFEHNGRDFKGMDFEF 69998

69999 IPFGAGRRICPGITFGMAHVELVLSALLYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNL 70178

70179 LVRPIHRVSVPVE* 70220

 

#211

>aaaa01007181.1d CYP71K5 (indica cultivar-group) orth AP003990.1j $F chr 2 99%

10296 LLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPNRAMRDLARWHGPLMLLRLGE 10475

10476 VEX 10481 frameshift

10486 VVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGLVFAPYGEAWRRLRRVCTQELL 10665

10666 SHRRVQSFRPVREDELGRLLRAVDAAAAAGTAVNLTAMMSTYVADSTVRAIIGSRRLKDR 10845

10846 DAFLRMLDELFTIMPGMSLPDLFPSSRLAMLVSRAPGRIMRYRRRMRRIMDSII 11007

11008 HEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGAQYPLTTENIKTVM 11157

11249 QDIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLNY 11428

11429 LKLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 11608

11609 EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVE 11737

 

>AP003990.1j $F CYP71K5 chromosome 2 clone OJ1073_F05

70828 MAGELAFYLLLVGLVAVPLLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPHRA 71007

71008 MRDLARRHGPLMLLRLGEVEAVVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGL 71187

71188 VFAPYGEAWRRLRRVCTQELLSHRRVQSFRPVREDELGRLLRAVDAAAAAGT 71343

71344 AVNLTAMMSTYVADSTVRAIIGSRRLKDRDAFLRMLDELFTIMPGMSLPDLFPSSRLAML 71523

71524 VSRAPGRIMRYRRRMRRIMDSIIHEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGA 71703

71704 QYPLTTENIKTVMM 71745 (0)

71837 DIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLSYL 72016

72017 KLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 72193

72194 EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVELALAALLFHFDWSLPG 72370

72371 GMAADELDMAESSGLTTRRRLPLLVVARPHAALPTKYCN* 72490

 

#249

>aaaa01012971.1b CYP71K6 (indica cultivar-group) orth AP003523.1c $F chr 6 99%

7052 LLRYLFSVPMLFFIVPLLFLVCSPGRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAM 7231

7232 RDIARRHGPLVLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGV 7411

7412 IFAPYGETWRQLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELM 7588

7589 SAYAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMP 7756

7757 RRMKRHRERMTAYLDAIIEEHQESRASREDDEDLLDVLL 7873

 

>AP003523.1c $F CYP71K6 chromosome 6 clone P0416A11 six different genes

73% to AP003523.1d 64% to AP003523.1f

128476 MAAELVHLLRYLFSVPM

128425 LFFIVPLLFLVCSPRRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAMRDIARRHGPL 128246

128245 VLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGVIFAPYGETWR 128066

128065 QLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELMSA 127913

127912 YAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMPRRMKRH 127733

127732 RERMTAYLDAIIEEHQESRASREDDEDLLDVLLRM 127628 frameshift

       QREGDLEVSRESIRSTIG bad exon boundary should be phase 0

126439 DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGY 126275

126274 MNLVIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEF 126095

126094 IPERFENAGINFKGTNFEYMPFGAGRRMCPGMAFGLATLELALASLLYHFDWKLPDGV 125921

125920 EIDMKEQSGVTTRRVHDLMLVPIIRVPLPV* 125828

 

#249

>aaaa01008944.1 CYP71K6 (indica cultivar-group) orth AP003523.1c $F chr 6 98%

see aaaa01012971.1b for ortholog

1338 DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGYMNL 1511

1512 VIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEFIPE 1691

1692 RFENAGINFKGTNFEYMPFGAGRRMCPGMAFSLVMLELALASLLYHFDWKLPDGVEIDM 1868

1869 KEQSGVTTRRVHDLMLVPII 1928

 

#42

>aaaa01001026.1a $PI CYP71K7P (indica cultivar-group) 3 defects, probable pseudogene

probable ortholog of AP003523.1d

     MAEVVQLHHLILLLPLFILPSSSSVR (fs)

3368 RRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIARRHGPLVLLRLGELPVVVIASS 3189

3188 ADAARNVMKTHDLAFATRPITHMMRLVFPEGSEGIIFSPYGETWRQLRKICTVELLSARR 3009

3008 VNSFRSVREEEVNRLLRAVAAAAASATSPAKMVNLSELMSAYAADSSVRAMIGRRCKDRD 2829

2828 KFLEMLERGIKLFVTPSLPDLYPSSRLAMVVSRMPRRMRRHREEVFAFLDAIIAEHQENR 2649

2648 ASGEDEEDLLDVLLRIQREGCMEST (fs) 2580

2572 PLLSTESIRTTIG bad boundary 0 expected 2540

1662 DLFNGGSETTATTLQWIMAELMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYLVI 1483

1482 KEALRLHPPGPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSKEFIPERF 1303

1302 EHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKLSDKIKVGDLDM 1123

1122 TEERGATTRRLHDLLLVPVIRVPLPLDSRS* 1030

 

>AP003523.1d $F CYP71K7P chromosome 6 clone P0416A11 six different genes

probable ortholog of aaaa01001026.1a

138301 MAEVVQLHHLILLLPLFILPFLLLRSSRRRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIA 138152

138151 RRHGPLVLLRLGELPVVVASSADAARDVMKTHDLAFATRPITRMMRLVFPEGSEGIIFSP 137972

137971 YGETWRQLRKICTVELLSARRVNSFRSVREEEVNRLLRAVAAAAASATSPAKTVNL 137804

137803 SELMSAYAADSSVRAMIGRRCKDRDKFLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMP 137624

137623 RRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCME 137480 frameshift

       SPLLSTESIRTTIG bad exon boundary should be phase 0

136562 DLFNGGSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYL 136389

136388 VIKEALRLHPPRPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILE 136209

136208 RFEHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKL 136056

136039 GDLDMTQERGATTRRLHDLLLVPVIRVPLPLDSRS* 135942

 

#42

>aaaa01012971.1a CYP71K7P (indica cultivar-group) orth AP003523.1d $F chr 6 99%

see aaaa01001026.1a for ortholog

4777 GSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYLVIKE 4607

4606 ALRLHPPGPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILERFEH 4427

4426 VDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKLSDKIKVGDLDMTE 4247

4246 ERGATTRRLHDLLLVPVI 4193

 

#43

>aaaa01001026.1b $PI CYP71K8 (indica cultivar-group) Pseudogene of AP003523.1e

japonica gene does not look like a pseudogene

      MAGFPVYL (deletion and fs) LAA (fs) LIILPMANLIRSARHRRLAGAR (fs)

16438 PPPGPWALPVIGHLHHLAGKLPHHHKLRDLAARHGPLMLLRFGELPVVVASSAGAAREITK 16620

16621 THDLAFATRPVTRTARLTLPEGAEGIIFAPYGDGWRQLRKICTLELLSARRVQSFRAVRE 16800

16801 EEVRRLLLAVASPSPEGTTATASVVNLSRMISSCVADSSV RAIIGSGRFKDRETFLRLME 16980

16981 RGIKLFSGPSLPDLFPSSRLAMLVSRVPGRMRRQRKEMMEFMDTIIEEHQAAREASM 17151

17152 ELEKEDLVDVLLRVQRDGSLQFSLTTDNIKAAIA (0) 17253

this segment is homologous to 108 aa region before the sequence gap at 17972

gene may not be assembled correctly

RAMIGSRFKDRN*FLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMPRRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCMESTVSTESIRTTIG (0)

      missing Ihelix exon

19112 YLHLVIKETLRLHPPAPLLLARECREPCQILGFDVPKGAMVLINAWSIGRDPSNWHAPKK 19291

19292 FMPERFEQNNIDFKRTSFKYIPFGAGRRICPGMTFGLANIELLLASLLYHFDWELPHGMQ 19471

19472 AGDLDMTETLAVTARRKADLLVVPVVRVPIVG* 19570

 

>AP003523.1e $F CYP71K8 chromosome 6 clone P0416A11 six different genes AQ331067 AQ364007.2

AQ331067 55% identical to AQ328148 47% identical to C74921 57% to 71B4 58% to 76C1 64% to AP003523.1f

AQ364007.2 nbxb0060E04f CUGI Rice BAC Length = 393 65% to 99A1

End of this gene matches AP003571 at 155149

151229 MAGFPVYLLFLAALIILPMANLIRSARHRRLAGARRPPPGPWALPVIGHLHHLLAGKLPH 151408

151409 HHKLRDLAARHGPLMLLRFGELPVVVASSADAAREIAKAHDLAFATRPVTRTARLTLPEG 151588

151589 GEGVIFAPYGDGWRQLRKICTLELLSARRVLSFRAVREQEVRCLLLAVASPSPEGTTAT 151765

151766 ASVVNLSRMISSCVADSSVRAIIGSGRFKDRETFLRLMERGIKLFSCPSLPDLFPSSR 151939

151940 LAMLVSRVPGRMRRQRKEMMEFMETIIEEHQAARQASMELEKEDLVDVLLRVQRDGSLQF 152119

152120 SLTTDNIKAAIA 152155 (0)

166133 DLFIGGSETAATTLQWAMSELLNNPKVMQKAQDEIRQVLYGQERITEETISSLHYLHL 166306

166307 VIKETLRLHPPTPLLLPRECREPCQILGFDVSKGAMVLINAWSIGRDPSNWHAPEKFMPE 166486

166487 RFEQNNIDFKETSFEYIPFGAGRRICPGMTFRLANIELLLASLLYHFDWELPYGMQAGD 166663

166664 LDMTETLAVTARRKADLLVVPVVRVPIVG* 166753

 

#44

>aaaa01001026.1c $FI CYP71K9 (indica cultivar-group) Cterminal differs from AAAA01001026.1b and AP003523.1f may be a frameshift (check)

may be ortholog of AP003523.1f 95%

20983 MAAAASSVLAYLLVVALLAIVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGAL 21162

21163 PHVAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPE 21342

21343 GGEGIIFAPYGDRWRELRKICTVELLSARRVQSFRPVREEEAGRLLRAVAAASSPSPAQ 21519

21520 AAVNLSALLSAYAADSAV RAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMW 21699

21700 LSRMPRRMMQHRREAYAFTDAIIREHQENRAAGAGDDKEDLLDVLLRIQREGDLQF 21867

21868 PLSTERIKTTVG (0) 21903

22325 DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 22498

22499 VIKEVLRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 22678

22679 RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFDWQLPDGMDTADL 22858

22859 DMTEEMVVSARRLXXXXXXXVVHVPLPVASS 22951

 

>AP003523.1f $F CYP71K9 chromosome 6 clone P0416A11 six different genes

AU096456 AU096455 71% to AP002968 65% to 71B24 also = AU032983

may be ortholog of aaaa01001026.1c 95%

168538 MAAAASSVLAYLLVVALLAIVPLVYFGWVARRRGEGGRLPPSPWGLPVIGHLHHLAGALPHHAMRDLA 168741

168742 RRHGPLMLLRLGELPVVVASSAEAAREVMRTRDIEFATRPMSRMTRLVFPAGTEGIIFAP 168921

168922 YGDEWRELRKVCTVELLSARRVQSFRAVREEEVGRLLRAVAATSSSPSPAQAAVNL 169089

169090 SALLSAYAADSAVHAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMWLSRMP 169269

169270 RRMMQHRREAYAFTDAIIREHQENRAAGAGDGDGDDKEDLLDVLLRIQREGDLQFPLSTE 169449

169450 RIKTTVG 169470 (0)

169893 DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 170066

170067 VIKEALRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 170246

170247 RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFNWQLPDGMDTAD 170423

170424 LDMTEEMVVSARRLHDLLLVPVVHVPLPVASS* 170522

 

#45

>aaaa01001026.1d $FI CYP71K10 (indica cultivar-group) orth of AP003571.1h $F 99%

28790 MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPHVA 28966

28967 MRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEGGEG 29146

29147 IIFAPYGDRWRELRKICTVELLSARRVQSFRPVREEEAGRLLRAVAAASPGQAVN 29311

29312 LSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAMLLSRM 29491

29492 PRRMKQHHRDMVAFLDAIIQEHQENRSAAGDDDDNDLLDVLLRIQREGDLQFPLSS 29659

29660 ESIKATIG (0) 29683

29867 DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEIRRELIGHRKVTEDTLCRLNYMHMVI 30046

30047 KEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPERF 30226

30227 EHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENLDM 30406

30407 TEEMRFTTRRLHDLVLIPVVHVPLPTI* 30490

 

>AP003571.1h $F CYP71K10 chromosome 6 clone P0458E02 continuation of contig AP003523.1

40% to 71B23

AQ687385 nbxb0074N19f 50% to 71B20

AQ258331 nbxb0020M04r 71-like sequence 36% to 71B33

145371 MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPH 145201

145200 VAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEG 145024

145023 GEGIIFAPYGDRWRELRKICTVELLSGRRVQSFRPVREEEAGRLLRAVAAASPG 144862

144861 QAVNLSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAM 144685

144684 LLSRMPRRMKQHHRDMVAFLDAIIQEHQENRSAAADDDNDLLDVLLRIQREGDLQFPLS 144508

144507 SESIKATIG 144481 (0)

144297 DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEVRRELIGHRKVTEDTLCRLNYMHM 144124

144123 VIKEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPE 143944

143943 RFEHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENL 143764

143763 DMTEEMRFTTRRLHDLVLIPVVHVPLPTI* 143674

 

#248

>aaaa01008885.1 $FI CYP71K11 (indica cultivar-group) almost same as AAAA01011410.1

ortholog to AC118346.1a gene 1

6884 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 6705

6704 LPPHHAMRDIALRHGPLVRLRLGGLQVILASSVDAAREVMRTHDLAFATRPSTRVMQLVF 6525

6524 PEGSQ (0)

     GIVFTPYGDSWRNLR 6345

6344 KICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNLSELISAYSADSTMRALI 6165

6164 GSRFKDRDRFLMLLERGVKLFATPSLPDLYPSSRLAELISRRPRQMRRHRDEVYAFLDII 5985

5984 IKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG 5853

5758 DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 5585

5584 VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 5405

5404 RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 5225

5224 DMKEEMGAIARRLHDLSLVPVIRHPLPVDM 5135

 

>AC118346.1a $F CYP71K11 Gene 1, 94448-96200, 3 exons 97% identical to gene 2 (12 diffs)

36% to 41% with 71As and 71Bs

94448 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 94627

94628 LPPHHAMRDIALRHGPLVRLRLGGLQVI 94711

94712 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0)

      GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLL 95071

95072 RAVAAASPARRAVNLSELISAYSADSTMRALIGSRFKDRDRFLMLLERGVKLFATPSLPD 95251

95252 LYPSSRLAELISRRPRQMRRHRDEVYAFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQR 95431

95432 KGDFPLSTDNIKTTIG (0) 95479

95574 DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 95747

95748 VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 95927

95928 RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 96107

96108 DMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 96200

 

#286

>aaaa01011410.1 $FI CYP71K12 (indica cultivar-group) Ortholog to AC118346.1b gene 2

4935 MADQLVHLPQQLLVL

     LLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGALPPQHAMRNIALRH 5156

5157 GPLVRLRLGGLQVILASSVDAAREVMRRHDLAFATRPSTRVMQLVFPEGSQ (0) 5309

5428 GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNLSE 5607

5608 LISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRPR 5784

5785 QMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 5964

6058 DLFNGGSETTATTLKWIMAELIRNPRVMQKAQDEVRQVLGKHHKVTEEALR 6210

6211 NLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFHVPQGTMILVNMWAISRDPMYWD 6387

6388 QAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFNWELP 6567

6568 DETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 6684

 

>AC118346.1b $F CYP71K12 (japonica cultivar-group) chromosome 11 clone Ba0039F06,

Gene 2 = AU096586.1, D48250 97% identical to gene 1 (12 diffs)

113634 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGA 113813

113814 LPPQHAMRNIALRHGPLVRLRLGGLQVI 113897

113898 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0) 114008

114127 GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNL 114300

114301 SELISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRP 114480

114481 RQMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 114663

       DLFNGGSETTATTLKWIMAELIRNPRVM 114840

114841 QKAQDEVRQVLGKHHKVTEEALRNLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFH 115020

115021 VPQGTMILVNMWAISRDPMYWDQAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIA 115200

115201 FGLVNLELVLASLLYHFNWELPDETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 115383

 

#287

>AC118346.1c Gene 3 $P CYP71K13P pseudogene 56% to AC118346.1 genes 1, 2 no ortholog

117756 DAAREVMRTHDLAFATRPSTRVMQLVFLEGSQ 117661

117553 GDRFTPYGDIWRNLRRSAPLAVSAKRVQFFRPIHQEEVCRLLQAVAVASPA 117395

117394 RGPPETLTSSFRPTWATLQCAP**GARLRDRDKSLMLLYRGVKPIRHARACQIFTQSIAL 117215

117214 ADLIIKSLSPMRRASYPMSNLLDIIFK 117134

117108 SDNHMDLTLVAFLLRFHKKGACPLSFCYIRKQFG*AF 116998

 

#172

>aaaa01005413.1 $FI CYP71P1 (indica cultivar-group) ortholog to AL713951.1

5862 MSLALLVLSAAYVLVALRRSRSSSLKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELARTMRAPLFRMRL

     GSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAGPYHRMARR

     VVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLANDVLCRVAFG

     RRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCLADLREACD

     VIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0) 4972

4875 DMFVAGTDTTFATLEWVMTELVRHPRILKKAQEE

4773 VRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPARTR 4594

4593 VFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGYTFA 4414

4413 LATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFKGEE 4234

4233 LSEV* 4219

 

>AL713951.1 $F CYP71P1 chromosome 12 clone Monsanto- 39% to 83B1

AF088221 BI305808.1 49% to 76C6 mRNA

44616 MSLALLVLSAAYVLVALRRSRSSSSKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELART 44437

44436 MRAPLFRMRLGSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAG 44257

44256 PYHRMARRVVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLAND 44077

44076 VLCRVAFGRRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCL 43897

43896 ADLREACDVIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0)

      DMFVAGTDTTFATLEWVMTELVRHPRILKKA 43537

43536 QEEVRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPA 43357

43356 RTRVFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGY 43177

43176 TFALATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFK 42997

42996 GEELSEV* 42973

 

#264

>aaaa01009895.1 $PI CYP71P2P (indica cultivar-group) orth of AP003544.1

6273 GSMPAMVISKPNLARPALTTNDAVLASRQHLLNG*FLSFG frameshift

     CSDVTFAPAGPYHRM frameshift

     QMAR 6094

6093 GVEVSELLSAHHVAMYGVVRVKELQRLLAHLTNNTSSAKPIDLSECFLNLANDVLCRVAF 5914

5913 GRRFPRDEGDKLSAVLANAQDL 5848 frameshift

5848 LAGFTISDFFLELEPVASTVTGLCHRLKKCLADLYEACDVIVDVHISGNRQRIPSDREED 5669

5668 FVDVLLRVQ 5642

 

>AP003544.1 $P CYP71P2P chr 6 clone P0599C12 same as AP003686.1 8668-8037 pseudogene of AL713951.1

this gene matches a barley EST BF255745.2 75% and a sorghum EST BE354971.1 79%

so there must be a functional copy of this gene in rice 

107880 GSMPAVVISKPNLARPALTTNDAVLASRQHLLNG*FLSF 107764 frameshift

107762 GCSDVTFAPAGPYHRM 107715 frameshift

107713 QMARGVEVSELLSAHHVAMYGVVRVKELQRLLAHLTKNTSSAKPIDLSECFLNLANDVLCRVAF 107521

107520 GRRFPRDEGDKLSAVLANAQDLL 107452 frameshift

107452 AGFTISDFFLELEPVASTVTGLCHRLKKCLADLCEACDVIVDVHISGNRQRIPSDREEDFVDVLLRVQ 107249

 

 

#127

>aaaa01003512.1 i CYP71Q1 (indica cultivar-group) ortholog to AP004346.1a 95%

6414 SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVHSFAYARTAEVARLVDTLAASPPGVPF 6593

6594 DISCTLYQLLDGIIGTVAFGKVYGAAQWSTERAVFQDVLSELLLVLGSFSFEDFFPSSAL 6773

6774 ARWGDALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQEDMVDALVRMWREQQDR 6953

6954 PSGVLTREHIKAILM 6998

8541 NTFAGGIDTTAITAIWIMSELMRNPRVMQKAQAEVRNTVKNKPLVDEEDIQNLKYLE 8711

8712 MIIKENFRLHPPGTLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRDPMIWDNPEEFYP 8891

8892 ERFEDRNIDFRGSHFELVPFGSGRRICPGIAMAVASLELVVANLLYCFDWKLPKGM*EED 9071

9072 IDMEEIGQLSFHRKVELFIVPVKHEQCEP*DQLMGH 9179

 

>AP004346.1a $F CYP71Q1 two genes and a pseudogene

71B like 47% to AC092559.2 75% to AP004346.1b

22020 MADDFLSSQPQPW 22058

22059 PPLLQLSAAVLFFLLPLLYLLFLRGSNGEVRGRQGNSASAPSLPGPCRQLPVLGNLLQIG 22238

22239 SRPHRYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRPPSPG 22405 (2)

26427 SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVRSFAYARAAEVARLVDTL 26577

26578 AASPPGVPVDLSCALYQLLDGIIGTVAFGKGYGAAQWSTERAVFQDVLSELLLVLG 26745

26746 SFSFEDFFPSSALARWADALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQED 26919

26920 MVDALVKMWREQQDRPSGVLTREHIKAILM 27009 (0)

28586 NTFAGGIDTTAITAIWIMSEIMRNPRVMQKARAEVRNTVKNKPLVDEEDSQNLKYLEMIIKEN 28774

28775 FRLHPPGNLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRGPMIWDNPEEFYPERFE 28948

28949 DRNMDFRGSNFELVPFGSGRRICPGVAMAVTSLELVVANLLYCFDWKLPKGMKEEDIDM 29125

29126 EEIGQISFISFRRKVELFIVPVKHEQYQLMGHIN* 29221

 

#127

>aaaa01043764.1 CYP71Q1 (indica cultivar-group) orth AP004346.1a $F 97%

see aaaa01003512.1 for ortholog

905 LSAAVLFFFLLPFLYLLFLRGSNGEVRGRQGNSASAPSPPGPCRQLPVLGNLLQIGSRPH 726

725 RYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRP 585

 

#121

>aaaa01017559.1 CYP71Q2 (indica cultivar-group) orth AP004346.1b $F 98%

see aaaa01003239.1b above

3049 DCCLHPVCTRFFSPYSAYWREMRKLLVIELTSIRRVQSFAYARAAEVAR 3195

3196 LVDTLAASLAGVPVDLSSALYTFSDGVIGTVAFGKVYGSAAWSSSEWGGSFQEAMDETM 3372

3373 QVLGSFSFEDFFPSSALARWADALTGAAGQRRRVFHRIDGFFDAVIDKHLEPERLSAGV 3549

3550 QEDMVDATVKVWREQKDEAFGLTCDHIKAIL 3642

 

#121

>aaaa01025743.1 CYP71Q2 (indica cultivar-group) orth AP004346.1b $F 99%

see aaaa01003239.1b for ortholog

1225 FLLLPLVYLLFFKGDGNGGVMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRY 1404

1405 GPVVQVQLGSIRTVVVHSPEAAKDVLRTNDLQCCSRPSS 1521

 

#121

>aaaa01003239.1b i CYP71Q2 (indica cultivar-group) exon 2 ortholog to AP004346.1b 95%

17640 LQDAFVGGIDTTAVTTTWIMSELMRNPRVVQKA*AEVHNIVKNKSKVCKEDIQNMKYLKM 17461

17460 IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNICDNPEQFYPE 17281

17280 RFEDKGIDFRGSHFELLPFGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDI 17101

17100 DMDEIGQLAFRK 17065

 

>AP004346.1b $F CYP71Q2 two genes and a pseudogene 48% to AC092559.2 75% to AP004346.1a

69192 MATELLASQLLPWQPLVQLLAAGLFLLPLVYLLFFKGDGNGG 69317

69318 VMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRYVPVVQVQLGSIRTVVVHS 69494

69495 PEAAKDVLRTNDLQCCSRPSSPG 69565 (2)

72047 NYNYLDVAVSPYS 72083 (frameshift)

72085 YWREMRKLLVIELTSIRRVQSFAYARAAEVARLVDTLAASPAGVPVDLSSALYTF 72249

72250 SDGVIGTVAFGKVYGSAAWSSWEWGASFQEAMDETMQVLGSFSFEDFFPSSALARWADALTGA 72438

72421 AGRRRRVFHRIDGFFDAVIDKHLEPERLSAGVQEDMVDAMVMVWREQKDEAFGLTRDHIKAILL 72630 (0)

84351 DAFVGGIDTTAVTVTWIMSELMRNPRVMQKAQAEVHNIVKNKSKVCEEDIQNMKYLKM 84524

84525 IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNIWDNPEQFY 84698

84699 PERFEDKGIDFRGSHFELLPFGSGRRICPGIAMGVANVELVVANLLYCFNWQLPKGMKEE 84878

84879 DIDMDEIGQLAFRKNFLF* 84935

 

note there is another gene on AP004346.1

 

#120

>aaaa01003239.1a $PI CYP71Q3 (indica cultivar-group) exon 2 ortholog to AP004346.1c 96%

2517 DAFAGGIDTTVVTTTWIMSELMRNPTVMQ 2431 frameshift

2432 KAQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCKLHPPGTLLIPRHTMKTCTIGGYN 2253

2252 VPSKTRIYVNVWAMWRDPNIWDNPEQFYLERFEDKGIDFRGSHFELLT 2109 (?)

1820 FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGTKEEDIDMDEIG*LAFRK 1659

 

>AP004346.1c $P CYP71Q3 two genes and a pseudogene

probable pseudogene 89% to AP004346.1b

92865 DAFAGGIDTTVVTTTWIMSELMRNPRVMQK 92954 (frameshift)

92956 AQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCRLHPPGTLLIPRHTMKTCTIGGYSV 93132

93133 PSKRRIYVNVWAMWRDPNIWDNLEQFYLERFEDKGIDFRGSHFELLT 93273 (insertion)

93561 FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDTDMDEIG*LAFRKKLPLFI 93740

93741 VPMKH* 93758

 

#84

>aaaa01002200.1 $PI CYP71Q4P (indica cultivar-group) ortholog of AC087599.11 $P 94% PERF region resembles CYP71Q sequences

2930 PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSPEEFWPERFLASREAMD 3109

3110 FQGNNYQLILFITDRRICPDINFAVPVLETALVGLLHPTNELLGGGGGLMWLQRSCSRAR 3289

3290 RLRSTGHRRHRSGTHPAAAVAAAAT 3364

 

>AC087599.11 $P CYP71Q4P chromosome 10 clone OSJNBa0057L21, pseudogene fragment like 71A1 44% to AAAA01006105.1b

16812 GGGGRWTETLEWIMAELTANTRVMAKLQDEISRAADGK 16925 24 aa deletion and frameshift

16931 PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSSEEFWPEQFLASREAVD 17110

17111 FQGNNYQLILFITDRRIFPDINFAVPVLETALVGLLHPTNELLGGG 17248

17249 GGLMWLQRSCSRARRPRSTAHRRHRSGTHPAAIAAAAAT* 17368

 

#145

>aaaa01004200.1 CYP71R1 (indica cultivar-group)

16322 MAAVQLDFGLLVGFLFLATCLAVAIRSYLRSGGAAIPSPPALPVIGNLHQLGR 16480

16481 GRHHRALRELARRHGPLFQLRLGSVRALVVSSAPMAEAVLRHQDHVFCGRPQQRTARGTL 16660

16661 YGCRDVAFSPYGERWRRLRRVAVVRLLSARRVDSFRALREEEVASFVNRIRAASGGGV 16834

16835 VNLTELIVGLTHAVVSRAAFGKKLGGVDPAKVRETIGELADLLETIAVSDMFPRLRWVDW 17014

17015 ATGLDARTKRTAAKLDEVLEMALRDHEQSRGDDDDGGGGDGEPRDLMDDLLSMANDGGGD 17194

17195 HGHKLDRIDVKGLILV (1)

      NMFIA (frameshift)

      GTDTIYKSIEWT 17374

17375 MAELIKNPAEMAKVQAEVRHVAAAAHGDEDEDTVAVVREQQLGKMTLLRAA 17527

17528 MKEAMRLHPPVPLLIPREAIEDTVLHGHRVAAGTRVMINAWAIGRDEAAWEGAAEFRPGR 17707

17708 FAGGGDAAGVEYYGGGDFRFVPFGAGRRGCPGVAFGTRLAELAVANMACWFEWELPDGQ 17884

17885 DVESFEVVESS 17917

 

aaaa01004200.1 no ortholog in japonica 9/6/02

 

#214

>aaaa01007242.1 $PI CYP71R2P (indica cultivar-group) ortholog of AP003575.1 99%

11280 MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAAITSPPALPVIGNLHQLGR 11459

11460 GRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRHQDHVFCGRPQQHTARGTL 11639

11640 YGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEEVASFVNRIRAASGGGGGV 11819

11820 VNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGELADLLGTIAVSDMFPRLRWVDW 11999

12000 ATGLDARTKRTAAKLDEVLEMVLRDHEQSRGDDDDDDGDGEARDLMDDLLSMANGGDDHG 12179

12180 YKLDRIDVKGLLILV (0)

      DMFAAGTDTVYKSIE frameshift

      MAEL 12359

12360 IKNPAEMAKVQAEVRHVVAAAHGGEGDEDAVVIVKEEQASS frameshift

12482 LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 12661

12662 EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC 12841

12842 WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYAVETT 12976

 

>AP003575.1 $P CYP71R2P chromosome 6 clone P0528B02, similar to 71A24 one in frame stop codon 395 to 71A14

54816 MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAA

54690 ITSPPALPVIGNLHQLGRGRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRH 54511

54510 QDHVFCGRPQQHTARGTLYGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEE 54331

54330 VASFVNRIRAASGGGGGVVNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGEL 54160

54159 ADLLGTIAVSDMFPRLRWVDWATGLDARTKRTAAKLDEVLEMVLRDHEPSRGDDDDDDGD 53980

53979 GEARDLMDDLLSMANGGDDHGYKLDRIDVKGLLIL 53875 (0)

      DMFAAGTDTVYKSIE*TMAELIKNPAEMAKVQAEVRHVVAAAHGGEGDEDA (0) may be incorrect joint

53613 LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 53434

53433 EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC 53254

53253 WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYVVQTTRM* 53110

 

#448

>aaaa01059584.1 CYP71R3 (indica cultivar-group) 59% to CYP71R1

596 VAAKTRVIINTWAIGRDSIIRENAEEFLPERFIDNGIDYNSKDFSFIPFGAGRRGCPGIAFATRLA 793

794 ELALANLMYHFDWELQEGQDLESFQLVSPSVIQTWGSS 907

 

no japonica ortholog found 9/12/02

 

#362

>aaaa01016223.1 CYP71S1 (indica cultivar-group) orth AL606614.1b $F chr 4 96%

1585 PRPRGLPLIGNLHQVGALPHRSLAALAARHATPLMLLHLGSVPTLVVSTADAARALFRDN 1764

1765 DRALSGRPALYAATRLSYGQKNISFAPDGAYWRAARRACMSALLGAPRVRELRDAREREA 1944

1945 AALIAAVAAAGASPVNLSDMVAATSSRIVRRVALGDGDGDESMDVKAVLDETQA 2106

2107 LLGGLWVADYVPWLRWVDTLSGMRRRLELRFHQLDALYERVIDDHLNNRKHASDEE 2274

2275 DDLVDVLLRLHGDPAHRSTFGSRSHIKGIL 2364

2699 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHY 2872

2873 LRLVIKETLRLHPAAPLLVPREMTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAE 3052

3053 RFVPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWR 3223

3224 APPGREVDVEEENGLVVHKKNPLVLI 3301

 

>AL606614.1b $F CYP71S1 chromosome 4 clone OSJNBb0011N17 40% to 71A25

23233 MSMASLQAPEFLASCLLLATILFFKQLLAPSSKQRAASPSLPRPRGLPLIGNLHQVGALPHRSLAALAAR 23096

23095 HAAPLMLLRLGSVPTLVVSTADAARALFRDNDRALSGRPALYAATRLSYGQKSISFAPD 22919

22918 GAYWRAARRACMSELLGPPRVRGLRDAREREAAALVAAVAAAGASPVNLSDMVAATSSR 22742

22741 IVRRVAFGDGDGDESMDVKAVLNETQALLGGLWVADYVPWLRWVDTLSGKRWRLERRFRQ 22562

22561 LDALYERVIDDHLNKRKHASDEEDDLVDVLLRLHGDPAHRSTFGSRSHIKGILT 22400 (0)

22059 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHYLR 21889

21888 LVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAERF 21709

21708 VPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWRAP 21538

21537 PGREVDVEEENGLAVHKKNPLVLIATKSKRNTGGH* 21427

 

#362

>aaaa01040889.1 CYP71S1 (indica cultivar-group) orth AL606614.1b $F chr 4 100%

see aaaa01016223.1 for ortholog

822 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHY 986

987 LRLVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAE 1166

1167RFVPERHRD 1193

 

#298

>aaaa01011971.1 CYP71S2 (indica cultivar-group) orth AL606614.1a $F chr 4 96%

987  DMFIAGSDTSAVTVQWAMTELVRNPDVLAR 1085

     AQHEVRRVVAAAGGGDKDGAMVREADLPELHYLRLVIKETLRLHPASPLVQR 1240

1241 ETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWGPDAERFVPERHRAHDADGGQQHDGF 1420

1421 ALVPFGIGRRSC 1456

1450 ELLLANLLFCFDWSAPPGREVDVEEENGLAVRKKNPLVLI 1569

 

>AL606614.1a $F CYP71S2 chromosome 4 clone OSJNBb0011N17 90% to AL606614.1b

19270 MASLQAPEFLASCLLLLATILLFKQLLAPSSKKRAASPSLPRPKGLPLIGNLHQVGALPHRSLAAL 19073

19072 AARHAAPLMLLRLGSVPTLVVSTADAARALFRNNDRALSGRPALYAATRLSYGQKNISF 18896

18895 APDGAYWRAARRACMSALLGAPRVCELRDAREREAAALIAAVAAAGASPVNLSDMVAAT 18719

18718 SSRIVRRVAFGDGDGDESMDVKAVLDETQSLLGGLWVADYVPWLRWVDTLSGMRRRLERR 18539

18538 FRQLDAFYERVIDDHINKRKHASDEEDDLVDVLLRLHGDPAHRSMFGSRTHIKGILT (0)18368

17337 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVIAGGGGGDKDGAMVREADL 17149

17148 PELHYLRLVIKETLRLHPASPLVQRETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWG 16969

16968 PNAERFLPERHRAHDADGEQQHEHDGFALVPFGIGRRSCPGVHFAAAAAELLLANLLFCF 16789

16788 DWRALPGREVDVEEENGLAVRKKNPLVLIATKSKSNRDAH* 16666

 

#24

>aaaa01000559.1 $FI CYP71T1 (indica cultivar-group) 98% to AP003434.1a

22050 MELSSSLAAVLHSPLFLLAALLLLPVFTLLSFSSAKKPGDGGGRRLPLPPSPRGVPFLGH 21871

21870 LPLLGSLPHRKLRSMAEAHGPVMLLWFGRVPTVVASSAAAAQEAMRARDAAFASRARVSM 21691

21690 AERLIYGRDMVFAPYGEFWRQARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGV 21511

21510 RGGGETVNLSDMLMSYANGVISRAAFGDGAYGLDGDEGGGKLRELFANFEALLGTATVGE 21331

21330 FVPWLAWVDKLMGLDAKAARISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDH 21151

21150 RDFVDVLLDVSEVEEGAGAGEVLLFDAVAIKAIIL 21046

20444 DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELRL 20265

20264 LRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRPXAAWGDRAEE 20085

20084 FVPERWLDGGGGGEAVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLL 19917

      YHFDWELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV 19769

 

>AP003434.1a $F CYP71T1 chromosome 1, PAC clone:P0452F10, complete 41% to 71A24 = C98812

C98812 52% identical to D48413 43% to 71A13, 44% to 71B10

34853 MELSSSLAAVLHSPLFLLAAL

34916 LLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGHLPLLGSLPHRKLRSMAEAHGP 35095

35096 VMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRMAERLIYGRDMVFAPYGEFWR 35272

35273 QARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGVRGGGETVNLSDLLMSYANGV 35452

35453 ISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGEFVPWLAWVDKLMGLDAKAA 35629

35630 RISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDHRDFVDVLLDVSEVEEGAG

      AGEVLLFDTVAIKAIIL (0)

36458 DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELR 36634

36635 LLRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRDAAAWGDRAE 36814

36815 EFVPERWLDGGGEEVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDW 36994

36995 ELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV* 37129

 

#76

>aaaa01002066.1 $PI CYP71T2 (indica cultivar-group) ortholog of AP003434.1b $F 99%

1728 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLPLPPSPPGVPLLGH 1549

1548 LPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRTRDLAFASRPRVRM 1369

1368 SERLFYGRDM 1339 frameshift and deltion

     DFVDVMLDVSEAEEGAGAGAGGVLLDTVAIKAVIL 1233

 

>AP003434.1b $F CYP71T2 chromosome 1, PAC clone:P0452F10, complete = AA754300

AA754300      42% IDENTICAL TO 71A14   1/98 I-HELIX 43% to 703A2

39698 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP

39839 LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018

40019 RDLAFASRPRVRMSERLFYGRDM AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195

40196 VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375

40376 DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552

40553 VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)

42074 DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169

42170 QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349

42350 DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529

42530 RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709

42710 VRLKADLNLVAKPWSPGAS* 42769

 

note there are 4 sequences on AP003434.1

 

#206

>aaaa01006724.1 CYP71T3 (indica cultivar-group) orth AP003434.1c $F chr 1 99% also AU163704.1

7723 RRRLPPSPPWGLPLLGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEE 7544

7543 VMRTRDLEFASRPRVAMAERLLYGGRDVAFAPYGEYWRQTRRICVVHLLSARRVLSFRRV 7364

7363 REEEAAALVARVRAAGGAVDLVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRVLR 7193

7192 KLFDDFVELLGQEPMGELLPWLGWVDALNGMEVKVQRTFEALDGILEKVIDDHRRRRR 7019

7018 EVGRQMDDGGGGDHRDFVDVLLDVNETDMDAGVQLGTIEIKAII 6887

5268 DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 5095

5094 AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPARTRIVINAWTIGRDQATWGEHAEEFI 4915

4914 PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 4735

4734 EFGTSSLDMSEMNGLSVHLKYGLPLIAI 4651

 

>AP003434.1c $F CYP71T3 AU163704.1 chromosome 1, PAC clone:P0452F10, complete 44% 71A14

48011 MAVSLVVVVVV

48044 VIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHLLGALPHRALRS 48223

48224 LAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAERLLYGGRDVAFA 48403

48404 PYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVDLVEHLTAY 48574

48575 SNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLGWVDALN 48748

48749 GMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVNETDMD 48925

48926 AGVQLGTIEIKAIIL 48970 (0)

51142 DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 51312

51313 AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAWTIGRDQATWGEHAEEFI 51492

51493 PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 51672

51673 EFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP* 51771

 

#204

>aaaa01006398.1c $FI CYP71T4 (indica cultivar-group) AP003434.1d 95%

probably orthologs since 6398 and 3434 have only 3 nuc diffs and two

1 nuc indels in the intron.

9740  MAVSLLVVLLVVLAIVVPLLYLVLLPAGNTTRNGAARWEDDGGDGRRRRRLPPSPRGLPL 9919

9920  LGHLHLLGALPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDVEFASRP 10096

10097 RMAMAELLLYGGRDVAFAPYGEYWRQAPRICVVHLLSARRILSFRRVREEEAAALVGRV 10273

10274 RAAAADVVDLSDLLIAYSNTVLTRIAF GDESARGGGGGDRGRELRKVFDDFARL 10435

10436 LGTEPMGELLPWFWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRLMDDDGGG 10615

10616 DHRDFVDVLLDVNETDKDAGIQLGTVEIKAIIM (0) 10711

11174 DMFVGGSDTTTTMIAWTMAELINHPRAMHKAQNEIRAVVGNTSHVTKDHVDKLPYLKAVF 11353

11354 KETLRLHPPLPLLIPREPLADAQILGYTIPAHTRVVINAWAIGRDPAAWGQQPDEFSPEK 11533

11534 FLNGAIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWE 11680

11681 AAATDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 11812

 

>AP003434.1d $F CYP71T4 chromosome 1, PAC clone:P0452F10, complete like 71A

58119 MAVSLLPAVL

58149 VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328

58329 LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508

58509 GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685

58686 LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859

58860 FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036

59037 VNETDKDAGIQLGTVEIKAIIM 59102 (0)

59562 DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717

59718 LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897

59898 PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077

60078 TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200

 

#205 = #203 = #445 reduce gene count by 2

>aaaa01006398.1d $PI CYP71T5 (indica cultivar-group) seq gap before 12414

first exon has two frameshifts and part is missing (1205) no ortholog

      GDESARG (fs) RALRKLFENFARLLGTEPMGELLPWLGWVDAV (fs)

      WLDGKVQRTFEALDSIIEKVIDDHRRRRRRREVGRQMDSDDDGGGGG

      DHRDFVDVLLDVNETDKDAGIRLGTIEIKAIIL (0)

12924 DMFAAGTDTTTTAMEWAMAELITHRDAMHKVQDEIRAVVGVTGCVTEDHIDRLPYLKAVL 13103

13104 KETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIGRDPATWGEHAEKFIPER 13283

13284 FLNNNVDYKGQDFGLVPFGAGRRGCPGMGFAVPTIEMALASLLYNFSWETRPVDRRCKSG 13463

13464 TSSLDMSEMNGISVRLKYGLPLIAKSHFP* 13553

 

#203 = #205 = #445 reduce gene count by 2

>aaaa01006398.1b $FI CYP71T5 (indica cultivar-group) no ortholog

4560 MAVSLLPAVLVLLAIVAPLLYLVLLPAVKYTTSNGAARWEDDDGGDGRRRRRLPPSPRGLPLLGHLHLLGAL 4775

4776 PHRALRSLAAAHGPVLLLRLGRVPAVVVSSAAAAEEVMRARDVAFASRPRVAMAERLLYG 4955

4956 GRDVAFAPYGEYWRHARRICVVHLLSARRVLSFRRVREEEAAALVARVRAAARAPGAR 5129

5130 GAVDLVEHLTAYSNTVVSRAVFGDESARGLYGDVDRGRALRKLFDDFVELLGQEPMGELL 5309

5310 PWLGWVDAVRGLDGKVQRTFEALDSIIEKVIDDHRRRRRRHEVGRQMDSDDDGG 5471

5472 GGGDHRDFVDVLLDVNETDKDAGIRLGTIEIKAIIL (0)

     DMFAAGTDTTTTAMEWAMAELITHRDAMHKVQDEIRAVVGVTG 5831

5832 CVTEDHIDRLPYLKAVLKETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIG 6011

6012 RDPVTWGEHAEKFIPERFLNNNVDYKGQDFGLVPFGAGRRGCPGMGFAVPTIEMALASLL 6191

6192 YNFSWETRPVDRRCKSGTSSLDMSEVNGISVHLKYGLPLMAKFYSS* 6332

 

aaaa01006398.1b no ortholog found in japonica 9/7/02

 

#445 = #203 = #205 reduce gene count by 2

>aaaa01054542.1 CYP71T5 (indica cultivar-group) 76% to AP003434.1c $F

96% to aaaa01006398.1d $PI >99% over 970 bp eve outside the coding region

581 DMFAAGTDTTTTAMEWAMAELITHRNAMHKVQDEIRAVVGVTGCVTEDH 408

407 IDRLPYLKAVLKETLRLHPPNPLLVPHVPLADTEILGYTVPTHTRVLINAWTIGRDPATW 228

227 GEHAEKFIPERFLNNYVDYKGQDYGLVPFGAGRRGCPGMGFAVPTIEMALASLLYISAWE 48

47  TRPVDR 30

 

no japonica ortholog found 9/12/02

 

#202

>aaaa01006398.1a CYP71T6 (indica cultivar-group) (partialI)

1854 MVVVVVVVAIAIVVPLLYLVLLPPARRGGGDSARRRLPPSPRGLPLLGHLHLLGALP 2024

2025 HRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRARDVAFASRPRVAMAERLLY 2198

2199 GGRDVAFAPYGEYWR 2243 sequence gap here

 

aaaa01006398.1a no ortholog found in japonica 9/7/02

 

#89

>aaaa01002288.1 $FI CYP71T7 (indica cultivar-group) 13560 MDISLASLVLVLLAFVLPLLYLLLQLPGKKSGGGGGDGPRLPPSPAGCLPLLGHLHL 13390

13389 LGPLPHVALRSMAAAHGPVLRLRLGRVPTVVVSSAAAAEEVLRARDAAFSSRPRSAMAER 13210

13209 ILYGRDIAFAPYGEYWRQARRVCVVHLLSAQRVSSFRRVREEEAAALADAVRAAGRGGG 13033

13032 RAFDLSGLIVAYASAVVSRAAFGDESARGMYGGADGGRAVRKAFSDFSHLFGTKPVSDYL 12853

12852 PWLGWVDTLRGRERKARRTFEALDGVLDKVIDDHRRRRDSGRRQTGDADAGHRDFVDVL 12676

12675 LDVNEMDNEAGIHLDAIEIKAIIM 12604

12529 DMFVAGSDATSKPMEWAMAELVSHPRHMRRLQDEIRAVVGGGRVTEDHVDKLPYLRAAL 12353

12352 KEALRLHAPLPLLVARETVADTEIMGYHVAARTRVVINGWAIGRDTAVWGETAEEFMPER 12173

12172 FLAGGNGGGAAAADYKVQGFEMLPFGGGRRGCPGVTFGMATVE 12044

12041 SAVASLLYHFDWEAAAADGKGGREGTPLLDMSETSGISMGLKHGLPLVAKPRFP 11880

 

aaaa01002288.1 $FI has no ortholog in nr or HTGS on 9/5/02

 

#33

>aaaa01000893.1 CYP71T8 (indica cultivar-group) Nterm 49% to AP003434.1

33765 MSSYVVVAAALLVFVVVVVAAIKNLGKGKLPPSPPSLPFVGHLHLVGELPH 33917

33918 RSLDALHRRYGSDGGLMFLRLGRAGALVVSTAAAAADLYRGHDLAFASRPPSHSAERLFY 34097

34098 GGRNMSFAPLGDAWRRTKKLAVAHLLSPRRARPRRRGR 34211

 

aaaa01000893.1 may not have an ortholog

 

#437

>aaaa01042159.1 CYP71T9 (indica cultivar-group) 60% to AP003434.1b

58% to wheat AL821861 This seq may belong in another family or

subfamily like CYP703

DIMGAATDTSFVTLEWIMTELIRNTQVMSKLQNEIIQVTGS 3

 

no japonica ortholog found 9/12/02

 

#275 = #377 reduce gene count by 1

>aaaa01010398.1 CYP71T10 (indica cultivar-group) not an exact match 71 like pseudogene fragment 75% to 71T5 in a small region

2295 LMYLVLLPDVNRSNRPERWEDSDGWQRLPP*PRRLPLLRYLHLLSVPLHQAFHPLPR 2465

2466 HMAWCCYSSSNACRWWLSSFAATSRPKSAMAEQLLYGCDVAFAPYGEY* 2612

2613 S*ECRICVLFFRCIREEEVAVLVKHVRHPCR 2705

 

no japonica ortholog found 9/10/02

 

#377 = #275 reduce gene count by 1

>aaaa01017833.1 CYP71T10 (indica cultivar-group) N-term pseudogene fragment

3692 LMYLVLLPDVNRSNRPERWEDGDGWQRLPP*PRRLPLLRYLHLLGAPLHQAFHPLPR 3862

3863 HMAWCYYSSSNACRWWLSSFAATSRPKSAMAEQLLYGCDVAFAPYGEY* 4009

4010 S*ECRICVLFFRCIREEEVAVLVKHVRHPCR 4102

 

No japonica ortholog found 9/11/02

 

#179

>aaaa01005635.1 $PI CYP71U1P (indica cultivar-group) one stop codon, one fs

62% to AAAA01000843.1

8796 MDELSAGSLYLVVLGTLALALAFKRVLRGKETGVKLPPGPWNLPIIGSLHHLVGAHLPHRALLRVSR 8596

8595 RQGPLMLLRLGEVPAVVVSSPEAAMEFLRTRDPVFASRPRGALRSTSSASAVK 8437

8436 GSSWRHTASTGGRCARSAWWSCSAQGRCSGWSLSGRRRGGVPPRRGHRHDIA 8281

8280 CYSQHRHDPDASGAQ*RHHREGGVRRQVPTAGLRYLRVLKVVATLAGSFN 8131

8130 MVDLFPSSRLVRWLSCVERRLREHHAQTVRIVDSIIQDRKENEASASPGASAEDDDNDDL 7951

7950 LDVLLRLQREDNLTFPITAEIIGALIS (0) 7852

7202 DIFGAATDTTGSTLEWAMAELMRNPRTMEKAKQEVQNALGQGRAMVTGADIGDLHYLQMV 7023

7022 IKETLR (fs) 7005

7000 LHPSIPLIVRASEESTLVMGYDIPQGTNIFINAFAVARDPRYWKDADEFMPERFEKN 6830

6829 GDDIKATTVHMGFIPFGAGR (deletion of 18 aa heme signature region in seq gap) 6770

6669 NLLYHFDWTLINGESPESLDMGEVWGISIHRRSDLRLHAALSVSSGFLRHSDRDS* 6496

 

aaaa01005635.1 no ortholog in japonica on 9/7/02

 

#31

>aaaa01000843.1 $FI CYP71U2 (indica cultivar-group) 63% to AAAA01005635.1 37% to 71B2

92% to AP004872.1 and AP005536.1 (00843 is best indica match for these seqs)

21357 MDELSIENHSPISMDELSFG

21297 SLCLVAMATLALALALMVVMGAHRRGGEKGATTGAKNLPPGPWNLPVTGSLHHLLGASP 21121

21120 PPHRALLRLSRRHGPLMLVRLGEVPTVIVSGSDAAMEGWVLKAHDPAFADRARSTTVDAV 20941

20940 SFGGKGIIFAPYGEHWRQARRVCLAELLSARQVRRLESIRQEEVSRLVGSIAGSSNA 20770

20769 AAVDMTRALAALTNDVIARAVFGGKCARQEEYLRELGVLTALVAGFSMADLFPSSRV 20599

20598 VRWLSRRTERRLRRSHAQMARIVGSIIEERKEKKASDDGVGAKDEDDDLLGVLLRLQEED 20419

20418 SLTSPLTAEVIGALVI (0) 20371

17791 DIFGAATDTTASTLEWVMVELMRNPRAMEKAQQEVRNTLGHEKGKLIGTDISELHYLRMV 17612

17611 IKETLRLHPSSALILRQS (fs) 17558

17558 QGNCRVMGYDIPQATPVLINTFAVARDAKYWDNAEEFKPERFENSGADIRTSTAHLGFVP 17379

17378 FGAGCRQCPGALFATTTLELILANLLYHFDWALPDGVSPESLDMSEVMGITLHRSSSLHL 17199

17198 HATLSRLGFVSHSGQ* 17151

 

aaaa01000843.1 $FI may not have an ortholog

 

#41

>AP004872.1 $F CYP71U3 (japonica cultivar-group) chr 2 = AP005536.1  92% to aaaa01000843.1 (best match in indica) low percent for an ortholog

98325 MDELSIENHSPISMDELSFGSLCMVAMATLALALALMVMGAHRRGGEKGATTGAKNLPP 98149

98148 GPWNLPVIGSLHHLLGASPPHRALLRLSRRHGPLMLVRLGEVPTVIVSGSDAAMEVL 97978

97977 KARDPAFADRARSTTVDAVSFGGKGVIFAPYGEHWRHARRVCLAELLSARQVRRLESIRQ 97798

97797 EEVSRLVDSIIAGSSNAAAVDMTRALAALTNDVIARAVFGGKCARQEEYRRELGVLTTLV 97618

97617 AGYSMVDLFPSSRVVRWLSRRTERRLRRSHAEMARIVGSIIEERKEKKGSDAGVGAKDED 97438

97437 DDLLGVLLRLQEEDGLTSPLTAEVIAALV 97351

94360 XDIFGAATDTTASTLEWIMVELMRNPRAMDKAQQEVRNTLGHEKGKLIGIDISELHYLCMV 94181

94180 IKETLRLHPASALILRQSRENCRVMGYDIPQATPVLINTFAVARDPKYWDNAEEFKPE 94007

94006 RFENSGADIRTSIAHLGFIPFGAGCRQCPGALLATTTLELTLANLLYHFDWALPDGVSPK 93827

93826 SLDMSEVMGITLHRRSSLHLHTTLTRSGFFSHSGR 93722

 

#486 incorrectly labeled as #200

>aaaa01025826.1 CYP71V1 (indica cultivar-group) orth AC096855.1 $F chr 3 98%

604 YFFFLQSLLLCIAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHRALRD 783

784 LAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGWADILFS 963

964 PSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPV 1116

1672 DMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFHGKAVVMEADLQASNLRYL 1851

1852 KLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVNVWAIGRHPKYWDDAEEFK 2031

2032 PERFDDGAIDFMGGSYKFIPFGSGRRMCPGFNYGLASMELVLVAMLYHFDWSLPVGVKEV 2211

2212 DMEEAPGLGVRRRSPLLL 2265

 

>AC096855.1 $F CYP71V1 chromosome 3 clone OJ1365_D05 54% to AC087550 frameshift before PERF?

= AQ326032 AQ329780 73% to AF321860 Lolium rigidum similar to CYP71D sequences

87309 MDDYFFLQSLLLCVAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHR 87136

87135 AMRDLAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGW 86962

86961 ADILFSPSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPVNLS 86782

86781 VLFHSTTNDIVARAAFGRKRKSAPEFMAAIKAGVGLSSGFKIPDLFPTWTTALAAVTGMK 86602

86601 RSLRGIHKTVDAILQEIIDERRCVRGDKINNGGAADDQNADENLVDVLIALQEKGGF 86431 (1)

86339 GKSVTTPWVIVTHMICTLDVQDMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFH 86157

86156 RKAVVTEADLQASNLRYLKLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVN 85977

85976 VWAIGRDPK Y*Y*E DAEEFKPEQFDDDAIDFMGGSYEFIPFGSGRRMCPGFNYGLASMEL 85797

85796 VLVAMLYHFDWSLLVGVKEVDMEEAPGLGVRRRSPLLLCATPFVPAAVSADY* 85638

 

#200

>aaaa01006345.1 CYP71V2 (indica cultivar-group) 77% to AC096855.1 $F

no orth 9/15/02

10409 VLQLLKLLLVRHRRPRTPPGPWRLPVIGSMHHLVNVLPHRKLRELAAVHGPLMMLQLGET 10230

10229 PLVVATSKETARAVLKTHDTNFATRPRLLAGEIVGYEWADILFSPSGDYWRKLRQLCAAE 10050

10049 ILSPKRVLSFRHIREDE 9999

9730 VNLSVMFHSVTNSIVSRAAFGKKRKNAAEFLAAIKSGVGLASGFNIPDLFPT 9575

9574 WTGILATVTGMKRSLRAIYTTVDGILEEIIAERKGIRDEKISGGAENVDENLVDVLIGL 9398

9397 QGKGGFGFHLDNSKIKAIILQDMFA 9218

9217 GGTGTSASAMEWGMSELMRNPSVMKKLQAEIREVLRGKTTVTEADMQAGNLRYLKMVI 9044

9043 REALRLHPPAPLLVPRESIDVCELDGYTIPAKSRVIINAWAIGRDPKYWDNPEEFRPERF 8864

8863 EDGTLDFTGSNYEFIPFGSGRRMCPGFNYGLASMELMFTGLLYHFDWSLPEGVNEVDMAE 8684

8683 APGLGVRRRSPLMLCATPFVPVV 8615

 

#139

>aaaa01004037.1  CYP71V3 (indica cultivar-group) ortholog to AL732378.3 99%

12867 LPPGPRNLPVVGSAHRLVNTLAHRVLRDLADVHGPLMHLRVGQVPVVVVTSKELARD 12697

12696 ILKTHDANFATRPKLVAGGIVAYDWTDILFSPSGDYWRKLRRLCIQEILSAKRILSFEHI 12517

12516 REDEVRMLADEIRAVGPSVAVDLSARLHRITNTIVSRAAFGNKRSNAADFLVAIKQSVIM 12337

12336 ASGFYVPDLFPRFSVLLCWLTGMRRTLHGIRDTIDSILEEIISEKEEAKQQQDNNL 12169

12168 VDVLLSLKDKGDFGFPITRDTIKAIVL 12088

11880 DIFAGGSGTSANAMEWAMSELMMNPRVMNKVQAEIRDAFHGKQSIGEADLRARDLKYLKL 11701

11700 VMKETLRLHPPAPLLVPRESIDACEINGYMIPAKARVIVNAWAISRDPRYWEDAEEFKPE 11521

11520 RFAEGGIDFYGSNYEYTPFGSGRRMCPGYNYGLASMELTLAQLLHSFDWSMPDGATEV 11347

11346 DMTEAPGLGVRRKTPLLLCAAPYVASPI 11263

 

>AL732378.3 $F CYP71V3

      MAWLDDVLSLCNNNTRMCNALVLSVVVVSFLQLLKHVLLTPSRLP

64951 LPPGPRNLPVVGSAHRLVNTLAHRVLRDLADVHGPLMHLRVGQVPVVVVTSKELARDILK 64772

64771 THDANFATRPKLVAGGIVAYDWTDILFSPSGDYWRKLRRLCIQEILSAKRILSFEHIRED 64592

64591 EVRMLADEIRAVGPSVAVDLSARLHRITNTIVSRAAFGNKRSNAADFLVAIKQSVIMASG 64412

64411 FYVPDLFPRFSVLLCWLTGMRRTLHGIRDTIDSILEEIISEKEEAKQQQDNNLVDVLLSL 64232

64231 KDKGDFGFPITRDTIKAIVL 64172

63964 DIFAGGSGTSANAMEWAMSELMMNPRVMNKVQAEIRDAFHGKQSIGEADLRARDLKYLKL 63785

63784 VMKETLRLHPPAPLLVPRESIDACEINGYMIPAKARVIVNSWAISRDPRYWEDAEEFKPE 63605

63604 RFAEGGIDFYGSNYEYTQFGSGRRMCPGYNYGLASMELTLAQLLHSFDWSMPDGATEVDM 63425

63424 TEAPGLGVRRKTPLLLCAAPYVASHIYA* 63338

 

#193

>aaaa01006105.1a $FI CYP71V4 (indica cultivar-group) similar to Lolium rigidum AF321859

MDELLYRALLLSVLAVALLQIIEAFLIIIRAKPAAPPLPPGPWRLPVIGSMHHLAGKLPHRALRD 3196

LAAAHGPLMMLRLGETPLVVASSREMAREVLRTHDANFATRPRLLAGEVVLYGGADILF 3019

SPSGEYWRRLRQLCAAEVLGPKRVLSFRHIREQE (0) 2914

MESQVEEIRAAGPSTP

VDLTAMFSFLVISNVSRASFGSKHRNAKKFLSAVKTGVTLASGFKIPDLFPTWRKVLAAV 1750

TGMRRALEDIHRVVDSTLEEVIEERRSAREDKARCGMVGTEENLVDVLIGLHEQGGC 1579

LSRNSIKSVIFDMFTAGTGTLSSTLGWGMSELMRSPMVMSKLQGEIREVFYGKATVGEED 1399

IQASRLTYLGLFIKETLRLHPPVPLLVPRESIDTCEIKGYMIPARSRIIVNAWAIGRDPR 1219

YWDDAEEFKPERFEKNIVDFTGSCYEYLPFGAGRRMCPGVAYGIPILEMALVQLLYHFDW 1039

SLPKGVVDVDMEESSGLGARRKTPLLLCATPFVVPVL* 925

 

aaaa01006105.1a  no japonica ortholog found 9/7/02

 

#439

>aaaa01045745.1 CYP71V4 (indica cultivar-group) 64% to AC096855.1 $F

98% to aaaa01006105.1a $FI 2 diffs

2   ESIDTCEIKGYMIPARSRIIVNAWAIGRDPRYWDDAEEFKPKRFEKNMVDFTGSCYEYLP 181

182 FGAGRRMCPGVAYGIPILEMALVQLLYHFDWSLPKGVVDVDMEESSGLGARRKTPLLL 355

 

no japonica ortholog found 9/12/02

 

#194

>aaaa01006105.1b $FI CYP71V5 (indica cultivar-group)

6903 MDGLLYQALLLSALAVAVLQIVKLAVVNRGKKQAAAAAPTPPGPWRLPVIGSMHHLAGKLAHRALRD 6703

6702 LAAVHGPLMMLQLGETPLVVVSSREVAREVLRTHDANFATRPRLLAGEVVLYGGADILF 6526

6525 SPSGEYWRKLRQLCAAEVLGPKRVLSFRHIREQE (0) 6421

     MASRVERIRAVGPSVP

5914 VDVSALFYDMAISIVSCASFGKKQRNADEYLSAIKTGISLASGFKIPDLFPTWRTVLAAV 5735

5734 TGMRRALENVHRIVDSTLEEVIEERRGAARECKGRLDMEDNEENLVDVLIKLHEQGG 5564

5563 HLSRNSIKSVIFDMFTAGTGTLASSLNWGMSELMRNPRVMTKLQGEIREAFHGKATV 5393

5392 GEGDIQVSNLSYLRLFIKETLRLHPPVPLLVPRESIDMCEVNGYTIPARSRIVVNAWAIG 5213

5212 RDPKYWDDPEEFKPERFEGNKVDFAGTSYEYLPFGAGRRICPGITYALPVLEIALVQLIY 5033

5032 HFNWSLPKGVTEVDMEEEPGLGARRMTPLLLCATPFVVPVL* 4907

 

aaaa01006105.1b no japonica ortholog found 9/7/02

 

#195

>aaaa01006105.1c $PI CYP71V6P (indica cultivar-group) 79% to AAAA01006105.1b

but no Nterm exon present in 2000bp to end of clone

12642 (0) IASRVDLICAVGPLTL

12594 VDVSALFYDITISIASCASFGKKHRNVDEYLSSIKTRVSLASRFKIPDLFPSWRTMLAMV 12415

12414 TGMRRALEEVHGIVDSTLEDVIEERQGEKEDKTRPDMVDTKENLVDVLIGLHENGA 12247

12246 HLSRDSIKAVIFDMFTAGTGTLASALNWGMSKLMRNPRVMTKLQGEIRKAFHGKVTVG 12073

12072 EDDIQAANLPYIRLFIEETLLLHPVVPLLVPRESIDVCEVNGYTILARSRIVVNAWAIGR 11893

11892 DPKYWDNPEEFKPEWFEGNIVDFPGSSYEYLPFGAG*RMCPGIAYGLPVLEMALVQLLYH 11713

11712 FD*SLPNGVMKVDMEEEPGLGARRKTPLLLNLFVIPVLQGQQ*  11578

 

aaaa01006105.1c no japonica ortholog found 9/7/02

 

#406

>aaaa01023722.1 $FI CYP71W1 (indica cultivar-group

71  MELTTLLLLALISFFFLVKLIARYASPSGRESALRLPPGPSQLPLIGSLHHLLLSRYGDL 250

251 PHRAMRELSLTYGPLMLLRLGAVPTLVVSSAEAAAEVMRAHDAAFAGRHLSATIDILSC 427

428 GGKDIIFGPYTERWRELRKVCALELFNHRRVLSFRPVREDEVGRLLRSVSAASAEGGA 601

602 ACFNLSERICRMTNDSVVRAAFGARCDHRDEFLHELDKAVRLTGGINLADLYPSSRLVRR 781

782 LSAATRDMARCQRNIYRIAESIIRDRDGAPPPERDEEDLLSVLLRLQRSGGLKF 943

944 ALTTEIISTVIF (0) 979

1125 DIFSAGSETSSTTLDWTMSELMKNPRILRKAQSEVRETFKGQDKLTEDDVAKLSYLQLVI 1304

1305 KETLRLHPPAPLLIPRECRETCQVMGYDVPKGTKVFVNVWKIGREGEYWGDGEIFRPERF 1484

1485 ENSTLDFRGADFEFIPFGAGRRMCPGIALGLANMELALASLLYHFDWELPDGIKSEELDM 1664

1665 TEVFGITVRRKSKLWLHAIPRVPYYSTY* 1751

 

no japonica ortholog found 9/12/02

 

#330

>aaaa01013880.1b CYP71W2 (indica cultivar-group) orth of AC120537.1a

stops are even in the same location

4325 ARRAQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCAR 4507

     gap

4868 ETFFNMDNLRTHDTYRKKNHSGNSQHCTASFIVFSFSELQLKMTIWQSHHYKLPINLRK 5044

5045 YFQQGARQLNDTLVGNI*ASEKYPQVMQKAQTEVREKFR 5161

     G*DKLIKDDMNRLSYLHLVIQE 5226

5227 TLRLH 5241

 

>AC120537.1a CYP71W2 chromosome 3 clone pseudogene fragment

2543 ARRVQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCA 2364

2363 RRDEFLHVQARGLRQARGRVQLGRPVPIVVASELAQRRAAVGRPSVAAGAFARCGRPAET 2184

2183 FFNMDNLRTHDTYRKKNHSGNSQHCTAFSALSFSELQLKMTIWQSHHYKLPINLREIFS  

2006 SAGSETLNDTLVGNI*ANEKYPQVMQKAQTEVREKFRG*DKLIKDDMNRLSYLHL 1846

1845 VIQETLRLH

 

#329

>aaaa01013880.1a $FI CYP71W3 (indica cultivar-group) ortholog of AC120537.1b AQ869247.1

 801 MEVSLPLLIGVVLAFLLLFVLVNVKNSCRSWWPPPEKEKKKLRLPPGPWRLPLVGSLHHVLLS (fs) 989

 991 RHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYLTPTLA 1170

1171 VLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHVREDEAARLVRSVAAECAG 1347

1348 RGGAAVVSVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLYPSSW 1527

1528 LARRLSCAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPL 1689

1690 TTDLITNVVL (0)

2513 DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEVMDKL 2671

2672 SYLRLVIRETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPE 2851

2852 VFKPERFENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRD 3031

3032 RNDEIDLSETFGITAKRKSKLMVYATQRIPCLG* 3133

 

>AC120537.1b $F CYP71W3 chromosome 3 clone

AQ869247.1 nbeb0034D08r CUGI Rice BAC genomic Length = 447 53% to 99A1

AZ130570.1 OSJNBb0104D19r CUGI Rice BAC genomic Length = 327

80150 MEVSLPLLIGVVLAFLLLFVLVNIKNSCRSWWPPPEKEKKKLRLPPGPWQLPLVGSLHHV 80329

80330 LLSRHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYL 80503

80504 TPTLAVLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHGREDEAARLVRSVAA 80683

80684 ECAARGGAAVVNVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLY 80863

80864 PSSWLARRLSGAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPLTT 81043

81044 DLITNVVL (0) 81067

81865 DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEMMDKLSYLRLVI 82044

82045 RETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPEVFKPERF 82224

82225 ENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRDRNDEIDL 82404

82405 SETFGITAKRKSKLMVYATQRIPCLG 82482

 

#392

>aaaa01021177.1 $FI CYP71W4 (indica cultivar-group) ortholog to AC120537.1c

AAAA01039974.1 (indica cultivar-group) Nterm 132 aa

396 MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERRLRLPPGPWRLPLVGSLHHVLLSR 217

216 HGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAEAAREVLKTHDACFASRHMTPTLAV 37

36  FTRGGRDILF 7

3930 SPYGDLWRQLRRICVLELFSARRVQSLRHVREDEAARLVRAVAEECAIGGGGGAVVPIGD 3751

3750 MMSRMVNDSVVRSAIGGRCARRDEFLRELEVSVRLTGGFNLADLYPSSSLARWLSGALRE 3571

3570 TEQCNRRVRAIMDDIIRERAAGKDDGDGEDDLLGVLLRLQKNGGVQCPLTTDM 3412

3411 IATVIM (0) 3394

2892 EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLSYLHLVI 2713

2712 RETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEIFKPERF 2533

2532 NANLVDFKGNYFEYIPFGSGRRVCPGITLGLTSMELVLASLLYYFDWELPGGKRCEEIDM 2353

2352 SEAFGITVRRKSKLVLHATPRVPCLH* 2272

 

>AC120537.1c $F CYP71W4 chromosome 3 clone OSJNBb0042N11

AQ573952 nbxb0083G09r 60% to AQ259669 52% to 99A1 51% to 71B23

BM039053 clone V013G04.Length = 527

103769 MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERR

103880 LRLPPGPWRLPLVGSLHHVLLSRHGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAE 104056

104057 AAREVLKTHDACFASRHMTPTLAVFTRGGRDILF SPYGDLWRQLRRICVLELFSARRVQS 104236

104237 LRHVREDEAARLVRAVAEECAIGGGGGAVVPIGDMMSRMVNDSVVRSAIGGRCARRDEFL 104416

104417 RELEVSVRLTGGFNLADLYPSSSLARWLSGALRETEQCNRRVRAIMDDIIRERAAGKDDG 104596

104597 DGEDDLLGVLLRLQKNGGVQCPLTTDMIATVIM (0) 104695

105190 EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLS 105351

105352 YLHLVIRETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEI 105531

105532 FKPERFNANLVDFKGNDFEYIPFGSGRRVCPGITLGLTSMELVLASLLYHFDWELPGGKR 105711

105712 CEEIDMSEAFGITVRRKSKLVLHATPRVPCLH* 105810

 

#393

>AC078894.1 $P CYP71W5P chromosome 10 clone OSJNBa0096G08 12 unordered pieces starts 123

probable pseudogene fragment 47% to 71B1 2 diffs with AP004175.1 $P

80% to 71W4

51119 LRRICVLELFSAHRV*SLHHVREEEAAPLVRVVADIRSPLGP 50994

 

#394

>AP004175.1 $P CYP71W6P chromosome 2 clone OJ1006_B12 pseudogene fragment

94% to AC078894 51119-50994

64520 LRRICMLELFSAHRV*SLHHVREEEAARLVRVVA 64419

 

#311

>aaaa01012657.1 CYP71X1 (indica cultivar-group) orth AP003990.1g $P chr 2 99%

7170 FTPLFLLAVLPLKLTNGGDGV*LPPGPWRLPVIGSMHHLMGESLVHRAMADLARRLDAPL 7349

7350 MYLKLGEVPVVLASSPCAAREIMRVHDVAFASRP 7451

7490 RQLRKICVVELLSARRVRTFRRVREEE 7570

6081 DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL 5908

5907 VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 5782

5782 AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCPGLAFAEAIMDLLFST 5603

5602 LLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPIL 5483

 

>AP003990.1g $P CYP71X1 chromosome 2 clone OJ1073_F05 pseudogene

42530 MDHVLACVGILVAFTPLFLLAVLPLKLTNGGDGVKLPPGPWRLPVIGSMHHLMGESLVHRAMAD 42721

42722 LARRLDAPLMYLKLGEVPVVLASSPCAAREIMRAHDVAFASRPLSPTVRRMR 42877

42878 PPPPRRRQLRKICVVELLSARRVRTFRRVREEEVARLVGALVCLAHVA 43021 gap

      AMIGARFERRDEFLE

      missing mid region

43862 DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL 44035

44036 VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 44161 frameshift

44161 AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCSGLAFAEAIIDLLFS 44337

44338 TLLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPILRVPQTQTSSALLF* 44502

 

 

#222

>aaaa01007431.1b CYP71X2 (indica cultivar-group) orth AP003990.1f

10785 MYDAVACVVAVVVVVIFAMLRVKLARSGDGGGGGGGGVR

10668 LPPGPWRLPVIGSLHHVVGDRLLHRSMARIARRLGDAPLVYLQLGEVPVVVASSPGAARE 10489

10488 VTRTHDLAFADRALNPTARRLRPGGAGVALAPYGALWRQLRKICVVELLSARRVRSFRRV 10309

10308 REEEAGRLVGALAAAAASPGEEAAVNFTERIAEAVSDAALRAMIGDRFERRDEFLQ 10141

10140 ELTEQMKLLGGFSLDDLFPSSWLASAIGGRARRAEANSRKLYELMDCAIRQHQQQRAE 9967

9966  AAVVDGGAGVEDDKNQDLIDVLLNIQKQGELETPLTMEQIKAVIL 9790

9595 DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 9422

9421 IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNVWAIGRDPKYWDDAEEFRPE 9242

9241 RFEHSTVDFKGVDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMVASEL 9062

9061 DMTEEMGITVRRKNDLHLRPXXXXXXXXXXXXXXXRERERHFV 8936

 

>AP003990.1f $F CYP71X2 chromosome 2 clone OJ1073_F05

38238 MYDAVACVVAVVVVVVFAMLWVKLARSGDGGGGGSGGVRLPPGPWRLPVIGSLHHVVGDRLLHRSMA 38438

38439 RIARRLGDAPLVYLQLGEVPVVVASSPGAAREVTRTHDLAFADRALNPTARRLRPGGAGV 38618

38619 ALAPYGALWRQLRKICVVELLSARRVRSFRRVREEEAGRLVGALAAAAASPGEEA 38783

38784 AVNFTERIAEAVSDAALRAMIGDRFERRDEFLQELTEQMKLLGGFSLDDLFPSSWLASAI 38963

38964 GGRARRAEANSRKLYELMDCAIRQHQQQRAEAAVVDGGAGVEDDKNQDLIDVLLNIQKQG 39143

39144 ELETPLTMEQIKAVIL 39191 (0)

39428 DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 39601

39602 IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNAWAIGRDPKYWDDAEEFRPE 39781

39782 RFEHSTVDFKGIDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMAASE 39958

39959 LDMTEEMGITVRRKNDLHLRPHPPCVVRSNFRSFVERERERHFV* 40093

 

#221

>aaaa01007431.1a CYP71X3 (indica cultivar-group) orth AP003990.1e $F chr 2 99%

lower case does not match japonica seq, but matches seq b

4252 NLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADAAR 4431

4432 EIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSFHG 4611

4612 VREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERREDFLEV 4773

4774 LPEIVKLASGFSLDDLFPSSwlagaiggsrrRGEAVNRASYELVDSAFRQRQQQKEAM 4947

4948 AAPPPDIAKEEEDDLMDELIRIHKEGSLEVPLTAGNLKAVI  5070

5313 ELFCAGSETSSNAIQWAMSELVRNPRVMEKAQNEVRSILKGKPTVTEADMVDLTY 5486

5487 VKMIVKETHRLHPVLPLLTPRVC*QTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 5666

5667 KPERFEDSEIDLKGTNYEFIPYGAGRRICPGLALAQVSIEFILTTLLYHFNWELPKGAAP 5846

5847 KELDMTEDMGLTIRRKNDLYLLPTL 5921

 

>AP003990.1e $F CYP71X3 chromosome 2 clone OJ1073_F05

33613 MEQVSCFAAAAAAVLVVLSLARMLLAPRREWD

33709 GLNLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADA 33888

33889 AREIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSF 34068

34069 HGVREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERRE 34221

34222 DFLEVLPEIVKLASGFSLDDLFPSS 34296 check joint

      GSPAPSAARGEAVNRASYELVDSAFRQRQQQKEAMAAPPPDIAKEE

      EDDLMDELIRIHKEGSLEVPLTAGNLKAVIL 34528 (0)

34777 ELFCAGSETSSNAIQWAMSELVRNPKVMEKAQNEVRSILKGKPTVTEADMVDLTY 34941

34942 VKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFIKSWAIMRDPKHWDDAETF 35121

35122 KPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILTMLLYHFNWELPNGAA 35298

35299 PEELDMTEDMGLTIRRKNDLYLLPTLRVPLTA* 35397

 

#221

>aaaa01070587.1 CYP71X3 (indica cultivar-group) orth AP003990.1e $F chr 2 96%

see aaaa01007431.1a for ortholog

84  YQVKVSHMLHFGIV*ELFCAGSETSSNAIQWAMSELVRNPRVMEKAQNEVRSILKGKP 257

258 TVTEANMVDLTYVKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFINSWTIM 437

438 RDPKHWDDAETFKPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILATLLY 617

618 HFNWELPNGAAPKELDMTEDMGLTIRRKNDLYLLPTL 728

 

#376

>AP003990.1d $F CYP71X4 chromosome 2 clone OJ1073_F05 no ortholog

27391 MEQVSCFAAAAAVVVVVLLLARMLLAPRGEWDGLNLPPSPPRLPFIGSFHLLRRSPLVHRALADVARQL 27597

27598 GSPPLMYMRIGELPAIVVSSADAAREVMKTHDIKFASRPWPPTIRKLRAQGKGIFFEPYG 27777

27778 ALWRQLRKICIVKLLSVRRVSSFHGVREEEAGRLVAAVAATPPGQAVNLTE 27930

27931 RIEVVIADTTMRPMIGERFERREDFLELLPEIVKIASGFSLDDLFPSSWLACAIGGSQRR 28110

28111 GEASHRTSYELVDSAFRQRQQQREAMAASPPDIAKEEEDDLMDELIRIHKEGSLEVPLTA 28290

28291 GNLKAVIL 28314 (0)

28577 DLFGAGSETSSDALQWAMSELMRNPRVMEKAQNEVQSILKGKPSVTEADVANLKY 28741

28742 LKMIVKETHRLHPVLPLLIPRECQQTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 28921

28922 KPERFEDGEIDLKGTNYEFTPFGAGRRICPGLALAQASIEFMLATLLYHFDWELPNRAA 29098

29099 PEELDMTEEMGITIRRKKDLYLLPTLRVPLTA* 29197

 

#375

>aaaa01017763.1 CYP71X5 (indica cultivar-group) orth AP003990.1c $F chr 2 97%

1306 FLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD 1485

1486 APLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSPTTRRLRCDGEGVVFATYGAL 1665

1666 WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERITAVITDAT 1842

1843 MRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAEANH 2007

2008 RRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGNI 2181

2182 KAIIL

2562 DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKY 2735

2736 LKLVIKETLRLHPVLPLLLPRECQEACNVIGYDVPKYTTVFINVWAINRDPKYWDMAEMF 2915

2916 KPERFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYHFDWELPSGMSP 3095

3096 EELDMTEDMGLSVRRKNDLYLHPTV 3170

 

>AP003990.1c $F CYP71X5 chromosome 2 clone OJ1073_F05 one in frame stop at W

AQ259669 61% identical to AQ328148 53% to 76C2 also has stop at W

AQ690680.1 nbxb0082B18f CUGI Rice BAC genomic clone Length = 768

AQ579195 nbxb0084A11f AQ509836 nbxb0094K16f 72% identical to AQ259671

16507 MEKVAWCACFLLLALMVVRLTAKRRGDNGAERLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD 16710

16711 APLMSLRLGEVPVVVASSADAAREIMRTHDVAFATRPWNPTTRRLRCDGEGVVFATYGAL 16890

16891 WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERI 17043

17044 TAVITDATMRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAE 17223

17224 ANHRRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGN 17403

17404 IKAIIL 17421 (0)

17795 DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKYLKL 17968

17969 VIKETLRLHPVLPLLLPRECREACNVIGYDVPKYTTVFINV*AINRDPKYWDMAEMFKPE 18148

18149 RFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYYFDWELPSGMSPEE 18325

18326 LDMTEDMGLSVRRKNDLYLHPTVCVPL* 18409

 

#75

>aaaa01002047.1c $PI CYP71X6 (indica cultivar-group) ortholog of AP003990.1b 99%

20381 MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRT 20560

20561 MADLARRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSEGVG 20740

20741 LVFAPYGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNVSERI 20920

20921 AALVSDAAVRTIIG 20962

20979 VAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMA 21125 aa 452 out of sequence

21126 DLFPSSRLASFIGGTTRRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDI 21299

21300 VDVLLRIQKEGSLQVPLTMGNIKAVVL 21380

22004 DLFSAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKLII 22183

22184 KETLRLHPVVPLLLPRECQETCKVMDYDIPIGTIVLVNVWVIGRDPKYWD 22333

22334 DAKTFRLERFEDGHVDFKGMNFEYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFP 22513

22514 DGILPAKMDMMEVMGSTV*KKNDLYLVPNAHVPVAP 22621

 

>AP003990.1b $P CYP71X6 chromosome 2 clone OJ1073_F05

11020 MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLA 11214

11215 RRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 11394

11395 YGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNV 11547

11548 SERIAALVSDAAVRTIIGDRFERRDEFLEGLAEGIKITSGFSLGDLFPSSRLASFIGGTT 11727

11728 RRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDIVDVLLRIQKEGSLQVPLT 11907

11908 MGNIKAVVL 11934 (0)

12556 DLFGAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKL 12729

12730 IIKETLRLHPVVPLLLPRE 12786 frameshift

      CQETCKVMDYDVPIGTIVLVNMWVIGRDPKYWEDAKTFRPERFEDGHIDFKGMNF 12955

12956 EYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFPDGISPAKMDMMEVMGSTVRKKN 13135

13136 DLYLVPNAHVPVAP* 13180

 

note cluster continues on AP003990.1 to sequence j

 

#74

>aaaa01002047.1b $FI CYP71X7 (indica cultivar-group) ortholog of AP003990.1a 99%

16718 MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 16897

16898 LVHRTMAGLARGLGDAPLLSLRLGEVPVVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 17077

17078 MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAATRRPG 17257

17258 EAAVNVGERLTVLITDIAMRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPSSRLAS 17437

17438 FVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRIQKEGG 17617

17618 LEVPLTMGVIKGVIR 17662

17911 DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKYLK 18081

18082 LVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETFIP 18261

18262 ERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVAPSN 18441

18442 LDMEEEMGITIRRKNDLYLVPKVHVPL 18522

 

>AP003990.1a $F CYP71X7 chromosome 2 clone OJ1073_F05 42% to 71B24

AQ259671 323-379 region I-helix 55% to 71B4

AQ691116.1 nbxb0088K01f CUGI Rice BAC genomic clone Length = 544

7359 MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 7538

7539 LVHRTMAGLARGLGDAPLLSLRLGEVPIVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 7718

7719 MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAA 7883

7884 TRRPGEAAVNVGERLTVLITDIAVRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPS 8063

8064 SRLASFVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRI 8243

8244 QKEGGLEVPLTMGVIKGVIR 8303 (0)

8551 DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKY 8715

8716 LKLVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETF 8895

8896 IPERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVA 9072

9073 PSNLDMEEEMGITIRRKNDLYLVPKVRVPL* 9165

 

#72

>aaaa01002047.1a $FI CYP71X8 (indica cultivar-group)

ortholog of AP004000.1a 99%

2682 MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMG 2861

2862 GPLVHRTMADLARRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWSSTIRV 3041

3042 LMSDGVGLVFAPYGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQP 3221

3222 VNVSERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVG 3401

3402 GTTRRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGN 3581

3582 IKAVVL 3599

4057 ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKLII 4236

4237 KETLRLHPVVPLLLPRECRETCEVMGYDIPIGTIVLVNVWAIGRDPKYWEDAETFIPERF 4416

4417 EDGHIDFKGTNFEFIPFGAGRRMCPGMVFAEVIMELALASLLYHFDWELPDGISPTKVDM 4596

4597 MEELGATIRRKNDLYLIPAVRVPLSTVL 4680

 

>AP004000.1a $F CYP71X8 chromosome 2 clone OJ1115_B01

95316 MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLA 95101

95100 RRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWTSTIRVLMSDGVGLVFAP 94921

94920 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQPVNV 94768

94767 SERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVGGTT 94588

94587 RRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGNIKA 94408

94407 VVL 94399 (0)

93945 ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKL 93772

93771 IIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTVLVNVWAIGRDPKYWEDAETFIPE 93592

93591 RFEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTK 93415

93414 VDMMEELGATIRRKNDLYLIPTVRVPLSTVL* 93319

 

#72

>aaaa01027906.1 CYP71X8 (indica cultivar-group) orth AP004000.1a $F chr 2 99%

see aaaa01002047.1a for ortholog

1930 LFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKLI 1757

1756 IKETLRLHPVVPLLLPRECRETCEVMGYDIPIGITVLVNVWAIGRDPKYWEDAETFIPER 1577

1576 FEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTKMD 1397

1396 MMEELGATIRRKNDLYLIPAV 1334

 

#72

>aaaa01012191.1 CYP71X8 (indica cultivar-group) orth AP004000.1a $F chr 2 98%

see aaaa01002047.1a for ortholog

762 KPRLPPGPWRLPVIGNLHQIMVGGPLVHRTMADLARRLDAPLMSLRLGELRVVVLYYRFI 583

582 *IPALSPFYLATRPWSSTIRVLMSDGVGLVFAPYGALWRQLRKIAVVELLSARRVQSF 409

408 RRIREDEVGRLVAAVAAAAAASAAQPVNVSERIAALISDSAVRTIIGDRFERRDEF 241

240 LEGLAEGIKITSGFSLGDLFPS 175

411 TARRKADLHLRPCL 370

 

#73

>aaaa01030108.1 CYP71X9P orth of AP004000.1b

1684 RVIASSTGAACREFTETHDVKFATRPWSSTVRVLMADGLG 1565

1556 GLVFAPYGALWRQLRKIAMVELLSARRVQSHRRYRRRGDAAR 1431

 

>AP004000.1b $P CYP71X9P chromosome 2 clone OJ1115_B01 pseudogene fragment

3 aa diffs with AAAA01030108.1

101503 RVVASSTDAACREFTKTHDVKFATRPWSSTVRVLMADGLG 101393

 

#414

>aaaa01025401.1 CYP71X10 (indica cultivar-group) orth AP004000.1c $F chr 2 98%

2324 KPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLARRLDAPLMSLRLGELRVVVASSADA 2145

2144 AREITKTHDVAFATRPWSPTIRVLMSDGVGLVFAPYGALWRQLRKIAMVELLSARRVQSF 1965

1964 RRIREDEVGRLVADVAAAQPGEAVNVSERITALISDSAVRTIMGDRFEK 1818

917 LFGAGSETSASTLHWAMTELIMNPKVI 837

832 DELSNVIKGKQTISEDDLVELRYLKLVIKETLRLHPVVPLLLPRECRETCEVMGY 659

658 DIPIGTTMLVNVWAIGRDPKYWEDAETFRPERFEDGHIDFKGTDFEFIPFGAGRRKCPGM 479

478 AFAEAIMELVLASLLYHFDWELPDGISPTKVDMMEELGATIRKKNDLYLVPTV 320

 

>AP004000.1c $F CYP71X10 chromosome 2 clone OJ1115_B01

109813 MAMVQYVTGYLCLLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRL

       PVIGNLHQVAMGGPLVHRTMADLA 109595

109594 RRHDAPLMSLRLGELRVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 109415

109414 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVCRLVAAVAAAQPGEAVNV 109262

109261 SERITALISDSAVRTIMGDRFEKRDEFLEGLAEGDRIASGFSLGDLFPSSRLASFVGGTT 109082

109081 RRAEANHRKNFGLIECALRQHEERRAAGAVDDDEDLVDVLLRVQKEGSLQVPLTMGNIKAVIL 108893 (0)

107479 ELFGAGSETSASTLHWAMTELIMNPKVMLKAQDELSNVIKGKQTISEDDLVELRYLKL 107306

107305 VIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTMLVNVWAIGRDPKYWEDAETFRPE 107126

107125 RFEDGHIDFKGTDFEFIPFGAGRRMCPGMAFAEAIMELVLASLLYHFDWELPDGISPTK 106949

106948 VDMMEELGATIRKKNDLYLVPTVRVPMSTAL* 106853

 

#106

>aaaa01002996.1a $FI CYP71X11 (indica cultivar-group) ortholog of AP004000.1d >99%

13279 MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPL 13458

13459 VHRALADLARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTA 13638

13639 DGEGLVFAPYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV 13818

13819 NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 13998

13999 AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 14178

14179 TMGIIKAVIL 14208

14345 DLFSAGSETSATTIQWAMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLADLNYLKLII 14524

14525 KETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVFVNAWAIGRDPKYWDDPEEFKPERF 14704

14705 EDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSELDM 14884

14885 TEEMGITVRRKNDLYLHAVVRVPLHATTP 14971

 

>AP004000.1d $F CYP71X11 chromosome 2 clone OJ1115_B01

123850 MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPLVHRALAD 124050

124051 LARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTADGEGLVF 124230

124231 APYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV 124389

124390 NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 124569

124570 AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 124749

124750 TMGIIKAVIL 124779 (0)

124917 DLFSAGSETSATTIQW AMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLTDLNYLKL 125090

125091 IIKETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVLVNAWAIGRDPKYWDDPEEFKPE 125270

125271 RFEDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSE 125447

125448 LDMTEEMGITVRRKNDLYLHAVVRVPLHATTP* 125546

 

#107

>aaaa01002996.1b $FI CYP71X12 (indica cultivar-group) ortholog of AP004000.1e 99%

16497 MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPQ 16676

16677 VHRAMADLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMA 16856

16857 DGKGLTFARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAVV 17036

17037 NVSERAAVLVTDTTVRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFP 17186

17187 SSRLASLVSGTARRAAASHRKMFELMDCAIRHHQERKAAMDADEDILDVLLRMQKEGGHD 17366

17367 APLTMGDVKDTIL 17405

17534 DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKLVI 17713

17714 KETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPDRC 17893

17894 ENNKYDFRGTDFEYIPFGSRRKICPCPAFTHAILELALAALLYHFDWELPCGVAQ 18058 frameshift

18055 SGEVDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT 18174

 

>AP004000.1e $F CYP71X12 chromosome 2 clone OJ1115_B01

AP004066.1 chromosome 2 clone OJ1572_F02, 55% to 71B17 aa 342-511 runs off beginning

contig of AA751324 and AQ327456 54% IDENTICAL TO 71B24   1/98 K-HELIX

58% identical to AQ328148

127153 MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPHVHRAMA 127350

127351 DLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMADGEGLA 127530

127531 FARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAV 127689

127690 VNVSERAAVLVTDTX 127731 frameshift

127734 VRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFPSSRLASLVSGTARRAAASHRKMFE 127913

127914 LMDCAIRHHQERKAAMDADEDILDVLLRIQKEGGHDAPLTMGDVKDTIL 128060 (0)

128189 DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKL 128362

128363 VIKETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPD 128542

128543 RCENNKYNFRGTDFEYIPFGSRRKICPGPAFTHAILELALAALLYHFDWELPCGVAPGE 128719

128720 VDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT* 128833

 

#96

>aaaa01002645.1a $PI CYP71X13P (indica cultivar-group) sequence gap at 950 sequence

similarity stops at 236 (80% identical to AAAA01002645.1b)

ortholog to AP005385.1b $P 99%

949 PLMRLQLGGVPVVVASSPDAAREVTCTHDAAFASRPWPPTVRRLRPHREGVVFATYGAM 773

772 WRQLRKVCIVEMLSARRVRSFRRVREEEAASLAAAVAASLSSPPARRDAVNVSALVALAV 593

592 ADATMRVVIGDRLERREEFLESMTEAVRSFTGFSLDDLFPSSRIAAAVGGMTRRAEASHR 413

412 KGNELIESAIRQHEQVRDAMAAQGGGGAMEEDLLDTLLRIQKEGALDMPLTMDNIKAVI 236

 

>AP005385.1b $P CYP71X13P (japonica cultivar-group) chr 2 = aaaa01012992.1

146254 MDQVACWSICAFLALLLLVRIGGKRGRGGDGARLRQPPPGPWRLPVIGNLHQLMLRGP 146427

146428 LVHRTMADLARGLDDAPLMRLQLGGVPVVVASSPDAAREVTCTHDAAFASRPWPPTVRRL 146607

146608 RPHREGVVFAPYGAMWRQLRKVCIVEMLSARRVRSFRRVREEEAANLAAAVAASLSSPPA 146787

146788 RRDAVNVSALVAAAVADATMRVVIGDRLERREEFLESMTEAVRSFTGFSLDDLFP 146952

146953 SSRIAAAVGGMTRRAEASHRKGNELIESAIRQHEQVRDAMAAQGGGGAMEEDLLDTLL 147126

147127 RIQKEGALDMPLTMDNIKAVI 147189

147557 DIFGAGSDTSSNIIQW

       FS and Small deletion 19aa

147612 RNTLQGKHPVKEDDLVNIKYLKLIIKETLRLHPVVPLLLPRECLHACKVMGYDVPKGTTV 147791

147792 FVNIWAINRDPKHWDDPEVFKPERFDDGKIDFKGANFEYIPFGAGRRSCPGVTFGHATVE 147971

147972 LMLATLLYHFKWELLEGVAPNELDMTEEIGINVGRKNPLWLCPIVRVPLQ* 148124

 

#96

>aaaa01012992.1 CYP71X13P (indica cultivar-group) 80% to AAAA01002645.1b

runs off end of clone (partialI) orth of AP005385.1b

see aaaa01002645.1a for ortholog

175 DIFGAGSDTSSNIIQWAMSELMRNPKVMQKAQVELRNTLQGKHPVKEDDLVNIKYLKL 348

349 IIKETLRLHPMVPLLLPRECLHACKVMGYDVPKGTTVFVNIWAINRDPKHWDDPEVFKPE 528

529 RFDDGKIDFKGANFEYIPFGAGRRSCPGVTFGHATVELMLATLLYHFKWELLEGVAPNEL 708

709 DMT 717 (fs)

717 EEIGINVGRKNPLWLCPIVRVPLQ* 791

 

#97

>aaaa01002645.1b $FI CYP71X14 (indica cultivar-group) no introns 40% to 71B23

ortholog to AP005385.1a $F 99%

2048 MEQVAYWFICACAFLALVLLVRLGAARRDVVRLPPGPWRLPVVGNLHQLMLRGPLVHRTMADLA 2239

2240 RGLDDAPLMRLQLGGVPVVVASSADAAREVTRTHDLDFASRPWPPTVRRLRPHREGVVF 2416

2417 APYGAMWRQLRKVCVVEMLSARRVRSFRRVREEEAARLVASIASSSSSSPTGHDGGAAP 2593

2594 AVNVSAPIAAAVADATMRAVIGDRFERREEFLESITEAVRSFTGFSLDDLFPSSRLAAAV 2773

2774 GGMTRRAEASIRKGHQLMDSAFRQHQQLRDAMAAQPHLDDCAMEEDLLDTLLRIQKEDNL 2953

2954 DVPLTTGNIKAVLLDIF 3133

3134 GARSDTSSHMVQWVLSELMRNPEAMHKAQTELRSTLQGKQMVSEDDFASLTYLKLVIKET 3313

3314 LRLHPMVPLLLPRECRQTCKVMGYDVPQGTTVFVNVWAINRDPRHWDEPEVFKPERFHSG 3493

3494 KIDFKGANFEYIPFGAGRRICPGMTFGHATVELMLAMLLYHFDWELPKGVAPNELDMTEE 3673

3674 MGITVGRKNALYLHPIVRVSLEQASMS* 3757

 

>AP005385.1a CYP71X14 (japonica cultivar-group) chr 2

142591 MEQVAYWFICACAFLALVLLVRLGAARRDVVRLPPGPWRLPVVGNLHQLMLRGPLVHRTM 142770

142771 ADLARGLDDAPLMRLQLGGVPVVVASSADAAREVTRTHDLDFASRPWPPTVRRLRPHREG 142950

142951 VVFAPYGAMWRQLRKVCVVEMLSARRVRSFRRVREEEAARLVASIASSSSSSPTGHDGGA 143130

143131 APAVNVSAPIAAAVADATMRAVIGDRFERREEFLESITEAVRSFTGFSLDDLFPSSRLAA 143310

143311 AVGGMTRRAEASIRKGHQLMDSAFRQHQQLRDAMAAQPHLDDCAMEEDLLDTLLRIQKED 143490

143491 NLDVPLTTGNIKAVLL (0)

143668 DIFGAGSDTSSHMVQWVLSELMRNPEAMHKAQIELRSTLQGKQMVSEDDLASLTYLKLVIK 143850

143851 ETLRLHPVVPLLLPRECRQTCKVMGYDVPQGTTVFVNVWAINRDPRHWDEPEVFKPERFH 144030

144031 SGKIDFKGANFEYIPFGAGRRICPGMTFGHATVELMLAMLLYHFDWELPKGVAPNELDMT 144210

144211 EEMGITVGRKNALYLHPIVRVPLEQATMS 144297

 

#237

>aaaa01008333.1a $FI CYP71X15 (indica cultivar-group) very similar to AP003990.1

6914 MAMVQDATGYLSLFLALLSITLVLHKVARKASGDGAGKPRLPPGPWRLPVIGNLHQIAMGG 6732

6731 PLVHRTMADLARRHDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTVRVL 6552

6551 MSDGVGLVFAPYGALWRQLRKIAMVELLSARRVQSFRGIREDEVGRLVAAVAAASAAQ 6378

6377 PGEAVNVSERIAVLIA DSVVRALMGDRFDRRDEFLDQLAERVKITSGFSLGDLFPSSRL 6201

6200 ASFIGGTTRRAEANHRKNFELIECALRQHEERRAARAGAAAAGAVDDDEDLVDVLLRIQK 6021

6020 EGKLEVPLTMGNINAVIY (0) 5967

5378 DLFGAGSETSANTLQWVMSELILNPRVMLKLQAELRGILQGKQRVTEDDLVELKYLKLVI 5199

5198 KETLRLHPVVPLLLARECQDTCKIMGYDIPVGTIVFVNVWVICRESKYWKDAETFRPERF 5019

5018 ENVCVDFKGTHFEYIPFGAGRRMCP 4944

4945 PGVAFAEASMELVLASLLYHFDWKLPNDILPTKLDMTEEMGLSIRRKNDLYLIPTICVPPLAA* 4754

 

no japonica ortholog on 9/7/02

 

#238

>aaaa01008333.1b CYP71X16 (indica cultivar-group) runs off end of clone (partialI)

like AP004000 exon 2

11117 EMFGAGSETSANTLQWLMSELILNPRVMSKAQVELSDTLRGKQTVTEDDLAGLKYLKLII 10938

10937 KENLRLHPVVPLLLPRECQKTCKVMMYDVPVGTTVLVNVWSINRDPKYWEDPETFKPERF 10758

10757 EDGHIDFKGTDFEFIPFGAGRRMCPGITFAEAIMELALASLLYHFDWKLLGNGISSTKLD 10578

10577 MTEELGATVRRKNDLYLVPTIRVPLPADS* 10488

 

no japonica ortholog on 9/7/02

 

#68

>aaaa01001712.1 $PI CYP71X17P (indica cultivar-group) missing C-terminal exon

not found in 20000bp of seq.

6693 MAMAQDVTGYLCLFVALLVLLKVVRKASGNGAAGRLRLPPGPWRLPVIGNLHQVAMGG 6866

6867 PLVHRTMADMARRLDAPLMSLRLGEIPVVVASSADAAREITKTHDVAFATRPLSSTIRVM 7046

7047 VSDGEGLVFTPYGALWRRLRKIAMLELLSARRVQSFRRVREEEVGRLVAAVAAAAAAR 7220

7221 PGEAVNLSQLIAELISDTAARTIIGDRFEKRQELLEGLTEGIRISSGFSLGDLFPSSRL 7397

7398 ANLIGGTTRRAEANHRKNLALIECALRQHEERRAAGDEEDDEDLVDVLLRVQKEGG 7565

7566 GEVPLTMGNVKVVIR (0) 7610

 

aaaa01001712.1 $PI no ortholog yet, no match in nr or HTGS 9/5/02

 

#413

>aaaa01025223.1 CYP71Y1 (indica cultivar-group) orth? AP003571.1g $F chr 6 95%

1835 QRLPPGPWMLPAIGSLHHLAGKLPHRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREV 1656

1655 MKTHDTAFATRPLSATLRVLTNGGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIR 1476

1475 EEEVAAVLRAVAVAAGTVEMRAALSALVSDITARTVFGNRCKDRGEFLFL 1326

1325 LDRTIEFAGGFNPADLWPSSRLAGRLSGVVRRAEECRNSVYKILDGIIQEHQER 1164

1163 TGAGGEDLVDVLLRIQKEGELQFPLAMDDIKSII 1062

991 QDIFSAGSETSATTLAWAMAELIRNPTAMHKATPEVRRAFAAAGAVSEDALGELPYLH 818

817 LVIRETLRLHPPLPLLLPRECREPCRVLGYDVPRGTQVLVNAWAIGRDERCWPGGSPEEF 638

637 RPERF 623

588 RGADFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFDWEVPGLADPAKLDMTEAFGI 412

411 TARRKADLHLRPCL 370

 

>AP003571.1g $F CYP71Y1 chromosome 6 clone P0458E02

139036 MEDATHGYVYVGLALVSLFVVLLARRRRSPPPAAHGDGGLRLPPGPWTLPIIGSLHHLVGQIP 139224

139225 HRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREVTKTHDTAFAMRPLSATLRVLTN 139398

139399 GGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIREEEVAALLRAVAVA 139554

139555 AGTVEMRAALSALVSDITARTVFDNRCKDRGEF

       LVLLERTIEFAGGFNPADLWPS 139719 (?) bad exon boundary

140222 SRLAGRLSSVVRRAEECRNSVYKILDGIIQEHQERTSAGGEDLVDVLLRIQKEGG 140386

140387 LQFPLAMDDIKSIIF 140428 (0)

       DIFSAGSETSATTLAWAMAELIRNPTAMHKVMAEVRRAFAAAGAVSEDALGE 140655

140656 LRYLQLVIRETLRLHPPLPLLLPRECREPCRVLGYDVTRGTQVLVNAWAIGLDERYWPGG 140835

140836 SPEEFRPERFEDGEATAAVDFRGTDFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFD 141015

141016 WEVPGLADPAKLDMTEAFGITARRKADLHLRPCLLVSVPGV* 141141

 

#474

>aaaa01101459.1 CYP71Y2P (indica cultivar-group) orth AP003571.1f $P chr 6 97%

463 LDFRGADFELLPFGXARRMCPGMAFGLANVELPLSSLLFHFDWEVPGMADPTKLDMTEAF 284

283 GITSRRKENLHLRPLL 236

 

>AP003571.1f $P CYP71Y2P chromosome 6 clone P0458E02 pseudogene fragment

136961 VSEDALGELRYLQLVIRETLRLHPPLPLLLPRECTIGR 137074

137075 DERYWPGGSPEEFRPERFDDGEATAAVDFRGADFELLPFGGGRRMCPGMAFGLANVELPL 137254

137255 SSLLFHFDWEVPGMADPTKLDMTEAFGITSRRKENLHLRPLLRVSVPG 137398

 

#215

>aaaa01007286.1 CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6 100%

4839 YFYLGLALASLLVVLFARRRRSAAHGDGGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDL 4660

4659 ARRHGPVMMLRLGEVPTLVVSSRDAAREVMRAHDAAFASRPLSATVRVLTSGGRGIIFAP 4480

4479 YGGSWRQLRKIAVTELLTARRVASFRAIREEEVAAMLRAVAAAAAAGRAVELRAALSALV 4300

4299 AETTVRAVIGDRCKDRDVFLRKLQRTIELSAGFNPADLWPSSRLAGRLGG 4150

4149 AVREAEECHDTVYGILDGIIQEHMERTSSGSCGAGDGDGDGEDLLDVLL 4003

 

>AP003571.1e $F CYP71Y3 chromosome 6 clone P0458E02

128926 MADDYFYLGLALASLLVVLFARRRRSAAHGDGGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLA 129120

129121 RRHGPVMMLRLGEVPTLVVSSRDAAREVMRAHDAAFASRPLSATVRVLTSGGRGIIFAP 129297

129298 YGGSWRQLRKIAVTELLTARRVASFRAIREEEVAAMLRAVAAAAAAGRAVELRAA 129462

129463 LSALVAETTVRAVIGDRCKDRDVFLRKLQRTIELSAGFNPADLWPSSRLAGRLGGA 129630

129631 VREAEECHDTVYGILDGIIQEHMERTSSGSCGAGDGDGDGEDLLDVLLRIQKEGGLEFPV 129810

129811 DMLAIKQVIF 129837 (0)

132964 DIFGAGSETSATTLEWVMAELIRNPKAMRKATAEVRRAFAADGVVLESALGKLHYMHLVI 133143

133144 RETFRLHTPLPLLLPRECREPCRVLGYDVPRGTQVLVNVWAIGRDERYWPGGSPEEFRPE 133323

133324 RFEDGEAAAAVDFRGADFELLPFGAGRRMCPGLAFGLANVELALASLLFHFDWEAPDVAD 133503

133504 PAEFDMTEGFGITARRKADLPLRPTLRVPVLVSVG* 133611

 

#215

>aaaa01048884.1 CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6

97% 3 diffs see aaaa01007286.1 for ortholog

1024 VRLHTPLPLLLPRECREPCRVLGYDVPRGSQVLVNVWAIGRDERYWPGGSPEEFRPERFE 845

844  DGEAAAAVDLRGADFELLPFGAGRRMCPGLAFGLANVELALASLLFHFDWEAPDVADPAE 665

664  FDMTEGFGITARRKANLPLRPTL 596

 

#215

>aaaa01040160.1 CYP71Y3 (indica cultivar-group) orth AP003571.1e $F chr 6 96%

see aaaa01007286.1 for ortholog

667 MQDIFGAGSETSATTLEWVMAELIRNPKAMRKATAEVRRAFAANGVVSESALGKLHYL 840

841 HLVIRETFRLHTPLPLLLPRECREPCRVLGYDVPRGSQVLVNVWAIGRDERYWPGGSPEE 1020

1021FRPERFED 1044

1064 LDFRGADFELLPFGAGRWMCPGFGVRARQRG 1156

 

#301

>aaaa01012291.1 CYP71Y4 (indica cultivar-group) orth AP003571.1d $F chr 6 99%

6215 DAGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLARRHGPVMMLRLGEVPTLVVSSRDAA 6394

6395 REVMRTHDAAFASRPLSASVRAATKGGRDIAFAPYGDYWRQLRKIAVTELLSARRVLSFR 6574

6575 PIREEEVGRSPATLQPGQHAASGRTVELRAALCALVADSTVRAVVGERCAGLDVF 6739

6740 LRQLDRAIELAAGLNVADLWPSSRLAGRPSQRRRAPGREVRDTMFGVLDGII 6895

6896 QAHLEKTGGAGEDILDVLLRIHKEGGLEFPLDMDAVKCV

     DVISGGCETSATTLGWAFAELIRNPAAMK 7249

7250 KATAEVRRDFEAAGAVSESALSVGELPYLRLVVRETLRLHPPLPLLLPRECREPCRVLGY 7429

7430 DVPRGAQVLVNAWAIGRDERYWPGGSPEEFRPERFGDGEAAAAVDFKGADFELLPFGGGR 7609

7610 RMCPGMAFGLANVELPLASLLFHFDWEASGVADPTEFDMTEAFGITARRKANLLLRPIL 7786

 

>AP003571.1d $F  CYP71Y4 chromosome 6 clone P0458E02

118418 MADGYFYLGLALVSLLVVLFARRRRSAAAAHGDAGLRLPPGPWQLPVIGSLHHLAGKLPHRAMRDLA 118618

118619 RRHGPVMMLRLGEVPTLVVSSRDAAREVMRTHDAAFASRPLSASVRAATKGGRDIAFAP 118795

118796 YGDYWRQLRKIAVTELLSARRVLSFRPIREEEVAATLRAVAAAAADGRTVELRAA 118960

118961 LCALVADSTVRAVVGERCAGLDVFLRQLDRAIELAAGLNVADLWPSSRLAGRLSGAVRQ 119137

119138 AERCRDTMFGVLDGIIQAHLEKTGGAGEDILDVLLRIHKEGGLEFPLDMDA 119290

119291 VKCVVV 119308 (0)

119451 DVISGGCETSATTLGWAFAELIRNPAAMKKATAEVRRDFEAAGAVSESALAVGELPYLRL 119630

119631 VVRETLRLHPPLPLLLPRECREPCRVLGYDVPRGAQVLVNAWAIGRDERYWPGGSPEEFR 119810

119811 PERFGDGEAAAAVDFKGADFELLPFGGGRRMCPGMAFGFANVELPLASLLFHIDWEASGV 119990

119991 ADPTEFDMTEAFGITARRKANLLLRPILRVPVPGV* 120098

 

#390

>aaaa01020516.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 95%

3   IVFAPYGDYWRQLRKITVTELLSARRVASFRAIREEEVAAMLRAVAASAAAGRAVEMRPL 182

183 LSALVSDSTVRAVMGDQFPHRDVFLRELDRSIELVAGFNPADLWPSSRLAGCLT 344

345 GTMRQAKKCWDTMSSVLESTIQEHLQKNGSSGGGAGATDEDLIDVLLRIQKEGGLQFP 518

519 FDMDVIKSVI 548

 

>AP003571.1c $F CYP71Y5 chromosome 6 clone P0458E02

AQ328148 49% identical to C72289 58% to AQ327456 61% to AQ259669

56% to 71B3

106935 MADLHTYLYLGLALVSLLAVQLARRRRSSAAHGSGALRLPPGPWQLPVIGSLHHLVGKL 107111

107112 PHQAMRDLARRHGPVMMLRLGEVPTLVVSSPEAAREVTKTHDVSFATRPLSSTTRVFS 107285

107286 NGGRDIVFAPYGDYWRQLRKITVTELLSARRVASFRAIREEEVAAMLRAVGGYAA 107450

107451 AGCAVEIRPLLAALVSDSTVRAVMGDRFPHRDVFLRELDRSIELTAGFNPADLWPSSRL 107627

107628 AGCLTGTIRQAKKCWDTMSSVLESTIQEHLQKNGSSGGGAGATDEDLIDVLLRIQKEGGL 107807

107808 QFPFDMDVIKSVIH 107846 (0)

110756 NVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYLHLVI 110935

110936 KETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEEFRPE 111115

111116 RFGDGEPAAALDFKGTDYELLPFGAGRRMCPGLAFGLANVELPLASLLFHFDWEVPGMAD 111295

111296 PTKLDMTEAFGIGVRRKADLIIRPILRVPVPGV* 111397

 

#390

>aaaa01021346.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 98%

1 diff see aaaa01020516.1

158 LPPGPWQLPIIGSLHHLVGKLPHQAMRDLARRHGPVMMLRLGEVPTLVVSS 6

 

#390

>aaaa01083019.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 99%

see aaaa01020516.1 for ortholog

501 MQNVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYL 328

327 HLVIKETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEE 148

147 FRPERFGDGEPAAA*DFKGTDYELLTFGAGRRMCPGLAFGLANVELPL 4

 

#390

>aaaa01032282.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6

100% see aaaa01020516.1 for ortholog

297 NVFGAGSETSATTLGWAIAELIRNPMAMKKATAEVRQAFAAAGVVSEAALSELRYL 470

471 HLVIKETLRLHPPGPLLLPRECREQCKVLGYDVPRGTQVLVNVWAIGRDPRYWPGGSPEE 650

651 FRPERFGDGEPAAALDFKGTDYELLPFGAGRRMCPGLAFGLANVELPLASLLFHFDWEVP 830

831 GMADPTKLDMTEAFGIGVRRKADLIIRPIL 920

 

#390

>aaaa01032612.1 CYP71Y5 (indica cultivar-group) orth AP003571.1c $F chr 6 100%

see aaaa01020516.1 for ortholog

1464 LPPGPWQLPVIGSLHHLVGKLPHQAMRDLARRHGPVMMLRLGEVPTLVVSSPEAAREVTK 1643

1644 THDVSFATR 1670

 

#444

>aaaa01053818.1 CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6 100%

600 LLTKRSRKATAQRLPPGPWQLPVIGSLHHLAGKLPHHAMRDLARRHGPVMMLRLGEVPTL 779

780 VVSSPEAAQEVMRTHDAVFATRALSATVRAATMGGRDIAFAPYGDRWRQLRKIAATQLLS 959

960 ARRVASF 980

 

>AP003571.1b $F CYP71Y6 chromosome 6 clone P0458E02

80945 MEDASHGYVYLAMAVVALLGVLLTKRSRKATAQRLPPGPWQLPVIGSLHHLAGKLPHHAMRDLARRHG 81148

81149 PVMMLRLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAATMGGRDIAFAPYGDR 81325

81326 WRQLRKIAATQLLSARRVASFRAIREEEVATMLRAVAAAAADGRAVEMRAALCVV 81490

81491 VADSTARAMVGESCQERDAFLREIDRSMELVSGFNPEDLWPSSRLAGRLSGAVRKIEAS 81667

81668 LHTVLGILDRIIQKRLQEKIGGAGAAAASEDILDVLLRIHKDGGAGGLQVPLDMDDITLV 81847

81848 IT 81853 (0)

83437 SMQLQNAHACALFLTIVSSTYSYSLFNDPPSLHMQDLFSGGGETVATLLVWAMAELIRN 83613

83614 PMAMQKATAEVRRAFALPGVVSEGEGALGELRYLHLVIRETFRLHPPGPLLLPRECSEPC 83793

83794 QVLGYDVPRGTQVLVNVWAIGRDERCWPAAAGGGSPEEFWPERFEDGAEAVDLRGNNFEL 83973

83974 LPFGAGRRMCPGVAFALANIELTLASLLFHFDWEVPGMADPAKLDMAEALGITARRKGDL 84153

84154 LLRPVLRMPVPGV* 84195

 

#444

>aaaa01076398.1 CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6

99% see aaaa01053818.1 for ortholog

725 QLLSARRVASFRAIREEEVATMLRAVAAAAADGRAVEMRAALCVVVADSTARAMVGESCQ 546

545 ERDAFLREIDRSMELVSGFNPEDLWPSSRLAGRLSGAVRKIEASLHTVLGILDRIIQKRL 366

365 QEKIGGAGAAAASEDILDVLLRIHKDGGAGGLQVPLDMDDITLVITVSDQL 213

 

#444

>aaaa01098934.1 CYP71Y6 (indica cultivar-group) orth AP003571.1b $F chr 6 99%

see aaaa01053818.1 for ortholog

131 MQDLFSGGGETVATLLVWAMAELIRNPMAMQKATAEVRRAFALPGVVSEGEGALGELRYL 310

311 HFVIRETFRLHPPGPLLLPRECSEPCQVLGYDVPRGTQVLVNVWAIGRDERCWPAAAGGG 490

491 SPEEFWPERFED 526

 

#310

>aaaa01012578.1 CYP71Y7 (indica cultivar-group) orth AP003571.1a $F chr 6 99%

5308 LVALLGVLLTKRSRTATAQRRLPPGPWQLPVIGSLHHLIGKLPHHAMRDLTRRHGPVMML 5487

5488 RLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAGTMGGRDIAFAPYGDYWRQLRK 5667

5668 IAATELLSAPRVASFRAIREEEVAATLRTVAAAAADGRAVELRAALCALVTDSTSRAVVG 5847

5848 DRCKESDALIRAFDRSMELASGFNPADLWPSSRLAGLLSGGVREIEANLHTVFGIL 6015

6016 DRLIEKRLQQKKTAPSSAAGEDILDALLRIHKEGGGLQFPLDMDSIKLII 6165

 

>AP003571.1a $F CYP71Y7 chromosome 6 clone P0458E02

67847 MADVLSQGYVYLAMALVALLGVLLTKCSRTATAQRRLPPGPWQLPVIGSLHHLIGKLPHHAMRDLTRRHG 68056

68057 PVMMLRLGEVPTLVVSSPEAAQEVMRTHDAVFATRALSATVRAGTMGGRDIAFAPYGDY 68233

68234 WRQLRKIAATELLSAPRVASFRAIREEEVAATLRTVAAAAADGRAVELRAALCAL 68398

68399 VTDSTSRAVVGDRCKESDALIRAFDRSMELASGFNPAADLWPSSRLAGLLSGGVREIEA 68575

68576 NLHTVFGILDRLIEKRLQQKKTAPSSAAGEDILDALLRIHKEGGGLQFPLDMDSIKLIIA 68755 (0)

73915 DLFSGGGETVATLLVWAMAELIRNPMAMQKATTEVRRAFALAGAVSEGKGALGELRYLHL 74094

74095 VIKEASRLHPPAPLLLPRECSEPCQVLGYDVPRGTQVLVNAWAIGRDERCWTGGSGDGSS 74274

74275 PEEFRPERFEDGAEAVDLRGNNFELLPFGAGRRMCPGMAFALANIELTLASLLFHFDWEV 74454

74455 PDMADPAKLDMTETLGITARRKGDLLLRPVLRMPVPGVY* 74574

 

#289

>aaaa01011521.1b $FI CYP71Y8 (indica cultivar-group)

6071 MADTSHGYVYIGLALVSLFVVLLDRRRRSPPPPAAH

6179 GDGGLRLPPGPWTLPIIGSLHHLVGKLPHHAMRDLARRHGPVMLLRIGQVPTLVVSSRDA 6358

6359 AREMMKTHDMAFATRPLSATLHVITCDGRDLVFAPYGDYWRQLRKIAVTELLTARRVNS 6535

6536 YRAIREEEVAAMLRAVAAAAEGSGAAAGTVEMRAALTALSTDITARAVFGNRCKDREEYL 6715

6716 AQVDHTIELTAGFNPADLWPSSRLAGRLSGIVRRAEECRDTAFKILDRIIQERLE 6880

6881 MARSDGAAGEYLIDVLLRIQKEGGLQFPLAMDDIKANIF (0) 6994

7066 DIFGAGSETSGTALAWAMAELIRNPTVMRKATAEVRRAFAAAGAVSEDGLGELPYLHLVI 7245

7246 RETFRLHPPLPLLLPRECREPCRLLGYDVPRGTQVLVNAWALGRDERYWPGGSPEEFRPE 7425

7426 RFEDGEATAAVNFRGADFEFLPFGGGRRMCPGIAFALATVELPLASLLFHFDWEVPGMAD 7605

7606 PTKLDMTEAFGITARRKADLHLRPLLRVSVPGV* 7707

 

no japonica ortholog found 9/10/02

 

#288

>aaaa01011521.1a $PI CYP71Y9P (indica cultivar-group)

1377 MADASDGYVYVG

1413 LAVVSLFVVLLAWRSRSPAAHGVGDGGLRLPPGPWTLPVIGSLHHLAGQLPHRAMRDLAR 1592

1593 RHGPLMLLRIGEVPTLVVSSRDAAREVMKTHDMAFATRPLSATLRVITCDGRDLVFAPY 1769

1770 GDYWRQVRKIAVTELLTVRRVSSFRSIREEEVAAVLRAVAAAAAVEEATPAMATVEMRAA 1949

1950 LSALVTDITARTAFGNRCKDREEYLVLLERIVEIAGGFNPADLWPSSRLAGRLKRCRAPR 2129

2130 RGVPQLGVILDGIIQEERTGAGSEDLVDVLLRIQKEGELQFPLAMDD 2270

2271 IKSIDIFNAGIETSGTTLQWAMAELIRNPTVM 2450

2451 HKATAEVRHAFAAAGDVSEDALGELRYLQL 2540 (deletion of about 104 aa)

2539 FDWEVPGMADLTKLDMTEAFGITARRKENLHLRPLLRVSVPAASS 2673

2674 RLRWTTTAFSICCHDTHLV*

 

no japonica ortholog found 9/10/02

 

#285

>aaaa01011369.1 CYP71Z1 (indica cultivar-group) orth AL606625.1 $F chr 4 99%

8999 LWFGEVGTVFASSPEAAREVLRSHDLAFADRHLTAAAAAFSFGGRDVVLSPYGERWRQLR 8820

8819 KLLTQELLTASRVRSFRRVREEEVARLMRDLSAAATAGAAVNLSEMVTRMVNDTVLRCSV 8640

8639 GSRCEHSGEYLAALHAVVRLTSGLSVADLFPS 8544

5576 KSLFQDMFAGGTDTSSTTLIWAMAELIRSPRVMAKVQSEMRQIFDGKNTITEDDLVQL 5403

5402 SYLKMVIKETLRLHCPLPLLAPRKCRETCKIMGYDVPKGTSAFVNVWAICRDSKYWEDAE 5223

5222 EFKPERFENNDIEFKGSNFEFLPFGSGRRVCPGINLGLANMEFALANLLYHFDWKLPNRM 5043

5042 LHKDLDMREAPGLLVYKHTSLNVCPVTH 4959

 

>AL606625.1 $F CYP71Z1 chromosome 4 clone OSJNBa0032I19 similar to 71B28 = AQ858445.1

AQ858445.1 nbeb0013M22r CUGI Rice BAC genomic Length = 824 54% to 71B23

82576 MGASILLVVVVSKLMISFAAKPRLNLPPGPWTLPLIGSIHHVVSSRESVHSAMRRLARRHGAPLM 82770

82771 QLWFGEVGTVVASSPEAAREVLRSHDLAFADRHLTAAAAAFSFGGRDVVLSPYGERWRQL 82950

82951 RKLLTQELLTASRVRSFRRVREEEVARLMRDLSAAATAGAAVNLSEMVTRMVNDTVLRCS 83130

83131 VGSRCEHSGEYLAALHAVVRLTSGLSVADLFP 83226

83227 SSRLAAMVSAAPRAAIANRDKMVRIIEQIIRERKAQIEADDRAADSKSC 83373

83374 ACSLDDLLRLQKEGGSPIPITNEVIVVLLM 83463 (0)

84970 DMFAGGTDTSSTTLIWAMAELIRSPRVMAKVQSEMRQIFDGKNTITEDDLVQLSY 85134

85135 LKMVIKETLRLHCPLPLLAPRKCRETCKIMGYDVPKGTSAFVNVWAICRDSKYWEDAEEF 85314

85315 KPERFENNDIEFKGSNFEFLPFGSGRRVCPGINLGLANMEFALANLLYHFDWKLPNGMLH 85494

85495 KDLDMREAPGLLVYKHTSLNVCPVTHIASSCA* 85593

 

#88

>aaaa01002274.1a CYP71Z2 (indica cultivar-group) ortholog of AP003805.1 $F 100%

a duplicate of 1b

2088 MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVI 1909

1908 GKLAREHGPVMQLWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVV 1729

1728 MAQYGERWRHLRKLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVN 1549

1548 RLVNDTVLRCSVGSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLA 1369

1368 NRNKVERIIEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPI 1189

1188 TNQVITVLLW 1159

 

>aaaa01002274.1b CYP71Z2 (indica cultivar-group) ortholog of AP003805.1 $F 100%

duplicate of 1a count only once

22179 MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVI 22358

22359 GKLAREHGPVMQLWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVV 22538

22539 MAQYGERWRHLRKLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVN 22718

22719 RLVNDTVLRCSVGSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLA 22898

22899 NRNKVERIIEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPI 23078

23079 TNQVITVLLW 23108

 

>AP003805.1 $F CYP71Z2 chromosome 7 clone OJ1080_F08, similar to AC087550.2

39% to 71B23

10416 MEDKVLLAVGASLVLVFLSKLISSYAKKPRLNLPPGPWTLPLIGSAHHLVSWSESVHSVIGKLAREHGPVMQ 10201

10200 LWLGEVPTVVASSPEAAQEILRDHDLIFADRHLTSTTAAITFGGTDVVMAQYGERWRHLR 10021

10020 KLLTQELLTVARVRSFRRVREEEVARLVRDLSAAAASGATVNLTDMVNRLVNDTVLRCSV 9841

9840  GSRCKYREEFLAALHAILHQTSALSVADLFPSSKLASMVATGPRNVLANRNKVERI 9673

9672  IEEIIQERKNQIETDMMSGNDDVGDKAAVESKSCSLDVLLRLQKEGGTPIPITNQVITVLLW (0)9487

3785  DMFGAGTDTSSTTLIWTMAELMRSPRVMAKVQAEMRQAFQGKNTITEDDLAQLSYLKMVL 3606

3605  KESFRLHCPVPLLSPRKCRETCKIMGYDVPKGTSVFVNVWAICRDSMYWKNAEEFKPERF 3426

3425  EDNDIELKGSNFKFLPFGSGRRICPGINLGWANMEFALANLLYHFDWNLPDGMLHKDLDM 3246

3245  QESPGLVAAKCSDLNVCPVTHISSSCA* 3162

 

#13

>aaaa01000275.1 CYP71Z3 (indica cultivar-group) orth AC087550.2 $F chr 10, 100%

same as AAAA01002847.1 $FI see that accession below for ortholog

42371 MEDKRPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHRSMRALAE 42171

42170 KHGRHHLMQISLGEVFAVVVSSPEAAEEILR 42078

 

#13

>aaaa01002847.1a $FI CYP71Z3 (indica cultivar-group) ortholog to AC087550.2a 99%

also aaaa01000275.1 part

14507 MDDKLLQLLLLALAVSVVSSIVTISKLVYRATNKPRLNLPPGPWTLPVIGSLHHLVMRSP 14328

14327 SIHRSMRALAEKHGPLMQVWLGEVPAVVVSSTEAAEEVLKNQDARFADRFITTTLGAI 14154

14153 TFGGGDLAFAPYGERWRHLKMLCTQQLLTAARVRSFRRIREEEVARLVRDLAASAGGGSE 13974

13973 VAVNLSERVARLVNDIMVRCCVGGRSKHRDEFLGALCTALSQTSWLTVADLFPSSRLARM 13794

13793 LGTAPRRALASRKKMELILEQIIQEREEMTTDRSGDGEAGPTNECFLDVLLRLQK 13629

13628 EGDTPIPITMELIVMLLF 13575

12405 DIVSGGTETSTIVLNWTMAELIRTPRVMAKAHAEVRQTFQAKSTITEDDDISGL 12244

12243 TYLKMVIKESLRMHCPVPLLGPRRCRETCKVMGYDILKDTTVFVNVWAMCRSSIYWNDAE 12064

12063 EFKPERFENKCIDYKGSNFEFVPFGSGRRMCAGMNLGMADVEFPLASLLYHFDWKLPDGM 11884

11883 SPEDIDMQEAPGLFGGRRTSLILYPITRVAPSDLQVI 11773

 

>AC087550.2a $F CYP71Z3 chromosome 10 clone nbeb0016G17 74% to AC087554 seq 14167

132002 MDDKLLQLLLLALAVSVVSIVTISKLVYRATNKPRLNLPPGPWTLPVIGSLHHLVMRSPS 131823

131822 IHRSMRALAEKHGPLMQVWLGEVPAVVVSSTEAAEEVLKNQDARFADRFITTTLGAITF 131646

131645 GGGDLAFAPYGERWRHLKMLCTQQLLTAARVRSFRRIREEEVARLVRDLAASAAGGGEVA 131466

131465 VNLSERVARLVNDIMVRCCVGGRSKHRDEFLGALCTALSQTSWLTVADLFPSSRLARML 131289

131288 GTAPRRALASRKKMELILEQIIQEREEMTTDRSGDGEAGPTNECFLDVLLRLQK 131127

131126 EGDTPIPITMELIVMLLF 131073 (0)

       DIVSGGTETSTIVLNWTMAELIRTPRVMTKAHAEVRQTFQAKSTITEDDDISGL 129741

129740 TYLKMVIKESLRMHCPVPLLGPRRCRETCKVMGYDILKDTTVFVNAWAMCRSSIYWNDAE 129561

129560 EFKPERFENKCIDYKGSNFEFIPFGSGRRMCAGMNLGMADVEFPLASLLYHFDWKLPDGM 129381

129380 SPEDIDMQEAPGLFGGRRTSLILCPITRVAPSDLQVIV* 129264

 

#101

>aaaa01002847.1b $FI CYP71Z4 (indica cultivar-group) = aaaa01000275.1

ortholog to AC087550.2b >99% 1 diff

21763 MEDKRPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHR 21584

21583 SMRALAEKHGRHHLMQISLGEVFAVVVSSPEAAEEILRNQDVTFADRFLSTTIGVITFGG 21404

21403 NDMAFAPYGERWRQLRKLCTLELLSAARVRSFRRIREEEVARLVRDLAASAAAGEAVNLS 21224

21223 GRIAKLINDVVVRCCVGGRSEHRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAPR 21044

21043 KALASRKKIEHILEQIIQERKRIMDRSSHGGDGDGEAMNTSECFLDVLLRLQKDGNTPIP 20864

20863 ITNEVIVVLLF 20831

18797 DMFSGGSETSSSTLIWTMAELIRKPKVMAKAHVEVRQAFQGKNTITEDDGVNELTYLKMV 18618

18617 IKESLRMHCPVPLLGPRKCRETCKVMGYDIPKDTTVFVNAWAICRDPKYWDDAEEFQPER 18438

18437 FENKSIDFKGSNFEFLPFGSGRRMCAAMNLGIANVELPLASLLYHFDWKLPDGMMPEDVD 18258

18257 MQDAPGILVGKRSSLIMCPVTRVAPSNPQVIAS 18159

 

>AC087550.2b $F CYP71Z4 chromosome 10 clone nbeb0016G17 same as seq on AC087544 from 1-3082

AQ330340 nbxb0046P18r 60% to D48250 65% to 76C4 almost identical to AC087550.2

139422 MEDKLPLALTVLSVSVLIAVVISKLVSYATKPRLNLPPGPWKLPVIGSLHHLVGSHAIHR 139243

139242 SMRALAEKHGRHHLMQISLGEVFAVVVSSPEAAEEILRNQDVTFADRFLSTTIGVITFGG 139063

139062 NDMAFAPYGERWRQLRKLCTLELLSAARVRSFRRIREEEVARLVRDLAASAAAGEAVNLS 138883

138882 GRIAKLINDVVVRCCVGGRSEHRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAP 138706

138705 RKALASRKKIEHILEQIIQERKRIMDRSSHGGDGDGEAMNTSECFLDVLLRLQKDGNT 138532

138531 PIPITNEVIVVLLF (0)

136455 DMFSGGSETSSSTLIWTMAELIRKPKVMAKAHVEVRQAFQGKNTITEDDGVNELTYLKMV 136276

136275 IKESLRMHCPVPLLGPRKCRETCKVMGYDIPKDTTVFVNAWAICRDPKYWDDAEEFQPER 136096

136095 FENKSIDFKGSNFEFLPFGSGRRMCAAMNLGIANVELPLASLLYHFDWKLPDGMMPEDVD 135916

135915 MQDAPGILVGKRSSLIMCPVTRVAPSNPQVIAS* 135814

 

#184

>aaaa01005737.1 $FI CYP71Z5 (indica cultivar-group) orth of AP004790.1 >99%

14185 MEDKTILLSLALSMLLAILLSKLVSISKKPRLNLPPGPWTLPVIGSIHHLASNPNTHRALRALSQK 13988

13987 HGPLMQLWLGEVPAVVASTPEAAREILRNQDLRFADRHVTSTVATVSFDASDIFFSPY 13814

13813 GERWRQLRKLCTQELLTATRVRSFSRVREDEVARLVRELAGGGGAAVDLTERLG 13652

13651 RLVNDVVMRCSVGGRCRYRDEFLGALHEAKNQLTWLTVADLFPSSRLARMLGAAPRRGLA 13472

13471 SRKRIERIIADIVREHEGYMGSGGGGGDEAAAAAAGKDCFLSVLLGLQKEGGTPIPITNEIIVVLLF (0) 13271

10674 DMFSGGSETSATVMIWIMAELIRWPRVMTKVQAEVRQALQGKVTVTEDDIV 10522

10521 RLNYLKMVIKETLRLHCPGPLLVPHRCRETCKVMGYDVLKGTCVFVNVWALGRDPKYWED 10342

10341 PEEFKPERFENSDMDYKGNTFEYLPFGSGRRICPGINLGIANIELPLASLLYHFDWKLPD 10162

10161 EMASKDLDMQEAPGMVAAKLTSLCVCPITRVAPLISA* 10048

 

>AP004790.1 $F CYP71Z5 (japonica cultivar-group) chr 2

51668 MEDKTILLSLALSMLLAILLSKLVSISKKPRLNLPPGPWTLPVIGSIHHLASNPNTHRAL 51847

51848 RALSQKHGPLMQLWLGEVPAVVASTPEAAREILRNQDLRFADRHVTSTVATVSFDASDIF 52027

52028 FSPYGERWRQLRKLCTQELLTATRVRSFSRVREDEVARLVRELAGGGGAAVDLTERLGRL 52207

52208 VNDVVMRCSVGGRCRYRDEFLGALHEAKNQLTWLTVADLFPSSRLARMLGAAPRRGLASR 52387

52388 KRIERIIADIVREHEGYMGSGGDGGDEAAAAAAGKDCFLSVLLGLQKEGGTPIPITNEII 52567

52568 VVLLF 52582

55179 DMFSGGSETSATVMIWIMAELIRWPRVMTKVQAEVRQALQGKVTVTEDDIVRLNYLK 55349

55350 MVIKETLRLHCPGPLLVPHRCRETCKVMGYDVLKGTCVFVNVWALGRDPKYWEDPEEFMP 55529

55530 ERFENSDMDYKGNTFEYLPFGSGRRICPGINLGIANIELPLASLLYHFDWKLPDEMASKD 55709

55710 LDMQEAPGMVAAKLTSLCVCPITRVAPLISA 55802

 

#16

>aaaa01000393.1 $FI CYP71Z6 (indica cultivar-group) 89% to AP005114.1b

9756 MEDKLILALCLSALFVVVLSKLVSSAVKPRLNLPPGPWTLPLIGSLHHLAMTKSPQTHRSLRALS 9562

9561 EKHGPIMQLWMGEVPAVVVSSPAVAEEVLKNQDLRFADRHLTATTEEIFFGGRDVIFGP 9385

9384 YGERWRHLRKICMQELLTAARVRSFRGVREGEVARLVRELAASAAGAGAGAVGAAAGVNL 9205

9204 NERISKLANDIVMVSSVGGRCSHRDEFMEALEVAKKQITWLSVADLFPSSKLARMVAVAP 9025

9024 RKGLASRKRMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVP 8848

8847 VTDEIIVVLLF (0) 88315

4681 DMISGASETSPTVLIWTLAELMRNPRIMAKAQAEVRQAVAGKTTITEDDIVG 4526

4525 LSYLKMVIKETLRLHPPAPLLNPRKCRETSQVMGYDIPKGTSVFVNMWAICRDSRYWEDP 4346

4345 EEYKPERFENNSVDYKGNNFEFLPFGSGRRICPGINLGVANLELPLASLLYHFDWKLPNG 4166

4165 MAPKDLDMHETSGMVAAKLITLNICPITHIAPSSA* 4058

 

aaaa01000393.1 has no ortholog in nr or HTGS 9/2/02

 

#328

>aaaa01013736.1 $FI CYP71Z7 (indica cultivar-group) ortholog of BI811079.1 AP005114.1b

6446 MEDNKLILALGLSVLFVLLSKLVSSAMKPRLNLPPGPWTLPLIGSLHHLVMKSPQIHRSLRALSE 6252

6251 KHGPIMQLWMGEVPAVVVSSPAVAEEVLKHQDLRFADRHLTATIEEVSFGGRDVTFAPYS 6072

6071 ERWRHLRKICMQELLTAARVRSFQGVREREVARLVRELAADAGAGGDAGVNLNERISKLA 5892

5891 NDIVMVSSVGGRCSHRDEFLDALEVAKKQITWLSVADLFPSSKLARMVAVAPRKGLASRK 5712

5711 RMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVPVTDEIIVVL 5532

5531 LF (0) 5526

4289 DMFTGASETSPTVLIWILAELMRCPRVMAKAQAEVRQAAVGKTRITENDIVGLSYLKMVI 4110

4109 KEALRLHSPAPLLNPRKCRETTQVIGYDIPKGTSVFVNMWAICRDPNYWEDPEEFKPERF 3930

3929 ENNCVDFKGNNFEFLPFGSGRRICPGINLGLANLELALASLLYHFDWKLPNEMLPKDLDM 3750

3749 QETPGIVAAKLTTLNMCPVTQIAPSSAEDAS* 3654

 

>AP005114.1b $F CYP71Z7 (japonica cultivar-group) chromosome 2

BI811079.1 clone K015D02.Length = 347 57% to AC087550.2 C-helix

41% to 71B11

120645 MEDNKLILALGLSVLFVLLSKLVSSAMKPRLNLPPGPWTLPLIGSLHHLVMKSPQIHRSL 120824

120825 RALSEKHGPIMQLWMGEVPAVIVSSPAVAEEVLKHQDLRFADRHLTATIEEVSFGGRDVT 121004

121005 FAPYSERWRHLRKICMQELLTAARVRSFQGVREREVARLVRELAADAGAGGDAGVNLNER 121184

121185 ISKLANDIVMVSSVGGRCSHRDEFLDALEVAKKQITWLSVADLFPSSKLARMVAVAPRKG 121364

121365 LASRKRMELVIRRIIQERKDQLMDDSAAGAGEAAAGKDCFLDVLLRLQKEGGTPVPVTDE 121544

121545 IIVVLLF (0) 121565

122803 DMFTGASETSPTVLIWILAELMRCPRVMAKAQAEVRQAAVGKTRITENDIVGLSYLKMVI 122982

122983 KEALRLHSPAPLLNPRKCRETTQVMGYDIPKGTSVFVNMWAICRDPNYWEDPEEFKPERF 123162

123163 ENNCVDFKGNNFEFLPFGSGRRICPGINLGLANLELALASLLYHFDWKLPNGMLPKDLDM 123342

123343 QETPGIVAAKLTTLNMCPVTQIAPSSAEDAS* 123438

 

#29

>aaaa01000805.1a CYP71Z8 partial (indica cultivar-group) 100% to AC087544.2

4553 MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLNLPPGPWTLPVIGSIHHLVGSHPIHRS 4374

4373 MRALAEKHGRDLMQVWLGELPAVVVSSPEAARDVLRSQDLAFADRYVSTTIAAIYLGGRD 4194

4193 LAFAPYGERWRQLRKLCTQRLLTAARVRSFRCVREEEVARLVRDLAASAAAGEAVDLTAR 4014

4013 VAELVNDVVVRCCIGGRRSRYRDEFLDALRTALDQTTWLTVADVFPSSKLARMLGTAPRK 3834

3833 ALASRKKMERILEQIIQERKQIKERSTGAGAGADDEAAAAGNECFLDVLLRLQKEGDTPI 3654

3653 PITNETMMLLLH 3618 sequence gap

 

#29

>aaaa01000805.1b CYP71Z8 partial (indica cultivar-group) 100% to AC087544.2

duplicate of first 46 aa probably an assembly error. Count only once.

18349 MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLNLPPGPWTLPVIG 18212

 

>AC087544.2 $F CYP71Z8 chromosome 10 clone nbxb0046P18,

47% to CYP71D7

AZ131846.1 OSJNBb0111D08r CUGI Rice BAC Length = 377 59% to 71B9

AZ132319.1 OSJNBb0062F12r CUGI Rice BAC genomicLength = 683

14167 MEDKLLLLLALAVSVLVAVVISKLVSYATKPRLN

14065 LPPGPWTLPVIGSIHHLVGSHPIHRSMRALAEKHGRDLMQVWLGELPAVVVSSPEAARDV 13886

13885 LRSQDLAFADRYVSTTIAAIYLGGRDLAFAPYGERWRQLRKLCTQRLLTAARVRSFRCVR 13706

13705 EEEVARLVRDLAASAAAGEAVDLTARVAELVNDVVVRCCIGGRRSRYRDEFLDALRTALD 13526

13525 QTTWLTVADVFPSSKLARMLGTAPRKALASRKKMERILEQIIQERKQIKERSTGAGAGAD 13346

13345 DEAAAAGNECFLDVLLRLQKEGDTPIPITNETMMLLLH 13232 (0)

10760 NMFSAGSETSSTTLNWTMAELIKSPRVMAKVHDEVRQAFQGKNTITDDDVAKLSYLKMVT 10581

10580 KESLRMHCPVPLLGPRRCRETCKVMGYDVPKGTIVFVNAWAICRDSKYWKSAEEFKPERF 10401

10400 ENISIDYNGNNFEFLPFGSGRRICPGITLGMANVEFPLASLLYHFDWKLPNQMEPEEIDM 10221

10220 REAPGLVGPKRTSLYLHPVTRVAPSSV* 10119

 

#29

>aaaa01011405.1 CYP71Z8 (indica cultivar-group) orth AC087544.2 $F chr 10 99%

see aaaa01000805.1a = aaaa01000805.1b for ortholog

2160 SPRVMAKVHDEVRQAFQGKNTITDDDVAKLSYLKMVTKESLRMHCPVPLLGPRRCRET 2333

2334 CKVMGYDVPKGTIVFVNAWAICRDSKYWKSAEEFKPERFENISIDYNGNNFEFLPFGSGR 2513

2514 KICPGITLG 2540

 

#468

>aaaa01092069.1 CYP71Z9 (indica cultivar-group) 61% to AC087544.2 frag = 623bp

623 AIAMAFRQTSVLTLADLFPSSRLMQALGTAPRKVLACRDKIQRILEQVIQEKAQEMGRGDEATAGNEGFV 414

413 GVLLRLQKEGSTPVQLTNDTI 351

207 DMFSAGSETSSTTLNWCMTELVRSPVVMAKAQAELRDAFKGKNTITENDLEGLSYLKLVI 28

27  KEALRMHAP 1

 

no japonica ortholog found 9/12/02

 

#466

>aaaa01088222.1 CYP71Z10 i  not an exact match 64% to AP005114.1b $F

651bp frag. N-term runs off the end

247 MEENKALLAAVSLSILLVILSKLKSFLATKPKLNLSPGPWTLPVIG

SLHHLVRSPNIYRAMRALAQKHGQLMTLRLGEVQCM 2

 

no japonica ortholog found 9/12/02

 

#431

>aaaa01035499.1 CYP71Z11 (indica cultivar-group) 53% to AP005114.1b (partialI)

no ortholog in known set might be a new subfamily

3   TMPTTIQGYHIPAKTIAFINVWAIGRDPAAWDTPDEFRPERFMGSAVDFRGNDYKFIPFG 182

183 AGRRLCPGIILALPGLEMVIASLLYHFDWELPDGMDVQDLDMAEAPGLTTPPMNPVWLIP 362

363 RCRTI* 380

 

no japonica ortholog found 9/12/02

 

#475

>BI808626.1 CYP71Z12 (partial) clone D005B07.Length = 538 EST with numerous frameshifts similar to AAAA01035499.1 and 71Bs no ortholog found in indica

no extensions in htgs nr gss or est sections of Genbank 8/3/02

66% to AY104083.1 Zea mays

11 LFVHAWAIGRDPXAWXXPEEFRPDRFLXXSVDFRGNDYQLVPFGAAPRICPGISFXX

PVLEMALFALLHHFDWELPAGMXXXXXDMSEAPGLTTPLRVPLRLVPKRKARLPRHIYKRNVIGE*

 

#93

>aaaa01002599.1a CYP71AA1P (indica cultivar-group) orth of AP004326.2d 100%

even the frameshifts are the same

8731 FFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 8552

8552 QETLRLHPPVPLLLPRLWSEPCKIMGYDIP

     KNTAIFVNTWALGR

     KIKNTGLMQVSSG 8382

8381 LKYSRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSN 8202

8201 KLDMTEANGITTHRRIDIWLEATPFVPR 8118

 

>AP004326.2d $P CYP71AA1P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence Gene 4 pseudogene

81031 DFFFCGLETTTTATIWAI*EFIKNPHAVEKAQSDIRKILGGKSIVEEADIEGQLHYFQMVN 81213 frameshift

81213 QETLRLHPPVPLLLPRLWSEPCKIMGYDIP 81302 frameshift

81304  KNTAIFVNTWALGR 81345 frameshift

81344 KIKNTGLMQVSSGLKY

81393 SRMGIVDFNGLDFRFLPCGAGRRICLGLMFELSDIELTLASLLYHSSWRLPTRSYSNK 81566

81567 LDMTEANGITTHRRIDIWLEATPFVPR 81647

 

#94

>aaaa01002599.1b $FI CYP71AA2 (indica cultivar-group) ortholog of AP004326.2c $F 99%

12298 MAGIMDSTTASYYTTLLCGALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHC 12119

12118 LLGSLPHHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTAS 11939

11938 ILTYGARDIVFAPFGKHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASAS 11759

11758 SAVNVSELVKIMTNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARV 11579

11578 LGGRSLRTTKRVHEKLHQITEAIIQGHGIKDTVGDEHHECEDIL 11447

11446 DVLLRFQRDGGLGITLTKEIVSAVLF 11369

11213 DLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHYLQLV 11034

11033 IKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKNVNEFRPE 10854

10853 RFKDDIVDFSGTDFRFIPGGSGRRMCPGLTFGVSNIEIALVTLLYHFDWKLPSETDTHEL 10674

10673 DMRETYGLTTRRRSELLLKATPSY 10602

 

>AP004326.2c $F CYP71AA2 genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence Gene 3 39% to 71B11

77487 MAGIMDSTTASYYTTLLCG

77544 ALLLAAVVFKLKTAAAFSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRRY 77717

77718 GPVMLLRLGHVRTLVLSSPEAAREVMKTHDVAFANRAVTPTASIDIVFAPFG 77873

77874 KHLRELRKLCALELLSPRRVRSFRHVREEEAARLARSVAAAASASSAVNVSELVKIM 78044

78045 TNDVTMRAIIGDRCPQREEYLEALDKTMDLLAGFNLVDLFPGSPLARVLGGRSLRTTKRV 78224

78225 HEKLHQITEAIIQGHGIKDTVGDEHHECEDILDVLLRFQRDGGLGITLTKEIVSA 78389

78390 VLF 78398 (0)

78554 DLFAGGSETTSTTILWAMSELMRSPHVMEQAKYEIRQVLQGKAMVSEADIEGRLHYLQLV 78733

78734 IKETLRLHPPVPIVIPRLCSKPNSKIMGYDIPQGTSVLVNVSAIGRDEKIWKDVNEFRPE 78913

78914 RFKDDIVDFSGTDFRFIPGGSGRRMRPGLTFGVSNIEIALVTLLYHFDWKLPSETDTHEL 79093

79094 DMRETYGLTTRRRSDLLLKATPSYARLGWSTNMQIYSVKCLVYE* 79228

 

#95

>aaaa01002599.1c $FI CYP71AA3 (indica cultivar-group) ortholog of AP004326.2b $F >99%

22237 MAGIVDTAAFCTLLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLP 22058

22057 HHAMRELSRRYGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGA 21878

21877 RDIVFAPFSKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSEL 21698

21697 VKIMANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRA 21518

21517 TKRVHQKLHQITDTIIQGHEIIEDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLLRFHR 21338

21337 DGGLGITLTKEIVSAVLF 21284

20770 DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIHQVLQGKTVVSEADIEGRLHYLQLV 20591

20590 IRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEFRPE 20411

20410 RFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASSCKL 20231

20230 DMRETHGVTARRRTELLLKATPLYT 20156

 

>AP004326.2b $F CYP71AA3 genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence

Gene 2 no good matches in NR 79% to AP004326.2c

71860 MAGIVDTAAFCT

71896 LLCLLLTLVVFKLKTATSSRHNAGVNLPPGPWALPVIGSIHCLLGSLPHHAMRELSRR 72069

72070 YGPVMLLRLGHVRTLVLSSPEAAREVMKTHDAAFATRAVTPTASILTYGARDIVFAPF 72243

72244 SKHLRELRKLCTLELLSPRRVRSFRHVRDEEAARLARSVAAAAPAVVNVSELVKI 72408

72409 MANNIIMTAIIGDTCPQRDEYLEALDKTMDLMNGFNLIDLFPGSRLARVLGARSLRATKR 72588

72589 VHQKLHQITDTIIQGHEIIKDGSVGDDTIQETVGTHNMHGHGHKCEDILDVLL 72747

72748 RFHRDGGLGITLTKEIVSAVLF 72813 (0)

73327 DLFAAGSETTSTTIIWAMSELVRTPHVMERAQSEIRQVLQGKTVVSEADIEGRLHYL 73497

73498 QLVIRETLRLHPPVPFLIPRLCSEANSKIMRYNIPQGAMVLVNISAIGRDEKIWKNANEF 73677

73678 RPERFKDDMVDFSGTDFRFIPGGAGRRMCPGLTFGLSNIEIALASLLYHFDWKLPNDASS 73857

73858 CKLDMRETHGVTARRRTELLLKATPLYT* 73944

 

cluster continues on AP004326.2 seq a

 

#334

>aaaa01014066.1 CYP71AA4P (indica cultivar-group) orth AP004326.2a $P

chr 1 100%

5245 YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 5424

5425 F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 5604

5605 RMELDMTESAGLT 5643

 

>AP004326.2a $P CYP71AA4P genomic DNA, chromosome 1, BAC clone:OJ1294_F06, complete sequence

Length = 102983

4 genes 71B like

Gene 1 pseudogene 71 family

67989 LPPVPWPLPVIGSMH*LLGSLPHH 68060 frameshift with deletion

68060 RPACAVELLSPRRARSFRRVREAEPARLVRAVAASPAWPLVNVVGGEHVAAMMTAV 68227

68228 GARP 68239 frameshift with small deletion

68238 RCPRQEEYLEELGKVAKLAAGFNLVDLFPESRLVRAAQAAHGKIHSIMDAMVQ 68396

68397 DHLKAMEERREEVADGVVDDGDGDGADRDEELLSILLRFQRDGGLGITLTNGNHQRDS 68570 (0)

68886 GILAGGSDTTTTTVMWAMSELLRCPRAMQ 68972 frameshift with deletion

69023 YMQLVIKETLRLHLPFPLLFPRLCTETCKIMGFDVPKGTIVIVNNWAISRDERCWEDAED 69202

69203 F*PERFEHDDTDYNGTYFQFLSGGFGRRMFPDFIFAQFNIEIALANLLYHFDWELPCSEN 69382

69383 RMELDMTESAGLTASRLTDLFG* 69451

 

#442

>aaaa01051575.1 CYP71AA5 (indica cultivar-group) 69% to AP004326.2b

855 DVFAAGSETTATATIWAMSELVRTPRLMERAQAEIRQLLQGKTRVAEEDIQGRLPYLQMV 676

675 IKETLRLHPPAPLILPRLCAESTKILGFDVPEGTTVFVNAWALGRDDKSWVDANEFKPER 496

495 FEDDDRVDFSGADFRFIPGGSGRRMCPGLTFGLANIETTLANLLYHFDWKLPGGANPYEL 316

315 DMAESYGITARRTTDLLLEATPYVPHGSVS* 223

 

no japonica ortholog found 9/12/02

 

#273

>aaaa01010273.1 $FI CYP71AB1 (indica cultivar-group) ortholog of AC113337.1

6315 MANLIYYSLLIILPFLFLIKFYKAMFSSRKQARRLPPCPWQLPIMGSIHHLIGDLPHRAL 6494

6495 RDLSRRYGPVMLLKFGQVPFIIVSSPEAAKDIMKTHDSIFATRPQSEIMKIITKRGQGLV 6674

6675 FAPYDDQWRQLRKICIRELLCAKRVQSFCAIREEEAARLVKSISSDQAHLVNLSKKLADY 6854

6855 ATDAAIRIITGTRFENQE VRDKFQYYQDEGVHLAASFCPANLCPSLQLGNTLSRTAHKA 7031

7032 EIYREGMFAFIGGIIDEHQERRAQDMSHKEDLIDVLLRIQQEGSLESPVSMETIKFLIF (0) 7208

7297 DILAGGSETVTTVLQWAMAELMRNPTVMSKVQDEVREVFKWKEMVSNDDINKLTYLQFVI 7476

7477 KETLRLHTPGPLFMRECQEQCQVMGYDMPKGTKFLLNLWSISRDPKYWDDPETFKPERF 7653

7654 EDDARDFKGNDF EFISFGAGRRMCPGMLFGLANIELALANLLFYFDWSLPDGVLPSELDM 7833

7834 TENFGVTVRKKEDLLLHASLYAQLSC* 7914

 

>AC113337.1 $F CYP71AB1 (japonica cultivar-group) cultivar Nipponbare clone OSJNBa0061H20,

from chromosome 10

AC074355.2 Oryza sativa clone OSJNBa0071I20, gene 1 43% to 71A13

AQ288798 65-164 region C-helix 54% to 71A12 same as AC074355.2

AQ840770.1 nbxb0071I20f CUGI Rice BAC genomic cloneLength = 754

AQ840078.1 nbxb0051B18f CUGI Rice genomic cloneLength = 694

similar to lotus 71D

AQ865944.1 nbeb0026D10f BAC genomic Length = 473 59% to 99A1 69% to AP004000.1

23542 MANLIYYSLLIILPFLLLINFYKAMFSSRKQAGRLPPCPWQLPIMGSIHHLIGDLPHRSL 23721

23722 HDLSRRYGPVMLLKFGQVPFIIVSSPEAAKDIMKTHDSIFAMRPQSEIMKIITKRGQGLV 23901

23902 FAPYDDQWRQLRKICIRELLCAKRVQSFCAIREEEAARLVKSISSDQAHLVNLSKKLADY 24081

24082 ATDAAIRIITGTRFENQEVRDKFQYYQDEGVHLAASFCTANLCPSLQLGNTLSRTARKAE 24261

24262 IYREGMFAFIGGIIDEHQERRAQDMYHKEDLIDVLLRIQQEGSLESPVSMETIKFLIF (0)

      DILAGGSETVTTVLQWAMTELMRNPTVMSKAQ 24621

24622 DEVREVFKWKKMVSNDDINKLTYLQFVIKETVRLHTPGPLFMRECQEQCQVMGYDVPKGT 24801

24802 KFLLNLWSISRDPKYWDDPETFKPERFENDARDFKGNDFEFIPFGAGRRMCPGMLFGLAN 24981

24982 IELALANLLFYFDWSLPDGVLPSELDMTENFGVTVRKKEDLLLHASLYAQLSC* 25143

 

#263

>aaaa01009869.1 CYP71AB2 (indica cultivar-group) orth AP004684.1b $F chr6 98%

2834 LPLVHYLITLFLHGSRDSDLRLPPGPWRLPLIGSLHHLFFGALPHRALRDLARRHGPLML 3013

3014 LAFGDAPVVVVASTAAAAREILRTHDDNFSSRPLSAVVKACTRRGAGITFAPYGEHWRQV 3193

3194 RKICRLELLSPRRILAFRAIREEEAARLVRAIGVASPPLVTNLSQLLGNYVTDTTVHIV 3370

3371 MGERFRERDALLRYVDEAVRLAGSLTMADLFPSSRLAHAMSSTTLRRAEAFVES 3532

3533 LMEFMDRVIREHLEKKRSCQGGEREEDLIDVLLRLQAEGSLHFELTMGIIRAVIF

     DLFSGGSETATTT 3886

3887 LQWAMAELMRNPGVMSRAQAEVREAYKDKMEVTEEGLTNLTYLQCIIKETLRLHTPGP 4060

4061 LALPRECQEQCRILGYDIPKGATVLVNVWAICTDTEFWDESEKFMPERFEGSTIEHKGNN 4240

4241 FEFIPFGAGRRICPGMQFGIANIELALANLLFHFDWTLPEGTIHSDLDMTETMGITARRK 4420

4421 EDL 4429

 

>AP004684.1b $F CYP71AB2 chromosome 6 clone P0012H03, Length = 163117

New seq similar to AP004000 57% to AP003523.1 78% to AP004688.1 

36% to 41% with 71A and 71B sequences possibly new subfamily in 71

117908 MDAAVFCCLLALLPLLHYLITLFLHGSRDSDLRLPPGPWRLPLIGSLHHLF 118060

118061 FGALPHRALRDLARRHGPLMLLAFGDAPVVVVASTAGAAREILRTHDDNFSSRPLSAV 118234

118235 VKVCTRRGAGITFAPYGEHWRQVRKICRLELLSPRRILAFRAIREEEAARLVRAIGVASP 118414

118415 PLVTNLSELLGNYVTDTTVHIVMGERFRERD ALLRYVDEAVRLAGSLTMADLFPSSRLAR 118594

118595 AMSSTTLRRAEAFVESLMEFMDRVIREHLEKKRSCQGGEREEDLIDVLLRLQAEGSLHFE 118774

118775 LTMGIIRAVIF 118807 (0)

118932 DLFSGGSETATTTLQWAMAELMRNPGVMSRAQAEVREAYKDKMEVTEEGLTNLTYLQCII 119111

119112 KETLRLHTPGPLALPRECQEQCQILGYDIPKGATVLVNVWAICTDNEFWDESEKFMPERF 119291

119292 EGSTIEHKGNNFEFIPFGAGRRICPGMQFGIANIELALANLLFHFDWTLPEGTLHSDLDM 119471

119472 TETMGITARRKEDLYVHAIPFVQLP* 119549

 

#71

>aaaa01002000.1 CYP71AB3 (indica cultivar-group) ortholog of AP004688.1 $F 98%

6046 METADLCCLLALLPLVYCLLTLFHGSRESDLRLPPGPWRLPLIGSLHHLFGRTLPHHALR 5867

5866 DLARLHGPLMLLSFGQASPVVIASTAIAAREIMRTHDDNFSTRPLSTVLKVCTRYGAGMT 5687

5686 FVPYGEHWRQVRKICSLELLSPRRILKFRSIREEEVARLVLAIASSSTPTPTPPAPVNLS 5507

5506 KLLSNYMTDATVHIIMGQCFRDRDTLVRYVDEAVRLASSLTMADLFPSWRLPRVMCATTL 5327

5326 HRAEVFVESVMEFMDRVISEHLEKRSCQGGDREEDLIDVLLRLQAEGNLEFELTTSIIKA 5147

5146 IIF 5138

     ELLAGGSEAPITTLQWAMAELMRNPDVMSRAQAEVREAYKEKMKV 4911

4910 TEEGLTNLPYLHCIIKETLRLHTPGPFVLPRECQEQCQILGYDVPKRATVVVNIWAICRD 4731

4730 AEIWDEPEKFMPDRFEGSAIEHKGNHFEFIPFGAGRRICPGMNFALANMELALASLLFYF 4551

4550 DWSLPEDVLPGDLDMTETMGLTARRKEDLYVCAIPFVQLP 4431

 

>AP004688.1 $F CYP71AB3 chromosome 6 clone P0036C11, Length = 137929

New seq similar to AP004000 37% to 71B23 52% to AP003523.1 78% to AP004684.1b 

57304 METAELCCLLALLPLVYCLLTLFHGSRESDLRLPPGPWRLPLIGSLHHLFGRTLPHRA 57477

57478 LRDLARLHGPLMLLSFGQAAPVVIASTAIAAREIMRTHDDNFSTRPLSTVLKVCTRYGA 57654

57655 GMTFVPYGEHWLQVRKICSLELLSPRRILKFRSIREEEVARLVLAIASSSTPTPTPPAPV 57834

57835 NLSKLLSNYMTDATVHIIMGQCFRDRDTLVRYVDEAVRLASSLTMADLFPSWRLPRVMCA 58014

58015 TTLHRAEVFVESVMEFMDRVISEHLEKRSCQGGDREEDLIDVLLRLQAEGNLEFELSTSI 58194

58195 IKAIIF 58209 (0)

58290 ELLAGGSEAPITTLQWAMAELMRNPDVMSRAQAEVREAYKEKMKVTEEGLTNLPY 58469

58470 LHCIIKETLRLHTPGPFVLPRKCQEQCQILSYDVPKRATVVVNIWAICRDAEIWDEPEKF 58649

58650 MPDRFEGSAIEHKGNHFEFIPFGAGRRICPGMNFALANMELALASLLFYFDWSLPEDVLP 58829

58830 GDLDMTETMGLTARRKEDLYVCAIPFVQLP* 58919

 

#267

>aaaa01010030.1b CYP71AC1 (indica cultivar-group) orth AP003523.1b $F chr 6  99%

6579 QDMFAGGSESTSTTLEWALSELVRNPHVMQKAQAEIRHALQGRTRVTEDDLINLKYPK 6406

6405 NVIKETLRLHPVAPLLVPKECQESCKILGYDVPKGTIMFVNAWAIGRDPRYWNDAEVFMP 6226

6225 ERFEKVAVDFRGTNFEFIPFGAGRRMCPGITFANATIEMALTALLYHFDWHLPPGVTPDG 6046

6045 LDMEEEFGMSVSRKRDLYLRPTLH 5974

 

>AP003523.1b $F CYP71AC1 chromosome 6 clone P0416A11 six different genes

54-58% with genes in AP003090 and AP004000 group

38% to 41% with 71A and 71B sequences possibly new subfamily in 71

118132 MDLMKSNPLQGSPWSL

118084 LNLLVLIIVAAMICGELCRRRRRRRGDENGGATRLPPGPWRLPFVGSLHHLAVMRPRGVV 117905

117904 VHRALAELARRHDAPVMYLRLGELPVVVASSPEAAREVLKTHDAAFATRAMSVTVRESIG 117725

117724 DKVGILFSPYGKKWRQLRGICTLELLSVKRVRSFRPIREEQVARLVDAIAAAAASS 117557

117556 TAEAAAVNISRQITGPMTDLALRAIMGECFRWREEFLETLAEALKKTTGLGVADMFPSSR 117377

117376 LLRAVGSTVRDVKLLNAKLFELVECAIEQHREQIRAAHDNGGDDDDAHGHGDKECFLNTL 117197

117196 MRIQKEGDDLDD 117161 (frameshift) LTMATVKAVIL (0)

       DMFAGGSESTSTTLEWALSELVRNPHVMQKAQAEIRHALQGRTRV 115967

115966 TEDDLINLKYPKNIIKETLRLHPVAPLLVPKECQESCKILGYDVPKGTIMFVNAWAIGRD 115787

115786 PRYWNDAEVFMPERFEKVAVDFRGTNFEFKPFGAGRRMCPGITFANATIEMALTALLYH 115610

115609 FDWHLPPGVTPDGLDMEEEFGMSVSRKRDLYLRPTLHMGLETI* 115478

 

#267

>aaaa01031277.1 CYP71AC1 (indica cultivar-group) orth AP003523.1b $F chr 6 99%

see AP003523.1b above for ortholog

1393 LPPGPWRLPFVGSLHHLAVMRPRGVVVHRALAELARRHDAPVMYLRLGELPVVVASSPEA 1572

1573 AREVLKTHDAAFATRAMSVTVRESIGDKVGILFSPYGKKWRQLRGICTLELLSVKRVRSF 1752

1753 RPIREEQVARLVDAIAAGA 1809

 

#25

>aaaa01000575.1 $FI CYP71AC2 (indica cultivar-group) 74% to AP003523.1

same seq as aaaa01002303.1 $FI orth to AP005610.1 AP005192.1

31724 MDMEMGKLLHRPWKWSLNSPLL

31558 LLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVIGSLHHLAMNPKAVHRALADLAR 31379

31378 RCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAFATRAMSVTVRDSIGDTVGILF 31202

31201 SPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLVGAIAAAAAAPGGDQPPPVNVS 31022

31021 WQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKASRFGVADLFPSSRLLRAVGSTA 30842

30841 VRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGGDDDARDDNECLLNTLMRIQKE 30662

30661 GGGTLSMSTVKAVIL (0) 30617

28748 DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSY 28584

28583 PKNIIKETLRLHPVAPLLGXKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVF 28404

28403 LPERFEEITVDFGGTNYEFIPFGGGRRICPGITFAHATLEWALTALLYHFDWHLPPSVTP 28224

28223 DGLDMEEEFGMNVRRKRDLHLHPVIHVGVEKGIMS* 28116

 

#25

>aaaa01002303.1 $FI CYP71AC2 (indica cultivar-group) same seq as AAAA01000575.1

except 2 aa diffs and one short frameshifted region

see AAAA01000575.1 for ortholog

3994 MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAVGTRLPPGPWRLPVI 4173

4174 VQSAPPRHEPEGGARALADLARRCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 4353

4354 ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 4533

4534 GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 4713

4714 RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 4893

4894 DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 5001

6869 DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 7048

7049 KETLRLHPVAPLLGXKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPERF 7228

7229 EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLEWALTALLYHFDWHLPPSVTPDGLDM 7408

7409 EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 7498

 

>AP005610.1 $F CYP71AC2 (japonica cultivar-group) chr 6 = AP005192.1

115277 MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVI 115456

115457 GSLHHLAMNPKAVHRALADLARRCGGGGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 115636

115637 ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 115816

115817 GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 115996

115997 RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 116176

116177 DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 116284

118153 DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 118332

118333 KETLRLHPVAPLLMPKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPKRF 118512

118513 EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLELALTALLYHFDWHLPPSVTPDGLDM 118692

118693 EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 118782

 

>AP005192.1 $F CYP71AC2 (japonica cultivar-group) chr 6

83856 MDMEMGKLLHRPWKWSLNSPLLLLLIVPVMIHVQLKLRRRRKNAAAGTRLPPGPWRLPVI 83677

83676 GSLHHLAMNPKAVHRALADLARRCGGXGGVMYLRLGELPVVVASSRDAAREVLRTHDAAF 83497

83496 ATRAMSVTVRDSIGDTVGILFSPYGERWRRLRGICSLELLNARRVRSFRPIREEQVARLV 83317

83316 GAIAAAAAAPGGDQPPPVNVSWQIAGALTDLTLRAIMGECGFRWREEFLETLGEAQRKAS 83137

83136 RFGVADLFPSSRLLRAVGSTAVRDVRALNAKLFELVDRAIEQHREAAATTAAGGDHDDGG 82957

82956 DDDARDDNECLLNTLMRIQKEGGGTLSMSTVKAVIL 82849

80980 DMFAGGSETTSTILEWAMSELVKNPQVMQKAQAQIRLALQGRSRITEDDLINLSYPKNII 80801

80800 KETLRLHPVAPLLMPKECQESCKILGYNIPKGSIMLVNVWAIGRDHRYWDDAEVFLPKRF 80621

80620 EEITVDFGGTNYEFIPFGGGRRICPGITFAHATLELALTALLYHFDWHLPPSVTPDGLDM 80441

80440 EEEFGMNVRRKRDLHLHPVIHVGVEKGIMS 80351

 

#208

>aaaa01007044.1 $PI CYP71AC3P (indica cultivar-group) seq gap at 5222 no Nterm exon

might be in this gap but also one frameshift so probably a pseudogene of an

AP003523 like gene.

6146 DMFAGGSETTSTTLEWA 6196 (frameshift)

6196 PEVMQKAQAEIRHALQGKSRVTEDDLINLKYPKNIIKETMRLHPLASLLVPRKCQESCKI 6375

6376 LGYDIPKGTILIMNVWTIGRDHRYWDDAEVFIPERFEDTTIDFKGTHFEFIPFGAGRRMC 6555

6556 LGMTFAHATIELALTALLYHFDWHLPHGVTHDGMDMEEQFSVTVSRKRDLYLHPIQHVGVEEI* 6747

 

aaaa01007044.1 no japonica ortholog found 9/7/02

 

#427

>aaaa01034252.1 CYP71AC4 (indica cultivar-group) 77% to AP003523.1b

may be a pseudogene

1319 R*VVHRALADLVRRCDDLAPLMYLCLSELRVVVASTPDAAREVLKTHDAAMSTVVSAN 1146

1122 FAPYGKRWRHLRGICTLELLSAKRVRSFRPIREEQDARLVGAVVAAAAPSGESVNVRRLI 943

942  GGPMTDLALRAIMGE 898

 

no japonica ortholog found 9/12/02

 

#37

>aaaa01021566.1 CYP71AC5P (indica cultivar-group) orth of AL606658.1 2 diffs

lone pseudogene fragment

1048 SALNVSRQITGTLTDLTLRAIMGECGFRWHEEFLETLGEAQKKATRFGVADLFPSSRLLP 1227

1228 AVGSRSGD 1251

 

>AL606658.1 $P CYP71AC5P chromosome 4 clone OSJNBb0016D16 lone pseudogene fragment

72% to AP003523 118132-115478

120987 SALNVSWQITGTLTDLTLHAIMGECGFRWHEEFLETLGEAQKKATRFGVADLFPSSRLLPAVGSRSGD 120784

 

#38

>aaaa01013200.1 $PI CYP71AC6P (indica cultivar-group) 3 diffs with AL606658.1 95%

94% to AP004571.1 and AP004327.1 lone pseudogene fragment

7465 ALNVSRQITGTLTDLTLHAIMGECGFRWREEFLETLGEAQKKATRFGVADLFPSSRLLPA 7644

7645 VESRSGD 7665

 

>AP004571.1 $P CYP71AC6P (japonica cultivar-group) chr 6 94% to AAAA01013200.1

identical to AP004327.1 lone pseudogene fragment

60465 ALNVSRQITGTLTDLTLRAIMWECGFRWREEFLETLGEAQKKATRFGVADLFLSSRLLPA 60286

60285 VGSRSGD 60265

 

>AP004327.1 $P CYP71AC6P (japonica cultivar-group) chr 6 94% to AAAA01013200.1 4 diffs

identical to AP004571.1 lone pseudogene fragment

105764 ALNVSRQITGTLTDLTLRAIMWECGFRWREEFLETLGEAQKKATRFGVADLFLSSRLLPA 105943

105944 VGSRSGD 105964

 

#39

>aaaa01017762.1 $PI CYP71AC7P (indica cultivar-group) 89% to AL606658.1

92% to AAAA01013200.1 lone pseudogene fragment

4196 ALNVSRQITGTLTDLTLRAIMGECGFRWREEFLETLGEAQKKATRFGVADLFPLSRLLPV 4017

4016 IRSRSGD 3996

 

#290

>aaaa01011555.1 CYP71AD1 (indica cultivar-group) orth AC109595.1 $F chr 5 >99%

8324 NARRRLAPAPRGLPVIGNLHQVGALPHRALRALAAATGAPHLLRLRLGHVTALVASSPAA 8145

8144 AAAVMREHDHVFATRPYFRTAEILTYGFKDLVFAPYGEHWRHARRLCSEHVLSAARSH 7971

7970 RY 7965

7950 QEVALLVNAIRTEAAAAAVDVSKALYAFTNAVICRAVSGRLSREDEGRSELFRELIEE 7777

7776 NATLLGGFCVGDYFPALAWADAFLSGFAARACRNLRRWDELLEEVIAEHEARLRGGDDG 7600

7599 GGEEHREEDFVDVLLALQEESQRHDGSFKLTRDIIKSLLQDMFAAGTDTSFITLEWAMSE 7420

7419 LVKNPAAMRKLQDEVRRGGGATTAATPYLKAVVKETLRLHPPVPLLVPREC 7267

7266 ARDTDDDATVLGYHVAGGTRVFVNAWAIHRDAGAWSSPEEFRPERFLPGGGEAEAVDLRG 7087

7086 GHFQLVPFGAGRRVCPGMQFALATVELALASLVRLFDWEIPPPGELDMSDDPGFTVRRR 6910

6909 IPLRLV 6892

 

>AC109595.1 $F CYP71AD1 chr 5 39% to 71As 40% to 71Bs

44% to 71Cs clone OJ1212B02, Length = 126962

72095 MEIELSPVLLLLPFLLLGFLYLTGGVLRSGGNARRRLAPAPRGLPVIGNLHQVGALP 71925

71924 HRALRALAAATGAPHLLRLRLGHVTALVASSPAAAAAVMREHDHVFATRPYFRTAEILTY 71745

71744 GFKDLVFAPYGEHWRHARRLCSEHVLSAARSHRYGPMREQEVALLVNAIRTEAAAAAV 71571

71570 DVSKALYAFTNAVICRAVSGRLSREDEGRSELFRELIEENATLLGGFCVGDYFPALAWA 71394

71393 DAFLSGFAARACRNLRRWDELLEEVIAEHEARLRGGDDGGGEEHREEDFVDVLLALQE 71220

71219 ESQRHDGSFKLTRDIIKSLLQDMFAAGTDTSFITLEWAMSELVKNPAAMRKLQDEVRRGG 71040

71039 GATTAATPYLKAVVKETLRLHPPVPLLVPRECARDTDDDATVLGYHVAGGTRV 70881

70880 FVNAWAIHRDAGAWSSPEEFRPERFLPGGGEAEAMDLRGGHFQLVPFGAGRRVCPGMQFA 70701

70700 LATVELALASLVRLFDWEIPPPGELDMSDDPGFTVRRRIPLRLVAKPVGSEDDK* 70536

 

#383

>aaaa01019060.1 $FI CYP71AE1 (indica cultivar-group) one stop in exon 2

4042 MASLATVPNLPLLLLLHYALATFTASRARKNNKDRLPPSPLALLVIGHLLHLMGSLPRTSPSAASPHG 3839

3838 TGPTCSSGLAPCRCSLRRRRVPAAEAILRTHDHVFASRPRTVLLANIVFYRSRDVRFAPY 3659

3658 GDHWRQARKLVTTHLLSAKKVRSLRLAREEE (0) 3584

2413 VSLVMTKISKAATASAVVDIGQILRSFTNDMICRTVSGKCPRDDR*KRIFQELANETSLL 2234

2233 LGGFDIEEYFPVLARVGLVGKMMCLKAERLKKRWDELLEELINDHENDDHSCNLISDQND 2054

2053 EDFVDILLSVRQEYGFTREHVKAIL (0) 1979

1622 DVFFGGIDTSALVLEFTIAELMQRPRMLKKLQDEVRACIPKGQKIVSEVDINNMAYLRAV 1443

1442 IKEGIRLHPVAPVLAPHISMDDCNIDGYMIPSGTRVLVNVWAIGRDPRFWEDAEEFVPER 1263

1262 FIDSMSSAAANVNFTENDYQYLPFGYGRRMXPGMKFGIAVVEIMLANLMWKFDWTLPPG 1086

1085 TEIDMSEVFGLSVHRKEKLLLVPNNMSSC* 996

 

No japonica ortholog found 9/11/02

 

#382 = #425 = #40 reduce gene count by 2

>AQ573853 nbxb0085A03r CYP71AE2 (partial)   50% to 71A24 AQ691042 nbxb0086M20r

AQ795917.1 nbxb0058F03f CUGI Rice BAC genomic clone Length = 684

No indica ortholog found

MAARTWLWLLLSPLILLLLHYALALLTARRARKNPLPPSPPALPFIGHLHLIGALPHVSLCCLAT

KHAPDLMFLRLGTSLPVLVASSPCAAEAILRTHDDVFASRPRTVLADIIFYGSRDIGFAPYGEDWRQAR

 

#40 = #382 = #425 reduce gene count by 2

>AQ573853 nbxb0085A03r CYP71AE2 (partial)   50% to 71A24 AQ691042 nbxb0086M20r