Rice P450 sequences from japonica strain.  Some supporting accessions

like ESTs may be from indica. Use this file in conjunction with

Rice_annotation.P450s.xls

 

David Nelson

Sept. 18, 2007

 

412 rice sequences from japonica, plus three from indica and two duplicates, one with alternative exons. 417 total sequences.  317 genes and 95 pseudogenes.

 

>CYP51G1  

MMDLADPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIP

AAPLVGGLLRFMRGPIPMIREEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAE

MSQQEVYKFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAEEYF

SKWGESGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSSLFHDLDNGMQPVSVIF

PYLPIPAHRRRDRARQRLKEIFATIIKSRKASGRAEEDMLQCFIDSKYKSGRSTTEGE

ITGLLIAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILA

EMDVLYRCIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRL

PHIFKNPDSYDPDRYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLL

RNFEFELVSPFPETNWKAMVVGIKDEVMVNFKRRKLVVDN

 

>CYP51G3  

MDLTTGAIWLFLAQLFVAATMLSKIATRERTRTTGTKFSRPPPPPLARGAPLVGVLPSLLANGPVEFIRH 182

183 HYEKMGSVFTVSLLQQKVTFLVGSEASSHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVD 362

363 YATRHEQFRFFGDIMKPAKLRTYVDLMVAEVE (0)

    GYFARWGQSGTVNMKQEFEQLVTLIASR 542

543 CLLGEEVRDKMFDEVSTLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVR 722

723 SRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHTSSSTST 902

903 WAGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKETLRLHPPALML 1082

1083LRHARRSFVVRGGSGEREYEVPEGHTVASPLLLHNALPRVYRDPGEFDPGRFGAGRE 1253

1254EGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQLVSPFPETDWTVVMPGPK 1433

1434GKVMVTYNRRKLT* 1475

 

>CYP51G4P       

54048 VRFLHRKVTFLVGPEESSHFFTGLDSEISQDEVSRFIIPTFGS*VAFD 54191 (intron no boundaries)

54642 AGYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 54743 (0) missing 20 aa

56119 VVTLIATRCLFGEVRSKMLGEVPTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARL 56292

56293 GEIFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG 56421 frameshift

56424 AEVAGMLVSALFAGQYTSSSTSTWTGAR 56507 frameshift

56510 LLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLMLLRHAR 56698

56699 RSFVVRARGSGDAEYEVPAGHTVASPMVIHNALPHVYEDAGSFDPGRFGPAREEYRAYAA 56878

56879 DHAYTVFGGGRHACVGE 56929 frameshift

56932 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVIVGFSVQL* 57108

 

>CYP51H1  

55155 MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 55331

55332 RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 55511

55512 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ

57812 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS 57985

57986 TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 58150

58151 ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 58330

58331 DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 58510

58511 MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 58690

58691 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK* 58834

 

>CYP51H2P

52199 MDHLTSS (frameshift)

TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG 52375

52376 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 52480

 

>CYP51H3   also frameshifted in AACV01016110.1 may have a non-traditional C-term

not frameshifted in indica

12878 MDQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAAPPPVVQGVGLVRFVRAMAR 13063

13064 DGPLEAIREQQAKLGSVFTASAPLGLFKVTFLIGSEVSSHFYVASYSEISMGRLYEFTVP 13243

13244 IFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE 13372

13617 NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 13730

13727 ESVPGKLCELFGELDNGLHLISGLLPYLPIPAHRRRDRARQRLGEIITEVIRLRRNSSR 13903

13904 GAAGTDENNDDMLQCLINSRYKDGCAMTDAEIAGLVVALMFAGKHTSSGVSIWTGVHLLS 14083

14084 NPNHLAAVVAEQDRLMASCPGRTDDYHRLDYDTVQEMRSLHCCVKEALRLHPPVAAVRQA 14263

14264 YKHFTVQTKEDKEYTIPGGHMVVSTILVNHYLPHIYK

DPHVFDPQRFAPGREEDKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLLSN 14540

14541 FEIKMVSPFPET 14576 (frameshift)

QWSTVIPEPKGKVMVSYRRRTAPK* 14649 P450 conserved end

MEHGYSRA* frameshifted end

 

>CYP51H4  

78968 MDHIFSTNAWLTIALVFIITLAAKVVRSSVTLPAEKTSKPRPPPEAKGAPLV 79123

79124 GIIPAVLRRGLQAVIREQHRALGSVFTLRSLGLAVTFLVGPECSDHFFHAPELEIAID 79297

79298 GLYEVTVPIFGKEVGYDIDLDTRNEQHRFFAKMLRPAKLRGHVLPMVCEIEX 79450

79551 YFGKWGECGVVDLMQEVDHVLMLIASRCLLGKEVRENMFDEVASLFHELMGGMHLISMFF 79730

79731 PYLPTPGHRRRDKARAKLGEIFSQIVKTRKMSGRVEDDMLQDLIDSTYGDGRATT 79895

79896 DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 80075

80076 GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL 80255

80256 ASPLVVNTLLPNIYKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAY 80435

80436 QQIKVILSHLVSNFELKLESPFPETEDMLSMRPKGKAIVSYKRRTLS 80576

 

>CYP51H5

78158 MAVLFVATKMIQQRPRTLCLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVIHDLH

77972 SRLVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLYDVDLATRSRQISFCTDS 77805

77804 IKPINLRGHVDSMVHEVE 77751 (0)

76666 GYFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 76484

76483 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI 76304

76303 AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 76124

76123 MTTLTHCIKEALRLHPPANLLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYI 75944

75943 YKDPNVYDPSRFGPGREEDKVGGKFSYTPFSAGRHVCLGEDFAYMQIKVIWSHLLRNFDL 75764

75763 ELISPFPEEEWEKFIPGPKGKVMVTYKRRRLL* 75665

 

>CYP51H6  

70310 MELTSSSAMWLAMAILAITAALTKIALGGGRRRCLSESSDLTCKTPPPPPVVNCIALLGL 70489

70490 LPALFRGDVPATMQQLYAKFGSVFTVSVAGLLKATFLVGPEVSAHFFQGLESEVSHGDLF 70669

70670 EFTVPMFGKEVGHGVDNATRIEQGRFFAEALKPVRLRIHVDPMVQEVE 70813 (0?)

71263 DYFAKWGQHGTVDLKHELEQLLLLISGRCLLGKEVMGTKFDEVCNLFRDIEGGVNLMSV 71439

71440 FFPYTPLIPSNRRRDMARERLHAIFSDIVRSRKQQQGDQEEVNDKDVLQSFIDSR 71604 (2?)

71910 YKADGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVR 72083

72084 KHGIINGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCV 72263

72264 PAGHTMASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGEN 72443

72444 YAYMQIKAIWSHLLRNFELKLLSPFPKTDWSKLVPEPQGKVMVSYKRRQL 72593

 

>CYP51H7  

80009 MAIILAITAAVTKIARGGRRRSATDPTCKMPPPPPVVNSIALLRLLPTLFRSGLPA 80176

80177 ILHELYTKFGSVFTINLAGLLKMTFLVGPEVSAHFFQGLESEISHGNLLEFTVPMFGKEI 80356

80357 AHGVDSATRNEQARFFVDALKPARLRIHVDPMVQEVE 80467 (0?)

84741 DYFAKWGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSV 84917

84918 FFPYTPLIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRAT 85097

85098 TEA*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGR 85277

85278 ITDDRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIAS 85457

85458 PIVISNQVPYIYKDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 85637

85638 AIWSHLLRNFELRLLSPLPKSDFTKFVPEPHGELMVSYKRRQL 85766

 

>CYP51H8  

122577 MDLVSISIYMWAMAALCTIITAMVTTKLARVRRPITLNPKSKRPLPPVVNVIA 122735

122736 LLEHLPRLCTKGVIPVYKGSVFTVSLFGLKATFLVGPEVSAHFYQGMDSEISQGDLYEFT 122915

122916 VPLFGKGVGFDIDNATRTEHLRFFIDAIKTSKLRNHVNSMVQEVE 123050 (0?)

123296 DYFAKWGENGIVDIKHEFEKLLMLISGHCLLGKEVRDNMFDEVFSLF 123436

123437 HELDSGVGLGSVIFPYIPIPSHIRCDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLID 123616

123617 SKHRDGNSTTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQK 123796

123797 HGDHIDYNVLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLL 123976

123977 SPMIFNNRLPYIYKDPHMYDPDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIK 124156

124157 VIWSHLLRNFELKLESPFPKTNWSKILLEPWGKVMVSYKRRRL 124285

 

>CYP51H9  

50920 MDMAAAAAVWFSAIAAVLLAASTIAVVVVAKMTGKRNGGAAAAAAAAAEAELPL

51082 PPVVSGVSLIIPVITRGPMAVADELYVKLGSVFT (0)

GGFYSRPE 51261

51262 SEVHQGGTYRMTVPMFGRGVMYDVDVATRSEQIAVCFEALRPTKLRSSTVTMVRETE 51432 (0)

52114 EYFAKWGEQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLI 52299

52300 SLCFPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYRDGRAMSD 52479

52480 NEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIGDDRVDYDALT 52653

52654 TGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVWTREGKEYRMPAGHSVVSYAAFNHRLG 52833

52834 YVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLKMKVIWSYLLRNF 53013

53014 ELELVSPFPEVELNNIMLGPRGEVMVRYKRRKLTST* 53124

 

>CYP71C12

50394 MAQMLDGLRHDEQASLHAPQEASTMPTMSCSD

50298 LLLAMMCPLILLLIIFRCYAYATRSGGMLSRVPSPPGRLPVIGHMH 50161

50160 LISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQAILRTHDRVFASRPYNTIA 49981

49980 DILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQTRQQEVRLVMAKIVEEAAT 49801

49800 HMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEINSSLLGGFNLEDYFPSLA 49621

49620 RLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEY 49441

49440 GLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTE 49261

49260 EQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPS 49081

49080 YWENAEEFMPERFLSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVY 48910

48909 RFNWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 48790

 

>CYP71C13P      

54948 MAQMLGALLLFQDSLMSTMTRMSY

54876 SLLLPILCPLILLLLFRCYAYATRSGGMLDKLPSPPGRLPLIGHM 54742

54741 HLIGFFPHMSLRDLATDLMLLHLGTVPTLVVSSSRMAQVILRTHDRVFASR*QSAI 54574

54573 TDILF*GATDVAFSPYGDYWRQIKKIVTTNLLTIKKIRSYSQTRQQEVRLVMAKIVEEAA 54394

54393 THMA IDLTELLSCYSNNMVCHAVSGMFFCEEGRNQLFKELI*INSSLLGGFNIEDYFPSL 54214

54213 ARLPVRRL LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNDVESDFIDVLLSIQQE 54037

54036 YGLTKDNIKANLAIMFEAGTDTSFIELEYSMAELM*KPQMIAKLQAEVRGVVSKGQDIVT 53857

53856 EEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTIPSGTRVIVNAWAIARDP 53677

53676 SYWENAEEFMPERFLSNTMADYNGNNFNFLPFRTGRRICPGINFAITTIEIMLASLV 53506

53505 YRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 53389

 

>CYP71C14P       also frameshifted in AACV01017100.1, but not in indica

frameshift exist in 3 ESTs CI183363.1, CI183362.1 and CI018604.1

so this is probably a real pseudogene and not a seq error

58316 MAVMLVPIPLLLLHQHHNHEHEHPSPVAPQPTM

58217 ASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPV 58086

58085 IGHLHLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRT 57906

57905 YSAVTDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARIN 57726

57725 EAAVARTTVDLSELLNWFTNDIVCHAVSGKFFREEGQNQMFWELIQANSLLLGGFNLEDY 57546

57545 FPNLARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLS 57366

57365 IQHEYGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQ 57186

57185 EIVTEEQLGRMPY 57153 frameshift

57147 LKAVIKEMLRLHLAGPLLVPYLSIAECDIEGYTIPSGTRVFVNAWALSRDPSFWENAEEF 56968

56967 IPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRFDWEIPA 56797

56796 DQAAKGGIDMTEAFGLTVHRRRSSSLFLGSHKIK* 56692

 

>CYP71C15

68223 MELNNTEPLTASRAQAAAVFLLLPVALLLLLLRFARATTMAGDRNSELLLSKLPS 68387

68388 PPLRLPVIGHMHLVGSLPHVSLRDLAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTH 68567

68568 DLAFASRPRAMVPDIITYGATDSCYGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEV 68747

68748 RLVIAKLRGAAAMAGAPVDMTELLHSFANDLICRAVSGKFFREEGRNKLFRELIDTNASL 68927

68928 LGGFNLEDYFPSLARTKLLSKVICVRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQD 69107

69108 SDFIDILLYHQEEYGFTRDNIKAILV 69185 (0)

69331 DMFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEVVNEDNIVDMVYLKAV 69510

69511 IKETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPER 69690

69691 FMDSNIDFKGHDFHYLPFGSGRRMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKE 69858

69859 EDIDMTEVFGLTVHRKEKLFLVPQAA* 69939

 

>CYP71C16

78935 MELILQLEAKTAAQAVVTVFFFFLLPLALLFYFARAAISSRDSKTRELILSKLPSPPFK 79111

79112 LPVIGHMHLIGPLPYVSLRDLAAKHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAF 79291

79292 ASRPRSMVTDIIMYGALDSCFAPYSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVM 79471

79472 ARLRGAAAAAAAVDLSQTLQFFANDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFN 79651

79652 LEAYFPGLARMPLISKLICARAIRIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVL 79828

79829 LSLQDEYGFTRDHIKAISI 79885 (0)

80359 DMFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAV 80538

80539 IKETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPER 80718

80719 FMDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKK 80895

80896 EDIDMTDVFGLAIHRKEKLFLVPQIANY* 80982

 

>CYP71C17

63826 MVVQLMLFFHDKFMAPMAEEPLPFVLIMIII

63733 LLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHV 63581

63580 SLRDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGS 63401

63400 SNISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMR 63221

63220 ELLGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLR 63041

63040 RVVSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLD 62861

62860 RQQEYNLTRHNIHAILM 62810 (0)

62626 DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTQMVSEDDLNNMPYLKAV 62453

62452 VKETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDAKCWENSEEFMPER 62273

62272 FMDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMD 62093

62092 KDDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 61994

 

>CYP71C18P       also stop and fs in AACV01000601.1, AK100062 cDNA

56483 MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP 56650

56651 PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLR 56815

56816 THDHVFASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREE 56995

56996 E 56998 (0)

57929 VHKVMTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANY 58105

58106 VLLAGFNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGND 58285

58286 DQDEMDFVDVLLLQERGITRDHLKAILV 58369 (0)

58462 DMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRTNIPK*GRELITECDQTNMTYLKA 58641

58642 VIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 58821

58822 RFVDGGSAANVDFIGTDFQFLPFGAX 58896 frameshift

58899 RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 59078

59079 VEYKGSVQDSAVIL* 59123

 

>CYP71C19

21862 MEQAAGLVYQLFQHEMFPWTFSVLALFPFLLLVLHYLATNHRTPTTCKETKNHHPP

21694 PPSPPRLPIIGHLHLIGGLLHVSLRELAHRYGPDLMLLHLGQVPNLIVSSPRAAEAVLR 21518

21517 THDLVFASRPYSLIADILLYGPSDVGLSPYGE*WRRRIITTHLLTNKKVRSYRVAREE 21344

21343 E 21335 (0)

      VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKAN

      SVLLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMS

      KQQCEHDEGNDQDEMNFVNVLLLQEQGITREHLKAIL (0)

20004 DMYQAGTETSSVVLVFAMAELMQKPHLMAKLQAELRTTIPKQGHELITERDLTDMTYLKA 19825

19824 VIKETLRLHPPTPLLLPHLAMADCNIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 19645

19644 RFVDDGSAANVDFIGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVSAEAA 19465

19464 IDKDGIDMAEAFGLSVQLKEKLLLVPVDYKDGMQDSAVILL* 19339

 

>CYP71C20

MAQMLAAFLLDGLISHEHGHESLGAPPQAGTMAWYSLVLMTS

79980 LLFPLLVLLVMRCYVTRSGAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRDLATKH 79810

79809 SPDMMLLHLGAVPTLVVSSSRVAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNE 79630

79629 YWRQIKKITTTHLLTMKKVRSYVSARQREVRIVMARITEAASKHVVVDLTEMLSCYSNN 79453

79452 IVCHAVCGKFSLKEGWNQLLRELVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKA 79276

79275 HNINKRWDQLLEKLIDDHTTKHIRSSSMLNHYDEEAGFIDVLLSIQHEYGLTK 79117

79116 DNIKANLAAMLMAGMDTSFIELEYAMAELMQKPHVMGKLQAEVRRVMPKGQDIVTEEQLG 78937

78936 CMPYLKAVIKETLRLHPPAPLLMPHLSISDCNINGYTIPSGTRVIVNVWALARDSN 78769

78768 YWENADEFIPERFIVNTLGDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLV 78601

78600 YRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPHLHLR* 78472

 

>CYP2C22P

61469 EQESDFVDILLDHQQEYNLTRHNIHAILM 61383

 

>CYP71C23P      

103765 FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLAVHRKEKLLLVSWLPQD* 103595

 

>CYP71E4  

96529 MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLR

96388 LPPGPARLPVLGNLLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLR 96212

96211 THDADCCSRPSSPGPMRLSYGYKDVAFAPYDAYGRAARRLFVAELFSAPRVQAAWRARQDQ 96017 (0)

94678 VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDVMDMLASFSAEDFFPNA

94454

94453 AAARLFDHLTGLVAHRERVFQQLDAFFEMVIEQHLDSDSSNAGGGGGNLVGALIGL 94286

94285 WKQGKQYGDRRFTRENVKAIIF 94220 (0)

94119 DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAY 93946

93945 LKMVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVF 93766

93765 DPDRFEAKRVEFNGGHFELLPFGSGRRICPGIAMGAANVEFTLANLLHCFDWALPVGMAP 93586

93585 EELSMEESGGLVLHRKAPLVLVPTRYIQL* 93496

 

>CYP71E5  

31348 MAISLITSLLFSLPQQWQPVVLTGLLPVIVSLVLLARKGRLKMPPGPEQVPLLGNLHQLA 31169

31168 GPQPHRALRDLARVHGPVMRLRLGKASAVVLTSAEAAWEALRGHDLDCCTRPVSAGTRRV 30989

30988 TYGMKNVAFAPYGAYWREVRKLLMVELLSARRVKAAWYARHEQ 30860

30546 VEKLLSTLRRAEGKPVALDEHILSLSDGIIGTVAFGNIYGSDKFSQNKNFQHALDDV 30376

30375 MEMLSGEGSSAEDLQLPAAVGRLVDRLTGFAARRERIFRQLDSFFEMVIEQHLDPNRAPP 30196

30195 ENGGDLVDVLIGHWKKNEPRGTFSFTKDNVKAIIF 30091

29601 STFVAGIDTNAATILWAMSELARKPRVLKKVQAEIRAAVGVNGRVQPDDITKLSYLRKVV 29422

29421 KETLRLHPPTPLLLPRETMRHIQISGYDVPAKTRIYVNAWAIGRDPASWPDEPEEFNPER 29242

29241 FEANEIDFKGEHPELMPFGTGRRICPGMAMAMANVEFTLANLLFAFQWSLPEGTTPDNVC 29062

29061 LEEEGRLVCHRKTPLVLVPTVYRHGLE 28981

 

>CYP71E6  

2204 MAASLLLELLPQQWQLSITSLIL

2273 LAVSVVLIFWSRRRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLEALAGWHG 2437 (fs)

2437 LGTVSVVVLSSPKAAREALKVHDPECCSRSPS 2532

2533 AGPRMLSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAA 2658

8170 VMLPDYYCCM 8199

8599 VEKLIEKLTRNGRNAVAINEHIFSTVDGIIGTFALGETYAAEEFKDISETMDLLSSSSAE 8778

8779 DFFPGSVAGRLVDRLTGLAARREAIFRKLDRFFERIVDQHAAADDDGPAAARRKADDKGS 8958

8959 AGSDLVHELIDLWKMEGNTKQGFTKDHVKAMLL 9057

9159 DTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYLKMVV 9338

9339 KETFRLHPPATLLVPRETTRHFKVGGYDIPEKTKVIVNA*AIGRDPNIWKDPEEFIPERF 9518

9519 EEMDIDFNGAHFELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKEDIDM 9698

9699 EEAGKLTFHKKIPLLLVPTPNKAPN* 9776

 

>CYP71K1  

MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWAL

PVIGHLHHVAGALPHRAMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAF

ATRPITPTGKVLMADSVGVVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGR

LLRAVAAAAAVAALTTPGATAAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLER

RMKLLPAQCLPDLFPSSRAAMLVSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAE

EDLLDVLLRIQSQDKTNPALTNDNIKTVIIDMFVASSETAATSLQWTMSELMRNPRVM

RKAQDEVRRALAIAGQDGVTEESLRDLPYLHLVIKESLRLHPPVTMLLPRECRETCRV

MGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFAPERFEGVGAADFKGTDFEYIPFGAGR

RMCPGMAFGLANMELALAALLYHFDWELPGGMLPGELDMTEALGLTTRRCSDLLLVPA

LRVPLRDHER

 

>CYP71K3  

66403 MATELTEYLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHR 66582

66583 AMRDMARRHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEG 66762

66763 VIFAPYGDGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAASSSS 66924

66925 SPVNLTGMISAFVADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAML 67104

67105 LSRVPAKIERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 67281

67282 IKSILI 67299 (0)

67380 DMFGAGSETSATTLQWAMAELMRNPAVMRRAQDEVRRELAVAGNDRVTEDTLPSLHYL 67553

67554 RLVIKETLRLHPPAPLLLPRECGGACKVFGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEF 67733

67734 SPERFERCERDFRGADFELIPFGAGRRICPGMAFGLAHVELALAALLFHFDWRLPGGMA 67910

67911 AGEMDMTEAAGITVRRRSDLLVFAVPRVPVPAQ* 68012

 

>CYP71K4  

68586 MPLVVLLLATIPLLFFTIKRSAQRRGGGGGGEGRLPPGPWALPVIGHLHHLAGDLPHRA 68762

68763 LSALARRHGALMLLRLGEVQAVVASSPDAARDIMRTHDAAFASRPLSPMQQLAYGRDAEG 68942

68943 VIFAPYGDGWRHLRKICTAELLSARRVQSFRPVREAELGRLLRSVAEATSSSSSA 69107

69108 SLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRMLQDGLKIVPGMTLPDLFPSSRLALF 69287

69288 LSRVPGRIEHHRQGMQRFIDAIIVEHQEKRAAAAANDDDDEDEDFLDVLLKLQKEMGSQH 69467

69468 PLTTANIKTVML (0)

      DMFGAGSESSATVLQWT 69647

69648 MAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRLVIKETLRLHPPAPLLLP 69821

69822 RKCGSTCKILGFDVPEGVMVIVNAWAIGRDLTYWDKPEEFVPERFEHNGRDFKGMDFEF 69998

69999 IPFGAGRRICPGITFGMAHVELVLSALLYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNL 70178

70179 LVRPIHRVSVPVE* 70220

 

>CYP71K5  

70828 MAGELAFYLLLVGLVAVPLLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPHRA 71007

71008 MRDLARRHGPLMLLRLGEVEAVVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGL 71187

71188 VFAPYGEAWRRLRRVCTQELLSHRRVQSFRPVREDELGRLLRAVDAAAAAGT 71343

71344 AVNLTAMMSTYVADSTVRAIIGSRRLKDRDAFLRMLDELFTIMPGMSLPDLFPSSRLAML 71523

71524 VSRAPGRIMRYRRRMRRIMDSIIHEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGA 71703

71704 QYPLTTENIKTVMM 71745 (0)

71837 DIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLSYL 72016

72017 KLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 72193

72194 EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVELALAALLFHFDWSLPG 72370

72371 GMAADELDMAESSGLTTRRRLPLLVVARPHAALPTKYCN* 72490

 

>CYP71K6   not frameshifted in AACV01014660.1 or AACV01014659.1

and phase 0 boundary is correct, so this is not a pseudogene

128476 MAAELVHLLRYLFSVPM

128425 LFFIVPLLFLVCSPRRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAMRDIARRHGPL 128246

128245 VLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGVIFAPYGETWR 128066

128065 QLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELMSA 127913

127912 YAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMPRRMKRH 127733

127732 RERMTAYLDAIIEEHQESRASREDDEDLLDVLLRI 127628 frameshift

       QREGDLEVSRESIRSTIG bad exon boundary should be phase 0

126439 DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGY 126275

126274 MNLVIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEF 126095

126094 IPERFENAGINFKGTNFEYMPFGAGRRMCPGMAFGLATLELALASLLYHFDWKLPDGV 125921

125920 EIDMKEQSGVTTRRVHDLMLVPIIRVPLPV* 125828

 

>CYP71K7P

138301 MAEVVQLHHLILLLPLFILPFLLLRSSRRRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIA 138152

138151 RRHGPLVLLRLGELPVVVASSADAARDVMKTHDLAFATRPITRMMRLVFPEGSEGIIFSP 137972

137971 YGETWRQLRKICTVELLSARRVNSFRSVREEEVNRLLRAVAAAAASATSPAKTVNL 137804

137803 SELMSAYAADSSVRAMIGRRCKDRDKFLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMP 137624

137623 RRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCME 137480 frameshift

       SPLLSTESIRTTIG bad exon boundary should be phase 0

136562 DLFNGGSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYL 136389

136388 VIKEALRLHPPRPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILE 136209

136208 RFEHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKL 136056

136039 GDLDMTQERGATTRRLHDLLLVPVIRVPLPLDSRS* 135942

 

>CYP71K8  

151229 MAGFPVYLLFLAALIILPMANLIRSARHRRLAGARRPPPGPWALPVIGHLHHLLAGKLPH 151408

151409 HHKLRDLAARHGPLMLLRFGELPVVVASSADAAREIAKAHDLAFATRPVTRTARLTLPEG 151588

151589 GEGVIFAPYGDGWRQLRKICTLELLSARRVLSFRAVREQEVRCLLLAVASPSPEGTTAT 151765

151766 ASVVNLSRMISSCVADSSVRAIIGSGRFKDRETFLRLMERGIKLFSCPSLPDLFPSSR 151939

151940 LAMLVSRVPGRMRRQRKEMMEFMETIIEEHQAARQASMELEKEDLVDVLLRVQRDGSLQF 152119

152120 SLTTDNIKAAIA 152155 (0)

166133 DLFIGGSETAATTLQWAMSELLNNPKVMQKAQDEIRQVLYGQERITEETISSLHYLHL 166306

166307 VIKETLRLHPPTPLLLPRECREPCQILGFDVSKGAMVLINAWSIGRDPSNWHAPEKFMPE 166486

166487 RFEQNNIDFKETSFEYIPFGAGRRICPGMTFRLANIELLLASLLYHFDWELPYGMQAGD 166663

166664 LDMTETLAVTARRKADLLVVPVVRVPIVG* 166753

 

>CYP71K9  

168538MAAAASSVLAYLLVVALLAIVPLVYFGWVARRRGEGGRLPPSPWGLPVIGHLHHLAGALPHHAMRDLA168741

168742 RRHGPLMLLRLGELPVVVASSAEAAREVMRTRDIEFATRPMSRMTRLVFPAGTEGIIFAP 168921

168922 YGDEWRELRKVCTVELLSARRVQSFRAVREEEVGRLLRAVAATSSSPSPAQAAVNL 169089

169090 SALLSAYAADSAVHAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMWLSRMP 169269

169270 RRMMQHRREAYAFTDAIIREHQENRAAGAGDGDGDDKEDLLDVLLRIQREGDLQFPLSTE 169449

169450 RIKTTVG 169470 (0)

169893 DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 170066

170067 VIKEALRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 170246

170247 RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFNWQLPDGMDTAD 170423

170424 LDMTEEMVVSARRLHDLLLVPVVHVPLPVASS* 170522

 

>CYP71K10

145371 MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPH 145201

145200 VAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEG 145024

145023 GEGIIFAPYGDRWRELRKICTVELLSGRRVQSFRPVREEEAGRLLRAVAAASPG 144862

144861 QAVNLSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAM 144685

144684 LLSRMPRRMKQHHRDMVAFLDAIIQEHQENRSAAADDDNDLLDVLLRIQREGDLQFPLS 144508

144507 SESIKATIG 144481 (0)

144297 DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEVRRELIGHRKVTEDTLCRLNYMHM 144124

144123 VIKEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPE 143944

143943 RFEHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENL 143764

143763 DMTEEMRFTTRRLHDLVLIPVVHVPLPTI* 143674

 

>CYP71K11

94448 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 94627

94628 LPPHHAMRDIALRHGPLVRLRLGGLQVI 94711

94712 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0)

      GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLL 95071

95072 RAVAAASPARRAVNLSELISAYSADSTMRALIGSRFKDRDRFLMLLERGVKLFATPSLPD 95251

95252 LYPSSRLAELISRRPRQMRRHRDEVYAFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQR 95431

95432 KGDFPLSTDNIKTTIG (0) 95479

95574 DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 95747

95748 VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 95927

95928 RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 96107

96108 DMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 96200

 

>CYP71K12

113634 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGA 113813

113814 LPPQHAMRNIALRHGPLVRLRLGGLQVI 113897

113898 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0) 114008

114127 GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNL 114300

114301 SELISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRP 114480

114481 RQMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 114663

       DLFNGGSETTATTLKWIMAELIRNPRVM 114840

114841 QKAQDEVRQVLGKHHKVTEEALRNLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFH 115020

115021 VPQGTMILVNMWAISRDPMYWDQAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIA 115200

115201 FGLVNLELVLASLLYHFNWELPDETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 115383

 

>CYP71K13P      

117756 DAAREVMRTHDLAFATRPSTRVMQLVFLEGSQ 117661

117553 GDRFTPYGDIWRNLRRSAPLAVSAKRVQFFRPIHQEEVCRLLQAVAVASPA 117395

117394 RGPPETLTSSFRPTWATLQCAP**GARLRDRDKSLMLLYRGVKPIRHARACQIFTQSIAL 117215

117214 ADLIIKSLSPMRRASYPMSNLLDIIFK 117134

117108 SDNHMDLTLVAFLLRFHKKGACPLSFCYIRKQFG*AF 116998

 

>CYP71P1  

44616 MSLALLVLSAAYVLVALRRSRSSSSKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELART 44437

44436 MRAPLFRMRLGSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAG 44257

44256 PYHRMARRVVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLAND 44077

44076 VLCRVAFGRRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCL 43897

43896 ADLREACDVIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0)

      DMFVAGTDTTFATLEWVMTELVRHPRILKKA 43537

43536 QEEVRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPA 43357

43356 RTRVFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGY 43177

43176 TFALATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFK 42997

42996 GEELSEV* 42973

 

>CYP71P2P

107880 GSMPAVVISKPNLARPALTTNDAVLASRQHLLNG*FLSF 107764 frameshift

107762 GCSDVTFAPAGPYHRM 107715 frameshift

107713 QMARGVEVSELLSAHHVAMYGVVRVKELQRLLAHLTKNTSSAKPIDLSECFLNLANDVLCRVAF 107521

107520 GRRFPRDEGDKLSAVLANAQDLL 107452 frameshift

107452 AGFTISDFFLELEPVASTVTGLCHRLKKCLADLCEACDVIVDVHISGNRQRIPSDREEDFVDVLLRVQ 107249

 

>CYP71Q1  

22020 MADDFLSSQPQPW 22058

22059 PPLLQLSAAVLFFLLPLLYLLFLRGSNGEVRGRQGNSASAPSLPGPCRQLPVLGNLLQIG 22238

22239 SRPHRYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRPPSPG 22405 (2)

26427 SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVRSFAYARAAEVARLVDTL 26577

26578 AASPPGVPVDLSCALYQLLDGIIGTVAFGKGYGAAQWSTERAVFQDVLSELLLVLG 26745

26746 SFSFEDFFPSSALARWADALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQED 26919

26920 MVDALVKMWREQQDRPSGVLTREHIKAILM 27009 (0)

28586 NTFAGGIDTTAITAIWIMSEIMRNPRVMQKARAEVRNTVKNKPLVDEEDSQNLKYLEMIIKEN 28774

28775 FRLHPPGNLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRGPMIWDNPEEFYPERFE 28948

28949 DRNMDFRGSNFELVPFGSGRRICPGVAMAVTSLELVVANLLYCFDWKLPKGMKEEDIDM 29125

29126 EEIGQISFISFRRKVELFIVPVKHEQYQLMGHIN* 29221

 

>CYP71Q2   also frameshifted in AACV01015820.1

69192 MATELLASQLLPWQPLVQLLAAGLFLLPLVYLLFFKGDGNGG 69317

69318 VMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRYVPVVQVQLGSIRTVVVHS 69494

69495 PEAAKDVLRTNDLQCCSRPSSPG 69565 (2)

72047 NYNYLDVAVSPYS 72083 (frameshift)

72085 YWREMRKLLVIELTSIRRVQSFAYARAAEVARLVDTLAASPAGVPVDLSSALYTF 72249

72250 SDGVIGTVAFGKVYGSAAWSSWEWGASFQEAMDETMQVLGSFSFEDFFPSSALARWADALTGA 72438

72421 AGRRRRVFHRIDGFFDAVIDKHLEPERLSAGVQEDMVDAMVMVWREQKDEAFGLTRDHIKAILL 72630 (0)

84351 DAFVGGIDTTAVTVTWIMSELMRNPRVMQKAQAEVHNIVKNKSKVCEEDIQNMKYLKM 84524

84525 IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNIWDNPEQFY 84698

84699 PERFEDKGIDFRGSHFELLPFGSGRRICPGIAMGVANVELVVANLLYCFNWQLPKGMKEE 84878

84879 DIDMDEIGQLAFRKNFLF* 84935

 

>CYP71Q3P

92865 DAFAGGIDTTVVTTTWIMSELMRNPRVMQK 92954 (frameshift)

92956 AQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCRLHPPGTLLIPRHTMKTCTIGGYSV 93132

93133 PSKRRIYVNVWAMWRDPNIWDNLEQFYLERFEDKGIDFRGSHFELLT 93273 (insertion)

93561 FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDTDMDEIG*LAFRKKLPLFI 93740

93741 VPMKH* 93758

 

>CYP71Q4P

16812 GGGGRWTETLEWIMAELTANTRVMAKLQDEISRAADGK 16925 24 aa deletion and frameshift

16931 PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSSEEFWPEQFLASREAVD 17110

17111 FQGNNYQLILFITDRRIFPDINFAVPVLETALVGLLHPTNELLGGG 17248

17249 GGLMWLQRSCSRARRPRSTAHRRHRSGTHPAAIAAAAAT* 17368

 

>CYP71R1  

  56 MAAVQLDSGLLVGFLFLATCLAVAIRSYLRSGGAAIPSPPALPVIGNLHQLGRGRHHRAL 235

 236 RELARRHGPLFQLRLGSVRALVVSSAPMAEAELRHQDHVFCGRPQQRTARGTLYGCRDVA 415

 416 FSPYGERWRRLRRVAVVRLLSARRVDSFRALREEEVASFVNRIRAASGGGVVNLTELIVG 595

 596 LTHAVVSRAAFGKKLGGVDPAKVRETIGELADLLETIAVSDMFPRLRWVDWATGLDARTK 775

 776 RTAAKLDEVLEMALRDHEQSRGDDDDGGGGDGEPRDLMDDLLSMANDGGGDRGHKLDRID 955

 956 VKGLILDMFIAGTDTIYKSIEWTMAELIKNPAEMAKVQAEVRHVAAAAH 1102

1103 GDEDEDTVAVVREEQLGKMTLLRAAMKEAMRLHPPVPLLIPREAIEDTVLHGHRVAAGTR 1282

1283 VMINAWAIGRDEAAWEGAAEFRPGRFAGGGAAAGVEYYGGGDFRFVPFGAGRRGCPGVAF 1462

1463 GTRLAELAVANMACWFEWELPDGQDVESFEVVESS 1567

 

>CYP71R2P

54816 MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAA

54690 ITSPPALPVIGNLHQLGRGRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRH 54511

54510 QDHVFCGRPQQHTARGTLYGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEE 54331

54330 VASFVNRIRAASGGGGGVVNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGEL 54160

54159 ADLLGTIAVSDMFPRLRWVDWATGLDARTKRTAAKLDEVLEMVLRDHEPSRGDDDDDDGD 53980

53979 GEARDLMDDLLSMANGGDDHGYKLDRIDVKGLLIL 53875 (0)

      DMFAAGTDTVYKSIE*TMAELIKNPAEMAKVQAEVRHVVAAAHGGEGDEDAVVIVKEEQAS 53616 (fs)

53613 LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 53434

53433 EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC 53254

53253 WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYVVQTTRM* 53110

 

>CYP71S1  

23233 MSMASLQAPEFLASCLLLATILFFKQLLAPSSKQRAASPSLPRPRGLPLIGNLHQVGALPHRSLAALAAR 23096

23095 HAAPLMLLRLGSVPTLVVSTADAARALFRDNDRALSGRPALYAATRLSYGQKSISFAPD 22919

22918 GAYWRAARRACMSELLGPPRVRGLRDAREREAAALVAAVAAAGASPVNLSDMVAATSSR 22742

22741 IVRRVAFGDGDGDESMDVKAVLNETQALLGGLWVADYVPWLRWVDTLSGKRWRLERRFRQ 22562

22561 LDALYERVIDDHLNKRKHASDEEDDLVDVLLRLHGDPAHRSTFGSRSHIKGILT 22400 (0)

22059 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHYLR 21889

21888 LVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAERF 21709

21708 VPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWRAP 21538

21537 PGREVDVEEENGLAVHKKNPLVLIATKSKRNTGGH* 21427

 

>CYP71S2  

19270 MASLQAPEFLASCLLLLATILLFKQLLAPSSKKRAASPSLPRPKGLPLIGNLHQVGALPHRSLAAL 19073

19072 AARHAAPLMLLRLGSVPTLVVSTADAARALFRNNDRALSGRPALYAATRLSYGQKNISF 18896

18895 APDGAYWRAARRACMSALLGAPRVCELRDAREREAAALIAAVAAAGASPVNLSDMVAAT 18719

18718 SSRIVRRVAFGDGDGDESMDVKAVLDETQSLLGGLWVADYVPWLRWVDTLSGMRRRLERR 18539

18538 FRQLDAFYERVIDDHINKRKHASDEEDDLVDVLLRLHGDPAHRSMFGSRTHIKGILT (0)18368

17337 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVIAGGGGGDKDGAMVREADL 17149

17148 PELHYLRLVIKETLRLHPASPLVQRETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWG 16969

16968 PNAERFLPERHRAHDADGEQQHEHDGFALVPFGIGRRSCPGVHFAAAAAELLLANLLFCF 16789

16788 DWRALPGREVDVEEENGLAVRKKNPLVLIATKSKSNRDAH* 16666

 

>CYP71T1  

34853 MELSSSLAAVLHSPLFLLAAL

34916 LLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGHLPLLGSLPHRKLRSMAEAHGP 35095

35096 VMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRMAERLIYGRDMVFAPYGEFWR 35272

35273 QARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGVRGGGETVNLSDLLMSYANGV 35452

35453 ISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGEFVPWLAWVDKLMGLDAKAA 35629

35630 RISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDHRDFVDVLLDVSEVEEGAG

      AGEVLLFDTVAIKAIIL (0)

36458 DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELR 36634

36635 LLRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRDAAAWGDRAE 36814

36815 EFVPERWLDGGGEEVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDW 36994

36995 ELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV* 37129

 

>CYP71T2  

39698 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP

39839 LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018

40019 RDLAFASRPRVRMSERLFYGRDM AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195

40196 VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA 40375

40376 DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552

40553 VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)

42074 DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169

42170 QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE 42349

42350 DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529

42530 RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709

42710 VRLKADLNLVAKPWSPGAS* 42769

 

>CYP71T3  

48011 MAVSLVVVVVV

48044 VIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHLLGALPHRALRS 48223

48224 LAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAERLLYGGRDVAFA 48403

48404 PYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVDLVEHLTAY 48574

48575 SNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLGWVDALN 48748

48749 GMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVNETDMD 48925

48926 AGVQLGTIEIKAIIL 48970 (0)

51142 DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 51312

51313 AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAWTIGRDQATWGEHAEEFI 51492

51493 PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 51672

51673 EFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP* 51771

 

>CYP71T4  

58119 MAVSLLPAVL

58149 VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328

58329 LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY 58508

58509 GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685

58686 LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859

58860 FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036

59037 VNETDKDAGIQLGTVEIKAIIM 59102 (0)

59562 DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717

59718 LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897

59898 PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077

60078 TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200

 

>CYP71T6 indica AAAA02000630.1

7617 MVVVVVVVAIAIVVPLLYLVLLPPARRGGGDSARRRLPPSPRGLPLLGHLHLLGALPHR 7441

7440 ALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRARDVAFASRPRVAMAERLLYG 7270

7269 GRDVAFAPYGEYWR 7228

7226 QARRICVVHLLSARRVLSFRRVREEESAALVARVRAAAGGAVDLVEHLTAYSNTVVSRAV 7047

7046 FGDESARGLYGDVDRGRALRKLFDDFVELLGQEPMGELLPWLGWVDAVRGLDGKVQRTFE 6867

6866 ALDSIIEKVIDDHRRRRRRHEVGRQMDSDDDGGGGGDHRDFVDVLLDVNETDKDAGIRLG 6687

6686 TIEIKAIIL (0) 6660

6536 DMFAAGTDTT 6507

6506 TTAMEWAMAELITHRDAMHKVQDEIRAVVGVTGCVTEDHIDRLPYLKAVLKETLRLHPPN 6327

6326 PLLVPHVPLADTEILGYTVPTHTRVLINAWTIGRDPVTWGEHAEKFIPERFLNNNVDYKG 6147

6146 QDFGLVPFGAGRRGCPGMGFAVPTIEMALASLLYNFSWETRPVDRRCKSGTSSLDMSEVN 5967

5966 GISVHLKYGLPLMAKSYFS* 5907

 

>CYP71T7  

10977  MDISLASLVLVLLAFVLPLLYLLLQLPGKKSGGGGGDGPRLPPSPAGCLPLLGHLHQLGP  11156

11157  LPHVALRSMAAAHGPVLRLRLGRVPTVVVSSAAAAEEVLRARDAAFSSRPRSAMAERILY  11336

11337  GRDIAFAPYGEYWRQARRVCVVHLLSAQRVSSFRRVREEEAAALADAVRAAGRGGGRAFD  11516

11517  LSGLIVAYASAVVSRAAFGDESARGMYGGADGGRAVRKAFSDFSHLFGTKPVSDYLPWLG  11696

11697  WVDTLRGRERKARRTFEALDGVLDKVIDDHRRRRDSGRRQTGDADAGHRDFVDVLLDVNE  11876

11877  MDNEAGIHLDAIEIKAIIM (0) 11933

12008  DMFVAGSDATSKPMEWAMAELVSHPRHMRRLQDEIRAVVGGGRVTEDHVDKLPYLRA  12178

12179  ALKEALRLHAPLPLLVARETVADTEIMGYHVAARTRVVINGWAIGRDTAVWGETAEEFMP  12358

12359  ERFLAGGNGGGAAAADYKVQGFEMLPFGGGRRGCPGVTFGMATVEMAVASLLYHFDWEAA  12538

12539  AADGKGGREGTPLLDMSETSGISMGLKHGLPLVAKPRFP  12655

 

>CYP71T8   revised by ESTs CT848324.1, CK038296.1

869 MSSYVVVAAALLVFVVVVVAAIKNLGKGKLPPSPPSLPFVGHLHLVGELPHRSLDALHRR 690

689 YGSDGGLMFLRLGRAGALVVSTAAAAADLYRGHDLAFASRPPSHSAERLFYGGRNMSFAP 510

509 LGDAWRRTKKLAVAHLLSPRRA

AALAAPARAAEAAALVARARRAAEAARAVQLRELLYAYTN

GVITRVAAGGSGATAERFRKMMADTSELLAGFQWVDRLPEAAGWAARKLTGLNK

KLDDMADESDRFLGEILAAHDDEKAEGEEEDFVDVLLRLRRQGAAAAGGLELAEDNVKAIIK (0)

DIMGAATDTSFVTLEWIMTELIRNTQVMSKLQNEIIQVT

GSKPTVTEEDLTKLDYLKAVIKEVLRLHPPAPLLIPHHSTMPTTIQGYHIPAKTIAFINV

WAIGRDPAAWDTPDEFRPERFMGSAVDFRGNDYKFIPFGAGRRLCPGIILALPGLEMVIA

SLLYHFDWELPDGMDVQDLDMAEAPGLTTPPMNPVWLIPRCRTI*

 

>CYP71T10P      

       DVVVLLVLDIIVA

33536  LMYLVLLPDVNRSNRPERWEDNDGWQRLPP*PRRLPLLRYLHLLSAPLHQAFHPLPRHMA  33357

33356  WCYYSSSNACRWWLSSFAATSRPKSAMAEQLLYGCDVAFAPYGEY*S*ECRICVLFFRCI  33177

33176  REEEVAVLVKHVRHPCR  33126

 

>CYP71U3  

98325 MDELSIENHSPISMDELSFGSLCMVAMATLALALALMVMGAHRRGGEKGATTGAKNLPP 98149

98148 GPWNLPVIGSLHHLLGASPPHRALLRLSRRHGPLMLVRLGEVPTVIVSGSDAAMEVL 97978

97977 KARDPAFADRARSTTVDAVSFGGKGVIFAPYGEHWRHARRVCLAELLSARQVRRLESIRQ 97798

97797 EEVSRLVDSIIAGSSNAAAVDMTRALAALTNDVIARAVFGGKCARQEEYRRELGVLTTLV 97618

97617 AGYSMVDLFPSSRVVRWLSRRTERRLRRSHAEMARIVGSIIEERKEKKGSDAGVGAKDED 97438

97437 DDLLGVLLRLQEEDGLTSPLTAEVIAALV 97351

94360 XDIFGAATDTTASTLEWIMVELMRNPRAMDKAQQEVRNTLGHEKGKLIGIDISELHYLCMV 94181

94180 IKETLRLHPASALILRQSRENCRVMGYDIPQATPVLINTFAVARDPKYWDNAEEFKPE 94007

94006 RFENSGADIRTSIAHLGFIPFGAGCRQCPGALLATTTLELTLANLLYHFDWALPDGVSPK 93827

93826 SLDMSEVMGITLHRRSSLHLHTTLTRSGFFSHSGR 93722

 

>CYP71V1P stops also in AACV01007891.1, but not indica

stops in EST CI222916

87309 MDDYFFLQSLLLCVAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHR 87136

87135 AMRDLAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGW 86962

86961 ADILFSPSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPVNLS 86782

86781 VLFHSTTNDIVARAAFGRKRKSAPEFMAAIKAGVGLSSGFKIPDLFPTWTTALAAVTGMK 86602

86601 RSLRGIHKTVDAILQEIIDERRCVRGDKINNGGAADDQNADENLVDVLIALQEKGGF 86431 (1)

86339 GKSVTTPWVIVTHMICTLDVQDMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFH 86157

86156 RKAVVTEADLQASNLRYLKLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVN 85977

85976 VWAIGRDPK Y*Y*E DAEEFKPEQFDDDAIDFMGGSYEFIPFGSGRRMCPGFNYGLASMEL 85797

85796 VLVAMLYHFDWSLLVGVKEVDMEEAPGLGVRRRSPLLLCATPFVPAAVSADY* 85638

 

>CYP71V2  

3269  MDELFYQSLLLSVAAVTVLQLLKLLLVRHRRPRTPPGPWRLPVIGSMHHLVNVLPHRKLR  3448

3449  ELAAVHGPLMMLQLGET  3499

3501  PLVVATSKETARAVLKTHDTNFATRPRLLAGEIVGYEWVDILFSPSGDYWRKLRQLC  3671

3672  AAEILSPKRVLSFRHIREDE  3731

4000  VNLSVMFHSVTNSIVSRAAFGKKRKNAAEFLAAIKSGVGLASGFNIPDLFPTWTGILATV  4179

4180  TGMKRSLRAIYTTVDGILEEIIAERKGIRDEKISGGAENVDENLVDVLIGLQGKGGFGFH  4359

4360  LDNSKIKAIIL (0) 4392

4491  DMFAGGTGTSASAMEWGMSELMRNPSVMKKLQAEIREVLRGKATVTEADMQAGNLR  4658

4659  YLKMVIREALRLHPPAPLLVPRESIDVCELDGYTIPAKSRVIINAWAIGRDPKYWDNPEE  4838

4839  FRPERFEDGTLDFTGSNYEFIPFGSGRRMCPGFNYGLASMELMFTGLLYHFDWSLPEGVN  5018

5019  EVDMAEAPGLGVRRRSPLMLCATPFVPVVSAN*  5117

 

>CYP71V3  

MAWLDDVLSLCNNNTRMCNALVLSVVVVSFLQLLKHVLLTPSRLP

64951 LPPGPRNLPVVGSAHRLVNTLAHRVLRDLADVHGPLMHLRVGQVPVVVVTSKELARDILK 64772

64771 THDANFATRPKLVAGGIVAYDWTDILFSPSGDYWRKLRRLCIQEILSAKRILSFEHIRED 64592

64591 EVRMLADEIRAVGPSVAVDLSARLHRITNTIVSRAAFGNKRSNAADFLVAIKQSVIMASG 64412

64411 FYVPDLFPRFSVLLCWLTGMRRTLHGIRDTIDSILEEIISEKEEAKQQQDNNLVDVLLSL 64232

64231 KDKGDFGFPITRDTIKAIVL 64172

63964 DIFAGGSGTSANAMEWAMSELMMNPRVMNKVQAEIRDAFHGKQSIGEADLRARDLKYLKL 63785

63784 VMKETLRLHPPAPLLVPRESIDACEINGYMIPAKARVIVNSWAISRDPRYWEDAEEFKPE 63605

63604 RFAEGGIDFYGSNYEYTQFGSGRRMCPGYNYGLASMELTLAQLLHSFDWSMPDGATEVDM 63425

63424 TEAPGLGVRRKTPLLLCAAPYVASHIYA* 63338

 

>CYP71V4

6936  MDELLYRALLLSVLAVALLQIIKAFLIIIRAKPAAPPLPPGPWRLPVIGSMHHLAGKLPH  7115

7116  RALRDLAAAHGPLMMLRLGETPLVVASSREMAREVLRTHDANFATRPRLLAGEVVLYGGA  7295

7296  DILFSPSGEYWRRLRQLCAAEVLGPKRVLSFRHIREQE (0)  7409

8349  MESQVEEIRAAGPSTPVDLTAMFSFLVISNVSRASFGSKHRNAKKFLSAVKTGVTLASGF  8528

8529  KIPDLFPTWRKVLAAVTGMRRALEDIHRVVDSTLEEVIEERRSAREDKARCGMVGTEENL  8708

8709  VDVLIGLHEQGGCLSRNSIKSVIFDMFTAGTGTLSSTLGWGMSELMRSPMVMSKLQGEIR  8888

8889  EAFYGKATVGEEDIQASRLTYLGLFIKETLRLHPPVPLLVPRESIDTCEIKGYMIPARSR  9068

9069  IIVNAWAIGRDPRYWDDAEEFKPKRFEKNMVDFTGSCYEYLPFGAGRRMCPGVAYGIPIL  9248

9249  EMALVQLLYHFDWSLPKGVVDVDMEESSGLGARRKTPLLLCATPFVVPVL*  9401

 

>CYP71V5

3420  MDGLLYQALLLSALAVAVLQIVKLAVVNRGKKQAAAAAPTPPGPWRLPVIGSMHHLAGKL  3599

3600  AHRALRDLAAVHGPLMMLQLGETPLVVVSSREVAREVLRTHDANFATRPRLLAGEVVLYG  3779

3780  GADILFSPSGEYWRKLRQLCAAEVLGPKRVLSFRHIREQE (0)  3899

4362  MASRVERIRAVGPSVPVDVSALFYDMAISIVSCASFGKKQRNADEYLSAIKTGISLASGF  4541

4542  KIPDLFPTWRTVLAAVTGMRRALENVHRIVDSTLEEVIEERRGAARECKGRLDMEDNEEN  4721

4722  LVDVLIKLHEQGGHLSRNSIKSVIFDMFTAGTGTLASSLNWGMSELMRNPRVMTKLQGEI  4901

4902  REAFHGKATVGEGDIQVSNLPYLRLFIKETLRLHPPVPLLVPRESIDMCEVNGYTIPARS  5081

5082  RIVVNAWAIGRDPKYWDDPEEFKPERFEGNKVDFAGTSYEYLPFGAGRRICPGITYALPV  5261

5262  LEIALVQLLYHFNWSLPKGVTEVDMEEEPGLGARRMTPLLLFATPFVVPLL*  5417

 

>CYP71W1  

537   MELTTLLLLALISFFFLVELIARYASPSGRESALRLPPGPSQLPLIGSLHHLLLSRYGDL  716

717   PHRAMRELSLTYGPLMLLRLGAVPTLVVSSAEAAAEVMRAHDAAFAGRHLSATIDILSCG  896

897   GKDIIFGPYTERWRELRKVCALELFNHRRVLSFRPVREDEVGRLLRSVSAASAEGGAACF  1076

1077  NLSERICRMTNDSVVRAAFGARCDHRDEFLHELDKAVRLTGGINLADLYPSSRLVRRLSA  1256

1257  ATRDMARCQRNIYRIAESIIRDRDGAPPPERDEEDLLSVLLRLQRSGGLKFALTTEIIST  1436

1437  VIF  1445

1591  DIFSAGSETSSTTLDWTMSELMKNPRILRKAQSEVRETFKGQDKLTEDDVAKLSYLQLVI  1770

1771  KETLRLHPPAPLLIPRECRETCQVMGYDVPKGTKVFVNVWKIGREGEYWGDGEIFRPERF  1950

1951  ENSTVDFRGADFEFIPFGAGRRMCPGIALGLANMELALASLLYHFDWELPDGIKSEELDM  2130

2131  TEVFGITVRRKSKLWLHAIPRVPYISTY*  2217

 

>CYP71W2P

2543 ARRVQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCA 2364

2363 RRDEFLHVQARGLRQARGRVQLGRPVPIVVASELAQRRAAVGRPSVAAGAFARCGRPAET 2184

2183 FFNMDNLRTHDTYRKKNHSGNSQHCTAFSALSFSELQLKMTIWQSHHYKLPINLREIFS 

2006 SAGSETLNDTLVGNI*ANEKYPQVMQKAQTEVREKFRG*DKLIKDDMNRLSYLHL 1846

1845 VIQETLRLH

 

>CYP71W3  

80150 MEVSLPLLIGVVLAFLLLFVLVNIKNSCRSWWPPPEKEKKKLRLPPGPWQLPLVGSLHHV 80329

80330 LLSRHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYL 80503

80504 TPTLAVLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHGREDEAARLVRSVAA 80683

80684 ECAARGGAAVVNVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLY 80863

80864 PSSWLARRLSGAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPLTT 81043

81044 DLITNVVL (0) 81067

81865 DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEMMDKLSYLRLVI 82044

82045 RETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPEVFKPERF 82224

82225 ENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRDRNDEIDL 82404

82405 SETFGITAKRKSKLMVYATQRIPCLG 82482

 

>CYP71W4  

103769 MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERR

103880 LRLPPGPWRLPLVGSLHHVLLSRHGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAE 104056

104057 AAREVLKTHDACFASRHMTPTLAVFTRGGRDILF SPYGDLWRQLRRICVLELFSARRVQS 104236

104237 LRHVREDEAARLVRAVAEECAIGGGGGAVVPIGDMMSRMVNDSVVRSAIGGRCARRDEFL 104416

104417 RELEVSVRLTGGFNLADLYPSSSLARWLSGALRETEQCNRRVRAIMDDIIRERAAGKDDG 104596

104597 DGEDDLLGVLLRLQKNGGVQCPLTTDMIATVIM (0) 104695

105190 EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLS 105351

105352 YLHLVIRETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEI 105531

105532 FKPERFNANLVDFKGNDFEYIPFGSGRRVCPGITLGLTSMELVLASLLYHFDWELPGGKR 105711

105712 CEEIDMSEAFGITVRRKSKLVLHATPRVPCLH* 105810

 

>CYP71W5P       

51119 LRRICVLELFSAHRV*SLHHVREEEAAPLVRVVADIRSPLGP 50994

 

>CYP71W6P

64520 LRRICMLELFSAHRV*SLHHVREEEAARLVRVVA 64419

 

>CYP71X1P

42530 MDHVLACVGILVAFTPLFLLAVLPLKLTNGGDGVKLPPGPWRLPVIGSMHHLMGESLVHRAMAD 42721

42722 LARRLDAPLMYLKLGEVPVVLASSPCAAREIMRAHDVAFASRPLSPTVRRMR 42877

42878 PPPPRRRQLRKICVVELLSARRVRTFRRVREEEVARLVGALVCLAHVA 43021 gap

      AMIGARFERRDEFLE

      missing mid region

43862 DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL 44035

44036 VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 44161 frameshift

44161 AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCSGLAFAEAIIDLLFS 44337

44338 TLLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPILRVPQTQTSSALLF* 44502

 

>CYP71X2  

38238 MYDAVACVVAVVVVVVFAMLWVKLARSGDGGGGGSGGVRLPPGPWRLPVIGSLHHVVGDRLLHRSMA 38438

38439 RIARRLGDAPLVYLQLGEVPVVVASSPGAAREVTRTHDLAFADRALNPTARRLRPGGAGV 38618

38619 ALAPYGALWRQLRKICVVELLSARRVRSFRRVREEEAGRLVGALAAAAASPGEEA 38783

38784 AVNFTERIAEAVSDAALRAMIGDRFERRDEFLQELTEQMKLLGGFSLDDLFPSSWLASAI 38963

38964 GGRARRAEANSRKLYELMDCAIRQHQQQRAEAAVVDGGAGVEDDKNQDLIDVLLNIQKQG 39143

39144 ELETPLTMEQIKAVIL 39191 (0)

39428 DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 39601

39602 IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNAWAIGRDPKYWDDAEEFRPE 39781

39782 RFEHSTVDFKGIDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMAASE 39958

39959 LDMTEEMGITVRRKNDLHLRPHPPCVVRSNFRSFVERERERHFV* 40093

 

>CYP71X3  

33613 MEQVSCFAAAAAAVLVVLSLARMLLAPRREWD

33709 GLNLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADA 33888

33889 AREIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSF 34068

34069 HGVREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERRE 34221

34222 DFLEVLPEIVKLASGFSLDDLFPSS 34296 check joint

      GSPAPSAARGEAVNRASYELVDSAFRQRQQQKEAMAAPPPDIAKEE

      EDDLMDELIRIHKEGSLEVPLTAGNLKAVIL 34528 (0)

34777 ELFCAGSETSSNAIQWAMSELVRNPKVMEKAQNEVRSILKGKPTVTEADMVDLTY 34941

34942 VKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFIKSWAIMRDPKHWDDAETF 35121

35122 KPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILTMLLYHFNWELPNGAA 35298

35299 PEELDMTEDMGLTIRRKNDLYLLPTLRVPLTA* 35397

 

>CYP71X4  

27391 MEQVSCFAAAAAVVVVVLLLARMLLAPRGEWDGLNLPPSPPRLPFIGSFHLLRRSPLVHRALADVARQL 27597

27598 GSPPLMYMRIGELPAIVVSSADAAREVMKTHDIKFASRPWPPTIRKLRAQGKGIFFEPYG 27777

27778 ALWRQLRKICIVKLLSVRRVSSFHGVREEEAGRLVAAVAATPPGQAVNLTE 27930

27931 RIEVVIADTTMRPMIGERFERREDFLELLPEIVKIASGFSLDDLFPSSWLACAIGGSQRR 28110

28111 GEASHRTSYELVDSAFRQRQQQREAMAASPPDIAKEEEDDLMDELIRIHKEGSLEVPLTA 28290

28291 GNLKAVIL 28314 (0)

28577 DLFGAGSETSSDALQWAMSELMRNPRVMEKAQNEVQSILKGKPSVTEADVANLKY 28741

28742 LKMIVKETHRLHPVLPLLIPRECQQTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 28921

28922 KPERFEDGEIDLKGTNYEFTPFGAGRRICPGLALAQASIEFMLATLLYHFDWELPNRAA 29098

29099 PEELDMTEEMGITIRRKKDLYLLPTLRVPLTA* 29197

 

>CYP71X5  

16507 MEKVAWCACFLLLALMVVRLTAKRRGDNGAERLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD 16710

16711 APLMSLRLGEVPVVVASSADAAREIMRTHDVAFATRPWNPTTRRLRCDGEGVVFATYGAL 16890

16891 WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERI 17043

17044 TAVITDATMRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAE 17223

17224 ANHRRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGN 17403

17404 IKAIIL 17421 (0)

17795 DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKYLKL 17968

17969 VIKETLRLHPVLPLLLPRECREACNVIGYDVPKYTTVFINV*AINRDPKYWDMAEMFKPE 18148

18149 RFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYYFDWELPSGMSPEE 18325

18326 LDMTEDMGLSVRRKNDLYLHPTVCVPL* 18409

 

>CYP71X6P indica also has the stop codon, but not the frameshift, EST CI150239 has both

11020 MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLA 11214

11215 RRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 11394

11395 YGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNV 11547

11548 SERIAALVSDAAVRTIIGDRFERRDEFLEGLAEGIKITSGFSLGDLFPSSRLASFIGGTT 11727

11728 RRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDIVDVLLRIQKEGSLQVPLT 11907

11908 MGNIKAVVL 11934 (0)

12556 DLFGAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKL 12729

12730 IIKETLRLHPVVPLLLPRE 12786 frameshift

      CQETCKVMDYDVPIGTIVLVNMWVIGRDPKYWEDAKTFRPERFEDGHIDFKGMNF 12955

12956 EYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFPDGISPAKMDMMEVMGSTVRKKN 13135

13136 DLYLVPNAHVPVAP* 13180

 

>CYP71X7  

7359 MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 7538

7539 LVHRTMAGLARGLGDAPLLSLRLGEVPIVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 7718

7719 MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAA 7883

7884 TRRPGEAAVNVGERLTVLITDIAVRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPS 8063

8064 SRLASFVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRI 8243

8244 QKEGGLEVPLTMGVIKGVIR 8303 (0)

8551 DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKY 8715

8716 LKLVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETF 8895

8896 IPERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVA 9072

9073 PSNLDMEEEMGITIRRKNDLYLVPKVRVPL* 9165

 

>CYP71X8  

95316 MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLA95101

95100 RRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWTSTIRVLMSDGVGLVFAP 94921

94920 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQPVNV 94768

94767 SERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVGGTT 94588

94587 RRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGNIKA 94408

94407 VVL 94399 (0)

93945 ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKL 93772

93771 IIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTVLVNVWAIGRDPKYWEDAETFIPE 93592

93591 RFEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTK 93415

93414 VDMMEELGATIRRKNDLYLIPTVRVPLSTVL* 93319

 

>CYP71X9P 101503 RVVASSTDAACREFTKTHDVKFATRPWSSTVRVLMADGLG 101393

 

>CYP71X10

109813 MAMVQYVTGYLCLLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRL

       PVIGNLHQVAMGGPLVHRTMADLA 109595

109594 RRHDAPLMSLRLGELRVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 109415

109414 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVCRLVAAVAAAQPGEAVNV 109262

109261 SERITALISDSAVRTIMGDRFEKRDEFLEGLAEGDRIASGFSLGDLFPSSRLASFVGGTT 109082

109081 RRAEANHRKNFGLIECALRQHEERRAAGAVDDDEDLVDVLLRVQKEGSLQVPLTMGNIKAVIL 108893 (0)

107479 ELFGAGSETSASTLHWAMTELIMNPKVMLKAQDELSNVIKGKQTISEDDLVELRYLKL 107306

107305 VIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTMLVNVWAIGRDPKYWEDAETFRPE 107126

107125 RFEDGHIDFKGTDFEFIPFGAGRRMCPGMAFAEAIMELVLASLLYHFDWELPDGISPTK 106949

106948 VDMMEELGATIRKKNDLYLVPTVRVPMSTAL* 106853

 

>CYP71X11

123850 MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPLVHRALAD 124050

124051 LARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTADGEGLVF 124230

124231 APYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV 124389

124390 NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 124569

124570 AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 124749

124750 TMGIIKAVIL 124779 (0)

124917 DLFSAGSETSATTIQW AMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLTDLNYLKL 125090

125091 IIKETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVLVNAWAIGRDPKYWDDPEEFKPE 125270

125271 RFEDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSE 125447

125448 LDMTEEMGITVRRKNDLYLHAVVRVPLHATTP* 125546

 

>CYP71X12

127153 MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPHVHRAMA 127350

127351 DLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMADGEGLA 127530

127531 FARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAV 127689

127690 VNVSERAAVLVTDTX 127731

127734 VRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFPSSRLASLVSGTARRAAASHRKMFE 127913

127914 LMDCAIRHHQERKAAMDADEDILDVLLRIQKEGGHDAPLTMGDVKDTIL 128060 (0)

128189 DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKL 128362

128363 VIKETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPD 128542

128543 RCENNKYNFRGTDFEYIPFGSRRKICPGPAFTHAILELALAALLYHFDWELPCGVAPGE 128719

128720 VDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT* 128833

 

>CYP71X13P

146254 MDQVACWSICAFLALLLLVRIGGKRGRGGDGARLRQPPPGPWRLPVIGNLHQLMLRGP 146427

146428 LVHRTMADLARGLDDAPLMRLQLGGVPVVVASSPDAAREVTCTHDAAFASRPWPPTVRRL 146607

146608 RPHREGVVFAPYGAMWRQLRKVCIVEMLSARRVRSFRRVREEEAANLAAAVAASLSSPPA 146787

146788 RRDAVNVSALVAAAVADATMRVVIGDRLERREEFLESMTEAVRSFTGFSLDDLFP 146952

146953 SSRIAAAVGGMTRRAEASHRKGNELIESAIRQHEQVRDAMAAQGGGGAMEEDLLDTLL 147126

147127 RIQKEGALDMPLTMDNIKAVI 147189

147557 DIFGAGSDTSSNIIQW

147612 RNTLQGKHPVKEDDLVNIKYLKLIIKETLRLHPVVPLLLPRECLHACKVMGYDVPKGTTV 147791

147792 FVNIWAINRDPKHWDDPEVFKPERFDDGKIDFKGANFEYIPFGAGRRSCPGVTFGHATVE 147971

147972 LMLATLLYHFKWELLEGVAPNELDMTEEIGINVGRKNPLWLCPIVRVPLQ* 148124

 

>CYP71X14

142591 MEQVAYWFICACAFLALVLLVRLGAARRDVVRLPPGPWRLPVVGNLHQLMLRGPLVHRTM 142770

142771 ADLARGLDDAPLMRLQLGGVPVVVASSADAAREVTRTHDLDFASRPWPPTVRRLRPHREG 142950

142951 VVFAPYGAMWRQLRKVCVVEMLSARRVRSFRRVREEEAARLVASIASSSSSSPTGHDGGA 143130

143131 APAVNVSAPIAAAVADATMRAVIGDRFERREEFLESITEAVRSFTGFSLDDLFPSSRLAA 143310

143311 AVGGMTRRAEASIRKGHQLMDSAFRQHQQLRDAMAAQPHLDDCAMEEDLLDTLLRIQKED 143490

143491 NLDVPLTTGNIKAVLL (0)

143668 DIFGAGSDTSSHMVQWVLSELMRNPEAMHKAQIELRSTLQGKQMVSEDDLASLTYLKLVIK 143850

143851 ETLRLHPVVPLLLPRECRQTCKVMGYDVPQGTTVFVNVWAINRDPRHWDEPEVFKPERFH 144030

144031 SGKIDFKGANFEYIPFGAGRRICPGMTFGHATVELMLAMLLYHFDWELPKGVAPNELDMT 144210

144211 EEMGITVGRKNALYLHPIVRVPLEQATMS 144297

 

>CYP71Y1P AACV01014656.1 also has this boundary.  ESTs CI595924.1, CI597178.1 also have it.

139036 MEDATHGYVYVGLALVSLFVVLLARRRRSPPPAAHGDGGLRLPPGPWTLPIIGSLHHLVGQIP 139224

139225 HRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREVTKTHDTAFAMRPLSATLRVLTN 139398

139399 GGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIREEEVAALLRAVAVA 139554

139555 AGTVEMRAALSALVSDITARTVFDNRCKDRGEF

       LVLLERTIEFAGGFNPADLWPS 139719 (?) bad exon boundary

140222 SRLAGRLSSVVRRAEECRNSVYKILDGIIQEHQERTSAGGEDLVDVLLRIQKEGG 140386

140387 LQFPLAMDDIKSIIF 140428 (0)

       DIFSAGSETSATTLAWAMAELIRNPTAMHKVMAEVRRAFAAAGAVSEDALGE 140655

140656 LRYLQLVIRETLRLHPPLPLLLPRECREPCRVLGYDVTRGTQVLVNAWAIGLDERYWPGG 140835

140836 SPEEFRPERFEDGEATAAVDFRGTDFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFD 141015

141016 WEVPGLADPAKLDMTEAFGITARRKADLHLRPCLLVSVPGV* 141141

 

>CYP71Y2P

136961 VSEDALGELRYLQLVIRETLRLHPPLPLLLPRECTIGR 137074

137075 DERYWPGGSPEEFRPERFDDGEATAAVDFRGADFELLPFGGGRR