Rice P450 sequences from japonica strain. Some supporting accessions
like ESTs may be from indica. Use this file in
conjunction with
Rice_annotation.P450s.xls
David Nelson
Sept. 18, 2007
412 rice sequences from japonica, plus three from indica
and two duplicates, one with alternative exons. 417 total sequences. 317 genes and 95 pseudogenes.
>CYP51G1
MMDLADPNHRLIAGAALLVATLAFIKLLLSSAGGGKKRLPPTIP
AAPLVGGLLRFMRGPIPMIREEYARLGSVFTVPILSRKITFLIGPEVSAHFFKGNEAE
MSQQEVYKFNVPTFGPGVVFDVDYSVRQEQFRFFTEALRANKLRSYVDQMVVEAEEYF
SKWGESGTVDLKYELEHLIILTASRCLLGREVREKLFDDVSSLFHDLDNGMQPVSVIF
PYLPIPAHRRRDRARQRLKEIFATIIKSRKASGRAEEDMLQCFIDSKYKSGRSTTEGE
ITGLLIAALFAGQHTSSITSTWTGAYMLRFKQYFAAAEEEQKEVMKRHGDKIDHDILA
EMDVLYRCIKEALRLHPPLIMLLRQSHNDFSVTTKDGKEFDIPKGHIVATSPAFANRL
PHIFKNPDSYDPDRYAPGREEDKAAGAFSYISFGGGRHGCLGEPFAYLQIKAIWTHLL
RNFEFELVSPFPETNWKAMVVGIKDEVMVNFKRRKLVVDN
>CYP51G3
MDLTTGAIWLFLAQLFVAATMLSKIATRERTRTTGTKFSRPPPPPLARGAPLVGVLPSLLANGPVEFIRH
182
183
HYEKMGSVFTVSLLQQKVTFLVGSEASSHFYKGLDSEISQDEVSQFTIPTFGPGVAFDVD 362
363 YATRHEQFRFFGDIMKPAKLRTYVDLMVAEVE (0)
GYFARWGQSGTVNMKQEFEQLVTLIASR 542
543
CLLGEEVRDKMFDEVSTLLRELNDGMRLVTILFPRLPIPAHRRRDRARARLGEIFSDIVR 722
723
SRRGSSGGGGGGGGARHDDMLQCLIDARYKDGRATTETEVAGMLVAALFAGQHTSSSTST 902
903 WAGARLLTNPDHLRAAVEEQARLLRRHGGDRVDHAALAAMDTLQRCVKETLRLHPPALML
1082
1083LRHARRSFVVRGGSGEREYEVPEGHTVASPLLLHNALPRVYRDPGEFDPGRFGAGRE
1253
1254EGAGGLAYTAFGGGRHACVGEAFAYMQIKVIWSHLLRNFELQLVSPFPETDWTVVMPGPK
1433
1434GKVMVTYNRRKLT* 1475
>CYP51G4P
54048 VRFLHRKVTFLVGPEESSHFFTGLDSEISQDEVSRFIIPTFGS*VAFD
54191 (intron no boundaries)
54642 AGYATQ*EQFRFFGDTMKPTKLRSYVSHMVYEVE 54743 (0)
missing 20 aa
56119
VVTLIATRCLFGEVRSKMLGEVPTLLRELNDSMRLITIVFPYLPIPAHRRRDSARARL 56292
56293 GEIFVEIVRSRRSSPGGGGAGHDDMLQCLIDARYKDGRATTEG 56421
frameshift
56424 AEVAGMLVSALFAGQYTSSSTSTWTGAR 56507 frameshift
56510
LLTHPEHLRAAVREQEELVLVRHRHGGDVVDHDALQRMGHLHRCVKETLRLHPPSLMLLRHAR 56698
56699
RSFVVRARGSGDAEYEVPAGHTVASPMVIHNALPHVYEDAGSFDPGRFGPAREEYRAYAA 56878
56879 DHAYTVFGGGRHACVGE 56929 frameshift
56932 VIKVIWSHLLRNFELELMLPFPETDWNDVMPGPKGKVMVLPPSHNKIISVIVGFSVQL*
57108
>CYP51H1
55155
MDHVTSSTIARGAMSWVAATVALLLTTAVILTALQKRKISSPAAAAPPVVRGAGLVRFA 55331
55332
RAMARDGPLEAIREQQAKLGSVFTAIAPFGLFKVTFLIGPEVSSHFYLAPESEMGQGSIY 55511
55512 RFTVPLFGPEVGYAVDPDTRAEQMRLFWDVLKPRSIEARVGAMAEEVQ
57812 NYFSRWGEQGTVDLKKELEQVLMLIASRCLLGREVRESMVDEVYELFRDLDNGLHLIS
57985
57986
TMLPYLPTPAHRRRDRARQRLGEIFTEVIRSRRNSGTADNGDDVLQRLIDGRYKD 58150
58151
ERDLTDVEVVGLLVALVFAGKHSSSSVSTWTGINLLSHPNHLVAVIAEQDRLMASRARTD 58330
58331
DDHDRVNYDTVQEMTTLHRCIKEALRLHPPAVAMFRQARKHFTVQTKEGKEYTIPGGHTV 58510
58511
MSTILVNHHMPNVYKDPHVFDPSRFARGRGEDKAAGPFSFLAFGAGRHSCAGESFAYTQI 58690
58691 KVIWSHLLRNFELKMVSPFPETSWRMVTPEPKGTVMISYRRRNLTCK*
58834
>CYP51H2P
52199 MDHLTSS (frameshift)
TISDTMWLTTVALLLTTVVIWTALQKRKRGEACPAAVAPPPIVQGTALVRFLRAMARDG
52375
52376 PLEVIREQLAKLGSVFMASAPLGLFKVTFLVGAEL 52480
>CYP51H3 also frameshifted in AACV01016110.1 may have a non-traditional C-term
not frameshifted in indica
12878
MDQLTSSTVFWLTTAVAFLLINTVILRALQKRKSSPAAAAAAAPPPVVQGVGLVRFVRAMAR 13063
13064 DGPLEAIREQQAKLGSVFTASAPLGLFKVTFLIGSEVSSHFYVASYSEISMGRLYEFTVP
13243
13244 IFGPGVLYGVDLETRKEQIRFNWDILKPRSLKASVGAMAEEVE 13372
13617 NYFSRWGDQGTVDLKHELEQVLMLTASRCLLGKELRER 13730
13727
ESVPGKLCELFGELDNGLHLISGLLPYLPIPAHRRRDRARQRLGEIITEVIRLRRNSSR 13903
13904 GAAGTDENNDDMLQCLINSRYKDGCAMTDAEIAGLVVALMFAGKHTSSGVSIWTGVHLLS
14083
14084
NPNHLAAVVAEQDRLMASCPGRTDDYHRLDYDTVQEMRSLHCCVKEALRLHPPVAAVRQA 14263
14264 YKHFTVQTKEDKEYTIPGGHMVVSTILVNHYLPHIYK
DPHVFDPQRFAPGREEDKVAGRFSFLSFSAGRHACAGESFSYTQIKVLWSYLLSN
14540
14541 FEIKMVSPFPET 14576 (frameshift)
QWSTVIPEPKGKVMVSYRRRTAPK* 14649 P450 conserved end
MEHGYSRA* frameshifted end
>CYP51H4
78968
MDHIFSTNAWLTIALVFIITLAAKVVRSSVTLPAEKTSKPRPPPEAKGAPLV 79123
79124
GIIPAVLRRGLQAVIREQHRALGSVFTLRSLGLAVTFLVGPECSDHFFHAPELEIAID 79297
79298 GLYEVTVPIFGKEVGYDIDLDTRNEQHRFFAKMLRPAKLRGHVLPMVCEIEX
79450
79551
YFGKWGECGVVDLMQEVDHVLMLIASRCLLGKEVRENMFDEVASLFHELMGGMHLISMFF 79730
79731
PYLPTPGHRRRDKARAKLGEIFSQIVKTRKMSGRVEDDMLQDLIDSTYGDGRATT 79895
79896
DTEVTGLLVALLFAGHHTSSTVAVWTALRLLTHPEHLRAVRAEQERLVAAAEQQRSHHGG 80075
80076 GGGGGIDYGVLLQMDVLHRCIKEALRLHPVTPMILRRARRGFTVRDKEGGEYSVPAGRLL
80255
80256
ASPLVVNTLLPNIYKDPHVFDPDRFAAGRAEDKAVAGARDLAYLSFGAGKHACMGEGYAY 80435
80436 QQIKVILSHLVSNFELKLESPFPETEDMLSMRPKGKAIVSYKRRTLS
80576
>CYP51H5
78158 MAVLFVATKMIQQRPRTLCLYEKENKEEELLLPPVMSVVSVLTAYLPTLIAKGLPAVIHDLH
77972
SRLVTLLVTAHFFQASESEIRQSNIYKVTVPVFGRGVLYDVDLATRSRQISFCTDS 77805
77804 IKPINLRGHVDSMVHEVE 77751 (0)
76666
GYFAQWGEDGVVDIKYEMGNLILLIANRCLLGKQFGESKLEQVSTLLHELFDNGFHLISLF 76484
76483 FPYLPTPQHRRRDKARAMLGEMIHEAVRSRRNSGVAEDDVLQKFLDSKYINGRCMTENEI
76304
76303
AGLLICMMFAAQHTSSSTSTWTGACLLSHGHRSYLAAAIQEQKRIIQQHGDRINWGILLQ 76124
76123
MTTLTHCIKEALRLHPPANLLIRHASKSFSVQTRQGHRYQIPKGHTLATCTTVGNRLPYI 75944
75943
YKDPNVYDPSRFGPGREEDKVGGKFSYTPFSAGRHVCLGEDFAYMQIKVIWSHLLRNFDL 75764
75763 ELISPFPEEEWEKFIPGPKGKVMVTYKRRRLL* 75665
>CYP51H6
70310
MELTSSSAMWLAMAILAITAALTKIALGGGRRRCLSESSDLTCKTPPPPPVVNCIALLGL 70489
70490
LPALFRGDVPATMQQLYAKFGSVFTVSVAGLLKATFLVGPEVSAHFFQGLESEVSHGDLF 70669
70670 EFTVPMFGKEVGHGVDNATRIEQGRFFAEALKPVRLRIHVDPMVQEVE
70813 (0?)
71263 DYFAKWGQHGTVDLKHELEQLLLLISGRCLLGKEVMGTKFDEVCNLFRDIEGGVNLMSV
71439
71440
FFPYTPLIPSNRRRDMARERLHAIFSDIVRSRKQQQGDQEEVNDKDVLQSFIDSR 71604 (2?)
71910
YKADGRATTEAEVAGLITGVLFAAKHTSTHTSVWTGARLLTHEKFLAAAVDEQDQIVR 72083
72084 KHGIINGRIVTDHYGFLMEMHMLHICIKETLRLHPPAPMIVRTALRQFTVRTREGHEYCV
72263
72264
PAGHTMASPIVISNRVPYIYKDAHLYDPDRFGPRREEDKVGGKFSYTSFGGGRNSCVGEN 72443
72444 YAYMQIKAIWSHLLRNFELKLLSPFPKTDWSKLVPEPQGKVMVSYKRRQL
72593
>CYP51H7
80009
MAIILAITAAVTKIARGGRRRSATDPTCKMPPPPPVVNSIALLRLLPTLFRSGLPA 80176
80177 ILHELYTKFGSVFTINLAGLLKMTFLVGPEVSAHFFQGLESEISHGNLLEFTVPMFGKEI
80356
80357 AHGVDSATRNEQARFFVDALKPARLRIHVDPMVQEVE 80467 (0?)
84741
DYFAKWGQHGTVDLRRELEQLLLLISGRCLLGKEVMGTMFDEVCNLFRDIEGGVNLMSV 84917
84918
FFPYTPLIPSNRRRDMARKRLHAIFSDIVRSRKQREGDNVDKDVLQSLIDSRYKADGRAT 85097
85098
TEA*VAALMICLLFAAKHTSAYTSVWTGARLLSHERFLTAAVDEQDKIAREHSNINGGGR 85277
85278
ITDDRYGSLMEMRTLHSCIKETLRLHPPVPMLVRTAHKQFTVRTREGHEYAVPAGHTIAS 85457
85458
PIVISNQVPYIYKDGHLYDPDRFGPAGREEDKVGGKFSYASFGGGRTGCVGEGYAYMQIK 85637
85638 AIWSHLLRNFELRLLSPLPKSDFTKFVPEPHGELMVSYKRRQL 85766
>CYP51H8
122577
MDLVSISIYMWAMAALCTIITAMVTTKLARVRRPITLNPKSKRPLPPVVNVIA 122735
122736
LLEHLPRLCTKGVIPVYKGSVFTVSLFGLKATFLVGPEVSAHFYQGMDSEISQGDLYEFT 122915
122916 VPLFGKGVGFDIDNATRTEHLRFFIDAIKTSKLRNHVNSMVQEVE
123050 (0?)
123296 DYFAKWGENGIVDIKHEFEKLLMLISGHCLLGKEVRDNMFDEVFSLF
123436
123437
HELDSGVGLGSVIFPYIPIPSHIRCDKAHAKLAKIFSKIVRSRRDSNRPAEQDVLQYLID 123616
123617
SKHRDGNSTTEQEVTGWIISMVFAGKHTSTNSTTWTGACLLTHDKFLTEALDEQKHMIQK 123796
123797
HGDHIDYNVLLDMDILHCCIKEALRMHPVAPIIYRKAQKSFVVRTREGDAYDIPEGHNLL 123976
123977
SPMIFNNRLPYIYKDPHMYDPDRFAPKREEDKVGGMFSYTSFGGGRHICIGEAYAYMQIK 124156
124157 VIWSHLLRNFELKLESPFPKTNWSKILLEPWGKVMVSYKRRRL 124285
>CYP51H9
50920
MDMAAAAAVWFSAIAAVLLAASTIAVVVVAKMTGKRNGGAAAAAAAAAEAELPL
51082 PPVVSGVSLIIPVITRGPMAVADELYVKLGSVFT (0)
GGFYSRPE 51261
51262
SEVHQGGTYRMTVPMFGRGVMYDVDVATRSEQIAVCFEALRPTKLRSSTVTMVRETE 51432 (0)
52114
EYFAKWGEQGTVDLKRELDLLILTIASRVLLGKEVRETMFADVVASFHELMDNSMHLI 52299
52300
SLCFPNLPIPRHRRRDTASARLKELFSRAIQLRRGSGRAEDDVLQRFLESRYRDGRAMSD 52479
52480 NEITGMLIALVVAGQHMSSSASTWTGAFLLRDPKHLAAAVDEQRRLIGDDRVDYDALT
52653
52654
TGMSTLHRCIKEALRMHPPAPALVRTVRRGFAVWTREGKEYRMPAGHSVVSYAAFNHRLG 52833
52834
YVYRDPDEYDPERFGPERKEDRVAGKFSFTAFGGGRHACLGEHYAFLKMKVIWSYLLRNF 53013
53014 ELELVSPFPEVELNNIMLGPRGEVMVRYKRRKLTST* 53124
>CYP71C12
50394 MAQMLDGLRHDEQASLHAPQEASTMPTMSCSD
50298 LLLAMMCPLILLLIIFRCYAYATRSGGMLSRVPSPPGRLPVIGHMH
50161
50160
LISSLPHKSLRDLATKHGPDLMLLHLGAVPTLVVSSARTAQAILRTHDRVFASRPYNTIA 49981
49980
DILLYGATDVAFSPYGDYWRQIKKIVTMNLLTIKKVHSYGQTRQQEVRLVMAKIVEEAAT 49801
49800
HMAIDLTELLSCYSNNMVCHAVSGKFFREEGRNQLFKELIEINSSLLGGFNLEDYFPSLA 49621
49620
RLPVVRRLLCAKAYHVKRRWDQLLDQLIDDHASKRRSSMLDNNDEESDFIDVLLSIQQEY 49441
49440
GLTKDNIKANLVVMFEAGTDTSYIELEYAMAELIQKPQLMAKLQAEVRGVVPKGQEIVTE 49261
49260 EQLGRMPYLKAVIKETLRLHPAAPLLVPHVSMVDCNVEGYTIPSGTRVIVNAWAIARDPS
49081
49080
YWENAEEFMPERFLSNTMAGYNGNNFNFLPFGTGRRICPGMNFAIAAIEVMLASLVY 48910
48909 RFNWKLPIDQAANGGIDMTETFGITIHLKEKLLLVPHLP* 48790
>CYP71C13P
54948 MAQMLGALLLFQDSLMSTMTRMSY
54876 SLLLPILCPLILLLLFRCYAYATRSGGMLDKLPSPPGRLPLIGHM 54742
54741
HLIGFFPHMSLRDLATDLMLLHLGTVPTLVVSSSRMAQVILRTHDRVFASR*QSAI 54574
54573
TDILF*GATDVAFSPYGDYWRQIKKIVTTNLLTIKKIRSYSQTRQQEVRLVMAKIVEEAA 54394
54393 THMA
IDLTELLSCYSNNMVCHAVSGMFFCEEGRNQLFKELI*INSSLLGGFNIEDYFPSL 54214
54213 ARLPVRRL LYAKAYDVKKRWDQLLDKLIDDHSSKHRSSLLDNNDVESDFIDVLLSIQQE
54037
54036
YGLTKDNIKANLAIMFEAGTDTSFIELEYSMAELM*KPQMIAKLQAEVRGVVSKGQDIVT 53857
53856
EEHLGRMPYLKAVIKETLRLHPAAPLLAPHVSVVDCNVEGYTIPSGTRVIVNAWAIARDP 53677
53676
SYWENAEEFMPERFLSNTMADYNGNNFNFLPFRTGRRICPGINFAITTIEIMLASLV 53506
53505 YRFDWKLFTSRIDMTETFGATIHLKEKLFLVPHLPQDN* 53389
>CYP71C14P also
frameshifted in AACV01017100.1, but not in indica
frameshift exist in 3 ESTs
CI183363.1, CI183362.1 and CI018604.1
so this is probably a real
pseudogene and not a seq error
58316 MAVMLVPIPLLLLHQHHNHEHEHPSPVAPQPTM
58217 ASYYTLLLALLCPLLLLLIKLCRAKTRDDELFDKLPSPPGRLPV 58086
58085
IGHLHLIGSLPYVSFRELAIKHGPDLMLLRLGTVPTLVVSSARAAQAILRTNDHVFASRT 57906
57905
YSAVTDILFYGSSDVAFSPYGEYWRQVKKIATTHLLTNKKVRSYSRARQQEVRLVMARIN 57726
57725 EAAVARTTVDLSELLNWFTNDIVCHAVSGKFFREEGQNQMFWELIQANSLLLGGFNLEDY
57546
57545
FPNLARVTTVRRLLCAKAHNVNKRWDQLLDKLIDDHATKRSSSVLDLDNEESDFIDVLLS 57366
57365
IQHEYGLTRDNVKAILVIMFEGGTDTAYIELEYAMAELIRKPQLMAKLQAEVRSVVPRGQ 57186
57185 EIVTEEQLGRMPY 57153 frameshift
57147 LKAVIKEMLRLHLAGPLLVPYLSIAECDIEGYTIPSGTRVFVNAWALSRDPSFWENAEEF
56968
56967
IPERFLNSIAPDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLVYRFDWEIPA 56797
56796 DQAAKGGIDMTEAFGLTVHRRRSSSLFLGSHKIK* 56692
>CYP71C15
68223
MELNNTEPLTASRAQAAAVFLLLPVALLLLLLRFARATTMAGDRNSELLLSKLPS 68387
68388 PPLRLPVIGHMHLVGSLPHVSLRDLAAKHGRDGLMLVHLGSVPTLVVSSPRAAEAVLRTH
68567
68568
DLAFASRPRAMVPDIITYGATDSCYGPYGDHFRKVRKAVTVHLLNSHKVQAYRPAREEEV 68747
68748
RLVIAKLRGAAAMAGAPVDMTELLHSFANDLICRAVSGKFFREEGRNKLFRELIDTNASL 68927
68928 LGGFNLEDYFPSLARTKLLSKVICVRAMRVRRRWDQLLDKLIDDHATRLVRRHDHDQQQD
69107
69108 SDFIDILLYHQEEYGFTRDNIKAILV 69185 (0)
69331
DMFEAGTDTSYLVLESAMVELMRKPHLLAKLKDEVRRVIPKGQEVVNEDNIVDMVYLKAV 69510
69511
IKETLRLHPPAPLYIPHLSREDCSISGYMIPTGIRVFVNAWALGRDAKFWDMPDEFLPER 69690
69691 FMDSNIDFKGHDFHYLPFGSGRRMCPGIHSATVTLEIMLANLMYCFNWKLPAGVKE
69858
69859 EDIDMTEVFGLTVHRKEKLFLVPQAA* 69939
>CYP71C16
78935
MELILQLEAKTAAQAVVTVFFFFLLPLALLFYFARAAISSRDSKTRELILSKLPSPPFK 79111
79112
LPVIGHMHLIGPLPYVSLRDLAAKHGRDGLMLVRLGSVPTLVVSSPRAAEAVLRTHDLAF 79291
79292 ASRPRSMVTDIIMYGALDSCFAPYSDHFRSVKKVVTVHLLNSKRVQAYRHVREEEVRLVM
79471
79472
ARLRGAAAAAAAVDLSQTLQFFANDLICRAVSGKFLCEQGRNKVFRDLMEANSNLLGGFN 79651
79652
LEAYFPGLARMPLISKLICARAIRIRRRWDQLLDMLIDDHVASARDRAKNDDDDFIHVL 79828
79829 LSLQDEYGFTRDHIKAISI 79885 (0)
80359 DMFEAGTDTSHLVLEYAMVELTRKPHILTKLQDEVRRITPKGQHMVTEDDIVGMVYLKAV
80538
80539
IKETLRLHAPGGFTIPHLAREDCNVDGYMIPAGTRVLINLWALSRDANYWDKPDEFLPER 80718
80719
FMDGSNKNTDFKGQDFQFLPFGSGRRMCPGIHSGKVTLEIMLANLVYCFNWKLPSGMKK 80895
80896 EDIDMTDVFGLAIHRKEKLFLVPQIANY* 80982
>CYP71C17
63826 MVVQLMLFFHDKFMAPMAEEPLPFVLIMIII
63733 LLLLVLLHYYLSASTRRSSAASKSNDDVLPPSPPRLPVIGHMHLVGSNPHV
63581
63580
SLRDLAEKHAADGFMLLQLGQVRNLVVSSPRAAEAVLRAHDHVFASRPRSAIADILAYGS 63401
63400
SNISFSPYGDYWRKARKLVAAHLLSPKKVQSLRRGREEEVGIAVAKLHEAAAAGAAVDMR 63221
63220 ELLGSFTNDVLCRAVCGKSSFRREGRNKLFMELAAGNADQYAGFNLEDYFPSLAKVDLLR
63041
63040
RVVSADTKKLKEKWDSVLGDIVSEHEKKSSLRRDDQVQMDDDRDDDQEEQESDFVDILLD 62861
62860 RQQEYNLTRHNIHAILM 62810 (0)
62626
DMFAAGTDTSYIALEFAMSELIRKPHLMTKLQDEVRKNTTTQMVSEDDLNNMPYLKAV 62453
62452 VKETLRLHPPVPLLLPRLSMAQCNANGYTIPANTRVIINVWALGRDAKCWENSEEFMPER
62273
62272
FMDSGDTIDNVDFKGTDFQFLPFGAGRRICPGMNFGMASVELMLSNLMYCFDWELPVGMD 62093
62092 KDDVDMTDQFALTMARKEKLYLIPRSHVIKIT* 61994
>CYP71C18P also stop
and fs in AACV01000601.1,
AK100062 cDNA
56483 MEQAAGLVYQLFQNEMFPWILALFPFLLLALHYLATNHRTPTTCKETRNHLSPPSP
56650
56651
PRLPIIGHLHLIGDLPHVSLRELAHRYGPNLMLLHLGQVQNLVVSSPHAAEAVLR 56815
56816
THDHVFASRPHSLIGDILLYGPSDVGLSPYGEQWRRSRRIVTTHLLTNKKVRSYHVAREE 56995
56996 E 56998 (0)
57929
VHKVMTKVHELSTKGMGVDMFELFSTYSNDLICRLVSGKNFQGDKGRNKMFRQLFKANY 58105
58106
VLLAGFNLEDYYPGLARLKAVSWVMCAKARNTRKLWDELLDEIINDRMSKQPCEHDRGND 58285
58286 DQDEMDFVDVLLLQERGITRDHLKAILV 58369 (0)
58462
DMFQAGTETTSVVLVFAMAELMQKPHLMAKLQAELRTNIPK*GRELITECDQTNMTYLKA 58641
58642
VIKETLRLHPPSPLLVPHLAMADCDIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 58821
58822 RFVDGGSAANVDFIGTDFQFLPFGAX 58896 frameshift
58899
RRICPGINFASASMEIILANLLYHFDWDVSAEAAIDKDGIDMAEAFGLSVQLKEKLLLVP 59078
59079 VEYKGSVQDSAVIL* 59123
>CYP71C19
21862
MEQAAGLVYQLFQHEMFPWTFSVLALFPFLLLVLHYLATNHRTPTTCKETKNHHPP
21694 PPSPPRLPIIGHLHLIGGLLHVSLRELAHRYGPDLMLLHLGQVPNLIVSSPRAAEAVLR
21518
21517
THDLVFASRPYSLIADILLYGPSDVGLSPYGE*WRRRIITTHLLTNKKVRSYRVAREE 21344
21343 E 21335 (0)
VHKVMAKVHELSTKGMAVDMTELFSTFSNDLICRLVSGKNFQGEGRNKLFRQLFKAN
SVLLAGFNLKDYYPGLARLKAVSMVMCAKARNTRKLWDELLDEIIDERMS
KQQCEHDEGNDQDEMNFVNVLLLQEQGITREHLKAIL (0)
20004
DMYQAGTETSSVVLVFAMAELMQKPHLMAKLQAELRTTIPKQGHELITERDLTDMTYLKA 19825
19824
VIKETLRLHPPTPLLLPHLAMADCNIDGYTVRSGTRVIVNAWAIGRNSESWEAAEEFLPE 19645
19644 RFVDDGSAANVDFIGTDFQFLPFGAGRRICPGINFASASMEIILANLLYHFDWDVSAEAA
19465
19464 IDKDGIDMAEAFGLSVQLKEKLLLVPVDYKDGMQDSAVILL* 19339
>CYP71C20
MAQMLAAFLLDGLISHEHGHESLGAPPQAGTMAWYSLVLMTS
79980
LLFPLLVLLVMRCYVTRSGAKLLDKLPSVPGRLPVIGHLHLIGSLPHISLRDLATKH 79810
79809 SPDMMLLHLGAVPTLVVSSSRVAQSILHTHDDIFASRPYSPIANILFYGATDVGFSPYNE
79630
79629
YWRQIKKITTTHLLTMKKVRSYVSARQREVRIVMARITEAASKHVVVDLTEMLSCYSNN 79453
79452
IVCHAVCGKFSLKEGWNQLLRELVKVNTSLLGGFNIEDYFPSFTRLAAVRRLLLSCAKA 79276
79275
HNINKRWDQLLEKLIDDHTTKHIRSSSMLNHYDEEAGFIDVLLSIQHEYGLTK 79117
79116 DNIKANLAAMLMAGMDTSFIELEYAMAELMQKPHVMGKLQAEVRRVMPKGQDIVTEEQLG
78937
78936
CMPYLKAVIKETLRLHPPAPLLMPHLSISDCNINGYTIPSGTRVIVNVWALARDSN 78769
78768
YWENADEFIPERFIVNTLGDYNGNNFHFLPFGSGRRICPGINFAIATIEIMLANLV 78601
78600 YRFDWELPADQAAKGGIDMTETFGVAVHRKEKLLLIPHLHLR* 78472
>CYP2C22P
61469 EQESDFVDILLDHQQEYNLTRHNIHAILM 61383
>CYP71C23P
103765
FAIATIEIMLANLVYRFDWEILVDQAAKGGIDMTEAFGLAVHRKEKLLLVSWLPQD* 103595
>CYP71E4
96529 MATLLSKLLALPQQWQLLLLLLLLPIASLLLVIGRNTGGRRRRRHLR
96388
LPPGPARLPVLGNLLQLGALPHRSLRDLARRHGPVMMLRLGAVPAVVVSSPEAAQEVLR 96212
96211
THDADCCSRPSSPGPMRLSYGYKDVAFAPYDAYGRAARRLFVAELFSAPRVQAAWRARQDQ 96017 (0)
94678
VEKLIGKLTRPEPEPVELNDHIFALTDGIIGAVAFGSIYGTERFAGGGRKRFHHLLDDVMDMLASFSAEDFFPNA
94454
94453
AAARLFDHLTGLVAHRERVFQQLDAFFEMVIEQHLDSDSSNAGGGGGNLVGALIGL 94286
94285 WKQGKQYGDRRFTRENVKAIIF 94220 (0)
94119
DAFIGGIGTSSVTILWAMAELMRSPRVMRKVQAEIRATVGDRDGGGMVQPDDLPRLAY 93946
93945
LKMVVKETLRLHPPATLLMPRETMRDVRIGGYEVAARTRVMVNAWAIGRDAARWEEAEVF 93766
93765
DPDRFEAKRVEFNGGHFELLPFGSGRRICPGIAMGAANVEFTLANLLHCFDWALPVGMAP 93586
93585 EELSMEESGGLVLHRKAPLVLVPTRYIQL* 93496
>CYP71E5
31348
MAISLITSLLFSLPQQWQPVVLTGLLPVIVSLVLLARKGRLKMPPGPEQVPLLGNLHQLA 31169
31168
GPQPHRALRDLARVHGPVMRLRLGKASAVVLTSAEAAWEALRGHDLDCCTRPVSAGTRRV 30989
30988 TYGMKNVAFAPYGAYWREVRKLLMVELLSARRVKAAWYARHEQ 30860
30546 VEKLLSTLRRAEGKPVALDEHILSLSDGIIGTVAFGNIYGSDKFSQNKNFQHALDDV
30376
30375
MEMLSGEGSSAEDLQLPAAVGRLVDRLTGFAARRERIFRQLDSFFEMVIEQHLDPNRAPP 30196
30195 ENGGDLVDVLIGHWKKNEPRGTFSFTKDNVKAIIF 30091
29601
STFVAGIDTNAATILWAMSELARKPRVLKKVQAEIRAAVGVNGRVQPDDITKLSYLRKVV 29422
29421 KETLRLHPPTPLLLPRETMRHIQISGYDVPAKTRIYVNAWAIGRDPASWPDEPEEFNPER
29242
29241
FEANEIDFKGEHPELMPFGTGRRICPGMAMAMANVEFTLANLLFAFQWSLPEGTTPDNVC 29062
29061 LEEEGRLVCHRKTPLVLVPTVYRHGLE 28981
>CYP71E6
2204 MAASLLLELLPQQWQLSITSLIL
2273 LAVSVVLIFWSRRRRNPSSRLKLPPGPTRLPIIGNLHQIGRLPHRSLEALAGWHG
2437 (fs)
2437 LGTVSVVVLSSPKAAREALKVHDPECCSRSPS 2532
2533 AGPRMLSYGYKDVAFSPYSNYVRNMRKLFVVELLSMRRVQAA 2658
8170 VMLPDYYCCM 8199
8599
VEKLIEKLTRNGRNAVAINEHIFSTVDGIIGTFALGETYAAEEFKDISETMDLLSSSSAE 8778
8779 DFFPGSVAGRLVDRLTGLAARREAIFRKLDRFFERIVDQHAAADDDGPAAARRKADDKGS
8958
8959 AGSDLVHELIDLWKMEGNTKQGFTKDHVKAMLL 9057
9159
DTFVGGITTTSVTLHWAMSELIRNPRVMKKAQDEIRAVVGEKERVQHHDMPKLKYLKMVV 9338
9339
KETFRLHPPATLLVPRETTRHFKVGGYDIPEKTKVIVNA*AIGRDPNIWKDPEEFIPERF 9518
9519 EEMDIDFNGAHFELVPFGSGRRICPGLAMGVANIEFILASMLFCFDWELPHGVRKEDIDM
9698
9699 EEAGKLTFHKKIPLLLVPTPNKAPN* 9776
>CYP71K1
MAELPLYLLLLALLVAVPFLCLTRWSLRHGGGGGGRLPPSPWAL
PVIGHLHHVAGALPHRAMRDLARRHGPLMLLRLCELRVVVACTAEAAREVTKTHDLAF
ATRPITPTGKVLMADSVGVVFAPYGDGWRTLRRICTLELLSARRVRSFRAVREEEVGR
LLRAVAAAAAVAALTTPGATAAVNLSERISAYVADSAVRAVIGSRFKNRAAFLRMLER
RMKLLPAQCLPDLFPSSRAAMLVSRMPRRMKRERQEMMDFIDDIFQEHHESRAAAGAE
EDLLDVLLRIQSQDKTNPALTNDNIKTVIIDMFVASSETAATSLQWTMSELMRNPRVM
RKAQDEVRRALAIAGQDGVTEESLRDLPYLHLVIKESLRLHPPVTMLLPRECRETCRV
MGFDVPEGVMVLVNAWAIGRDPAHWDSPEEFAPERFEGVGAADFKGTDFEYIPFGAGR
RMCPGMAFGLANMELALAALLYHFDWELPGGMLPGELDMTEALGLTTRRCSDLLLVPA
LRVPLRDHER
>CYP71K3
66403
MATELTEYLLLLPLLVVPLLYLAASSSRRSGRLRLPPGPWALPVIGHLHHLALAGAPTHR 66582
66583
AMRDMARRHGPLMLLRFCELPVVVASSPDAAREIMRTHDVAFASRPIGPMLRLVFQGAEG 66762
66763
VIFAPYGDGWRQLRKICTVELLSHRRVHSFRPVRADELGRLLRAVADQAASSSS 66924
66925
SPVNLTGMISAFVADSTVRAIIGSRSRHRDTFLRLVEDGLKIMPGMSLPDLFPSSRLAML 67104
67105
LSRVPAKIERRRRGMMGFIDTIIQEHQESRAAAEDEDLLDVLLRLQKDMDSQYPLTTMN 67281
67282 IKSILI 67299 (0)
67380 DMFGAGSETSATTLQWAMAELMRNPAVMRRAQDEVRRELAVAGNDRVTEDTLPSLHYL
67553
67554
RLVIKETLRLHPPAPLLLPRECGGACKVFGYDVPAGTMVLVNAWAIGRDAAAWGAAAEEF 67733
67734
SPERFERCERDFRGADFELIPFGAGRRICPGMAFGLAHVELALAALLFHFDWRLPGGMA 67910
67911 AGEMDMTEAAGITVRRRSDLLVFAVPRVPVPAQ* 68012
>CYP71K4
68586
MPLVVLLLATIPLLFFTIKRSAQRRGGGGGGEGRLPPGPWALPVIGHLHHLAGDLPHRA 68762
68763
LSALARRHGALMLLRLGEVQAVVASSPDAARDIMRTHDAAFASRPLSPMQQLAYGRDAEG 68942
68943
VIFAPYGDGWRHLRKICTAELLSARRVQSFRPVREAELGRLLRSVAEATSSSSSA 69107
69108 SLVNLTELISAFVADSTVRAIIGSRFEHRDAYLRMLQDGLKIVPGMTLPDLFPSSRLALF
69287
69288
LSRVPGRIEHHRQGMQRFIDAIIVEHQEKRAAAAANDDDDEDEDFLDVLLKLQKEMGSQH 69467
69468 PLTTANIKTVML (0)
DMFGAGSESSATVLQWT 69647
69648
MAELMRNPRVMQKAQDEVRRALAGHDKVTEPNLTNLPYLRLVIKETLRLHPPAPLLLP 69821
69822 RKCGSTCKILGFDVPEGVMVIVNAWAIGRDLTYWDKPEEFVPERFEHNGRDFKGMDFEF
69998
69999
IPFGAGRRICPGITFGMAHVELVLSALLYHFDWELPQGMAAKDLDMTEDFGVTTQRRSNL 70178
70179 LVRPIHRVSVPVE* 70220
>CYP71K5
70828
MAGELAFYLLLVGLVAVPLLILLGSERRTAARTRLPPGPWALPVVGHLHHLAGGLPPHRA 71007
71008 MRDLARRHGPLMLLRLGEVEAVVASSPDAAREIMRTHDVAFASRPVGPMSRLWFQGADGL
71187
71188
VFAPYGEAWRRLRRVCTQELLSHRRVQSFRPVREDELGRLLRAVDAAAAAGT 71343
71344
AVNLTAMMSTYVADSTVRAIIGSRRLKDRDAFLRMLDELFTIMPGMSLPDLFPSSRLAML 71523
71524
VSRAPGRIMRYRRRMRRIMDSIIHEHQERRAAADAAGDDDDDDDEDLVDVLLRLQKEVGA 71703
71704 QYPLTTENIKTVMM 71745 (0)
71837
DIFGAASETSSTTLEWVMAELMRSPSAMRKAQDEVRRALAAGAAGHDTVTEDILPNLSYL 72016
72017
KLVVKETLRLHPPAPLLAPRRCDSPREVLVLGHDVPAGATVLVNAWAIGRDTAAWGGAA 72193
72194
EEFSPERFERCERDFRGADFELIPFGAGRRMCPGMAFGLVHVELALAALLFHFDWSLPG 72370
72371 GMAADELDMAESSGLTTRRRLPLLVVARPHAALPTKYCN* 72490
>CYP71K6 not frameshifted in AACV01014660.1 or AACV01014659.1
and phase 0 boundary is
correct, so this is not a pseudogene
128476 MAAELVHLLRYLFSVPM
128425
LFFIVPLLFLVCSPRRRRGRGSCRLPPSPWALPVVGHLHHLAGALQHRAMRDIARRHGPL 128246
128245
VLLRLGRLPVVVASSADAAREVMRTSDVAFAARPVNRMIRVVFPEGSEGVIFAPYGETWR 128066
128065
QLRKICTAELLSARRVHSFRSVREEEAGRMLRAVASAAAQTTVNLSELMSA 127913
127912
YAADSSARAMIGRRLKDRDTFLAMVERGIKLFGEQSLPNLYPSSRLAVLLSTMPRRMKRH 127733
127732 RERMTAYLDAIIEEHQESRASREDDEDLLDVLLRI 127628
frameshift
QREGDLEVSRESIRSTIG bad exon
boundary should be phase 0
126439
DMFIGGSEPPAITLQWIMAELIRNPEVMQKVQDEVRQLLVGQHRVTEESLSKLGY 126275
126274
MNLVIKETLRLHPPGPRLLLRVCRTTCQVLGFDVPKGTMVLVNMWAINRDPKYWSQAEEF 126095
126094 IPERFENAGINFKGTNFEYMPFGAGRRMCPGMAFGLATLELALASLLYHFDWKLPDGV
125921
125920 EIDMKEQSGVTTRRVHDLMLVPIIRVPLPV* 125828
>CYP71K7P
138301
MAEVVQLHHLILLLPLFILPFLLLRSSRRRRGACGRLPPSPWALPVIGHLHHLAGALPHRAMRDIA 138152
138151
RRHGPLVLLRLGELPVVVASSADAARDVMKTHDLAFATRPITRMMRLVFPEGSEGIIFSP 137972
137971
YGETWRQLRKICTVELLSARRVNSFRSVREEEVNRLLRAVAAAAASATSPAKTVNL 137804
137803
SELMSAYAADSSVRAMIGRRCKDRDKFLAMLERGIKLFVTPSLPDLYPSSRLAMVVSRMP 137624
137623 RRMRRHREEVFAFLDAIIAEHQENRASGEDEEDLLDVLLRIQREGCME
137480 frameshift
SPLLSTESIRTTIG bad exon
boundary should be phase 0
136562
DLFNGGSETTATTLQWIMAKLMRNPRVMQKAQDEVQRVFIGQHKVTEENLSNLSYMYL 136389
136388
VIKEALRLHPPRPPLLPRECRTTCQVLGFDVPKGTIVLVNMWAINRDPKYWDQSEEFILE 136209
136208
RFEHVDINFKGMNFEYMPFGAGRRMCPGMAFGLVNLELVLASLLYHFDWKL 136056
136039 GDLDMTQERGATTRRLHDLLLVPVIRVPLPLDSRS* 135942
>CYP71K8
151229
MAGFPVYLLFLAALIILPMANLIRSARHRRLAGARRPPPGPWALPVIGHLHHLLAGKLPH 151408
151409
HHKLRDLAARHGPLMLLRFGELPVVVASSADAAREIAKAHDLAFATRPVTRTARLTLPEG 151588
151589 GEGVIFAPYGDGWRQLRKICTLELLSARRVLSFRAVREQEVRCLLLAVASPSPEGTTAT
151765
151766
ASVVNLSRMISSCVADSSVRAIIGSGRFKDRETFLRLMERGIKLFSCPSLPDLFPSSR 151939
151940
LAMLVSRVPGRMRRQRKEMMEFMETIIEEHQAARQASMELEKEDLVDVLLRVQRDGSLQF 152119
152120 SLTTDNIKAAIA 152155 (0)
166133 DLFIGGSETAATTLQWAMSELLNNPKVMQKAQDEIRQVLYGQERITEETISSLHYLHL
166306
166307
VIKETLRLHPPTPLLLPRECREPCQILGFDVSKGAMVLINAWSIGRDPSNWHAPEKFMPE 166486
166487
RFEQNNIDFKETSFEYIPFGAGRRICPGMTFRLANIELLLASLLYHFDWELPYGMQAGD 166663
166664 LDMTETLAVTARRKADLLVVPVVRVPIVG* 166753
>CYP71K9
168538MAAAASSVLAYLLVVALLAIVPLVYFGWVARRRGEGGRLPPSPWGLPVIGHLHHLAGALPHHAMRDLA168741
168742
RRHGPLMLLRLGELPVVVASSAEAAREVMRTRDIEFATRPMSRMTRLVFPAGTEGIIFAP 168921
168922
YGDEWRELRKVCTVELLSARRVQSFRAVREEEVGRLLRAVAATSSSPSPAQAAVNL 169089
169090
SALLSAYAADSAVHAIIGSRFKDRDKYLMLLERGLKLFARHTLPDLYPSSRLAMWLSRMP 169269
169270
RRMMQHRREAYAFTDAIIREHQENRAAGAGDGDGDDKEDLLDVLLRIQREGDLQFPLSTE 169449
169450 RIKTTVG 169470 (0)
169893
DMFAGGSETAGTALQWIMAELIRNPRVMHKVQDEVRQTLAGRDRVTEDAISNLNYMHL 170066
170067
VIKEALRLHPPVPLLLPRECRNTCQVLGFDVPKGAMVLVNAWAISRDPQYWDEPEEFIPE 170246
170247
RFEDSNIDFKGTNFEYTPFGAGRRMCPGIAFGLANVELMLASLLYHFNWQLPDGMDTAD 170423
170424 LDMTEEMVVSARRLHDLLLVPVVHVPLPVASS* 170522
>CYP71K10
145371
MAAFLVYVLVLVPLAVVPFVYFNRVARRRGGDVRLPPSPWGLPVIGHLHHLVGALPH 145201
145200 VAMRDLARRHGPLMLLRLGELPVVVASSAEAAREVMKTRDLDFATRPMSRMARLVFPEG
145024
145023
GEGIIFAPYGDRWRELRKICTVELLSGRRVQSFRPVREEEAGRLLRAVAAASPG 144862
144861
QAVNLSELLSAHAADSSVRAIMGDRFRDRDAFLAMLERGLKLFAKPALPDLYPSSRLAM 144685
144684
LLSRMPRRMKQHHRDMVAFLDAIIQEHQENRSAAADDDNDLLDVLLRIQREGDLQFPLS 144508
144507 SESIKATIG 144481 (0)
144297
DMLVGGSETAATTLHWIMAELVRNPKVMQKAQDEVRRELIGHRKVTEDTLCRLNYMHM 144124
144123
VIKEALRLHPPGSLLLPRECRRTCQVLGYDIPKGATVFVNVSAIGRDPKYWDEAEEFIPE 143944
143943
RFEHSDVDFKGTHFEYTPFGAGRRMCPGMAFGLANVELTLASLLYHFNWELPSGIHAENL 143764
143763 DMTEEMRFTTRRLHDLVLIPVVHVPLPTI* 143674
>CYP71K11
94448
MADQLVHLPQQLLVLLLFIAPFFFFFLIRSMRRRDGGSVRLPPSPWALPVIGHLHHLMGA 94627
94628
LPPHHAMRDIALRHGPLVRLRLGGLQVI 94711
94712
LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0)
GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLL 95071
95072
RAVAAASPARRAVNLSELISAYSADSTMRALIGSRFKDRDRFLMLLERGVKLFATPSLPD 95251
95252
LYPSSRLAELISRRPRQMRRHRDEVYAFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQR 95431
95432
KGDFPLSTDNIKTTIG (0) 95479
95574
DLFNGGSETTATTLKWIMAELVRNPRVMQKAQDEVRRALGKHHKVTEEALKNLSYLHL 95747
95748
VIKEGLRLHPPGLPLLLRESRTTSQVLGFDVPQGTMILVNMWAISRDPMYWDQAEEFIPE 95927
95928
RFEHVNIDYYGTDVKYMPFGVGRRICPGIAFGLVNLELVLASLLYHFDWELPDGTELGNL 96107
96108
DMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 96200
>CYP71K12
113634 MADQLVHLPQQLLVLLLFIAPFFFFFLIRSIRRRDGGSVRLPPSPWALPVIGHLHHLMGA
113813
113814 LPPQHAMRNIALRHGPLVRLRLGGLQVI 113897
113898 LASSVDAAREVMRTHDLAFATRPSTRVMQLVFPEGSQ (0) 114008
114127
GIVFTPYGDSWRNLRKICTVELLSAKRVQSFRPIREEEVGRLLRAVAAASPARRAVNL 114300
114301
SELISAYSADSTMRALIGSRFKDRDKFLMLLERGVKLFATPSLPDLYPSSRLAELISRRP 114480
114481
RQMRRHRDEVYEFLDIIIKEHQENRSSSDDQEDLDLVDVLLRIQRKGDFPLSTDNIKTTIG (0) 114663
DLFNGGSETTATTLKWIMAELIRNPRVM
114840
114841
QKAQDEVRQVLGKHHKVTEEALRNLSYLHLVIKEGLRLHPPGLPLLLRESRTTSQVLGFH 115020
115021 VPQGTMILVNMWAISRDPMYWDQAEEFIPERFEHVNIDYYGTDVKYMPFGVGRRICPGIA
115200
115201
FGLVNLELVLASLLYHFNWELPDETELGNLDMKEEMGAIARRLHDLSLVPVIRHPLPVDM* 115383
>CYP71K13P
117756 DAAREVMRTHDLAFATRPSTRVMQLVFLEGSQ 117661
117553
GDRFTPYGDIWRNLRRSAPLAVSAKRVQFFRPIHQEEVCRLLQAVAVASPA 117395
117394 RGPPETLTSSFRPTWATLQCAP**GARLRDRDKSLMLLYRGVKPIRHARACQIFTQSIAL
117215
117214 ADLIIKSLSPMRRASYPMSNLLDIIFK 117134
117108 SDNHMDLTLVAFLLRFHKKGACPLSFCYIRKQFG*AF 116998
>CYP71P1
44616
MSLALLVLSAAYVLVALRRSRSSSSKPRRLPPSPPGWPVIGHLHLMSGMPHHALAELART 44437
44436 MRAPLFRMRLGSVPAVVISKPDLARAALTTNDAALASRPHLLSGQFLSFGCSDVTFAPAG
44257
44256
PYHRMARRVVVSELLSARRVATYGAVRVKELRRLLAHLTKNTSPAKPVDLSECFLNLAND 44077
44076
VLCRVAFGRRFPHGEGDKLGAVLAEAQDLFAGFTIGDFFPELEPVASTVTGLRRRLKKCL 43897
43896
ADLREACDVIVDEHISGNRQRIPGDRDEDFVDVLLRVQKSPDLEVPLTDDNLKALVL (0)
DMFVAGTDTTFATLEWVMTELVRHPRILKKA
43537
43536
QEEVRRVVGDSGRVEESHLGELHYMRAIIKETFRLHPAVPLLVPRESVAPCTLGGYDIPA 43357
43356
RTRVFINTFAMGRDPEIWDNPLEYSPERFESAGGGGEIDLKDPDYKLLPFGGGRRGCPGY 43177
43176
TFALATVQVSLASLLYHFEWALPAGVRAEDVNLDETFGLATRKKEPLFVAVRKSDAYEFK 42997
42996 GEELSEV* 42973
>CYP71P2P
107880 GSMPAVVISKPNLARPALTTNDAVLASRQHLLNG*FLSF 107764
frameshift
107762 GCSDVTFAPAGPYHRM 107715 frameshift
107713
QMARGVEVSELLSAHHVAMYGVVRVKELQRLLAHLTKNTSSAKPIDLSECFLNLANDVLCRVAF 107521
107520 GRRFPRDEGDKLSAVLANAQDLL 107452 frameshift
107452
AGFTISDFFLELEPVASTVTGLCHRLKKCLADLCEACDVIVDVHISGNRQRIPSDREEDFVDVLLRVQ 107249
>CYP71Q1
22020 MADDFLSSQPQPW 22058
22059
PPLLQLSAAVLFFLLPLLYLLFLRGSNGEVRGRQGNSASAPSLPGPCRQLPVLGNLLQIG 22238
22239 SRPHRYFQAVSRRYGPVVQVQLGGVRTVVVHSPEAAEDVLRTNDVHCCSRPPSPG
22405 (2)
26427 SYNYLDVAFAPYSDYWREMRKLFVVELTSVSRVRSFAYARAAEVARLVDTL
26577
26578
AASPPGVPVDLSCALYQLLDGIIGTVAFGKGYGAAQWSTERAVFQDVLSELLLVLG 26745
26746
SFSFEDFFPSSALARWADALAGVERRRRRIFRQVDGFLDSVIDKHLEPERLSAGVQED 26919
26920 MVDALVKMWREQQDRPSGVLTREHIKAILM 27009 (0)
28586
NTFAGGIDTTAITAIWIMSEIMRNPRVMQKARAEVRNTVKNKPLVDEEDSQNLKYLEMIIKEN 28774
28775
FRLHPPGNLLVPRQTMQPCLIGGYNVPSGTRVFINIWAMGRGPMIWDNPEEFYPERFE 28948
28949
DRNMDFRGSNFELVPFGSGRRICPGVAMAVTSLELVVANLLYCFDWKLPKGMKEEDIDM 29125
29126 EEIGQISFISFRRKVELFIVPVKHEQYQLMGHIN* 29221
>CYP71Q2 also frameshifted in AACV01015820.1
69192 MATELLASQLLPWQPLVQLLAAGLFLLPLVYLLFFKGDGNGG 69317
69318
VMDSASAPSPPGPPRQLPVLGNLLQIGSRPHRYFQAVARRYVPVVQVQLGSIRTVVVHS 69494
69495 PEAAKDVLRTNDLQCCSRPSSPG 69565 (2)
72047 NYNYLDVAVSPYS 72083 (frameshift)
72085
YWREMRKLLVIELTSIRRVQSFAYARAAEVARLVDTLAASPAGVPVDLSSALYTF 72249
72250
SDGVIGTVAFGKVYGSAAWSSWEWGASFQEAMDETMQVLGSFSFEDFFPSSALARWADALTGA 72438
72421
AGRRRRVFHRIDGFFDAVIDKHLEPERLSAGVQEDMVDAMVMVWREQKDEAFGLTRDHIKAILL 72630 (0)
84351
DAFVGGIDTTAVTVTWIMSELMRNPRVMQKAQAEVHNIVKNKSKVCEEDIQNMKYLKM 84524
84525
IIKENFRLHPPGTLLIPRQTMKTCTIGGYSVPSETRIYVNVWAMGRDPNIWDNPEQFY 84698
84699
PERFEDKGIDFRGSHFELLPFGSGRRICPGIAMGVANVELVVANLLYCFNWQLPKGMKEE 84878
84879 DIDMDEIGQLAFRKNFLF* 84935
>CYP71Q3P
92865 DAFAGGIDTTVVTTTWIMSELMRNPRVMQK 92954 (frameshift)
92956
AQAEVHNIVKNKSKVYEENIQNMKYLKMIIKENCRLHPPGTLLIPRHTMKTCTIGGYSV 93132
93133 PSKRRIYVNVWAMWRDPNIWDNLEQFYLERFEDKGIDFRGSHFELLT
93273 (insertion)
93561 FGSGQRICPGIAMGVANVELVVANLLYCFDWQLPKGMKEEDTDMDEIG*LAFRKKLPLFI
93740
93741 VPMKH* 93758
>CYP71Q4P
16812 GGGGRWTETLEWIMAELTANTRVMAKLQDEISRAADGK 16925 24 aa
deletion and frameshift
16931
PAPILVPRQSTATAVVQG*EILAKTSLFINVWAMGRYPAAWNSSEEFWPEQFLASREAVD 17110
17111 FQGNNYQLILFITDRRIFPDINFAVPVLETALVGLLHPTNELLGGG
17248
17249 GGLMWLQRSCSRARRPRSTAHRRHRSGTHPAAIAAAAAT* 17368
>CYP71R1
56
MAAVQLDSGLLVGFLFLATCLAVAIRSYLRSGGAAIPSPPALPVIGNLHQLGRGRHHRAL 235
236
RELARRHGPLFQLRLGSVRALVVSSAPMAEAELRHQDHVFCGRPQQRTARGTLYGCRDVA 415
416
FSPYGERWRRLRRVAVVRLLSARRVDSFRALREEEVASFVNRIRAASGGGVVNLTELIVG 595
596
LTHAVVSRAAFGKKLGGVDPAKVRETIGELADLLETIAVSDMFPRLRWVDWATGLDARTK 775
776
RTAAKLDEVLEMALRDHEQSRGDDDDGGGGDGEPRDLMDDLLSMANDGGGDRGHKLDRID 955
956
VKGLILDMFIAGTDTIYKSIEWTMAELIKNPAEMAKVQAEVRHVAAAAH 1102
1103 GDEDEDTVAVVREEQLGKMTLLRAAMKEAMRLHPPVPLLIPREAIEDTVLHGHRVAAGTR
1282
1283
VMINAWAIGRDEAAWEGAAEFRPGRFAGGGAAAGVEYYGGGDFRFVPFGAGRRGCPGVAF 1462
1463 GTRLAELAVANMACWFEWELPDGQDVESFEVVESS 1567
>CYP71R2P
54816 MAAVQLDSGLLIGFLFLATCLVVAVRSYLRSGGADGGRGGAA
54690 ITSPPALPVIGNLHQLGRGRHHRALRELARRHGPLFQLRLGSVRALVVSSASMAEAVLRH
54511
54510
QDHVFCGRPQQHTARGTLYGCRDVAFSPYGERWRRLRRVAVVHLLSARRVDSFRALREEE 54331
54330
VASFVNRIRAASGGGGGVVNLTELIVGLTHAVVSRAAFGKKLGGVEPAKVRETVGEL 54160
54159
ADLLGTIAVSDMFPRLRWVDWATGLDARTKRTAAKLDEVLEMVLRDHEPSRGDDDDDDGD 53980
53979 GEARDLMDDLLSMANGGDDHGYKLDRIDVKGLLIL 53875 (0)
DMFAAGTDTVYKSIE*TMAELIKNPAEMAKVQAEVRHVVAAAHGGEGDEDAVVIVKEEQAS 53616 (fs)
53613
LGKMTLLRAAMKEAMRLHPPLPLLIPREAIQDTVLHGHRVAAGTRVMINAWAIGRDEAAW 53434
53433 EDAGEFRPGRFADGGDNAGVEYYGGGGDFRFMPFGAGRRGCPGMAFATRLAELVVANMAC
53254
53253 WFEWELPDGQDVQSFEPSRSSSREVCRLGSSSGKIGNHVYVVQTTRM*
53110
>CYP71S1
23233
MSMASLQAPEFLASCLLLATILFFKQLLAPSSKQRAASPSLPRPRGLPLIGNLHQVGALPHRSLAALAAR 23096
23095
HAAPLMLLRLGSVPTLVVSTADAARALFRDNDRALSGRPALYAATRLSYGQKSISFAPD 22919
22918 GAYWRAARRACMSELLGPPRVRGLRDAREREAAALVAAVAAAGASPVNLSDMVAATSSR
22742
22741
IVRRVAFGDGDGDESMDVKAVLNETQALLGGLWVADYVPWLRWVDTLSGKRWRLERRFRQ 22562
22561
LDALYERVIDDHLNKRKHASDEEDDLVDVLLRLHGDPAHRSTFGSRSHIKGILT 22400 (0)
22059 DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVVAAGDKVREADLPELHYLR
21889
21888
LVIKETLRLHPAAPLLVPRETTEPFRTAHGVEIPARTRVVVNAMAIHTDPGVWGPDAERF 21709
21708
VPERHRDDADGCAQQHDGFALVPFGIGRRRCPGVHFAAAAVELLLANLLFCFDWRAP 21538
21537 PGREVDVEEENGLAVHKKNPLVLIATKSKRNTGGH* 21427
>CYP71S2
19270 MASLQAPEFLASCLLLLATILLFKQLLAPSSKKRAASPSLPRPKGLPLIGNLHQVGALPHRSLAAL
19073
19072
AARHAAPLMLLRLGSVPTLVVSTADAARALFRNNDRALSGRPALYAATRLSYGQKNISF 18896
18895
APDGAYWRAARRACMSALLGAPRVCELRDAREREAAALIAAVAAAGASPVNLSDMVAAT 18719
18718
SSRIVRRVAFGDGDGDESMDVKAVLDETQSLLGGLWVADYVPWLRWVDTLSGMRRRLERR 18539
18538
FRQLDAFYERVIDDHINKRKHASDEEDDLVDVLLRLHGDPAHRSMFGSRTHIKGILT (0)18368
17337
DMFIAGSDTSAVTVQWAMTELVRNPDVLAKAQHEVRRVIAGGGGGDKDGAMVREADL 17149
17148
PELHYLRLVIKETLRLHPASPLVQRETTEPFRTAHGVEIPARTRVVINAMAIHTDPGVWG 16969
16968 PNAERFLPERHRAHDADGEQQHEHDGFALVPFGIGRRSCPGVHFAAAAAELLLANLLFCF
16789
16788 DWRALPGREVDVEEENGLAVRKKNPLVLIATKSKSNRDAH* 16666
>CYP71T1
34853 MELSSSLAAVLHSPLFLLAAL
34916
LLLPVFTLLSFSSAKKPGDGGGWRLPLPPSPRGVPFLGHLPLLGSLPHRKLRSMAEAHGP 35095
35096 VMLLWFGRVPTVVASSAASAQEAMRARDAAFASRARVRMAERLIYGRDMVFAPYGEFWR
35272
35273
QARRVSVLHLLSPRRIASFRGVREQEVAALLDRVRRRCGVRGGGETVNLSDLLMSYANGV 35452
35453
ISRAAFGDGAYGLDGDEGGEKLRELFANFEALLGTATVGEFVPWLAWVDKLMGLDAKAA 35629
35630
RISAELDGLLERVIADHRERRRLSQPDGGDGDGDGDENVDHRDFVDVLLDVSEVEEGAG
AGEVLLFDTVAIKAIIL (0)
36458
DMIAAATDTTFTTLEWAMAELINHPPVMRKLQCEIRAAVGVPGASGGAEVTEDHLGELR 36634
36635
LLRAVVKETLRLHAPVPLLVPRETVEDTELLGYRVPARTRVIINVWAIGRDAAAWGDRAE 36814
36815
EFVPERWLDGGGEEVEYAAQLGQDFRFVPFGAGRRGCPGAGFAAPSIELALTNLLYHFDW 36994
36995 ELPPHADGAAAATAARLDMGELFGLSMRMKTTLNLVAKPWSSDV* 37129
>CYP71T2
39698 MELSSLAALLHSPLLLAVLLLVFSWLIVSSTKKRPPPPCGDGGRRLP
39839
LPPSPPGVPLLGHLPLLGTLPHRKLRSMAEAHGPVMLLRLGRVPAVVASSAAAAEEVMRT 40018
40019 RDLAFASRPRVRMSERLFYGRDM
AFAPYGEFWRQARRVTVLHLLSPRRVLSFRGVREQE 40195
40196 VAALLDRVRRRCGGGGETVNLSDLLMSYAHGVISRAAFGHGGAHGFDGDEGGEKLRKLFA
40375
40376
DFEGLLGTMTVGEFVPWLAWVDKLTGLDAKVARTSAAMDGLLERVIADHRERRRSRGQA 40552
40553 VGDGEADADHR DFVDVMLDVSEAEEGAGAGAGGVLFDTVAIKAVIL (0)
42074 DMMAAGTDSSFTTTEWVMAELINHPRVMRKL 42169
42170 QDEIRAVVGTSSASAAAAATGGGQVTEDHLGELPFLRAVIKEMLRLHAPGPLLLPRETVE
42349
42350
DTELLGYRIPARTRVIINVWAIGRDAAAWGDSAEEFVPERWLDGGGGGGVEYAQQLGKDS 42529
42530
RFVPFGAGRRGCPGAGFAALSVELALANLLYHFDWELPPPAASGIMATTRLDMDELFGLS 42709
42710 VRLKADLNLVAKPWSPGAS* 42769
>CYP71T3
48011 MAVSLVVVVVV
48044 VIAIVVPLLYLVLLPAWKPARRDDGDGGMRRRLPPSPPWGLPLLGHLHLLGALPHRALRS
48223
48224
LAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRTRDLEFASRPRVAMAERLLYGGRDVAFA 48403
48404
PYGEYWRQTRRICVVHLLSARRVLSFRRVREEEAAALVARVRAAGGAVDLVEHLTAY 48574
48575 SNTVVSRAVFGDESARGLYGDVDRGRVLRKLFDDFVELLGQEPMGELLPWLGWVDALN
48748
48749
GMEVKVQRTFEALDGILEKVIDDHRRRRREVGRQMDDGGGGDHRDFVDVLLDVNETDMD 48925
48926 AGVQLGTIEIKAIIL 48970 (0)
51142
DMFAAGTDTTTTVIEWAMAELITHPDAMRNAQDEIKAVVGITSHITEDHLDRLPYLK 51312
51313 AVLKETLRLHPPLPLLVPHEPSSDTKILGYSIPACTRIVINAWTIGRDQATWGEHAEEFI
51492
51493
PERFLESGLDYIGQDFVLVPFGAGRRGCPGVGFAVQAMEMALASLLYNFDWETRVVDRRS 51672
51673 EFGTSSLDMSEMNGLSVRLKYGLPLIAISRFP* 51771
>CYP71T4
58119 MAVSLLPAVL
58149
VLLAIVAPLLYLVLLPAVKYTTRNGAARWEDDGGDGRRRRRLPPSPRGLPLLGHLHLLGA 58328
58329 LPHRALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRSRDMEFASRPRMAMAELLLY
58508
58509
GGRDVAFAPYGEYWRQARRICVVHLLSARRVLSFRRVREEEAAALVGRVRAAAADVVVD 58685
58686
LSDLLIAYSNTVLTRIAFGDESARGGGGGGDRGRELRKVFDDFARLLGTEPMGELLPW 58859
58860
FWWVDALRGIDGKVQRTFEALDGILERVIDDHRRRREGGRRMDDDGGGDHRDFVDVLLD 59036
59037 VNETDKDAGIQLGTVEIKAIIM 59102 (0)
59562
DMFVGGSDTTTTMMAWTMAELINHPRAMRKAQNEIWAVVGNTSHVTKDHVDK 59717
59718
LPYLKAVFKETLRLHPPLPLLIPREPPADTQILGYTIPAHTRVVINAWAIGRDAAAWGQQ 59897
59898
PDEFSPEKFLNSTIDYKGQDFELLPFGAGRRGCPGIVFGVSAMEIALASLLYHFDWEAAA 60077
60078 TDHRRRGSQAWALPVDMSEVNGIAVHLKYGLHVVAKPRMP* 60200
>CYP71T6
indica AAAA02000630.1
7617
MVVVVVVVAIAIVVPLLYLVLLPPARRGGGDSARRRLPPSPRGLPLLGHLHLLGALPHR 7441
7440
ALRSLAAAHGPVLLLRLGRVPVVVVSSAAAAEEVMRARDVAFASRPRVAMAERLLYG 7270
7269 GRDVAFAPYGEYWR 7228
7226 QARRICVVHLLSARRVLSFRRVREEESAALVARVRAAAGGAVDLVEHLTAYSNTVVSRAV
7047
7046
FGDESARGLYGDVDRGRALRKLFDDFVELLGQEPMGELLPWLGWVDAVRGLDGKVQRTFE 6867
6866
ALDSIIEKVIDDHRRRRRRHEVGRQMDSDDDGGGGGDHRDFVDVLLDVNETDKDAGIRLG 6687
6686 TIEIKAIIL (0) 6660
6536 DMFAAGTDTT 6507
6506
TTAMEWAMAELITHRDAMHKVQDEIRAVVGVTGCVTEDHIDRLPYLKAVLKETLRLHPPN 6327
6326
PLLVPHVPLADTEILGYTVPTHTRVLINAWTIGRDPVTWGEHAEKFIPERFLNNNVDYKG 6147
6146
QDFGLVPFGAGRRGCPGMGFAVPTIEMALASLLYNFSWETRPVDRRCKSGTSSLDMSEVN 5967
5966 GISVHLKYGLPLMAKSYFS* 5907
>CYP71T7
10977
MDISLASLVLVLLAFVLPLLYLLLQLPGKKSGGGGGDGPRLPPSPAGCLPLLGHLHQLGP 11156
11157
LPHVALRSMAAAHGPVLRLRLGRVPTVVVSSAAAAEEVLRARDAAFSSRPRSAMAERILY 11336
11337
GRDIAFAPYGEYWRQARRVCVVHLLSAQRVSSFRRVREEEAAALADAVRAAGRGGGRAFD 11516
11517 LSGLIVAYASAVVSRAAFGDESARGMYGGADGGRAVRKAFSDFSHLFGTKPVSDYLPWLG 11696
11697
WVDTLRGRERKARRTFEALDGVLDKVIDDHRRRRDSGRRQTGDADAGHRDFVDVLLDVNE 11876
11877 MDNEAGIHLDAIEIKAIIM (0) 11933
12008
DMFVAGSDATSKPMEWAMAELVSHPRHMRRLQDEIRAVVGGGRVTEDHVDKLPYLRA 12178
12179 ALKEALRLHAPLPLLVARETVADTEIMGYHVAARTRVVINGWAIGRDTAVWGETAEEFMP 12358
12359
ERFLAGGNGGGAAAADYKVQGFEMLPFGGGRRGCPGVTFGMATVEMAVASLLYHFDWEAA 12538
12539 AADGKGGREGTPLLDMSETSGISMGLKHGLPLVAKPRFP 12655
>CYP71T8 revised by ESTs CT848324.1,
CK038296.1
869 MSSYVVVAAALLVFVVVVVAAIKNLGKGKLPPSPPSLPFVGHLHLVGELPHRSLDALHRR
690
689
YGSDGGLMFLRLGRAGALVVSTAAAAADLYRGHDLAFASRPPSHSAERLFYGGRNMSFAP 510
509 LGDAWRRTKKLAVAHLLSPRRA
AALAAPARAAEAAALVARARRAAEAARAVQLRELLYAYTN
GVITRVAAGGSGATAERFRKMMADTSELLAGFQWVDRLPEAAGWAARKLTGLNK
KLDDMADESDRFLGEILAAHDDEKAEGEEEDFVDVLLRLRRQGAAAAGGLELAEDNVKAIIK
(0)
DIMGAATDTSFVTLEWIMTELIRNTQVMSKLQNEIIQVT
GSKPTVTEEDLTKLDYLKAVIKEVLRLHPPAPLLIPHHSTMPTTIQGYHIPAKTIAFINV
WAIGRDPAAWDTPDEFRPERFMGSAVDFRGNDYKFIPFGAGRRLCPGIILALPGLEMVIA
SLLYHFDWELPDGMDVQDLDMAEAPGLTTPPMNPVWLIPRCRTI*
>CYP71T10P
DVVVLLVLDIIVA
33536
LMYLVLLPDVNRSNRPERWEDNDGWQRLPP*PRRLPLLRYLHLLSAPLHQAFHPLPRHMA 33357
33356
WCYYSSSNACRWWLSSFAATSRPKSAMAEQLLYGCDVAFAPYGEY*S*ECRICVLFFRCI 33177
33176 REEEVAVLVKHVRHPCR
33126
>CYP71U3
98325 MDELSIENHSPISMDELSFGSLCMVAMATLALALALMVMGAHRRGGEKGATTGAKNLPP
98149
98148
GPWNLPVIGSLHHLLGASPPHRALLRLSRRHGPLMLVRLGEVPTVIVSGSDAAMEVL 97978
97977
KARDPAFADRARSTTVDAVSFGGKGVIFAPYGEHWRHARRVCLAELLSARQVRRLESIRQ 97798
97797
EEVSRLVDSIIAGSSNAAAVDMTRALAALTNDVIARAVFGGKCARQEEYRRELGVLTTLV 97618
97617
AGYSMVDLFPSSRVVRWLSRRTERRLRRSHAEMARIVGSIIEERKEKKGSDAGVGAKDED 97438
97437 DDLLGVLLRLQEEDGLTSPLTAEVIAALV 97351
94360
XDIFGAATDTTASTLEWIMVELMRNPRAMDKAQQEVRNTLGHEKGKLIGIDISELHYLCMV 94181
94180
IKETLRLHPASALILRQSRENCRVMGYDIPQATPVLINTFAVARDPKYWDNAEEFKPE 94007
94006
RFENSGADIRTSIAHLGFIPFGAGCRQCPGALLATTTLELTLANLLYHFDWALPDGVSPK 93827
93826 SLDMSEVMGITLHRRSSLHLHTTLTRSGFFSHSGR 93722
>CYP71V1P stops also in AACV01007891.1, but not indica
stops in EST CI222916
87309 MDDYFFLQSLLLCVAAVALLQLAKVAATMRRRPRTPPGPWRLPVIGSMHHLVNALPHR
87136
87135
AMRDLAGVHGPLMMLRLGETPVVVASSRGAARAVLKTHDANFATRPRLLAGEIVGYGW 86962
86961
ADILFSPSGDYWRKLRQLCAAEILSPKRVLSFRHIREDEVTARVEEIRAAAAPSTPVNLS 86782
86781
VLFHSTTNDIVARAAFGRKRKSAPEFMAAIKAGVGLSSGFKIPDLFPTWTTALAAVTGMK 86602
86601 RSLRGIHKTVDAILQEIIDERRCVRGDKINNGGAADDQNADENLVDVLIALQEKGGF
86431 (1)
86339
GKSVTTPWVIVTHMICTLDVQDMFAGGTGTSASALEWAMSELMRNPAVMKKLQGQIREAFH 86157
86156
RKAVVTEADLQASNLRYLKLVIKEALRLHPPAPLLVPRESIDTCELDGYTIPAKSRVIVN 85977
85976 VWAIGRDPK Y*Y*E DAEEFKPEQFDDDAIDFMGGSYEFIPFGSGRRMCPGFNYGLASMEL
85797
85796
VLVAMLYHFDWSLLVGVKEVDMEEAPGLGVRRRSPLLLCATPFVPAAVSADY* 85638
>CYP71V2
3269
MDELFYQSLLLSVAAVTVLQLLKLLLVRHRRPRTPPGPWRLPVIGSMHHLVNVLPHRKLR 3448
3449 ELAAVHGPLMMLQLGET
3499
3501
PLVVATSKETARAVLKTHDTNFATRPRLLAGEIVGYEWVDILFSPSGDYWRKLRQLC 3671
3672 AAEILSPKRVLSFRHIREDE 3731
4000
VNLSVMFHSVTNSIVSRAAFGKKRKNAAEFLAAIKSGVGLASGFNIPDLFPTWTGILATV 4179
4180
TGMKRSLRAIYTTVDGILEEIIAERKGIRDEKISGGAENVDENLVDVLIGLQGKGGFGFH 4359
4360 LDNSKIKAIIL (0) 4392
4491 DMFAGGTGTSASAMEWGMSELMRNPSVMKKLQAEIREVLRGKATVTEADMQAGNLR 4658
4659
YLKMVIREALRLHPPAPLLVPRESIDVCELDGYTIPAKSRVIINAWAIGRDPKYWDNPEE 4838
4839
FRPERFEDGTLDFTGSNYEFIPFGSGRRMCPGFNYGLASMELMFTGLLYHFDWSLPEGVN 5018
5019 EVDMAEAPGLGVRRRSPLMLCATPFVPVVSAN* 5117
>CYP71V3
MAWLDDVLSLCNNNTRMCNALVLSVVVVSFLQLLKHVLLTPSRLP
64951
LPPGPRNLPVVGSAHRLVNTLAHRVLRDLADVHGPLMHLRVGQVPVVVVTSKELARDILK 64772
64771
THDANFATRPKLVAGGIVAYDWTDILFSPSGDYWRKLRRLCIQEILSAKRILSFEHIRED 64592
64591
EVRMLADEIRAVGPSVAVDLSARLHRITNTIVSRAAFGNKRSNAADFLVAIKQSVIMASG 64412
64411 FYVPDLFPRFSVLLCWLTGMRRTLHGIRDTIDSILEEIISEKEEAKQQQDNNLVDVLLSL
64232
64231 KDKGDFGFPITRDTIKAIVL 64172
63964
DIFAGGSGTSANAMEWAMSELMMNPRVMNKVQAEIRDAFHGKQSIGEADLRARDLKYLKL 63785
63784
VMKETLRLHPPAPLLVPRESIDACEINGYMIPAKARVIVNSWAISRDPRYWEDAEEFKPE 63605
63604 RFAEGGIDFYGSNYEYTQFGSGRRMCPGYNYGLASMELTLAQLLHSFDWSMPDGATEVDM
63425
63424 TEAPGLGVRRKTPLLLCAAPYVASHIYA* 63338
>CYP71V4
6936
MDELLYRALLLSVLAVALLQIIKAFLIIIRAKPAAPPLPPGPWRLPVIGSMHHLAGKLPH 7115
7116
RALRDLAAAHGPLMMLRLGETPLVVASSREMAREVLRTHDANFATRPRLLAGEVVLYGGA 7295
7296 DILFSPSGEYWRRLRQLCAAEVLGPKRVLSFRHIREQE (0) 7409
8349
MESQVEEIRAAGPSTPVDLTAMFSFLVISNVSRASFGSKHRNAKKFLSAVKTGVTLASGF 8528
8529
KIPDLFPTWRKVLAAVTGMRRALEDIHRVVDSTLEEVIEERRSAREDKARCGMVGTEENL 8708
8709
VDVLIGLHEQGGCLSRNSIKSVIFDMFTAGTGTLSSTLGWGMSELMRSPMVMSKLQGEIR 8888
8889
EAFYGKATVGEEDIQASRLTYLGLFIKETLRLHPPVPLLVPRESIDTCEIKGYMIPARSR 9068
9069
IIVNAWAIGRDPRYWDDAEEFKPKRFEKNMVDFTGSCYEYLPFGAGRRMCPGVAYGIPIL 9248
9249
EMALVQLLYHFDWSLPKGVVDVDMEESSGLGARRKTPLLLCATPFVVPVL* 9401
>CYP71V5
3420 MDGLLYQALLLSALAVAVLQIVKLAVVNRGKKQAAAAAPTPPGPWRLPVIGSMHHLAGKL 3599
3600
AHRALRDLAAVHGPLMMLQLGETPLVVVSSREVAREVLRTHDANFATRPRLLAGEVVLYG 3779
3780 GADILFSPSGEYWRKLRQLCAAEVLGPKRVLSFRHIREQE (0) 3899
4362
MASRVERIRAVGPSVPVDVSALFYDMAISIVSCASFGKKQRNADEYLSAIKTGISLASGF 4541
4542
KIPDLFPTWRTVLAAVTGMRRALENVHRIVDSTLEEVIEERRGAARECKGRLDMEDNEEN 4721
4722
LVDVLIKLHEQGGHLSRNSIKSVIFDMFTAGTGTLASSLNWGMSELMRNPRVMTKLQGEI 4901
4902
REAFHGKATVGEGDIQVSNLPYLRLFIKETLRLHPPVPLLVPRESIDMCEVNGYTIPARS 5081
5082 RIVVNAWAIGRDPKYWDDPEEFKPERFEGNKVDFAGTSYEYLPFGAGRRICPGITYALPV 5261
5262
LEIALVQLLYHFNWSLPKGVTEVDMEEEPGLGARRMTPLLLFATPFVVPLL* 5417
>CYP71W1
537
MELTTLLLLALISFFFLVELIARYASPSGRESALRLPPGPSQLPLIGSLHHLLLSRYGDL 716
717
PHRAMRELSLTYGPLMLLRLGAVPTLVVSSAEAAAEVMRAHDAAFAGRHLSATIDILSCG 896
897
GKDIIFGPYTERWRELRKVCALELFNHRRVLSFRPVREDEVGRLLRSVSAASAEGGAACF 1076
1077
NLSERICRMTNDSVVRAAFGARCDHRDEFLHELDKAVRLTGGINLADLYPSSRLVRRLSA 1256
1257
ATRDMARCQRNIYRIAESIIRDRDGAPPPERDEEDLLSVLLRLQRSGGLKFALTTEIIST 1436
1437 VIF 1445
1591 DIFSAGSETSSTTLDWTMSELMKNPRILRKAQSEVRETFKGQDKLTEDDVAKLSYLQLVI 1770
1771
KETLRLHPPAPLLIPRECRETCQVMGYDVPKGTKVFVNVWKIGREGEYWGDGEIFRPERF 1950
1951
ENSTVDFRGADFEFIPFGAGRRMCPGIALGLANMELALASLLYHFDWELPDGIKSEELDM 2130
2131 TEVFGITVRRKSKLWLHAIPRVPYISTY* 2217
>CYP71W2P
2543 ARRVQSPRHVR*D*QAGYLVHAVVGECAPGGAGAVVPISEKISRMVNDSVVRPAIGSRCA
2364
2363
RRDEFLHVQARGLRQARGRVQLGRPVPIVVASELAQRRAAVGRPSVAAGAFARCGRPAET 2184
2183
FFNMDNLRTHDTYRKKNHSGNSQHCTAFSALSFSELQLKMTIWQSHHYKLPINLREIFS
2006 SAGSETLNDTLVGNI*ANEKYPQVMQKAQTEVREKFRG*DKLIKDDMNRLSYLHL
1846
1845 VIQETLRLH
>CYP71W3
80150
MEVSLPLLIGVVLAFLLLFVLVNIKNSCRSWWPPPEKEKKKLRLPPGPWQLPLVGSLHHV 80329
80330
LLSRHADLPHRALRELAGKYGPLMMLRFGAVPTLVVSSAEAAREVLKTYDAAFASRYL 80503
80504
TPTLAVLSRGGRDILFSPYCDLWRQLRRICVHELLSARRVQSLRHGREDEAARLVRSVAA 80683
80684
ECAARGGAAVVNVGELISRAVNDSVVRSAVGARSARRDEFVRELDESVRLSGGFNLADLY 80863
80864
PSSWLARRLSGAMRETERCNRSLMAIMDDIIREHGDGEEDLLGVLLRLQRNGDVQCPLTT 81043
81044 DLITNVVL (0) 81067
81865
DMFAAGSETSSTTLEWALTELVRNPHIMEKAQSEVREIFRGENKLTEEMMDKLSYLRLVI 82044
82045 RETLRLHLPVPFLLPRQCREPCSVMGYDIPVGTKVLVNAWAIARDNQYWDDPEVFKPERF
82224
82225
ENNRVDFKGIDFEFIPFGAGRRICPGIALGLANIELMLASLLYHFDWEFLDRDRNDEIDL 82404
82405 SETFGITAKRKSKLMVYATQRIPCLG 82482
>CYP71W4
103769 MEMTLPLLGAAVVLAAFLLFFLVKNNRCCWSPAAERR
103880 LRLPPGPWRLPLVGSLHHVLLSRHGDLPHRALRELAGRYGALMLLRFGAVPTLVVSSAE
104056
104057 AAREVLKTHDACFASRHMTPTLAVFTRGGRDILF
SPYGDLWRQLRRICVLELFSARRVQS 104236
104237
LRHVREDEAARLVRAVAEECAIGGGGGAVVPIGDMMSRMVNDSVVRSAIGGRCARRDEFL 104416
104417
RELEVSVRLTGGFNLADLYPSSSLARWLSGALRETEQCNRRVRAIMDDIIRERAAGKDDG 104596
104597 DGEDDLLGVLLRLQKNGGVQCPLTTDMIATVIM (0) 104695
105190
EIFSAGSETASTTLEWAISELVRNPKVMDKAQSEVRKLFEGQDNLTEDDMSRLS 105351
105352
YLHLVIRETLRLHAPAPFLLPRECREQCNVMGYDITEGTRVLVNAWAIARDTRYWEDPEI 105531
105532 FKPERFNANLVDFKGNDFEYIPFGSGRRVCPGITLGLTSMELVLASLLYHFDWELPGGKR
105711
105712 CEEIDMSEAFGITVRRKSKLVLHATPRVPCLH* 105810
>CYP71W5P
51119 LRRICVLELFSAHRV*SLHHVREEEAAPLVRVVADIRSPLGP 50994
>CYP71W6P
64520 LRRICMLELFSAHRV*SLHHVREEEAARLVRVVA 64419
>CYP71X1P
42530 MDHVLACVGILVAFTPLFLLAVLPLKLTNGGDGVKLPPGPWRLPVIGSMHHLMGESLVHRAMAD
42721
42722
LARRLDAPLMYLKLGEVPVVLASSPCAAREIMRAHDVAFASRPLSPTVRRMR 42877
42878 PPPPRRRQLRKICVVELLSARRVRTFRRVREEEVARLVGALVCLAHVA
43021 gap
AMIGARFERRDEFLE
missing mid region
43862 DLFSGGSETSATVIQWAMSELMKNPRVMRKVQAELRDKLAGKPRVTEDDLSDLKYLKL
44035
44036 VIKETLRLHPAAPLLVPRQCREPCKIMGYDIPKGTTVFVNAW 44161
frameshift
44161
AIGRDPNYWDDAEVFRLERFANSTIDFKGMDMEFIPFGAGRRMCSGLAFAEAIIDLLFS 44337
44338
TLLFHFDWELPCGMTASELDMIEEMALTVRRKNDLHLRPILRVPQTQTSSALLF* 44502
>CYP71X2
38238 MYDAVACVVAVVVVVVFAMLWVKLARSGDGGGGGSGGVRLPPGPWRLPVIGSLHHVVGDRLLHRSMA
38438
38439
RIARRLGDAPLVYLQLGEVPVVVASSPGAAREVTRTHDLAFADRALNPTARRLRPGGAGV 38618
38619
ALAPYGALWRQLRKICVVELLSARRVRSFRRVREEEAGRLVGALAAAAASPGEEA 38783
38784 AVNFTERIAEAVSDAALRAMIGDRFERRDEFLQELTEQMKLLGGFSLDDLFPSSWLASAI
38963
38964
GGRARRAEANSRKLYELMDCAIRQHQQQRAEAAVVDGGAGVEDDKNQDLIDVLLNIQKQG 39143
39144 ELETPLTMEQIKAVIL 39191 (0)
39428
DLFSGGSETSATTLQWAMSELIKNPMVMQKTQAELRDKLRRKPTVTEDDLSGLKYVKL 39601
39602 IIKETLRLHPVVPLLVARECRESCKVMGYDVPKGTTVFVNAWAIGRDPKYWDDAEEFRPE
39781
39782
RFEHSTVDFKGIDLEFIPFGAGRRICPGMAFAEAIMELLLAALLYHFDWELPNGMAASE 39958
39959 LDMTEEMGITVRRKNDLHLRPHPPCVVRSNFRSFVERERERHFV* 40093
>CYP71X3
33613 MEQVSCFAAAAAAVLVVLSLARMLLAPRREWD
33709 GLNLPPSPSRLPFIGSFHLLRRSPLVHRALADVVRQLGAPPLMYMEIGEVPAIVVSCADA
33888
33889
AREIMKTHDINFASRPWPPTVQKLRAQGKGIFFEPYGALWRQLRKICIVKLLSVRRVSSF 34068
34069 HGVREEEAGRLVAAVAATPPGQAVNLTERIKVAIADTTMRPMIGERFERRE
34221
34222 DFLEVLPEIVKLASGFSLDDLFPSS 34296 check joint
GSPAPSAARGEAVNRASYELVDSAFRQRQQQKEAMAAPPPDIAKEE
EDDLMDELIRIHKEGSLEVPLTAGNLKAVIL
34528 (0)
34777
ELFCAGSETSSNAIQWAMSELVRNPKVMEKAQNEVRSILKGKPTVTEADMVDLTY 34941
34942
VKMIVKETHRLHPVLPLLTPRVCQQTCQIMGYDVPQGSVIFIKSWAIMRDPKHWDDAETF 35121
35122
KPERFEDSEIDLKGTNYEFTPYGAGRRICPGLALAQVSIEFILTMLLYHFNWELPNGAA 35298
35299 PEELDMTEDMGLTIRRKNDLYLLPTLRVPLTA* 35397
>CYP71X4
27391
MEQVSCFAAAAAVVVVVLLLARMLLAPRGEWDGLNLPPSPPRLPFIGSFHLLRRSPLVHRALADVARQL 27597
27598
GSPPLMYMRIGELPAIVVSSADAAREVMKTHDIKFASRPWPPTIRKLRAQGKGIFFEPYG 27777
27778 ALWRQLRKICIVKLLSVRRVSSFHGVREEEAGRLVAAVAATPPGQAVNLTE
27930
27931
RIEVVIADTTMRPMIGERFERREDFLELLPEIVKIASGFSLDDLFPSSWLACAIGGSQRR 28110
28111
GEASHRTSYELVDSAFRQRQQQREAMAASPPDIAKEEEDDLMDELIRIHKEGSLEVPLTA 28290
28291 GNLKAVIL 28314 (0)
28577 DLFGAGSETSSDALQWAMSELMRNPRVMEKAQNEVQSILKGKPSVTEADVANLKY
28741
28742
LKMIVKETHRLHPVLPLLIPRECQQTCQIMGYDVPQGSVIFINSWAIMRDPKHWDDAETF 28921
28922
KPERFEDGEIDLKGTNYEFTPFGAGRRICPGLALAQASIEFMLATLLYHFDWELPNRAA 29098
29099 PEELDMTEEMGITIRRKKDLYLLPTLRVPLTA* 29197
>CYP71X5
16507 MEKVAWCACFLLLALMVVRLTAKRRGDNGAERLPPGPWRLPLVGNLHQVMARGPLVHRTMADLARRLD
16710
16711
APLMSLRLGEVPVVVASSADAAREIMRTHDVAFATRPWNPTTRRLRCDGEGVVFATYGAL 16890
16891 WRQLRKLCVVELLGARRVRSFRRVREEEARRLVAAVAASPRGEAVNVSERI
17043
17044
TAVITDATMRAMIGDRFGRRDEFLELLADIVKIGSGFSLDDLFPSWRLAGAIGGMARRAE 17223
17224 ANHRRTYELMDSVFQQHEQRRVHVAAPADGAMDDAEEDLVDVLFRIQKDGGLEVPLTIGN
17403
17404 IKAIIL 17421 (0)
17795
DLFNAGSETSANTLQWVMSELMRNPKVMRKAQAELRNNLQGKTTVTEDDLTNLKYLKL 17968
17969
VIKETLRLHPVLPLLLPRECREACNVIGYDVPKYTTVFINV*AINRDPKYWDMAEMFKPE 18148
18149 RFDNSMIDFKGTDFEFVPFGAGRRICPGIAFAQSNMELVLATLLYYFDWELPSGMSPEE
18325
18326 LDMTEDMGLSVRRKNDLYLHPTVCVPL* 18409
>CYP71X6P indica also has the stop codon, but not the
frameshift, EST CI150239 has both
11020
MEKVAWCACFLFLALMVVRLRTKRRGDNNGGVKLPPGPWRLPLVGNLHQVMARGPLVHRTMADLA 11214
11215
RRLDAPLMSLRLGEVPVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 11394
11395 YGALWRQLRKIAMVELLSARRVQSFRRIREDEVGRLVAAVAAAQPGEAVNV
11547
11548
SERIAALVSDAAVRTIIGDRFERRDEFLEGLAEGIKITSGFSLGDLFPSSRLASFIGGTT 11727
11728 RRAEANHRKNFELMECALKQHEEKRAAAAAAAAGAVEDDEDIVDVLLRIQKEGSLQVPLT
11907
11908 MGNIKAVVL 11934 (0)
12556
DLFGAGSETSANTLQ*AMAELIMNPRTMLKAQAELRDALQGKQIVSEYDLVKLKYLKL 12729
12730 IIKETLRLHPVVPLLLPRE 12786 frameshift
CQETCKVMDYDVPIGTIVLVNMWVIGRDPKYWEDAKTFRPERFEDGHIDFKGMNF 12955
12956 EYLPFGAGRRMCPGVAFAEAIMELALASLLYHFDWEFPDGISPAKMDMMEVMGSTVRKKN
13135
13136 DLYLVPNAHVPVAP* 13180
>CYP71X7
7359
MADLELEKVASFLLAALLPLVLFKLAAAKRGGGDGGMRLPPGPWRLPVIGNLHQIMAGGQ 7538
7539
LVHRTMAGLARGLGDAPLLSLRLGEVPIVVASSADAAREIMSRHDAKFATRPWSPTVRVQ 7718
7719 MVDGEGLAFAPYGALWRQLRKITMVELLSPRRVRSFRRVREEEVGRLVVAVATAA
7883
7884
TRRPGEAAVNVGERLTVLITDIAVRTIIGDRFERREDFLDAAAEWVKIMSGFSLGDLFPS 8063
8064
SRLASFVSGTVRRAEANHRKNFELMDYALKQHEEKRAAAAAAGAGAVEDDEDIVDVLLRI 8243
8244 QKEGGLEVPLTMGVIKGVIR 8303 (0)
8551 DLFGAGSETSANTLQWTMSELVRNPRVMQKAQTELRDCLRGKQSVSEDDLIGLKY
8715
8716
LKLVIKETLRLHPVVPLLLPRECQETCNIMGYDVPKGTNVLVNVWAICRDPRHWENAETF 8895
8896
IPERFEDSTVDFKGTDFEFIPFGAGRRMCPGLAFAQVSMELALASLLYHFDWELPSGVA 9072
9073 PSNLDMEEEMGITIRRKNDLYLVPKVRVPL* 9165
>CYP71X8
95316 MAMVQDITGYLCLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRLPVIGNLHQVAMGGPLVHRTMADLA95101
95100
RRLDAPLMSLRLGELRVVVASSANAAREITKTHDVAFATRPWTSTIRVLMSDGVGLVFAP 94921
94920 YGALWRQLRKIAVVELLSARRVQSFRRIREDEVGRLVAAVAAAPAAQPVNV
94768
94767
SERIAALISDSAVRTIIGDRFERRDEFLEGLAEAIKITSGFSLGDLFPSSRLASFVGGTT 94588
94587
RRAEANHRKNFELIECALRQHEERRAAGAVDDDEDLVDVLLRVQKDGSLQMPLTMGNIKA 94408
94407 VVL 94399 (0)
93945
ELFGAGSETSANTLQWAMTELIMNPRVMLKAQAELSNVIKGKQTISEDDLVELKYLKL 93772
93771
IIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTVLVNVWAIGRDPKYWEDAETFIPE 93592
93591 RFEDGHIDFKGTNFEFIPFGAGRRMCPGMAFAEVIMELALASLLYHFDWELPDGISPTK
93415
93414 VDMMEELGATIRRKNDLYLIPTVRVPLSTVL* 93319
>CYP71X9P 101503
RVVASSTDAACREFTKTHDVKFATRPWSSTVRVLMADGLG 101393
>CYP71X10
109813
MAMVQYVTGYLCLLSLALLLLTLVLHKVARKATGNGAGKPRLPPGPWRL
PVIGNLHQVAMGGPLVHRTMADLA
109595
109594
RRHDAPLMSLRLGELRVVVASSADAAREITKTHDVAFATRPWSSTIRVMMSDGVGLVFAP 109415
109414
YGALWRQLRKIAVVELLSARRVQSFRRIREDEVCRLVAAVAAAQPGEAVNV 109262
109261
SERITALISDSAVRTIMGDRFEKRDEFLEGLAEGDRIASGFSLGDLFPSSRLASFVGGTT 109082
109081
RRAEANHRKNFGLIECALRQHEERRAAGAVDDDEDLVDVLLRVQKEGSLQVPLTMGNIKAVIL 108893 (0)
107479
ELFGAGSETSASTLHWAMTELIMNPKVMLKAQDELSNVIKGKQTISEDDLVELRYLKL 107306
107305
VIKETLRLHPVVPLLLPRECRETCEVMGYDIPIGTTMLVNVWAIGRDPKYWEDAETFRPE 107126
107125
RFEDGHIDFKGTDFEFIPFGAGRRMCPGMAFAEAIMELVLASLLYHFDWELPDGISPTK 106949
106948
VDMMEELGATIRKKNDLYLVPTVRVPMSTAL* 106853
>CYP71X11
123850
MEEPTSCYGYYHYLALAVAVLVLVRVTRTRGGGSDGVRLPPGPWRLPVIGSLHHLAGKPLVHRALAD 124050
124051
LARRMDAPLMYLRLGEVPVVVATSPGAAREVMRTHDVAFATRPVSPTVRIMTADGEGLVF 124230
124231 APYGALWRQLRRIAILELLSARRVQSFRRVREEEAARLAAAVAAAAPHGEAAV
124389
124390
NVSERIAVLIADSAVRAMIGDRFKKRDEFLEALAEGLKLVSGFSLADLFPSSWLASFVTG 124569
124570
AARRAQENHRKNFELMDRAIEQHQERRAAAAAASGDVVEDDDLVDVLLRIQKGGGLDVPL 124749
124750
TMGIIKAVIL 124779 (0)
124917
DLFSAGSETSATTIQW AMSELMRNPRVMKRAQAELRDNLQGKPKVTEEDLTDLNYLKL 125090
125091
IIKETLRLHLPAPLLLPRESRESCKIFGYDVPKGTTVLVNAWAIGRDPKYWDDPEEFKPE 125270
125271
RFEDSKIDFKGLDFEFLPFGSGRRMCPGIMFAQPNIELALATLLYHFDWSLPAGVKPSE 125447
125448
LDMTEEMGITVRRKNDLYLHAVVRVPLHATTP* 125546
>CYP71X12
127153
MAQDVAEYLSIFLALVVAPLLLLRVARRARGNGAGRPRLPPGPWRLPVIGSLHHLMGKPHVHRAMA 127350
127351
DLARRHGAPLMYLRLGEVPFVVASSPDAAREVLRAQDANFASRPWSPTLRVMMADGEGLA 127530
127531
FARHGAHWRLRKICVLELLGPRRVRSFRRVREEEVARLLAAVAAAAAAGADAV 127689
127690
VNVSERAAVLVTDTX 127731
127734
VRAMIGDRFEMRDEYLEGVAEVGKLLLGLSLGDLFPSSRLASLVSGTARRAAASHRKMFE 127913
127914
LMDCAIRHHQERKAAMDADEDILDVLLRIQKEGGHDAPLTMGDVKDTIL 128060 (0)
128189
DLFAAGTETSTATLQWAMSEVVRNPRIMQKAQAELRNKLQGKPSVTEDDLVGLTYLKL 128362
128363
VIKETLRLHPAAPMLVPRECGESCKVLGYDVPRGTNVLINAWAIGRDPNYWDDTETFKPD 128542
128543
RCENNKYNFRGTDFEYIPFGSRRKICPGPAFTHAILELALAALLYHFDWELPCGVAPGE 128719
128720
VDMAEETGVVVRPKNDLYLRPVVRVPPGAASSGNGGT* 128833
>CYP71X13P
146254
MDQVACWSICAFLALLLLVRIGGKRGRGGDGARLRQPPPGPWRLPVIGNLHQLMLRGP 146427
146428
LVHRTMADLARGLDDAPLMRLQLGGVPVVVASSPDAAREVTCTHDAAFASRPWPPTVRRL 146607
146608
RPHREGVVFAPYGAMWRQLRKVCIVEMLSARRVRSFRRVREEEAANLAAAVAASLSSPPA 146787
146788
RRDAVNVSALVAAAVADATMRVVIGDRLERREEFLESMTEAVRSFTGFSLDDLFP 146952
146953
SSRIAAAVGGMTRRAEASHRKGNELIESAIRQHEQVRDAMAAQGGGGAMEEDLLDTLL 147126
147127
RIQKEGALDMPLTMDNIKAVI 147189
147557
DIFGAGSDTSSNIIQW
147612
RNTLQGKHPVKEDDLVNIKYLKLIIKETLRLHPVVPLLLPRECLHACKVMGYDVPKGTTV 147791
147792
FVNIWAINRDPKHWDDPEVFKPERFDDGKIDFKGANFEYIPFGAGRRSCPGVTFGHATVE 147971
147972 LMLATLLYHFKWELLEGVAPNELDMTEEIGINVGRKNPLWLCPIVRVPLQ*
148124
>CYP71X14
142591
MEQVAYWFICACAFLALVLLVRLGAARRDVVRLPPGPWRLPVVGNLHQLMLRGPLVHRTM 142770
142771
ADLARGLDDAPLMRLQLGGVPVVVASSADAAREVTRTHDLDFASRPWPPTVRRLRPHREG 142950
142951
VVFAPYGAMWRQLRKVCVVEMLSARRVRSFRRVREEEAARLVASIASSSSSSPTGHDGGA 143130
143131
APAVNVSAPIAAAVADATMRAVIGDRFERREEFLESITEAVRSFTGFSLDDLFPSSRLAA 143310
143311
AVGGMTRRAEASIRKGHQLMDSAFRQHQQLRDAMAAQPHLDDCAMEEDLLDTLLRIQKED 143490
143491
NLDVPLTTGNIKAVLL (0)
143668
DIFGAGSDTSSHMVQWVLSELMRNPEAMHKAQIELRSTLQGKQMVSEDDLASLTYLKLVIK 143850
143851
ETLRLHPVVPLLLPRECRQTCKVMGYDVPQGTTVFVNVWAINRDPRHWDEPEVFKPERFH 144030
144031
SGKIDFKGANFEYIPFGAGRRICPGMTFGHATVELMLAMLLYHFDWELPKGVAPNELDMT 144210
144211
EEMGITVGRKNALYLHPIVRVPLEQATMS 144297
>CYP71Y1P AACV01014656.1 also has this boundary. ESTs CI595924.1, CI597178.1 also have it.
139036
MEDATHGYVYVGLALVSLFVVLLARRRRSPPPAAHGDGGLRLPPGPWTLPIIGSLHHLVGQIP 139224
139225
HRAMRDLARRHGPVMLLRIGEVPTLVVSSRDAAREVTKTHDTAFAMRPLSATLRVLTN 139398
139399
GGRDLVFAPYGDYWRQVRKIAVTELLTARRVHSFRSIREEEVAALLRAVAVA 139554
139555
AGTVEMRAALSALVSDITARTVFDNRCKDRGEF
LVLLERTIEFAGGFNPADLWPS 139719 (?) bad exon boundary
140222
SRLAGRLSSVVRRAEECRNSVYKILDGIIQEHQERTSAGGEDLVDVLLRIQKEGG 140386
140387
LQFPLAMDDIKSIIF 140428 (0)
DIFSAGSETSATTLAWAMAELIRNPTAMHKVMAEVRRAFAAAGAVSEDALGE 140655
140656
LRYLQLVIRETLRLHPPLPLLLPRECREPCRVLGYDVTRGTQVLVNAWAIGLDERYWPGG 140835
140836
SPEEFRPERFEDGEATAAVDFRGTDFEFLPFGAGRRMCPGMAFGLANVELPLASLLFHFD 141015
141016
WEVPGLADPAKLDMTEAFGITARRKADLHLRPCLLVSVPGV* 141141
>CYP71Y2P
136961
VSEDALGELRYLQLVIRETLRLHPPLPLLLPRECTIGR 137074
137075 DERYWPGGSPEEFRPERFDDGEATAAVDFRGADFELLPFGGGRR