Below
is a P450 collection from Phyophthora ramorum
P450s
found in the genome blast server at
JGI
D.
Nelson July 27, 2004
A tree of 65 Stramenopile
or Chromista P450 sequences
>CYP5014B1
scaffold 20 seq d minus strand 46% to seq 20 a
MSSVDPTSALL
YAASCSLALLVGAKVLFPEPKRAARHPDSLPLLGETWAALKYADEYYDWEAAMTEKMEGR
PWLFDVVGRPSEFVIGKPEIIEDVLRTHAESFGKGEYVHEVLSGLLGDGIIAVDGHKWAR
QRKTASNLFSLRELRESMATVVQDNVLTLNGIFQHAMDRGESLDLFQLLNRFTFEVIAEI
AFGIKFGGLATGSKHPLEAAFNCAQQRMFMRFLEPTWWWKLQRWLNVGPEGAFKKQVQII
DETCYSIISRSMKERKAKRSSHDADTLEGSASTQRQKSNIISLFLDGVSDDEAKAGDGLD
PKFLRDIVVTFMTAGRDTTASALSWFFYTLSQHPQTEEKIRQEMASKLPELANGAVSSPS
MLQANELVYLEAALKETLRLYPAVPTNIREALEDVVLCDGTVVKAGETVSWSTYALGRMP
HVWGEDAKEFKPERWVDADGKLIAVSPFKYPLFSAGPRVCLGGKLALMEMKITAASVLSK
YHFTIVPGQNVTYRIGLTFGMKNGLHVKVSEAAPAI*
&&&&&&&&&&&&&&&&&
>CYP5014C1
scaffold 20 seq c plus strand 54% to scaf 20 b +
MLSIASLKLEHPLYHALTVTSIVLLPIVFQL
SRGSSSSSTDDEERADSELERRDAD
RPPWTLPVLHNTLNWILGGDGIHEWITRNCERFKG
RPFTVKALGLPEMLVVSTPEAFEDVLKNQFMNFPKGPHVKENLQDLLGDGIFAADGVKWA
HQRDVARGLFRMRELRDCMTEAITRHTKALHDVLGKVCARNRSVDLHKLLSCFSTEAFAE
ISFGMKMGCLRANKELPFQAAFDSAQRLTAQRFVRPRWFWKMQRRLGLGAEDQLQLDIKE
IDAAVLNIVQRVLSQRALVPDDGAPKSTNMLSLFLDTIAKSPKAEEQLYDPAYLRDVVVN
FLVAGRDTTAQALSWFFYNVSQNPHVEAKLRREIYKKLPELVNSEVCVPTLQQVNRLVYL
EAVMKETLRLYPPVPMSPKYAVRDAMLSDGTFVAAGSMVCLPMYAMGRMPHVWGPDAAEF
NPERWIDPATKKIVSVSAFKFVAFNAGPRMCLGTTLAGLELKLVAASLLSRFHIHVENPE
DVTHEFSLTLPVKGPMNVRLARVQAAVA*
>scaffold
86 seq a 86112 = cyan
KIADES
MKKNRPQNSDVEAAIRDEIAEKL
PKGGRNATTSATATMQDVSQLVYLEAALKETLRLHPPVPMAPKYVVEDTTLSDGTFVKGK
SMIVLATYVMARMQEVWGPDAEVFKPERWIDPTTGKLIPVSAYKFASFNAGPRMCLGMNL
AMLEMKLVVAGLLSKFHVEVLNPEDVTYELSLTLPVKGPLNVKVSPAKLPTSPDFA*
>scaffold
86 seq b 76832 = cyan
86113 starts at MGCL = scaf 20 c +
MRELRDCMTEAITRHTKALHDVLGKVCARNRSVDLHKLLSCFSTEA
FAEISFGMKMGCLRANKELPFQAAFDSAQRLTAQRFVRPRWFWKMQRRLGLGAEDQLQLD
IKEIDAAVLNIVQRVLSQRALVPDDGAPKSTNMLSLFLDTIAKSPKAEEQLYDPAYLRDV
VVNFLVAGRDTTAQALSWFFYNVSQNPHVEAKLRREIYKKLPELVNSEVCVPTLQQVNRL
VYLEAVMKETLRLYPPVPMSPKYAVRDAMLSDGTFVAAGSMVCLPMYAMGRMPHVWGPDA
AEFNPERWIDPATKKIVSVSAFKFVAFNAGPRMCLGTTLAGLELKLVAASLLSRFHIHVE
NPEDVTHAFSLTLPVKGPMNVRLARVQAAVA*
>scaf_2029
C-term frag = scaf 86 b/ scaffold 20 c
PMYAMGRMPHVWGPDAAEFNPERWIDPATKKIVSVSAFKFVAFNAGPRMCLGTTLAGLEL
KLVAASLLSRFHIHVENPEDVTHAFSLTLPVKGPMNVRLARVQAAVA*
&&&&&&&&&&&&&&&&&&&
>CYP5014D1
scaffold 20 seq e minus strand 58% to seq 20 a
MTDKLSSSVAVAALSGLVTLPLAWYLLSTA
HGEKQLGTRKVVRPSTTKPLIGNTLDILYNLPIRHDWITSLCEEAKGEPVLLQSLGTPDM
TLLSTPGAFEDVFKNQFDNFPKGPKKSEYLRELLGEGIFAVDNEKWYRQRKTASNLFTMR
ALRDSMTSTIQRHLVVLERIFRRAAETNASIDMFRLLNRFTMEAFTEIGFGVEMNCLDSD
QEHPFQTAFDRSQQSLALRFVRPSWFWKTQRMLGLGPEGQLQQDMKVINSTICDIVAKTL
QNSARGAPKPDDKAAMDIVSLFLDDLNKSSDVDANCFDPTYLRDIVVNFIIAGRDTTAQA
LSWFFYCLSQNKQVETKIREELLAKLPDLFNGQCSPSMDAVGELTYVEAALRETLRLYPS
VPIVSKQAVQDAVLSDGTFIAAGAMAGLPMYALGRMPHVWGPDAADFKPERWVDAQTGKL
ISVSAYQFVAFNAGPRLCLGKNLAMLEMKLIVASLLSKYHVELETPKTVTYAISFTLPVK
GQLNAKISAV*
>CYP5014D2
scaffold 20 seq b plus strand (has one intron) 76831 = cyan
very
similar to scaf 86 seq a
MLKWISLFDHSGVAPVHPLLIADRFDVDLVSIDLVRIQ
(0?)
RKVHRPDSTLPLLENTLDILQAARDGDIHDRTVRACRGFKGEPVLIRSIGLPDKLVVSTPEAFEDVLKTQFNNFP
KGSYMCENLRDLLGDGIFVADGDQWVHQRKTASNLFTMRALRDSMTVVIQRHAVVLYDIL
QRASDNKETLDLFRLFNRFTIEAFAEIGFGVHMGCLDSDEEHPFQKAFDRAQRALLLRFV
RPGWFWKTQQWLSVGAEGRLKQDIEVINATVLEIVEKALAKRSTVSNDEIGDEDKNIVTL
FLDSVGGFASADSQQPDPMHLRDIVVNFLIAGRDTTAQTLSWFFLNLTKNSDVEAAIRDE
IAEKLPKGGRNATTSATATMQDVSQLVYLEAALKETLRLHPPVPMAPKYVVEDTTLSDGT
FVKGKSMIVLATYVMARMQEVWGPDAEVFKPERWIDAATGKLVNVSAYKFASFNAGPRLC
LGVNLAMLEMKLVVAGLLSKFHVEVLNPEDVTYELSLTLPVKGPLNVKVSPAKLPTSPDFA*
>CYP5014D3
scaffold 20 seq a 70% to scaffold 20 seq b plus strand
MLPIAELVENSSVTIGGLALLALALSFWHKMRVDEKS
KVHRPDSTLPFLENTLDLIIHGGGKGDLHDFTTQMAKQFDAE
PVFLRALGIPLNIMLFTPEAFEDVLKTQFHNFEKGPFVCENLKDLLGEGIFAVDGDQWVH
QRKTASNLFTMRALRDSMTVVIQRHAVVLYDILQRASDNKETLDLFRLFNRFTIEAFAEI
GFGVHMGCLDSDEEHPFQKAFDRAQRALLLRFTRPGWFWKTQRWLGVGVEGQLRRDIQVI
DKTVLDIVEKALAQRAQRVTHGDGGYDNGDNKVKKVGNIVSLFMDNLANSQQFDPKYLRD
IVVNFLVAGRDTTAQALSWFFVDLSKNPHVEVAIRKEMAEKLQPAEVNRPGAVSSTMEDV
SHRVYLEAALKEALRLHPSVPVVPKQAVQDTMLSDGTFVPAGSAIGLANYAMARMPQVWG
PDAEEFKPERWIDPTTGKLIPVSAYKFASFNAGPRMCLGMNLAMLEMKLVVAGLLSKFHV
EVLNPEDVTYDLSLTLPVKGALNVKVSSIGRSASPAYA*
>CYP5014E1
scaffold 36 plus strand 45% to scaf 6 +
MEPWSHGAKAVATVCRPADIRYRANLVADPPDLKALSRTFRPGHKSIRVV
MRTADVVACIQILWPTFPPWASVLVECLVISDRNEGGGGGGINTISYPNSTSLGQVGFKLSRLISLT
MLLPLQLMGDARAPVGLLACGLLVS
AVAVVRVRSHTQQKPHDTATRVPYLPTLVPLLGNTLELATNVGRFHEWVTGHSQQRDGKP
FALRLLGKNDLVFIARPEHFEEVLKTQSRNFCKSDTTREVFDDFLGEDIVLLNGKRWQFH
RKILASLITTRALREYMTPIIQENILRLQRTLKQWSETKGSVDFHKLVRHFTIDTFAEIG
FGCKLDVLASGEEHPFEVAFNDANRISSERITSPTWIWKLKRWLNIGDERRLREAIEEMN
ELMMKLIFTAFGLLQAGADDNQQSAHQNLISIIVSSEREITPTEVRDIALSGLEAGRNTT
ADTLSWLFHALSHNPRVETTLRAEIAAKLPKLEESDSYVPSFEEIQEVPYLEATIREVLR
LYPTVPTVPYHCVNDTVLADNTFIPAGTDVFLTLYAAGRLASVWGHNATEFSPERFIDEK
TGKLCQDSPYSAFSDGPRVCLGRNLAMLQLKIVAATIISRFHLSEDPEQDVQPILDLTIG
MKDPLMMRVETTQQEEAASFS*
>CYP5014F1
scaffold 36 seq a 62% to scaffold 36 seq
b
MLQSMFKSPVVPSLVTAAVLVTLYWTKSAKVSAKIKGDKVKSA
VILPSTLPVLGNAVELASNAARMHDWLADQFATTDGEAFIVRLPGKDDMMFIAKPEHLEA
VLKTQFDVFPKSEYIHDVFCDMLGDGIVVTNGETWKRQRNVVVGLFSARALREHMTPLVQ
KYTVQLVDILADAAASNAPLDVFDLLHRYTFDVFGEIGFGARMGSMKGAFQPFAEAMDEA
QFLAGKRFKQPMWYWKLRRWLNVGDEKKLRENVRVIDEHLMGIIADAIERRRIRVEQKKE
GRAAALADKDIVSIVLDTMDTKGLPVNPVEVRNIALASIIAGRDTTADCMGWLFHLLSQN
PRAEAKLREEVLAKIPQLSADKHYTPTVEDINKVPYLEACIRELLRLYPPAPLITTHCIK
DTVFPDGTFVPAKTDIGIALFSAGRLTSVWGEDALEFKPERFLDTETGEVISMTATKFSA
FSAGPRICVGQNLAFIETKIVIASIVSRFRMTPEPGQNVAYTEGISLGMMDPLMMRLESVN*
>CYP5014F2
scaffold 36 seq b scaffold 36 seq a
MLQSFFEDKLSFLGPVVPGLI
AAAVVVAVYYSTSSTHVSPIGDNEVCDGKVKPKRVRYLPSKIPVLGNAIELLSNAERMHD
WIADQIVPFDGEPFTLSLPGKDDMMFIAKPEHIEQVLKTQFDNFPKSQHIHDLFFDLLGD
GVVITNGETWKRQRRVLVNLFSARALREHMTPISQKYVVQLRKIFEEAAASKEPMDAFGL
MHRYTLDVFAEIGFGTEMKLLEGRYQPFAEAIEESQYIVSARFKQPDALWKIKRWLNIGS
EKKLRHAVQVIDEHVMGIISGAIQRRQQRVEAARSGKEAKPVDRDIVSIILDSMESNNQV
VDPVEVRNIAAAALIAGRDTTADALGWLIHVLSENPRVEAKLRSELLAHLPKLATDIDYV
PSAEELSQVHYLEATIRELLRILPAGPAIATHCVRDTVFPDGTFVPKNTDIGISFYTTGR
LSGVWGEDVSEFKPERFLDADTGEVVKVSSSKFCAFSAGPRICVGRNLAFLEMKIVIANI
LGRFHLVPEPGQKPAYTQGITLGMQTPLMMRVEAVSSQDSSVAA*
>CYP5014G1
scaffold 86 49% to scaf 6 minus
MLLWFLDSDDPLKKFGLLMLGVLIGAAVASQCLPSDTTDKKLIKQQQSVTPGAHGPVRVPFL
PSMIPVVGHAVLMAYHAGRFLDWVTDVFVSRGGAPFTLRLPFQRDMILTANPEHYEHVMK
IQYDNFLKGDHIYDLLVDLLGDSILIVEGDEWKFHRRVFVNLFSSRALREHMAPIIQKHV
RVLQGVLSNAAQSKQAIDFFTYSGRFTLDAFGEIAFGFNMSTLTLQHEHPFERAFVDAQH
ITAARLVVPTWYWKLKRWLNVGSEKNLREALTTVDQFVMDVISKTMDKRNAPPNDAEDKT
RNRDIVSLILANETVDGKPVDPILVRNVVLMALIAGRDTAADAIAWLFHLLTLNPRVEAK
LRADLLARLPKLGTDLDYVPDSQDVQGLAYLEATICEALRLFSPVGLAQKLCVHDTVFPD
GTFVPKGANIALVYHAMARMPSVWGPDAASFVPERFLDPETGDLLKVSSGKFSAFNTGPR
VCVGRKLAMLEMKMVVACVVARFHLDEVPGQDVACSGGITIGMKNPLLMRVNELLPAAKS
DEVAVGVAA*
>CYP5014H1
scaffold 6 minus strand 49% to scaf 6 plus strand
MLRQWLVKHRVLSPLGPAGLVLLAGAAAVAAYISTRSSGVEVKEFDLKEEE
STDKPRVVPYLPSKVPLLGNMLELAGNAHRFHSWMAEQCVIHNGVFKLYLPGQSDMLVTA
VPEHYEHVVKTQFEHFSKGKQQYDMFVDLMGHSVLIIEGERWKYHRRLLVRLFSARALRE
HMTPVIQRHTRMLQTVFLKAAVAKKPVDVYMLMHRFTFKAFAEMVFNNTLDSIDSEHEHP
FEQAFDEAQSIVAGRLQQPVWFWKLKRWLNVGLEHKLREDVALIDEFIMSIISTAIETRR
RRQEDLKAGRPVKPADKDIVSIVLECMEQDGDMVSPTDVRNIAVAALGAGRDTSADAMSW
LLHTLTQHPEVENKLRAELLEKLPKLAVSATYVPSMDEVHGLVYLEATIRELLRLQTPVP
FTLRECIHDTVFPDGTFVPKGTNVGMCHFGAARRTEVWGPDAAEFKPERFVDPVTGKLLH
TPMAKFNAFSGGQRVCVGKALAMLEMKLVIATLVGRFHFSEVPGQDVQYAMGITIGMRSS
LMMNVQPVIRGAAGAAA*
>CYP5014K1
scaffold 6 plus strand 50% to scaffold 36 seq b
MTPPPPHHHHGGHQTSFAFVHRPVTMLSPSAVLTHPLVVGLLTAAAVAALSSVATSYAADA
DSGDEDTKPVKPVPYLPGAHPVLGHTLLMAKNLDRFQDWLVETSVARNGEPFVLRQPGKN
DWLFSARPEDFEQILRVHFDTFIKGPQVRELLDDFMGENIVIINGHRWKFQRKALVNLFT
ARALRDHMTPIVQKCALALQRVFAQAATSGEAIDVHHVMGKFTLETFAEIEFGSQLGLLE
SGQEHAFETAIDDANHISLERFAVPMWVWKLKRWLNVGSERRLREDMAVISSFVMSCISG
AMERRKQRQEAAARGEPLGPVAKDIVSILLDSEDTIGEPVLPKDVFNISLAGVLAGKDTT
GDATSWLMHMLHENPRVENKLRTILLAEVPKLAVDESYVPSMEELDGVTYLEATIRESLR
LKPPAPCVTQHCTQDTVFPDGTFVSKGTDTTLLYHASALLPSVWGPDAAEFKPERFLDEN
DKLLVLPPLKFIAFSAGPRKCVGRKLAMIEMKVVTACLLSRFHLVQVPGQDIRGTMGISL
GMKYGMKVNVQRTPGVAIRA*
>CYP5015A1 scaffold 63 43% to 1485 early
MGESKSSLWVALPAAAAAAVLVYLLVPDERQRAIRRLPAPASTLPL
LGNTLDLMSLELPRLHDWLAEQCKAFGGRTWRLQVLGAPPLVVVSSVECFEDVLKTQFDV
FDKGARMNEIFRDVAGGGIVAVDGPQWLAQRKMLSRLFTMRAFRETISQCVHDYTLVLGR
KLNDAVRTGAPLDFADLMHRFSFDVFTDISFGLQANALEGGEHSQFMEAMGKIVHHIEMR
FHSPDWLWKLKRALKLGGEKELAQEIAILDKMVFTIINKNMERKFNPDAAANEWPPRPPR
STKDVVSLFLDAHDEQKEAGGSPLDASFLRDIAVVVLLAGKDTTAWSLSWLIIMLNRNPK
VETKLRQELREKLPRLFSDPTYVPTMDDVENLVYLDAVLRENLRLNPLVPLNAKEANRDT
TLVDGTFVKKGTRVYIPSYTLGRMKSVWGRDASKFKPERWLMEDPWTGELTIRPVSAFQF
VSFHAGPRTCLGMRFAMLEMKTVTAYMLSKYHFTTKENPKSYTYDVASLLQVKGPLVVKV
QRAG*
>CYP5015B1 scaffold 27 seq d same as
1485 early 49% to 1485 later
MLPFAIAASLVAAA
VAYLTSPNDQDRAVCELPTPRSTLPVIKNTLDLAVRQRARIYDWILEQCREHGGRPWRVR
VLGRPPAVIVSSPEAMEDILKTQFEVFVKGSAVATISKDLLGDGIFAVDGSQWKHQRKAA
SHFFSMNMIRDAMEHVVRDHSVRLTKKLSEAAVSGEVVNIKRVLDFYTMDVFTKIGFGVE
LRGLETGGNSDFMEAFERATRRIMARFQQPMCIWKLARWLSVGAERQMASDMKLINGVVY
DVIHRNLEGKKQRKQSGGTFNSGREDVISLFLEKANVEYSDDDHVKMTPTMLRDMSMVFL
FAGRDSTSLTMTWYIIEMNRHPEALANVRRELTDKLPRLGLNDAETPSMEDIDELVYLEA
AIRECIRLNPVAPVLQRTAAQDTTLYDGTFVKAGTRVILPHYAMGRLETVWGPDAEQYKP
ERWIDPDTGKLVHASPYKFAAFLAGPRMCLGMRFALAEMKLTLATVLSKFHIETVEDPFE
FTYVPSVTLQVKGPVDVRVTRPSSM*
>CYP5015B1
scaffold 1485 early = 77926, 87040 starts ar DRAVC 49% to 1485 later
MLPPFAIAASLVAAAVAYLTSPNDQDRAVCELPTPRSTLPVIKNTLDLAVRQRARIYDWILEQ
CREHGGRPWRVRVLGRPPAVIVSSPEAMEDILKTQFEVFVKGSAVATISKDLLGDGIFAV
DGSQWKHQRKAASHFFSMNMIRDAMEHVVRDHSVRLTKKLSEAAVSGEVVNIKRVLDFYT
MDVFTKIGFGVELRGLETGGNSDFMEAFERATRRIMARFQQPMCIWKLARWLSVGAERQM
ASDMKLINGVVYDVIHRNLEGKKQRKQSGGTFNSGREDLISLFLEKANVEYSDDDHVKMT
PTMLRDMSMVFLFAGRDSTSLTMTWYIIEMNRHPEALANVRRELTDKLPRLGLNDAETPS
MEDIDELVYLEAAIRECIRLNPVAPVLQRTAAQDTTLYDGTFVKAGTRVILPHYAMGRLE
TVWGPDAEQYKPERWIDPDTGKLVHASPYKFAAFLAGPRMCLGMRFALAEMKLTLATVLS
KFHIETVEDPFEFTYVPSVTLQVKGPVDVRVTRPSSM*
>CYP5015C1
scaffold 27 seq c same as 1485 later 49% to 1485 early
MKLEAMTAWVSPASVAATCVAALLVYLATPSAHDRAVKHLPGPDGDLPILRN
TLEIIAAQKSGTFHDWALKYCRKYQGKPWRLRVVGKEPTVVLCCPEAFEDIQKTQFEAFD
KSRFVSAAMYDVLGQGIFAISGPLWQHQRKTASHLFTTQMLQYAMEVVVPEKGDELIKRL
DGVCLKEKTADRVVSMKRLLDLYTMDIFAKVGFDVDLHGVQSNQNAELLDSFDRMSVRIL
ERIQQPMWYWKLLRWLHVGPEKQLAEDVKKLDDLVYGVMARSIEDKNRQDASQSARKDLI
SLFIDKSDVEYTKGVHTKKDLKLMRDFVISFLAAGRETTATAMSWVILMMNRYPDVLKRV
RQELNEKLPGLASGKLRAPSMEDAQKLVFLEAVVRETLRLFPVVAITGRSATRDVYLYEG
TLIKAGTRVVMPHYAMGRMSTVWGPDVDEFKPDRWIDPITGKGKVVSPFKFSVFLGGPRI
CLGKKFALAEIKISLAKLLSQFDFKTVRDPFDFTYGSSITLQIKGPLDVVVSRLH*
>CYP5015C1
scaffold 1485 later 49% to 1485 early
MKLEAMTAWVSPASVAATCVAALLVYLATPSAHDRAVKHLPGPDGD
LPILRNTLEIIAAQKSGTFHDWALKYCRKYQGKPWRLRVVGKEPTVVLCCPEAFEDIQKT
QFEAFDKSRFVSAAMYDVLGQGIFAISGPLWQHQRKTASHLFTTQMLQYAMEVVVPEKGD
ELIKRLDGVCLKEKTADRVVSMKRLLDLYTMDIFAKVGFDVDLHGVQSNQNAELLDSFDR
MSVRILERIQQPMWYWKLLRWLHVGPEKQLAEDVKKLDDLVYGVMARSIEDKNRQDASQS
ARKDLISLFIDKSDVEYTKGVHTKKDLKLMRDFVISFLAAGRETTATAMSWVILMMNRYP
DVLKRVRQELNEKLPGLASGKLRAPSMEDAQKLVFLEAVVRETLRLFPVVAITGRSATRD
VYLYEGTLIKAGTRVVMPHYAMGRMSTVWGPDVDEFKPDRWIDPITGKGKVVSPFKFSVF
LGGPRICLGKKFALAEIKISLAKLLSQFDFKTVRDPFDFTYGSSITLQIKGPLDVVVSRLH*
>CYP5015D1
scaffold 27 seq a 46% to scaf 1485 early
MQACDYLQVDARYGCYHIVGRIARGGEVEGASSWLALSHLRASAQAPTMWSSLKFAIEASTTS
WGALLLCSLLVGWHLLSMRKQARAPSKFARPQSTLPVLGNTVDLMFTHQEDIHDWMLAEC
RRCKGRPWVLAALGRPMSLVLSNVEAFEDVLQRKFDQFGKRSAWLVSDVFGDGIFAADGV
SWIHQRKTASHLFSLHMMRESMEQVVREQAAVLCKTLFAHLNSNQSSSMADQHGVPVNLK
YTMDWYATNVFTRVGFGVDLDSLPSQEHDEFFRAFTRLPIAIHRRIQQPGWLWRIKRALN
LGYEKQLKLDMKRVDDVIYQVISRSMTSKSSEDPRRLPDLISLFLAKESNEYRDRDTKQE
GGAAATRSVKTTPKLIRDMAFNFTAAGRGTTSQSLQWFIIMLNRYPSVERKIREELLAKL
PQLFESNSSPPTMNDVQQLVYLEAAIKESLRLNPVAPLIGRTATQDVCFSDGTFITSGTR
VVIPTYAVARLKSIWGEDAAEFNPERWIDPQTGKLLVISPYKFLVFLAGPRSCLGAKLAM
LELKVALATVLSKFHLRVLRDPFEIGYDASISLPVKGDVLAVVEPAEMVVPPSAENVERSAGAA*
>scaffold
27 seq b introns? duplication, assembly error
366140 PDGDLPILRNTLEIIAAQK 366196
366212 DWALKYCRKYQGKPWRLRVVGKEPTVVLCCPEAFEDIQKTQFEAFDKSRF
366361
366362
VSAAMYDVLGQGIFAISGPLWQHQRKTASHLFTTQMLQYAMEVVVPEKGD 366511
366512 EL 366517
XXXXXXXXXXXXXXXXPVAPVLQR
= scaf 1485 early, duplication or assembly error
TAAQDTTLYDGTFVKAGTRVILPHYAMGRLETVWGPDAEQYKPERWIDPDTGKLVHASPY
KFAAFLAGPRMCLGMRFALAEMKLTLATVLSKFHIETVEDPFEFTYVPSVTLQVKGPVDV
RVTRPSSM*
>transcript
87040 predicted transcript by JGI cyan = CYP5015C1
green = CYP5015B1
this
is a fusion of three genes, the third is not a P450
DRAVCELPTPRSTLPVIKNTLDLAVRQRARIYDWILEQCREHGGRPWRVRVLGRPPAVIV
SSPEAMEDILKTQFEVFVKGSAVATISKDLLGDGIFAVDGSQWKHQRKAASHFFSMNMIR
DAMEHVVRDHSVRLTKKLSEAAVSGEVVNIKRVLDFYTMDVFTKIGFGVELRGLETGGNS
DFMEAFERATRRIMARFQQPMCIWKLARWLSVGAERQMASDMKLINGVVYDVIHRNLEGK
KQRKQSGGTFNSGREDLISLFLEKANVEYSDDDHVKMTPTMLRDMSMVFLFAGRDSTSLT
MTWYIIEMNRHPEALANVRRELTDKLPRLGLNDAETPSMEDIDELVYLEAAIRECIRLNP
VAPVLQRTAAQDTTLYDGTFVKAGTRVILPHYAMGRLETVWGPDAEQYKPERWIDPDTGK
LVHASPYKFAAFLAGPRMCLGMRFALAEMKLTLATVLSKFHIETVEDPFEFTYVPSVTLQ
VKGPVDVR PEE MKLEAMTAWVSPASVAATCVAALLVYLATPSAH DRAVKHLPGPDGDLPI
LRNTLEIIAAQKSGTFHDWALKYCRKYQGKPWRLRVVGKEPTVVLCCPEAFEDIQKTQFE
AFDKSRFVSAAMYDVLGQGIFAISGPLWQHQRKTASHLFTTQMLQYAMEVVVPEKGDELI
KRLDGVCLKEKTADRVVSMKRLLDLYTMDIFAKVGFDVDLHGVQSNQNAELLDSFDRMSV
RILERIQQPMWYWKLLRWLHVGPEKQLAEDVKKLDDLVYGVMARSIEDKNRQDASQSARK
DLISLFIDKSDVEYTKGVHTKKDLKLMRDFVISFLAAGRETTATAMSWVILMMNRYPDVL
KRVRQELNEKLPGLASGKLRAPSMEDAQKLVFLEAVVRETLRLFPVVAITGRSATRDVYL
YEGTLIKAGTRVVMPHYAMGRMSTVWGPDVDEFKPDRWIDPITGKGKVVSPFKFSVFLGG
PRICLGKKFALAEIKISLAKLLSQFDFKTVRDPFDFT SPRALLKMLRSSWRLRLSLRRFS
TATASSSPRQLQFVTSNAIDRRLLQGLTSADDDARPVNLIQDEEGARRVLDKIKALGPDH
FHACDTEVGQIDIKAVGPVGNGVVTCLSLYSGPDVDYGNGPYVWVDNLDSAEGTLQYFKT
FLESKQYRKVWHNYSFDRHVLYNHGINVQGLGGDTMHMARLWNTARFQNGGYSLEALTAD
LLLQRKKPMKELFGIPKLKKDGSKGKERLMPSVEELQRFPEFRKRWIRYSVYDAESTWFL
HRVLQHKLDQTFWFEKPPKTGEGEVEPQVGSMYDFYRQYIIPFGECLTDIERKGMHVDLE
YLAGVEKQALQDRARLERLVLKWASRYCEESERMNLFSAAQKQQLLFAPYFDQKKKKEVL
PAER
>CYP5015E1
scaffold 41 seq a 79501 89% to scaf 41 b
MKAVSELLGDRNDVAVAAAAAVAVSLGLSLLLHSSKKNATIEA
RKLPPMPKTTLPILKNILDAGGNAERFHDWLNEQSTEFDNRPWMFTIPGRPANIVLSSPE
IFEDVLKTQDDVFLRGPTGQHISYDLFGNGMVITDGDLWFYHRKTASHLFSMQMMKDVME
ATVGEKLGVFLDVLDIYHKRGKPFSIKEELSHFTMDAIAKIGFGIEMDTLKNSPDREEDH
EFLKAFNEGSVAFGVRIQSPLWLWELKKYLNIGWEKILMDNTRIMHEFINDVIV
QSMNKKAELAAKGEKLVARDLISLFMESKLRQTEEMHIEDDDATIMRDMVMTFVFAGKDSTAHSMG
WFIVNMNRYPEVLQKIREEMKDKLPGLLTGEIKVPTEQQIRDLVYLEAVVKENVRLHPST
GFIVREAMESTTLVDGTFVEKGQTMMVSSYCNARNKRTWGEDALEFKPERMIDPETGKLR
VLSPYVFSAFGSGQHVCIGQKFAQMEIKLAMATLFSKFDIKTVEDPWTLTYEFSLTIPVK
GPLNVEVTPLAPLATAASA*
>CYP5015E2
scaffold 41 seq b 89% to scaf 41 a
MKAVSELLGDRNDVAVTAAAAVAVSLGLSLLLHSSKKNATIEARKLP
PMPKTTLPILKNILDAGGNAERFHDWLNEQSTEFDNRPWMFTIPGRPANIVLSSPEIFED
VLKTQDDVFLRGPTGQHISYDLFGNGMVITDGDLWFYHRKTASHLFSMQMMKDVMEATVC
EKLSVFLDVLGVYHTRGRTFSVKQELSHFTMDVIAKIAFGIELDTLKNSPDRDDDHEFLK
AFNKACVAFGVRIQSPMWLWELKRYLNVGWERVFKENNTIIQKFIN
DVIVQSMNKKAELA
AKGEKLVARDL
ISLFMESKLRQTEEMHIEDDDATIMRDMVMSFAFAGKDSTADNMCWFIV
NMNRYPEVLQKIREEMKDKLPGLLTGEIKVPTQDQVRDLVYLEAVMKENMRLHPSTGFIV
REAMESTTLVDGTFVEKGQTLMISSYCNARNKRTWGEDALEFKPERMIDPETGKLRVLSP
YVFSGFGSGQHVCIGQKFAMMEIKLTLATLFSKFDIKTIEDPWTLTYEFSLTTPVKGGLS
VEVTPLTPAASA*
>CYP5015E3
scaffold 41 seq c 70% to scaf 41b
MKTVSELFGDRNDLAVTAAAAVAVSLGLSLLVHKSKKNKSTEARKLPPMPKTTLPIFKSLFDIGN
NLDRFHDWLLERSAEFDNKPWMYSIPGRPVTIVLTAPEYLKDVMVTQEDVFLRGPLTQYM
SEDIFGNGMIVTDGDPWFFHRKTSSHLFSMQMMKDVMETTVRDKLDVFLDVLDIYHKRGK
PFSIKKDISHFTTDAICRIGFGIELDTLRNGPDSDEDHTLLKAFDLASIAFVVRVQTPWW
IWEPKRFFNVGWEKVFKDNVKIVHDFIDEVILQSMNRKAELAAKGEKMEARDLITMFMES
NLKETEHMNFQDNQATIMRDMVTTFMFAGKDTVGHSLSWFIVHINRYPETLKKIREEMKE
KLPGLLTGEIRVPTHEQLRELIYLEAAVRENVRLFPSTGFIAREAMRDTTLVDGTFVGKG
QTIMVSSYCNTRNKKLWGEDALEFNPDRMIDPETGKLRVFSPYQYSGFGSGQHVCIGQKF
AMMQMKLAMATLLSKFDIKTVEDPWKLTYDFSLTIPVHGPLDVVVTPLTPLSSA*
>CYP5015F1
scaffold 1 55% to scaffold 21
minus
MWSSTSHDGAQQSVLMALGALTVVYASWKILNLPVPMPDPGMEQLFRPASTLPVLGNTLDVLLFNRYRMSD
WINDQTDASSGQPWILQLLFQPPWVVLSMPNDLHDVFVDQFQVFEKGGTLGDISFDVLGN
GLLNVSGDKWKQQRRAASHLFSTQSIRDVMEPVIREKTLQLRDVLAQSAGRKQTVSMKSL
LGKFTSDVFTRIGFGVELDQLGGDVFKDEQHPLDIALHAVQNRFQTPAWMWKLARFLNVG
AEKRLRESMKTVNDMVRDIMVRSISEKSSGDQKKNLLTLLMKDNAAADPRELQDTAVNFF
IAGKDTTSFSLSWLIVMMNRYPRVLQKIREEIRSVLPTLLTGEMDAPTLEDTQKLVYLDA
AVKESVRLQAVSTYRCTTRDTTLTDGAFIKKGTVVVVSKYAAARRKGVWGEDAAEYKPER
WFDEKTGEPKNITPPQFITFSTGPRKCIGMRLAMLEMKTVMAVLFSRFDIETVEDPFKIT
YDFSFVLPVKGPLAVRVRDRAPLSA*
&&&&&&&&&&&&&&&
>CYP5015F2
scaffold 21 minus strand N-term
from scaf 1663 55% to scaf 1
MIEQQALPSLLATVFALLLVWWKLLNK
PSTNSQKLFRPASTLPILGNTLDLLWFQKHRLHDWMT
EQSLASGGKPWLLTGIGQLPRVVVTSP
AAYEEVFKTQFDVFVRGPGETVLEVLGEGIFNVDGDKWRRQRRVTSHLFSMHMLKDCMNA
VVREKTVKLRDVLAMCAERGDPVSMKSLLNKFTADAFTRIGFGVELNGLDDPADVDTSQP
LDAALQVVQIRLQSPVWLWKLRRFFDVGSERVMRESMQQVHDTIQHIMAKSLADKEEQAA
ASEEATRTSSHKDLMTLMLQTGDFKDTREIRDVAVN
FYAAGKDTTAFSLSWFIVMMNRHS
HVLCKVRDELRCVAPELFTGELDTPTLEHLQQLTYLEAALKESLRLNSLAVYRLANRDTT
LSDGTFVPKGARVVFSMYGSARQPGVWGPDAAEYKPERWIDETTGKLKNISSFQFVTFSA
GPRQCIGMRLAMMEMMTVLAVVYSRFDLKTVEDAFDITYDFSLVLPVKGPLAVRVHSLAAHKA*
>CYP5015F2
scaffold 1663 nearly identical to scaf 21 minus strand
MIEQQALPSLLATVFALLLVWWKLLNK
PSTNSQKLFRPASTLPILGNTLDLLWFQKHRLHDWMT
(sequence
gap) 87200 = cyan
VSFYAAGKDTTAFSLSWFIVM
MNRHSHVLCKVRDELR
CVAPELFTGELDTPTLEHLQQLTYLEAALKESLRLNSLAVYRLANRDTTLSDGTFVPKGA
RVVFSMYGSARQPGVWGPDAAEYKPERWIDAKTGKLKNISSFQFVTFSAGPRQCIGMRLA
MMEMMTVLAVVYSRFDLKTVEDAFDITYDFSLVLPVKGPLAVRVHSLAAHKA*
&&&&&&&&&&&&&&&
>CYP5015G1
scaffold 11 from JGI seq a 75133 65% to scaf 11 seq b
MWGIAQHQVNER
QAVIAVGALSGLYLGYKL
LSAVYSDMKITRALDSQGLHRPKSTLPILGNTLDVMFFQKDR
LQDWMADQSQISDGKPWVLSIIGRPQTLIITSPEACEDVFKTQFDNFGRGDELVDLQHDI
FGEGVAGVDGEKWLKQRRIASHMFSMKMLRDVMDEVIIEKSKKLRDVLAACAKEGRIAPM
KSLLGKFSSDVFTKIGFGVDLHGLDGDINSEMDHPFIEAVDGYAEVFGARLQSPMWFWKL
KRFLNIGDERMLKRCIKVATDLLNDVMLKSMSNKTAEDWNSKTDLLTLFVDSTGNTDSSD
LRDAMMNFFLAGKETTSFSMAWIIVNLNRHPRVLAKLRAQIRENLPELLTGELEVPTMED
LQKIPYVEAVLKESLRLNMTGVHRTPMRSTTLSEGTFVPFGSYVVMSVYAAARVKNVWGE
DAAEFNPDRWIDEETGKVKFVNPFQFITFGGGPHQCLGMRFALLEMQTVIAVLFSRFDIK
TLEDPFKITYDYSVTLPVKGPLECAINEVAAPAF*
>CYP5015G2 scaffold 11 from JGI seq b 65% to scaf 11
seq a
MWGLAQHQGNQREAMLAVGTVSALYVSYKVLSAMYKSNSIGRAFDAQG
LHRPRSTLPLLGNTLDVMFYQKERLWDWMAEQSNLSDGKPWVLSIIGRPDALIVTSPEAC
EDVFKTQFDNFGRGADLRDVIYDIFGDGIAGVDGEEWQKQRRVASHLFSMKMLRDVMDEV
IIEKVTKLREVLAGCAKEGKVVPMKSLFGKFTSEVFTKIGFGVDLHSLESDPCCDSNNAF
IKAVDVYAEVFGARVQSPAWFWK
LKRFFSIGDEGRLKESAKVA EGLTQEVLAKSLEARRR
DSSEVKR TDLLTLFVETNTNIDPKAVHDTLMSFLLASKDTTSFSLSWVLINLNRYPAV
LA
KLRDEIREKLPGLMTGEIKVPTMEDLQKLPYLEAVVKESLRLYMAVTNR
MAKTSTTLSDG
TFVPEGCAVMVPI
YASARVKNVWGDDAEEYKPERWIDLSTGKVKPVSPFKFFTFAAGPRQ
CLGMRFALLQIQTTVAVLFSHFDLKTQENPFDLTYDFAITLPVKGPLNITVRDIVPAAF*
&&&&&&&&&&&&&&&
>CYP5015G8P
scaffold 11 seq c pseudogene? 75116
= cyan almost identical to scaf 2789
87865 = cyan 2 aa diffs 64% to CYP5015G6
305733
QHPSTNHNTFETAAGLSALYTWWRIASELYSQHIIDSTLKK*GPHIPPST 305882
305883
VPILDNTLDALVFQKEHFWDWLTEQSNLSGGKPWMLHLVGRPTTLL 306020
306020
ETLEDIFKSHFDTFERGDDL*ELIYPFFGDGIVGADGENWLKQRRAGRH 306166
306156
RDV MGAVVKEKTLQLHDVLVKFSKEGRTVDMTSLFGKFSSDTFTKIAFGV
306305
306306
DLNGLAGDVGAEAYHPFNAAVGVMAEMLGSRLLSPTWVWKLKHFLNIGDE 306455
306456 RKLKEACDIVHELTY* VMIESIQKKIGDADQ 306548
306562
LHDTVMNFLLAGKDTTTFSMTWILVNLNRHPEALAKLRVEIKENLPGLLTGE 306717
>scaffold
2789 possible pseudogene see CYP5015G8P
SALYTWWRIASELYSQHIIDSTLKKQ
913
912
GPHIPPSTVPILDNTLDALVFQKECFWDWMTEQSNLSGGKPWMLHLVGRP 763
751
ETLEDIFKSHFDTFERGDDL*ELIYPFFGDGIVGADGENWLKQRRAGRH 605
615
RDVMGAVVKEKTLQLHDVLVKFSKEGRTVDMTSLFGKFSSDTFTKIAFGV 466
465
DLNGLAGDVDAEAYHPFNAAVGVMAEMLGSRLLSPTWVWKLKQFLNIGDE 316
315
RKLKE
209
LHDTVMNFLLAGKDTTTFSMTWILVNLNRHPEALAKLRVEIKENLPGLLT 60
59
GE 54
&&&&&&&&&&&&&
>CYP5015G9P
scaffold 18 pseudogene 76458 use scaff 11 b for comparison
242453
LTTLYSKRVVDAAIERQKLHSPQSTLPFLGNTLDVLFYQKGRLWDWMAEQ 242602
242603
SKLSKGRPWVLRIVGRPAIIIFASPEAFEDIFKTKFEIFX
242724
RGPDMQELFYAFFGEGVIGADGDKWLKHRPTPSLMVTTRTLRDIVDAAAK 242873
242874
EKAVQLRDVLGE*VKQRRVVSMNSLLGMFSSDVFTKIGFSVDLNGLGGDV
243023
243024
NSETGHPFIEAVEVYAEVLGSRLQSPTWLWK 243116
95
aa gap in 4 nucleotide space = pseudogene
243121
LSKLREEVRDKLCGLITGEIDAPTFEDLQNLPYLETVIKESLRLYVTATNR 243273 frameshift
243273
CSKPLKMYASARMEDVWGEDADKYKRERWIDSEIGK*RFKFIS 243398
243399
NFIAGPRQCIGMRFALLQMRIATAVIFSRFDLQTVEDAFAIMYDVAFTLPVKGPLNITVHEIAC*
>76458
predicted transcript Cyan = P450 C-term, the rest is not P450
MYASARMEDVWGEDADKYKRERWIDSEIG
NFIAGPRQCIGMRFALLQMRIATAVIFSRFD
LQTVEDAFAIMYDVAFTLPVKGPLNIT
TATSRPTASQAPRKNGSTGPKSAKKKKPQPKSN
GKDAAGPQPVAPPPVSGPWSNPNPATSSAPPPGFAPSPAKPGFSDAHQRLLRHRALFAFR
FLVGKAVELQLVASDERYAGVLDCVDPDDFSVVLKSTRRVSSASDAKPFEDGSTVIFRRH
QLAHLVADGTANYTDGAFVAGVTAAAGGFRTDTEISGRQGEHLIGRELETASSWLDPALD
TGALEDSAPNGRRHGKNQGKPGNWNQFEANEKLFGVVSTYDENIYTTKLDKTKISTEQSR
EAERLAQEIERQSAAGNFHLQEERGQAVRGGKHANDLDEEARYSSVDRRGAPPSSGGNAY
VPPALRNAQRQSSDGKSKAKATKTPPASASPATPAPPVPTAAEPVPAATTPAALSPSKPL
SFSEAVTGRSTATAAPPKSESEEKKAAAPAQVNDKDAKPAATKPKSSPKKKENGKEKKAE
SKTEESKDSFSSSTSTTTTTTTTKQTKEEAPKAAPKKELNPKAKEFKLSAAAVEFTPTFS
VPPAVKEQVSSPYRGGSPGHMPYPHPGMGYPPPMQEDWMYDGGMGGEEGGEMGMPPYGYG
VPVGPNGVPMMYPPMMPQQNMRMMGGQRGYGGYQQQGYNPRGYYSPPNGGYPAGGVPYAG
GPGGVGPQPPLPREAPTPTDDESPSESAGVPSPGAPAPPVPETTAPEVVSTPASSKKQQP
AKNK*
&&&&&&&&&&&&&&&&&&&
>CYP5016A1
scaffold 92 33% to scaf 20 e minus
MVPTNLLNTRGLVTFSPGPLLGSTVALMLLWAMRHGRERRLSWTSTPLPPRGYRVLPT
ASTSPVLVRVGMTAEQPSEGSPAFYDWVFAITTRFHGKPWLLHRPGRPDVLVVSSPGSFE
DIQRTFAVQFEKIEGDADGLAHDVHGGATALVCTGQLRPSVNMQRQLAASVLGSPALRQQ
ASALVNQHLDSLLRVLEGAARLGTGNVSTDLDVSKMMRQFAMEVFTELGFGLQLGALRSP
CYQASKLENAIDDVQRRMAERSQRPAVSWKLERLLAVGSEAALSRSIDVVSTITLEAVHA
KRSKHRAGSGCDSPLAGSRVDMLDLLLGQKCSSKSSKDPEFLAGFVLGLVVAARDSMAHA
LTKCLQCLAQNPEEQEKLLRELKEAEEEGKDPRSIARLEAVIKEALRLHPSKPFVRRRAR
QDTVLSDGTFVAEGTEVAMDLYSMARRENVWGPGSAQFRPQRWIDATNSKLRPVSKYKFN
AFLGGPRACLGADIALAELKTVIAAVVSKVYLDAIEQSDGKDKALTRGVPCGEAMRIQVR
RRDFSRPGHYS*
>CYP5017A1
scaffold 14 30% to
scaf 20 a
MKMPWHLFFDGAIYVSSVLLQYQENLLW
WSANRSLFVQITDPKDVQHILSTNVNNYVKPQGFLDAFQEIFLNSFFALNHHPQAPDGGA
RWRLQRKVAAKVFTTANFRVFTEQVFARHADKTLASAQAGAIQAEAREGDDQATGGSFCC
DMQEISASYTLQSIFDVAFGLPLGEIEGTEDFAGHMGFVNEHCAQRLFVKQYYKMFRWFM
PSERELRRHTRAIHAVAESVLLRRLQESEEKVSSRSDLLSLFICKARELAFDGKNGHDGP
EVASLLGPETLRSIILTFVFAGRDTTAECLTYAFYAIARHPRVQKRIVEELGSAKPNDDT
PFTFDEVKHLRYLEAVVYETVRLYPALPYNVKNAVKDDYLPDGTFVPAGVDIVYSPWYMG
RNGPLWGDDPLEFRPERWLEMAKRPSAYEFPAFQAGPRVCLGMKMAVLEAKLFLATTLLR
FDVAIAPGEKHERGYMLKSGLFMGGGLPLQMTPRPRNAPPA*
>CYP_un1
scaf_7 C-term frag or accidental match or pseudogene frag
WLSPPTRRRFSLSPRNCLGGHVADALRTFLVCRCLSPLPPWHHRHMPVEQQWSGVSPSDSANR
DWPWQPCWKLVQRSSKRIEK*