98
sequence pieces from moss = 71 complete seqs, 24 finished pseudogenes,
plus 3 bacterial
contamination P450s.
15 sequences not
from Physcomitrella moss, used to assemble moss genes.
in progress D.
Nelson
July 31, 2005,
revised Jan. 31, 2006
CYP51 clan
sequences (1 complete sequence and 1 finished pseudogene)
>CYP51G1 complete
ESTs BJ585158.1
BJ591215.1 BJ592754.1 BJ963427.1 BJ165255.1 BJ157286.1
BJ188328.1
BJ185333.1
MGDAEMQGAPVGESAVFDRSKVMMLLGSLVVAAIVGHFVLAWNRKRRNLPPVVD
SAVPFVGGLLKFIKGPVPLLKEEYGRLGQVFTLQMLTRNVTFLIGPEVSAHFFKAQEADLSQRE
(0)
VYQFNVPTFGPGVVFDVDYSIRMEQFRFFTEALTVKRLRSYVEMMVEEAH
(0)
LFFSKWGEEGEVDLKVELEQLIVNTASRCLLGPEIRNSHLEKVTSLFHDLDNGMLPVSVLFPYLPIPAHKRRDR (2)
ARKELAEIFSKVIKARKASGKKEPDMLQAFMDSTYRSLKRGTTEEECTGLLIAALFAGQHTSSITSTWTGAYLMK
(2)
YKQFMPAVIEEQKEIMRRHGDHLDYDVLNEMSCLHRAMKEALRLHPPLILLLRQNHTDFSVTTREGKSYTIPKG
HIVGTSPAFANRLPYVYKDPDTFNPDRFAPGNEEDVKAGQFSYIAFGGGRHGCLGETFAYMQ
(0)
VKTVWSYLLQHFELELTGPKFPEVDWNAMVVGIKGEVMVRYKRRQLTCD*
>756723207
755804088 717597240 CYP51 pseudogene 54% to 51G1 rice with
multiple
frameshifts finished
TSPAFANRLP
HLHKDPDFFNPDRLDPENEETK
IGGIISCMCFGGGHHSCLRETYLH
VQGKTVWSFLLQYFVMELVSPKFPAVDWNVMVVGNKGEVLGLLFSL*
CYP71 clan
sequences (41 complete
sequences 15 finished
pseudogenes
0 partial sequences)
>710927435 CYP756A1 33% to 92A8 66% to 710912672
complete
870079473
1028549191 997124596 no ESTs
MAHLHTRLSEEAEAWIATGVDSFSRWQEYAAGFGRATYIVAALGFFAVVILELHNS
RKRRLSKLPPGPFQWPYLGSLPNLLLTVGVTSSFRLREKVSELGRNH
GPLMFLQIADTQILIVSSGTAAKE (0)
VLIARDEEFNFRPQCAVGKYLGFGSSDIAFAEGRHHWYLRKLCDTRLFSADSFVSYGHIPRAEAL
KMLHSVWEASKKGNGISVRETVTAFVRNSLCGMLLGSAHLDIENVSLQFTEKTLI
TLLDETICVVGEITLSDLAPGLKRVDFHGRTRKLKELHERWEKYLRVILEDRSHRLEKSA
KPEALVDVLLSLDDADMKLSNEAIMGVLL (0)
DTLVGGVYSTSATIEWALAELVRHPGVLEKVQLEMSEVVGPYHIVEDAEISQLPYFQ
(0)
ATVKETLRLHPVVPMSLPHMNKVATSISRYQIPANTSVVIDYKAIA
RDPAAWHKPLRFDPSRFLHTSAASQIDNIFKFLPFGYGRRGCPGANFAAVLLQLALAHLI
QAFDWAPTKGQLPHDIDVKESPGLVCFRFSPLVLSSTPRLANSLYQVSP*
>710912672
CYP71B like4 CYP756A2 complete BJ165273.1
BJ581063 BJ157306 879465659
828157377
830613141 815612895 755803843
MAQAVGLGRLPPPLSEENKAWMTNQLESYSTWQMNAAGLGRALYILAAVVFSAVVCFEQYSKRKERLSKV
PPGPFQWPYLGSLPYLIRTVGLRSSTRLREKITELAMKHGPLMFLQIADTQIVVVSSGKIAKE
(0)
VLVAHDAEFYFRPRCLVGKYLGFDS
TNMAFAEGRNHWRLRQLCDKHLFLPECFASYGHVQKQEAEKMLHSVWEATKRGENISV
RHTVITFARNSLCRMLLGTAHLDIENTSLEFNKETLITLLDEAFAVAGETTLIDLTPG
LNWLDFHGREQKMKYLRRRLEKYFQEILDDRSQRLDRSEPEAFVDVLLSLDEDKTLSDEAMMGVLL
(0)
DTLVAGTYSISASVEWALTELVRNPTMLENVQNEITQVVGPHHIVEVTEFSQLSYFQ
AIVKETLRLHPAMPMSIPHMNKVATPLSSYEIPPNTSVVVDYAAIGRDSEIWPN 179
PLDFDPRRFTDTDAASQIDSFKFLPFGYGRRVCPGANLGLRLLQLGLAHLVQGFDW
IPIEGQLPHDIDIKESSGAICFKSTPLILRAIPRLASTLYEI*
>692433988 CYP73A49 complete 846079417 836296434
BJ177355.1 BJ182198.1
BJ176060.1 BJ186266.1 BJ197128.1
BJ963588.1
BJ192275.1 BJ179214.1 BJ191683.1
BJ975047 = mate
pair of BJ966530.1 BJ600264 BJ611038
BJ975788 = mate
pair of BJ967270.1 BJ587131 BJ593693 BQ039467
BJ582985 = mate
pair of BJ177851.1 BJ581278 BJ582455 BJ584349
71% to 73A5
MGAGWKEAMALGLAGAGVARVLAISEDVVDNAAVSSFVKQF
MNLEGAVQALLVAVLLGLLIAKLRAPKLNLPPGPVALPIVGNWLQVGDDLNHRNLAEMSQ
KYGDVFLLKMGQRNLVVVSSPEVAKEVLHTQSVEFGSRTRNVVFDIFT
GNGQDMVFTVYGDHWRRMRRIMTVPFFTNKVVQQSRGAWEDEA
LRVIQDLKAKPEAS
TTGVVIRKRLQLMMYNIMYRLMFDSRFESEEDPLFLKLKALNGERS
RLAQSFEYNYGDFIPVLRPLLRGYLKVCQEIKDRRLALFKEHFLDERKKLLSTLGPRPDG
EKAAIDLILEAQKRGEINEENVLYIVENINVAA
IETTLWSIEWGVAELVNN
PEMQTRIREELDSTLGKGNLITEPDTYNNKLPYLSAFVKEVMRLH
MAIPLLVPHMNLHQAKLAGYDI
PAESKILVNAWWIANNPNHWDQPEKFIPERFLDGKIEAKGDDFRFLPFGSGRRSCPGIIIA
MPLLSIVLGRLVQSLELLPPPGTKKVDVSEKGGQFSLHIATHSTVVCKPIA*
>CYP73Ayy CYP73A48 705225761 complete 69% to 73A5 68% to upper seq
686707760 two
frameshifts near end near exact overlap with 705225761 (join)
ESTs
BJ156825 BJ601621
AW598785 AW156067 BJ960273 BJ962155
BJ173522 BJ170223
BJ955206 BJ608342 BJ603676 BJ955206
BJ605702 BJ593402
BJ607994 BJ596392 BJ609670 BJ944446
BJ606143 BJ598803
BJ583187 BQ041736 BJ597214 BJ599148
BJ607151 BJ595328
BJ163543 BJ203089 BJ199707 BJ201130
BJ197619 BJ178048
BJ195721 BJ970670 BJ193565
BJ199312
BJ206667 BJ204697
BJ200793 BJ173402 BJ185972 BJ190015
MEAASLASMFTFNNIVQGLCVAVFLGIVIMKLRAPKLKLPPGPFALPIVGNWLQVGDDLNQR
NLAEMSQKYGDVFLLKMGQRNLVVVSSPDIAKDVLHTQGVEFGSRTRNVVFDIFTGNGQDMVF
TVYGEHWRRMRRIMTVPFFTNKVVQHSRYAWEEETDYVIKDLKARPEAATSGVIIRRRLQ
LMMYNIMYRMMFNSRFETEDDPLFVKLKALNGERSRLAQSFDYNYGDFIPI
LRPFLKGYLKTCQDVKDRRLALFKEHFVDERK
(2)
KLNSVNPPKSEAEKCAIDHILEAQKKGEINEDNVLYIVENINVAAIETTLWSIEWGIAE
LVNNQEIQTRIRNELDSILGKGNLVSEPDTYNNKLPYLTAFVKEVMRLHMAIPLLVPHMN
INQAKLAGYDIPAESKILVNAWWIANNPKYWDQPEKFMPERFLDGKIEASGNDFRFLP
FGVGRRACPGIIIAMPLLAIVLGRLIQSFELLTPPGVKKIDLTETGGQFSLRIANHSTVV
ARPLAV*
>755680881 CYP73A50 76% to 710919188
1005948486 67% to 73Ayy 60% to 73A5
895358221
852123011 complete
CYP73Axx (no ESTs)
lower part from AIE to end = 755680881
exon starting with
KLGNA has no exact matches in trace archive
This is proably
the same seq as 755680881
710919274
759458758 61% to 73A5
MEKPTVVRGLLAIFVIGLVTGVEFKAP
STLDFVFSLDPCRQSLLLIVFTAIVINLRMKDRKMNLPPGPTALPIVGNWLK
(0)
VGNDLKHRTLAEMSERYGDVFMLKMGWRNYVVVSSPEAAKGVLHTQGEEFAC
RTRNAVFDIFAGKGQDIVFTNYGDHWRRMRRILTVPFVTAKVLQHAHFAWEEEVDEVMED
VQSRPESATAGVVLRHRFQLMMYNIVYRMMFNSRFASEYDPLYLKLKGLNGERCRLPAQN
YKYNYADFFPFLKLFIRGYLKICQEVRDKRLSLFKEHFVDERR
(2)
KLVNAVGVSGAEEKCAIDHMLEAQKKGEISEDNILYLIENINVA
(1)
AIESTLLSIEWGIAELVNHPNVQKRLQAELLEVIGEGNLVAEPDTHNNK
LPFLTAVVKETLRLHMPVPLLVPLMNMRQAKLAGYDIPPQSKVLVNAWWIGNNSKFWDQP
EKFMPERFLPAAAENQSLDFRFLPFGAGRRSCPGTAIAMPLLAIVLGRLLQKFDLLPPPG
MSKVDVAESGGQFSLHMATHSIVVLRPRN*
>710919188 CYP73A51 717596474 756814972 762526716 61% to 73A5
N-term
883857969
977890599 complete no ESTs
MGNPMGIGVLLVILVFGVVAKVDFKSRILLDFAFSSSLL
MQALLLGVFIVIVVSLRREGRNMNLPPCPAAFPIVGNWLQ
(0)
VGNDLKHRKLAEMAQKYGDVFMLRMGRRRFVVVSSPEAAKEVLHTHGVEFASRTRNAVY
DVFAGKGQDIVFTNYGDHWRRMRRIMTVPFVTHKVVKHAHFAWEDEVDHCIRDVEARSE
SATTGVVLRHRFQLMMYNIMYRMMFNSRFADEHDPLYLKLKGLNSERCRLPAQNSKYNY
ADFIPILRPFLRGYLKVCQDVRDRRLTLFKERFVDDQ
(2)
KLVNAVGAAGAGEKCAIDHILEAQKKGEISENNVLYLIENINIA (1)
AIESTLLSIEWGVAELVNHPEVQKRVQKEL
DEVLGDGHLVSEPDIHSGKLPFLTAVVKETLRLHMPIPLLVPHMNVKQAKIFGYDIPPES
KILVNAWWIGNNPKFWDQPERFMPERFLSATSEATTVDFRFLPFGAGRRSCPGSAIAVPL
LAIVLGRLVQKIDLLPPPGMTKVDVTESGGQFSLHMATHSTVVTRPRA*
>BJ163406 CYP73A52P aa 169-308 mate pair to BJ173377 finished
this seq seen in
at least 6 trace archive seqs 1023200166
883856438
869788433 891417707 866058619 884943991
890407139
838586233 816063061
This
seems to be an expressed pseudogene 76% to 692433988
LATS*EAGNSELGASFLKHF
MGLESAVQGLLVAVLVGLLIAKLRAPKLKLPPGTVALPIVGNWLQ
(0)
VGDDLNIGNLAEMSKKYGDVFMLKMGQRNLVMVSSPEAAKEVLHTQGTEFGSRERNVVY
DILIGNGQDMVFAVYEEDWRKMRRIMTVPFFTSKVVQQSRGTWEDDALRVVEELRTRPEAS
TTGVVIRNRLNSMMYNSIYRLMFDRRFENEEDPLFLRLKALNGGRSRLAQSFEYN*GDFI
PILHPVLRGYQRVCQEDQDRRLGVFNEHFVDVRKKLLETTGP
KPDGEKAAIYLILEAQRKGEISEDNLLYIVENINVAAI ETMLWSIEGGVAELVNNPDI
QTRVSEELDHTLGKGNLITELDTYNPK
(deletion)
HWDQPGRFMPERFSGNNIEVSGGDFRFLPFGSGRRSCRSIIIAMPLLTIVLSRLVQS
MGQLSPPGMKKVDVAEKGGQFSRHIATRSTMVCKPIA*
>755683069 CYP73A53P 710485186 72% to 73A5 with frameshift and
stop codon
pseudogene, stop
codon seen in 12 sequences, finished
815536458 41/47 (87%) to 73Ayy 824722097 pseudogene
816099016 87% to CYP73Ayy, suspect these join with 755683069
MATAGTIMMQAVGMASMFTFQNVVQGLIVAMVVWIFVIKLKSPKYKLPPGPVANWLE
(0)
FGDDLNHRYLAELTKKYGDIFLLKMGQRNVVVISSPEIAKDALQTQGIAFGSRPRNVVF
DIFSGKGQDMVFTPYGEHWRLMRRITTAPLFT
NKVVQQSRYAWEEEID*VSKDLKTLPE
XXTSGVIIRTKLQLMMYNIIYRMMFTSRFKDEKDPLYLKLKVLN
EDPRYLNLKVLNGEQNRMGQSFDYNSGDFIPIVRPFLRGYLKVC
QDVKDRRIALFKEYFVDERR
(2)
KLTSVNPPKSEEQKCAIDYIFEAEKLGEINEDNVLYIVENINVAAIDTTLWSIE*GIAE
LVNNPELDGIREELDAVLGKGNMVIEPDTYNNKLPLLTAFVKEVMRLHMSIPHVVHMN
LKHEKLAGYDIPAESKILVNVWWIGNNPKY*DQ
PEKFMPERFLDGNIEPSGNDFRYLPFGVGRRRCPGIFIAMPLLSIVLGRLIQSFELLPPP
GVKEVDVTEFCGQFSLRIANHSTVVVRPLL*
>710518592 CYP73A54P 755698511 44% to 73A5 C-term to PERF finished
pseudogene stop
codon seen in 11 seqs
GNFVSEPDTHNSKLQFLRAVSRKS
KVTLPSPLFSLHMNVKQVQIGGYYMFAESKISVIAW*IGNSPKLWDQLEKFMPEK
123
>CYP75B like1 CYP767A1 change to 757A1
716895728 755798807 39% to 92A11 38% to 75B1 complete
852155406
977922988 859713404 42% to CYP76C
like14 no ESTs
TIMKLRKKIGKLPPGPRALPLIGNIHQIGDFSRRNLMQMAE
KYGPIMYMRIGSKPLLVVSTAEAAHEFLKTQDKEWADRPTTTADKIFTNDHRNIVCAPYA
AHWRHLRKICTMDLFTPKRLMSFRTPRTEEINQMMTSIHEDVAAGKEVKLHVKLGHLTTNNITRMLLGKR
(2)
FFTVDEKGQMEAHRFKELVFELFRASSTPMIGDFIPWLKWVSIASGYVKYLKRVKADLDAFLQEFL
EIKKAASDQATAERAKDFVDLLLEQKTVSGDGPLEDATIRS (0)
DMLLAGTDTVSNAMEWTIAELMRHPECMRKLQQELDTVVGKSRIVSETDLPNLPYLQAVVK
EVMRFYPPAPLSLPHQSIVPTTVCGYDLPAGTQLCINLYAIQRDPKYWPNPVQFNP
DRFLNCDVDVGGTHFQLIPFGAGRRQCPGMPLGNLLLQISVARLVQAFEYSLPRGTK
RNYFMNYYSGANKLTSGIMYLI*
>CYP76C like14 CYP761E2 complete 74% to CYP76C like13 43% to 736A1
44% to 92A13 42%
to 92A14
692514018 no ESTs
816061186
859980678 755804916
759452715
755804916
759452715
MYFEKTTVARMLTAGFESENGLTGRRVEFYVFLAAIFIMPLVLL
KITRRPRLKLPPSPPAYPIIGHLHLLGKLPHQSMTNLAKKYGEIYSLRLGSVPAIVISTP
EMAKEFLLTNDKIWSSRSVHMTSGYYFSYDYA (1)
GIAFAPSTPVWRSLRKICMSELFTQRRLEASKGLREEEMQYMIR
(2)
SILDDAHQGRLIDLKLKINALTANIVARMVLNK
RFTGCIDSTVETEAEAHQFKEMMEEHFLLLGVFMIGDYIPWLSPLDLGGT
EKRMKSLRKRLDAFLDDILEVHEVKRAKGPIPEEDQDVIDVLLNEMYQQDSN
ESKQLDTNNVKSTIL (0)
NLFAGGTDTSTVTIEWAMSEMLRNPTIMGKLKAELDARIGKDRRVRETDLSDLPYLQAVT
530
KETFRLHPVGPLLIPHVSTHDCEVGGYHIPTGTRLYVNVYAIGRNPKVWDRPLE 692
FDPERFMTGLNAGVDVKGKHFHLLPFGTGRRGCPALPLGLLIVQWTLATLVHALDLSL
866
PQSMEPEDVDMTEAYGLTVPXGASLYLNAKLRAADHLY*
>BJ166750
CYP754A1 BJ171256 BJ172713 BJ602502 mate = BJ205564 complete
1014495025
879457809 883995654 1014487906 862362793 BJ165056
51% to
75B like3 BJ157072 BJ157394 BJ190531 BJ170915 BJ595929
MVEESWLWVLFVGALSFSILLQWGLNRKRKLKLPPGP
TAWPIVGCVFGLPRLNPPEKLFNKLSEKYGELMLLQLGSWSIVVTSSARMAMEILKTHDN
EFANRPDVISSRLNFNNTGLIQMHSTNPLFKRTRRMFS
AEIVSPRTVLETGVIRRKQ (0)
LRTLRSIVQDFDAGRSVNFTHEMKTLAMNLSMSICFGTDYATKVNDEAEALIHTYK (0)
IMAIWTRRSLGAIFPALRWLDLDGIESGFADVELQLRTNITALIEKKKQEMSMWSAE
DIQAGANEGDVMTKFLSMEGEDRCSEDQLISVVF
TILLAGTDTVFNVVTEAMYALLMHPNFYHRAVEELDAVVGKSRLV
EEADIPKLPMIQNIIKETFRIKPAGPSLVPRKNFEACEVAGYHIPANTTVFVNCIPLMRD
PSFWDSPDEFNPDRFIDSKVTVLGSDFNYLPFGYGKRTCPGLNLGMI
TVQYILAACLQCISWKLSRPRRLDIETDDDPRKVDDVMVDGKQRVDPALLEFAPQPVK*
>CYP75B like3 CYP754B1 complete BI437153 AW509529 BJ595585.1
BJ977649.1 BJ969326
713843806
713853618 CYP71 N-term 869925250 881364275
34% to
92A sequences, no high similarity to a known family
36% to
Panax ginseng seq. AB122079
yellow
exon questionable
MVGDVWIWVLITMVVAVIVGVGIDKKTKRGLKLPPGPPAWPVVGCLASLPAGHPPEVMFAKLAEKHGE
LMLLWLGSKPYVVASSARMAMEFLKRHDQEFANRPMSVVREYVSFKGNSIISMSASDPKY
QRLRRTFVMELLSPKKIAATRDLRKDQ
(0)
VLKMLRAIREDLDAKHEANFTEAVLTLGMSLSIGLLFGRDYGGKVFSEEIQTLVLTFKT
(0)
MVKYLSMINISDLIPSLRWLDLQGIERGLGLGEVQLRKSIMALIEQKRLDKIRLSSDE
IESGACQRDILSKLLSLEGEDRLDDDQLMGVVF
ALMLAGSDSISRGVGRAMQELLKQPLLHQRALDELDEVVGRRRLVEESDISSLPLINNIIKETLRLHP
PAQLLIPHGNVEQCEVAGYHIPARSTVLVNLYALSRDPSFWNSPLEFAPDRFVDSNLTVQGS
DFHYIPFGYGRRGCPGLNLGMITVQYALALCLQCILWRLPAGATISETYIDWKNSPDLIV
DGDLRVDLHLLEGL*
>AB122079 CYP768A1 change to 764A1 Panax ginseng (Apiales) 41% to
92A9 40% to 75B
MFPLAYPLLFVLLGALSWWILPIISPLKRHHKLPPGPRGLPIIG
SLHTLGALPHRTLQTLAKKYGPIMSMRLGSVPTIVVSSPQAAELFLKTHDNIFASRPK
LQAAEYMSYGTKGMSFTAYGPHWRNIRKFVVLELLTPAKINSFVGMRREELGMVVKSI
KEASAANEVVDLSAKVANIIENMTYRLLLGRTKDDRYDLKGIMNEALTLAGRFNIADF
VPFLGPLDIQGLTRQFKDTGKRLDKILEFIIDEHEQNSSNGNASGDFIDDMLSLKNKP
SNTHDELSKVIDRSVIKAIMIDIISAAIDTSDTSIEWILTELIKHPRAMKKCQEEIDA
VVGVDRMVEETDLPNLEYVYMVVKEGLRLHPVAPLLGPHESMEDITINGYFIPKQSRV
IVNSWALGRDPNVWSEDADEFLPERFEGSNIDVRGRDFQLLPFGSGRRGCPGMQLGLI
TVQLVVARLVHCFDWNLPNGITPDNLDMTEKFGLTTPRVKHLLAVPKYRL
>CYP76C like1 CYP758A1 complete 715966631 AXOS23673.g1 BQ040580 AW599561 48% to 76C2
BJ609203.1
opp end = BJ201966 BQ040759.1 711801903 BJ168318 BJ166758 BJ158754
BJ160069
CYP76C like4
713876163 46% to 76C2 no ESTs one frameshift and stop codon
no exact matches
in trace archive, best match = 715966631
probably
a poor version of the 715966631 sequence
MATPDSSGGAFDLAKWINGLVAHWGSVAVAVVAAAVIAKFIFNSTVGRRKL
PPGPAPWPILGNIASLAGLPHRSLEKLARKYGSLMYLRLGEVPCIVISSADVAKQLFKTH
DILFSNRPGGCFFEQLTEYRNITASRYGPHWRHLRKTCVHELFTQKRLEAYQATRLE
EISISIKELFEESDKKGPVDLHAWLHRLLFNNLTRVIMNNR
YFGTDEKGMKDAMDFNNVTALMFSQAGDVVISDFLPYLGFLTRLQGKPLLYR
KTREIVLEMMRRMTNFDERKKLHAEGRSTGEPEDFVDVLLSSTLSDGTTPLPDDICLMLL
MDVLVAGTDTSATTVEWTITELLRHPEAYKRVREELNSVVGSDQLVKEEHLEHLPYLNAVLQ
356
ESFRLHPATPLGLPRESSEAFEFLGYSLPAGTRLFVNQWAIHRDPAVYEQPEEFN
521
PERFLGREALKFIGDTQFQLVPFGSGRRNCAGLPMAVIVIPLVLAHLLHSVEFSLP
DGQQPKDLDMTETFGVAAPKASPLMIYATPRESAALY*
>CYP76C like2
This is a hybrid seq of two different genes
713877140
762522648 711800153
692509705
710494763 715965478 755800431 692507811 713794646 43% to 76C2
692476632
879809357 859709904 716891783
717625521 869788525 870057181
AW599509 BJ583667 BJ579892 BJ597932 BJ166653 BJ194761 BJ946070
BJ174288
BJ190667
BJ158648 BJ174717 BJ177099 BJ178535 BJ588129 BJ586603 BJ585830
BJ179043
BJ179518 BJ171222 BJ579424 BJ582321 BJ582678 BJ584175
715978988
756810370 CYP76C like11 51% to 76C4 95% to 76C like2
no other exact
matches in trace files
1025182054
= best match same as 76C like2 assume this is the same seq
MAEGGLLFGFELADVLVAAVLISVVVLYFHAEALQRRRCPPGPWPWPVVGN
FSALGDLPYRNLHKLAGKYGGLMYLRL
(1)
GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNN
NDENYRNIGLAEYGPYYRKLRRLLNTELFSPRRHASYEVTRAQEI
QCMMKVLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR
(2)
FYGVSGDDGEKQ
GQDLQK MTSSVFELLGSVV
ISDFVPYLS
FITKLQGHASKFSKIRDVSDKLTADFFDLD
SHRNNYKKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ
(0)
ELLNAGTETSSNTSEWAMAE
LIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV MKENFRLHPPAPLLLPHESREPTELLGYHFPAGTELLVNAFAIHRDPSVYDNPDSF
DPDRFLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG
QTEVDMTETIGISVRKKQPLFLVPKPRFELSLESVAEN*
>713877140 CYP758F2 (no AP insert, LHK insert) complete
also 824697165 893557678
993602306
869943560 997070612 993467812
1020671150
755800431 892704920 883863821
1006335547
857898605 BJ583667 mate = BJ178535
BJ579892
mate = BJ174717
BJ597932
mate = BJ194761
BJ179518
BJ158648 BJ190667 BJ177099 BJ179043
BJ946070 mate = BJ956863
957569174 883845570 713794646
MAEGGLLFGFELADVLVAAVLISVVVLYFH
AEALQRRRCPPGPWPWPVVGNFSALGDLPHRNLHKLAGKYGGLMYLRL (1)
GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNNDENYRNIGLAEYGPYYRKLRRLLNTELFSPRR
HASYEVTRAQEIQCMMKVLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR
(2)
FYGVSGDDGEKQGQDLQKMTSSVFELLGSVVISDFVPYLS
FITKLQGHASKFSKIRDVSDKLTADFFDLDSHRNNY
KKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ
(0)
EMLNAGTETSSNTSEWAMAELIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV
MKENFRLHPPLLLPHESREPTELLGYHFPAGTEVLVNSFAIHRDPSVYDNPDSFDPDR
FLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG
QTEVDMTETIGLSVSKKQPLFLVPKPRFELSLESVAEN*
>830443705 CYP758F1 (AP insertion) 96% identical to 713877140 (no AP insert)
879461440 1020623392 complete
may be alternative splicing in this gene. Two exon 2s exist
863042033 870057181 881406076 833254849
1003202172
BJ579424 mate = BJ174288
BJ588129 mate = BJ182840
BJ585830 mate = BJ180571
BJ586603 mate = BJ181679
828265864 833246011 977999457
(exon2a,
short)
GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNND
AFPSQSNNSVAVALSVDVSHLILPILSLEMHCATR
MAEGGLLFGFALADVLVAAVLISVVVLYFHAETLQRRRCPPGPWPWPVVGNFSALGDLPHRNLAGKYGGLMYLRL
(1)
GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNND
ENYRNIGLAEYGPYYRKLRRLLNTELFSPRRHASHEVTRAQEIQCMMK
VLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR
(2)
FYGVRVDDSEKERQDLQKMTSSVFELLGSVDLSDFVPYLS
FITKLQGHASKFSKIRDVSDKLTADFFDLDSHRNNY
KKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ
(0)
ELLNAGTETSSNTSEWAMAE
LIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV MKENFRLHPPAPLLLPHESREPTELLGYHFPAGTELLVNAFAIHRDPSVYDNPDSF
DPDRFLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG
QTEVDMTETIGISVRKKQPLFLVPKPRFELSLESVAEN*
>815741313 CYP758B1 870248614 1029254207 43% to CYP76C
like2 complete
816093549 978054225
MLATAFLVGFLAWAAMILGKFILEGIQRRNLPPGPWAWPIVGSLFSLGPLPYKTLRVLAKKHGELMYLR
LGSIQSVVVSSASMAKEVVTNHDLQFAYRPTKLFGKLLFNSKDIVHASNGPAWRHLRMIC
TSQFFTKKRLASYEATRTFEIHTLMKDILRKSSSEDCVVNLPFQLRNTSTNFISQMVFNKRY
FVEGEESNVEDAKRYQKILKIHFSSYAIFVVSDYIPCLRFITKLQG
IRGKFQQIADKIHKKMDEIIDINGHERRRIDANHKQDADRKKDFVDLLLETTSHDGKGTLDHETVRG
(0)
DMLFAGAETQSSTLEWAMAFLIRNPGVMKQVQAELDGVVGTERVVQESDLEKL
PYLEAVVKEVMRVKPGAPIGINHESREPRQVAGHYLPAKTRLIFNIHAIHRDPS
VYDRPDEFDPTRFLSPGKGNVPTGQELFQLMPYGAGRRICPGMPLAIVNIPHVLAHLVHS
FDWSLPAGQDHRELDMTEKFDGVTSPRLHPLHLIPHPRKPAFLYK*
>710914928 CYP758F3P 44% to 76C1 80% to 76C like 2 finished
859944242 1000198948 756701133
pseudogene stop
codon seen in 9 seqs, cannot extend upstream
AMAKLITRPELIERAQTEMDSVIGFKRLVEESDI*QLX
LQAVMKENFRLHPPDPFLLPHESREHTELLEYQNLVGTKVLVNAFAIHVDPSVYNNPDSF
DLDRFLARPHIDHMSKSDPYELIPFEKGLRMYPEYKLANTMVALMLAKLLYVFDGSLLES
QTEVDMGETISLSVSKKQPLLLVPKPRFEISLESVTXN
>CYP76C like3 CYP758C2 715973711 692501289 755703204 complete
710499319
722465916 49% to 76C4 BJ184478.1 755703204
884001657
891382019 859936235
686708074 34% to
71B2 N-term
GAILKELGYFAPPVALVVILATKLVWDARIQSQRRKTLPPGPRPWPIIGNLSALVGDKPHRALQELAFEFGGLMYLQL
(1)
GMSSSIVVLSTAEAVREVFRSNDERILSRPKMLSFGIISDNYRSISFGPPGKLWQSMRRFCSTEL
FTNTRVASYQGRREEEVKHMLMVLVEESEKGKAVDLRSWLHDLSSNMTTRMLVNKR
(2)
FFANRGENDGKQEILKSELEHVLEGFNVNIMRNVLSDYLPSLRYFAEELHGARAALEAFRDEAATVGRKIV
ELEKHRQRAQNPSKDENYVPDFVDVLIGAPLDDGKSLSDTLLTVQVL
(0)
EFFFAGTHTSSATVEWAMAELITHPDLMKRAQAEVDGAVPADRLVRDS 549
DIPNLPYLQAVVKETFRMHPVLSLGGPRETTRPIEVNGYKIPAQTRLFVNIFAV 711
HRDPAVYTDPETFDPDRFLTQHLHTNHCSGFDSHELIPFGVGRRMCPGFHLGNTLVHLML
891
ANLLHRFHWSLPEGETIATVQASVMSEKLYGLLFHPNENLHLVPELKNGLPAS*
>755697346 CYP758C1 N-term complete 36% to 92A9 34% to 76C2 no ESTs
836315685
977938989 711868310
713876758
710485469
755791105
MMDSASYSAPFAALWDTFGRGTVVAVLVVVVVGELLLYARFQAQRRSTLPPGPRPWPILGN
FFVFSDVNHAHHDLRRLAAKFGPLMYLQL (1)
GSVPCVVVSTAEAAKELFRGHNDECLISRPKMLGLEILSDNYQLMAYAPAPGKLWHSLRKFGSMEL
FSFKRVAFYRSLREEELRHWIKFVLESREGEAMNLKSCVFELAANMMTRMLVNKR
(2)
MFDITGADTQQQLLRSEFESFMEEHYKCLMPNVISDFLPFLRFFCEKLQGWRAYIQD
HQEKSVEFWTRIIEVEKHRQRAAERQNDGSYVPDLVDFMSTAPLDDGKVLSDRNITLQIL
(0)
DFFLGGTDTTPLTLEWAMAELVTHPNFMKRAQEELDRVVGLERLVEETDFPNLP
FLQAIVKETYRLHPVGPLGGPRESTEPVEALGYKIPAKTRVILNIFAIHRDPAVYER
PDEFDPTRFLDRPLAAFDSYELMPFGVGRRMCPAFNLGNTTVHLILANLIHNFDWALADG
QNIDTFDMTERLHGVTFSLKYALSLIPTARSGILARAL*
>756700900 CYP759A1 75B like complete 83% to 76C like6, 38% to 92A14
755797279
883855058 850658210
BJ603828
BJ609557 BJ610988 BJ952601 BJ180041 BJ186479 BJ187843
BJ200343
BJ970091 BJ585378 BJ598216 BJ601033 BJ601619 BJ194512
BJ182396
BJ206478 BJ194704 BJ941950 BJ177426 BJ189078
BJ597358
BJ584496 BJ598216 BJ196301 BJ587678 BJ609929
BJ598163
BJ167822 BJ582057 BJ581628 BJ171635 BJ609821
BJ597734
BJ592849 BJ195065 BJ193723 BJ204894 BJ206483
BJ176403
BJ179692 BJ166342 BJ601824 BJ204693 BJ195012
BJ193292
BJ185416 BJ179347 BJ174872 BJ587298 BJ159549
KKLNLPPSPKGRMPIIGHLHLMDDNEAAHRTFARISEQNGPLTMIYMGNKPTLLVSTAAM
AEQVLKHNDQAFASRPFITAGKTLGFDFKSIVFAPFGNYYRRLRRIYTVELLSPKRVALSQ (0?)
VLRQHEIKHVINSVLAENQAEGRVNMTSILQEMGIDNLVRMIFAKPHMGATECLTKE
EMATLKSVVKEAVNLAGVIYVGDFIPLLDIYDFTGYKKKTNKLAAKMLDIATQLIEKHKS
DAGTGVDNDKLNLVDILLSQKGEDQLPPHAMAGILF (0)
DFIIAGSDTTSVSIEWAIAELLHYPHYLKRAQEEIDQVVGKERLVTEQDIKHMPFLQAVV
KELFRLHPAAPLGIPHCNMEETKLAGYDIPAKNTVMMNLWAIGRDPAHWDDALEFKPERF
LNKDITLLGRDFHLIPFSVGRRQCPGAGLGLAVVQLAVASLLHGFEWSTYNQKPEEIDMR
EKPGLVTPRKSDLIVTAVPRLPLHVYQGDKNGVQNGH*
>862766641 CYP759A2P 815526881
1009468231 1006200210 finished
pseudogene
61% to 756700900 stop codon lost at the end
759458516 (joins
with 862766641) heme
signature not typical
1028764555
stop
codon seen in 10 seqs. finished
MEIHALVSVTSVVVVLGVS*LFLRSSMSRKLNFRPIPK
GRMPIIWHLPLIEDKEATFARICQ*NDPPTMM*VGNKPKLLISAAGMTEQVLK
HNYQAFASRPFMTAEKTLEIDFKSIVLAPFGNYYKRLRRIYTAELLSLKRVALSH
(0)
VPRQHKVKHMVNKVLSEMQATGCENMISILQELGIDKLERMIFV*SHMVASEPLTREEM
VTLKTLVKEVVNLASVVCVGD
FLPLLNIYDFMGYKKK
MHYLSAKMNLVAQVL
I*KNQIQQRTWRRRG*LNMMDILLSQ*GENELPPNAMAVVIF
(0)
DFIIIGSDTTSVSIKMAIGELLHHSHFTRRAQEEIHILGNDSLIEEQDNNMLFV
QDVAKERFRLHPASPLCISKFNLE*NEFVGYDIPANSTVMLNSWTIGRD
PLAATPDSQEFNPVRFLNKDIKMMGPDFQLILVSVERRQCSGTGMIMEVVEVVDASLL
RGFEWRCYNHKPKEFDMSEKPGPVRPRSTDVAITQVPRIHVHVYYRYKNSVQCGH
>CYP76C like6 CYP759A3P 755805060 692477956 710486323 973548001 finished
45% to 98A3 82% to
756700900
stop codon seen in
9 sequences , pseudogene
IPLDIYDFT
(0)
GHKAKTMKLQAKMFASAMALIEKNKSNEKPVVDKDKQNLVDILLSQEGDDRLPDHALAAVLF (0)
DFIIAGSDTTSVSIEWAIAELLHYPHFMKRAQGEIDSVVGKDRLVDEQDIKNMPFLQAIIK
490
*LFRLHPAAPLGIPHFNLEQATLAGYDIPAQTTVMLNLWAIGRDPAFWTNPKEFNP
658
ERFLNKDITMFRRDFQLIPFSVGRRQCPGAGLGLAVVQLAVASLLHGFEWSTHNQKPE
832
EMDMREKPGLVTPRMTDLVVRVVPRLPLHVYHGRENGQQ*
>713808566
CYP76C like9 CYP761F1P 50% to 76C7 large
deletion seen in 5 seqs.
Pseudogene finished
1003459573
816293569 no ESTs
METSQLSDYWAGSQLLGNSSFGPGVRVDSVSGSQY
FVVEFFLSAIVFTVFNLVFQRLHEPSLIPPRLSAWNFLCQT
HVLRRNPTVVLHNLVKRYGPVTHVKLWSQDLLVLSSVXAVEEFYKLHDMEFGDRPSSMN
RVTLSNSINSSCFPPLATYWKHLRFVLVSASTIPSFSSSFAFFRM
EWALEALHHHPSIVAQVSE
EVERSLGSRSHIEDSDLAKLPYLQAVVKELFRLYPPCAFSFPHESFDEYCHIFGYEVSPRTQVLINIY
TIQRDPAVWTNPNEFNPTRFITHPGIDMHGQHYQLLPFGGGR
QCPATKLAIRYVQSGLARYFHDARSSHMIPHSTCLEDDL*
>755830564
CYP76C like13 CYP761E1 complete 716051221 53% to 76C4 BJ975320.1 opp end = BJ966803 890572743
710522592 55% to
76C2 N-term 78% to 75B like 2
830395654
85% to 755830564
no other trace files match this
The best match is
755830564, probably the same sequence
MDFAKSTVARISFEGLKPEDGLSNQRVEIIVFLAAMFILPFVLLKLMRRP
KLKLPPSPPAYPIIGHLHLLGKLPHHSIANIAKTYGEIYSLRLGSVPAIVVTTPEMAKEFLLTHDKIWASRTVRD
VSGYYLSYNHTGIAFAPFTPVWRNLRKICTSELFTQKRMEASQGV
RDVEMQCMIRSILNDAN
QRRLIDLKLEVNALTANVVTRMVLNKR
FMRCVDSTAEEESRAQQFKEIMKDHFTLQGIFMIGDYIPWLRPLDLGGK
EKRMKALRKRLDAFLNEILDDHEVKRAKGPIAEEDQDMIDVLLNEMHQQDPNEPHKMDLNNIKSTIL
(0)
NMFAGGTDTATITIEWAMSELLRNPPIMAKLKAELDALIGQDRRVRETDVPNLPYLQAITKETF
372
RLHPAGPLLVPHESTHDCEVAGYRIPAGTRLFVNIYAIGRSSKAWDRPLEFDPE
210
RFMTGPDASVDTKGKHYRLLPFGTGRRGCPGMSLGLLLVQFTLAALVHALDWSLPPGMDPED
VDMTEACGLKVPREHALSLNAKPRAAAQFY*
>CYP81D like1 CYP761D1 762530079 756730662 36% to 75B1 complete
866056372
863085271 879876478 BJ164021 BJ199501 BJ605910
IIGATVFAIFIFKKFLTKHSN
LPPGPIALPVIGSMHLLGTSPHHNLQKLSTKYGPLMSIRLGQAQCVVASS
TETAMEFLKNQDSNFTSRPALRVGEAVFYGQ (1)
DLVFQNSTPLWRHLKKIFQVEFTSTKRLDTTRHVREEEIAHLTSTLPHNCEV
NLRIHLKSMIGNIISRMAVGQRLCAKPEECESEEQLREVASLREVMDNVAFCIGAVNLAD
YIPALKWLDLQGLERRFKKTFQIMNSVSG
EIIAKHQERRKLSNPTDKQKDLIDVLLDDMEKPQDGSPRVTMDSIKAVTWNAFAGATDAI
525
AMSLEWAMSEILLHPHVQAKAHAELDVVVGKNRRVEESDIQNLSYIGAIIKETLRLHPV
348
APMLAPHAALNPCKAFGFDIPGGTWVIINAWAIARDPAVWKDPTEFNPDRFMQDDP
180
NALNPRVFEMLPFGAGKRMCPGVAMANVTMQRAIAKLLHEF
WGLTSELDMSEGTMSIVVPRAVPLHAVAKPRLSSEFYT*
>755808481 CYP761B1 39% to 93F1 complete 42% to 98A like1 no ESTs
1006224824
835905148 832116318 1006223877
MPFSGGQGTFMFQ (0)
GSAIAVVAIFLLARFITTPKNIPPGPFAWPIIGSLHLIGPYPHRSLAKLA
EKYGSLMSVWFGQRLIIFATSPETALEFVKTQDANFCSRPKQQAPSVLLPH
(1)
DLTFSDVTSHSKLLRKIFQQQFTTSKKMEATQQLRANEFAHMLRTIPH
DTTVNVKFHLEVLAGNIFSQLVMSRRLLQPSSIEDTT
TDSTEKLKDLMKITADLDRIIGTFNPGDFIPAVKRFDLAGIGCKFKQFRNRMDSFVEKI
IQERLEERKSSRAPKELREKDYLDALLDEADQQKEIDLNVVKTMIW
(0)
EIFAAGMETNIASSEWAMAELVNAPHTMKKAQAELDAVVGRDRMVKESDLPNLPYIKAIA
KESLRLHPPVPFLAHQCIKSCKAFGYDIKSGTSVFVNVYGLGRLESIYPDPNTFNPDRF
LPGGSNVGLDYQGQNFELLPFGSGRRICAGMPVASLMVQTAVATXLHAFTWIAP
KDHELMEGLGAASLSKAVPLKAHATPRLPSHVYSL*
>CYP98A like1 CYP761A1 692435809 36% to 98A3 815740002 815612579 complete
717626666 836312331 710501813 34% to 76C2 no ESTs
N-term
exon is a best guess
KFAVAVLGVYFVAVLIRGASRKLPPGPVGFPIIGSVHLLGPRSHVSLAQLARKYGAPLMSLYLGQ
KLFVVASSAEAAMEVLKKQDAVFCSRPPLRGFKVIFPH
DVTFADLTPESNYLRKF 964
IRLHLTTARSIEAFQHIRVDEMLQMVRSIVASPRDVVVNLRTSLEVMTANVLTRSIIGKR
784
FMGRTGLSESEKKEIMEFIHIAAEIGECLGAKNPGDLIPALKLVDWNGLDQRMKNLRR
610
KMATFLANIVRERREKSSLGTSNPPGKEMLGVLLDEMENAAAGEKITEDILNTIIW
ESFTAGMETTVLATDWTLAEVLRNPEVLQKCQAELDAVVGRNRRAQESDIPDLHYIKAVVKES
483
FRLHPVIPLLIPHYSHDPIKVLGYDIPAHTQLLINVWAIGRDPKVWADPLKFHPE
318
RFLEGPHRETEMFGKSFNLLPFGSGRRACMGITLGTLLVEASVVVLLHSFDWILP
AEGIDMTEGQGLSVRKNVPACAFATPRLPPHVYAE*
>BJ976918
mate = BJ968430 CYP78A27 BJ583325 686708553
47% to BJ601765 55% to 78A11
1023015550
828195390 complete
MDSTTCEVGGLWLFALPMLAKQGRSSLEEALGCTNFSSILCIVGVISACALLVCWASPGG
SSWGRLRCVKTIPGPRGFPVIGSLLEMGGLAHRRLAQLAVTYKATALMALSLGETRVVIASQPDTAREIL
HSTAFADRPLKQSAQQLLFGRAIGFAPYGGYWRNLRRIAANHLFAPKRIAAHGKTRLDEL
ALMLNAIQREVETTGHVLIRPHLQRASLNNIMGS
VFGRRYDFVLGSEEANELGALVKEGFELLGAFNLADHLPVLKCLDAQNILQRCAALVPRV
TAFVKKIIDEHRQRRDVRATTGESYEEDFVDVLLGLTGEEKLSEEDMIAVLW (0)
EMIFRGTDTTAILTEWIMAEMVLNPEIQCNVQRELDSAFRKKNITDFTSLESELS
773
RLPYLQAVIKETLRLHPPGPLLSWARLSTQDVCIAGHLIPKHTTAMVNMWAITHDPKLWA 594
593
NPNEFIPERFLPSHGGQDVDVRGNDLRLAPFGAGRRVCPGRALGLATVQLWVAQLLYNFK 414
413 WTAVPGCDVDLTEILKLSSEMVKPLQSVATRRLVDPSS*
>BJ608079
CYP78A28 75% to BJ976918 993690276 1006187613 976463611
complete BJ591726 BJ594024 BJ594365
BJ609075 BJ169137 BJ591774
BJ602430
BJ589265 BJ590426
MDSAPTQVGGWWWFAVPLLAKQGRSSLEEAAGYTNLNGLVIMLVLGVISACAIFV
CWISPGGSSWGRLRGKRTIPGPRGFPIIGSLLDMGGLAHRRLAQLAVAYKAMPLMALSLG
ETRVVIASQPDTAREILHSAGFADRPLKQSADQLMFSRAIGFASHGKYWRSLRRIAANHL
FSPKRIAEHEDSRVAESEFMLQSIENDLLVLGSVQIRGHLQRASLNNIMRSVF
GRRYDFVTGSEEATQLRAMVDEGFDLLGAFNWADHLPALKFLD
AQKIHQRCADLVPRVRTFVQKIIDEHRNENNSRVGADERRETDFVDVLLSLKGDEQLAD
EDMIAVLW
(0)
EMIFRGTDTTAILTEWIMAEMVLHPEIQRKVQFELDSVFPTGICNCASFENMLSRLPYLK
AVVKETLRLHPPGPLLSWARLSVQDVCVAGHTIPAGTTAMVNMWAITHDPEVWANPSVFS
PERFLPSHGGQDVDVRGNDLRLAPFGAGRRVCPGRALGLATVHLWVAQLLHNFEWTPAPVCEVDLTEVLK
LSSEMVNPLQSVATSRRVSTSG*
>BJ601765
mate = BJ204837 CYP78E1 47% to 78A9 48% to 78A4 pinus complete
BJ157387
mate = BJ165354
BJ163959
mate = BJ172406
BJ596508
mate = BJ192420
BJ164880
BJ605699 BJ604521 BJ595714 BJ601611 BJ602476 BJ582121
BJ604032
BJ603006
710548465 54% to
78A9
997125033
839338142 785867756 N-term is a little long
MAERPARLWPLTDFPIFISKGDIVCKDSCIGRFQKYQNVGRAVAKKFREFLSA
LTKSKACKPVNSVIKALAAPLILIAIAQEFSRDAVKQFLLDGFLTQPLRWLFQYISPFIQ
QVGTVDTATWTDVHASSILVFFIAAISLIISIVGWCGPGGPAWSFSRIFSPSNKLPTPNG
PRGCPVIGSWTLMQGSEMH
RELARQAWAGGPSTRNLMALSVGTTLIVLTSDANVAKEILRSAVFGERPLKQAALDLGFE
RAIGFALQGPYWRHLRKVAVTHMFSHRQIVTHSELLQRETLRMISAMVHSIRTDCVKDYR
VGLCARPFLQRAAVNNIMTIVFGRHFDFGNSCDEAEALEAMIREGFELLGGFNWADHL
PLVRHIPFLSFSRRCRNLTMKVRAFVQSILDERRRCHHQSHSATSSVLNTSFVDALLS
LEGDQKLQDEDIISILW (0)
EMVFRGTDTIAVLTEWALAEVILNQGIQARIHEELDAVVGSNRLVQQKDIENL
PYLQAVLKETLRSHPPGPLLSWARLANEDTQIAGCHIPRGTTTMVNMWAITHDSSVWPNP
EVFDPSRFLKSEGGSDLDVLGTDLRLAPFGSGRRVCPGRALGIATAQLWLASLLHHFS
WSQDLSHPIDLTDNLTLSCEMASPLHGCPTVRFPL*
>756810029 CYP758E3 complete very similar to 716896136
785857816
891367041 713807864
BJ161247
BJ196384 BJ597499 BJ605004 BJ608968 BJ606559 BJ596753 BJ164688
BJ600017
BJ603952 BJ611104 BJ610779 BJ608982 BJ604440 BJ602128 BJ169200
BJ169747
BJ172828 BJ597344 BJ601461 BJ601840 BJ603729 BJ170550 BJ606471
BJ598898
BJ596900 BJ171346 BJ167714 BJ171954 BJ595739 BJ609360 BJ164548
BJ598653
BJ604317 BJ604410 BJ166119 BJ168281 BJ198054 BJ202791 BJ606440
BQ040942 BQ040447
AW496963 BJ156788
CYP98A like2 42% to 98A3
MNLDESLDGKLYGNGMVAAAGLLVVLTVVFFSSTVVGKKKTPPGPLPWPVVGNFLDLSVLPHRALRNLATKYGGFMYLRL
()
GSVPCVVISTAAVAREFVLKNDADTAGRPMVVALAILEEDKTVASANPGPYWSQLRKLCHDQLFSPKRVASYE
SVRTEEIHLMMKLLLKDSNKGDAVNVRRWLQGVTCNYVTRMLLGKR
()
YFGNDEGNLEQEQERKEFEKFYEHIFWALGTFIIDDYIPYLSFITTL
QGWIPRLKEIRQFSDDIGAKLADLDKHRQRAQDRKIGEDYVPDFVDVLLTAKMEDGKPLPDMNIKMILM
(0)
DMLIAGIDTIANTVEWAMXELMKNPTLMKRAKDELDEVVGLN
RIVQEADIPNLPFLQAITKEALRMHPPAPLSLPHESTRPAEMFGYKLPAHTRVFYNLFAIHRDPAM
YEKPDEFNPQRFIDHPEISHLTGMDYYELIPFGAGRRMCPAFRLGNLMVSLILAHVLHSF
DWSFTEGESAETFDMSEEFKLTVSLKKPPSWIFKPRNPAFLY*
>716896136 CYP758E2 BJ191252.1 opp end = BJ610609 692435929 BJ196454
CYP98A like3 756802151 49% to 98A8 692435929
891396529
869807241 complete 39% to 92A9
MLAAGLLVAALTWIFYSNAGKRETKKPPGPRPWPVVGNLLNLSSLPHRSLRDLATKYGGFMYLRL ()
GSVPCVVISTAAAAREFVLKNDADTAGRPQVVALAILEEDKTVASANPGPYWNQLRKL
CHDQLFSPKRLASYENARTEEIHHMARLLREDAKRGEVIDVRRWLQGVTCNYVTRML
LGKR ()
YFGNGEESPEEKEEKQGFEKFYKRIFEAVGTFIIDDYVPYLSFITKL
QGWIPRLWDIRHFSDSISVKIADLDKHRQRALDRNRGEEYVPDFVDVLLTTKMENGEPLPDKNIKMVLM
(0)
NMLIAGTDTMANTVEWAMAELMVNPLHMKRAKDELDNVVGTNRLVQESDIPNLPFLQAITK
581
EALRMHPPAPLSLPHESIRPAEIMGYKFPAHTRVFYNLFAIHRDPAMYEKPDEFNP
749
QRFIDHPEVNHLTGMDYYELIPFGAGRRMCPAYQLGNLMVSLMLAHVLHSFDWLFPEGESVQTFDMSE
EFKLTVALKNPPRWIFQPRNPAFLY*
>BJ977736.1
mate = BJ969431 CYP98A34 65% to 98A3 complete
BJ592471
mate = BJ185073 846061460 692508229
BJ579946 BJ580001 BJ586442
BJ587890
BJ589491
BJ594223.1
BJ583015 BJ174825 BJ579629 BJ587496
MAVMWENTYTVAAIVAALLFMMYKSLRSSHKLPPGPRPLPVVGNLTHITPVRFK
1
CFMEWAQTYGSVLSVWMGPTLNVVVSSADAAKEMLKERDHALSSRPLTRAAARFSRNGQD 180
181
LIWADYGPHYVKVRKVCTLELFTFKRLESLKPVREDEVGAMVAALFKDCADSRPLN 348
349
LKKYVSAMAFNNITRIVFGKRFVDDKGNIDNQGVEFKEIVSQGMKLGASLKMSEHIPYLR 528
529
WMFPLQEEEFAKHGARRDNLTKAIMQEHRLQSQKNGPGHHFVDALLSMQKQYDLSETTI 705
706
IGLLWDMITAGMDTTAISVEWAIAELVRNPDVQVKAQQELDQVVGQDRVVTEADFSQLPYLQAVAK 674
673
EALRPHPPTPLMLPHKATETVKIGGYDVPKGTVVHCNVYAISRDPTVWEEPLRFRPERFL 494
493
EEDIDIKGHDYRLLPFGAGRRVCPGAQLGLNMVQLMLARLLHHFSWAPPPGVT
PAAIDMTERPGVVTFMAAPLQVLATPRLRAALYKNGSSPS*
>715983616
CYP98A like6 CYP760A2 715970277 755678777 63%
to 98A like5 complete
1020670982
715970660
762555758 33% to 75B1 34% TO 92A15 BJ604138
BJ175204
BJ595911
MESKIVPVTGSTLVCLAAALLFLVGMMRRCCFGEKSLPPGPTG
WPVVGSLYSLGPRNIPACRRFAALAAKYGALMFLRMGSRPTVVISDSTTAKEFFKSQDNN
FSSRPRLATGKHFGYDYSSVVFSSGEKFTEMRLIYSAELLSSTNVKKLAPVRMEEIRFLM
ADVLRRSESERLGDRPHATEGLINITSMVFKANLNLMGRIIFSQSLFGDSGTVNATPKEV
ENFKFFVKSATRLVGLFNVGDYIPALRWLDLQ (1)
GVEGDLQRLKPHQEGLLLPIIHQYRKMHHSAEGFSKQEDRRVDFIAALVAKYSTLSDENIMAVAI
(0)
DLIVGGSDSASTAVEWGMTELLRHPHYLQEVQAELDAVVGRDRLVELSDCDKLPFLDCV
570
VRETLRLHPPSPLAIPHYSSQECTLGGCRIPAKTTAYVNIHAIHRDPKVYT 723
NPNEFQPKRFKTLPSMQVMAQNCESIPFSAGRRACPGQKFAFPTVMLMLGNLLQCF
891
SWSPPPGIRGEDIDVDEAPGVVCSRLKPLVASATPRVEKSVILDHK*
>CYP98A like5 CYP760A1 complete 755781297 755685424
815607281
836327020 BJ194280 870229681 37% to 756700900 39% to 92A9
very
similar to BJ175204.1
but not identical
MEDNRMGDGQVEEYSSRVMHLSALTLCAAMAVILLRRVMXSWNADKT
LPPGPKGWPIVGSLYSLGPRTIPACRR
FTTLADKYGPVMFFRLGSRPTVIVSNDKMARELLRVHDQTFASRPKLATGKHFGYNYSSV 560
VFSPSGAHFVRMKKIYTHELLSPKKVELLSALRMEEAHILLVDVLRNSGTEA
404
NGVVNITSLVFKANLNLMGRIVFSKRLFGESATISAPPREVENFKFFVKSATKLVGL
224
FNIGDYIPALRWLDLQ (1)
GVEGALLQLKPHQEGLLRPIIQEYRKMSLNLEGGMKQKEDGRVDFIAALVSNDSGLSDENIMAVAI
(0)
DVMVGGSDSTSTAVEWSITELLRHPDCLQAAQEELDSVVGRDRLVEEADCANLPFLNCIVK
544
ETLRLHPPSPLAIPHFSAEECTLGGYRIPANTTAYVNIYAIGRDAATWEN 394
PNRFNPTRFKDSKVNVYGHDFNLLPFSSGRRGCPGVHFALPTYKLELANLLHCFKWS
223
PPPGVDFKDIDTKEAVGVVCSRLNPLMASVTPRIPRHVILAK*
>713858245 CYP703B2 715976935 46% to 703A2 complete 83% to BJ976025
711802793 978013669 CYP703 like1
993680425
MDFRISELLIPAISEPVESGHLAFATCTVLVTLLSCVFLFIRSGTSVNLPPGPKGLPIL
283
GNLLQMGSHPHRTMTALHKVYGHILNIRLGCIPTVVVDSPQLIAEITKEQDNVFSSRPHM 462
463
TFTEILAYDAHDFAMAPYGPHWRHVRRICVHELLTPKRLENTRKERIEESRCMIMEVAE 639
640
RANKGEVVDLRDVLAGVSMTVMCRMLLGRREFAATGKQPKDFKHIIHELFRLMGALNPRD 819
820
FVPALGWLDLQGFERDMYK (0) 876
LRGEFDEVLDAIIQEHRDLESGKLPGGKKNDFISVLLDLPGEDGAPHLDDKTIKAVTI
DMMAGATDTSAVTNEWAMAEIIRNPQIQRKLQEEIDSVVGLERNVEESDVSNFPYLMCVVKETFRLH
494
PAGPFAIPRESMADTTLNGYL
IPKGTRVLINIYSLGRSSETWVDPLIFQPERWANENLTAIHDSGFRILPFGNGRRQCPGY
NLGTTMVLFTLARLLHGFNWSFPPGVTSDSIDMEELYGCTTPLRTRLRAIATPRLAPHLY
SQ*
>BJ976025
mate = BJ967509 CYP703B1 complete 43% to 703A3 45% to 703A2
820975853
884956080 833255812 BJ591871
MNILSPELLVPLITEWIQGGRLIFATCSVLVALLSSVF
LVAHFRTPMNLPPGPKAMPLLGNLLQMGSHPHRTMTAMHKKYGHILYIRLGCIPTVVVDS
PQLIAEITKEQDNVFSSRPHMTFTDIVAYDAHDFAMAPYGPHWRYVRRICVHELLTPKRL
EITMKERIEESRCMIMAVAEAAQKGEIVDMRDVFAGVSMTVMCRMLLGRREFAATGKKAK
DFKHLIHELFRLMGALNLRDFVPALGWLDLQGFERDMYK
(0)
LRDEFDEVFDAVIQEHRDLASGKLPGGKPNDFISVLLDLPGENGAPHLDDKTIKAITP (0)
DMMAGATDTSAVTNEWAMAEIIRNTEIQRKLQEEIDSVVGLERNVQESDIN
KLPYLMCVVKETFRLHPAGPFAIPR
ETMADTKLSGYRIPKGTRVLINIFSLGRSSETWKDPLKFQPERWANENLSAIHDMGFRIL
PFGYGRRQCPGYNLGTTMVLLTLARLLHGFKWSFPPGVTAENIDMEELYGCTTPLRTRLR
TIATPRLAPHLYSQ*
>815536265 CYP703B3 869926625
831646057 833253850 73% to BJ976025 45%
to 703A2
complete
MDTLVSLELFGGVVWNWVQESRLLMTTCSVLVALVSSVFLFSRFRKPLQLPPGPKGLP
FVGNLLQLGSLPHKTVTELHKKYGHLVYLRLGSVQTIVMDSPELFREITREQDNVFSSRP
HLTFTELVAYDAHDFAMAPYGPHWRHVRKICVHELLTNKRLESTAGERKEEWRCM
VKAILEAANSGDVVDMRDVFAGVSMTVMCRMLLGTRDFSAREDNP
RDFKHLIHELFRLMGVLNLRDFVPALGWLDLQGFERDMRK
(0)
LRREFDEVFDAVIQEHRDLASGKLPGGKPNDFISVLLDLPGENGEPHLDEKTIKALLQ
(0)
DMLAGATDTSAVTNEWAMAEIIRNQEIQRKLQNELDSVVGRDRNVQESDLPNMPYLM
450
CITKETFRLHPAGPFGIPRETMADTKLNGYHVPKGTRVLMNFYSLGRNPEIWADPLEFK
273
PERWERENLAQVQDPEFRIFHFGNGRRQCPGYILGSSMVLSTLATLYHGFNWSLPPDV
ARIDMDEAFGCTLPMRTRLRAVATPRLPPYLYA*
>756814065 CYP761F2P 755834403 835931597
46% to 75B1 47% to BJ977736
stop codon seen in
12 sequences pseudogene finished
464 DIITAGTNSTIEVVEWAITECIHNPAVMSKA*AELHRVVGKSLRVHEAEIPNLLYLQAIC
643
644
KEVFCLHPTTPLLYPHVNQHACTVFGYDILAGTSMLVNVGAIVQDPSIWEDPLVFKPERF
LERHSHLDAQGHHIELLQFGTGWPQCPGIGLSLTMVYILTATLLHCFEWSLPQDCTETEAPLSG*
>710491797 CYP755A1 692498994 835920053
32% to 75B1, 33% to 92A11 complete
1009323251
711801827
BU052572 AW699134 BI741530 BI488241
GTAVVVGFVLLLLLLVFGYSRRVGKKKTLPPGPFAFPVIGNLFLVGKHPHVTFAKLAKQY
GNIMRLHFGAVPVVIVSDANMARELFSVQDMKFASRPIYDLMSTAYKYMNYG
TDEEVSLAISEYGPKVRDLRQLCTTELFTQRKIDMKKSVRAEEIQRMFGKIKTMIRDEEP
VEIRPIVSEFSLRISCRTTFNKAFLNFENLPWRPGALHPQAFRNMETENTKLLGEHQILD
594
MIPMLKFVLERFDVFGINARWKEVSALKEECTRPVIEWYRKHSSDDESTLDFVEVLL
423
RLSEEGKLSKTCVKSLIL (0)
ELLTAGSDTIASVLEWTLLELVRHPHGMERLSAEIDGFFGINRPVDEDEFTKLPYLQ
(0)
AVAKEVLRLHNPTTLGIPHSNMEEATLAGYHLPARTTV
LANFWAISRDPTTWGQDALTFNPDRFLACDLNVNGTNYEYLPFGAGR (2?)
RICPGRAVAMRVLAAAIGSFVHAFEWSALPGVELNANEGKDGLNIR
PETPLVLKLSPRPSAMLY*
>BJ969539
CYP761C1 884947623 890251096 863048229 839338572 692475254 complete
N-term
exon is a best guess 41% to 75B6, 43% to 75B9
VYYVATVLVVILLVRRLLTWPHQ
PPGPPGLPLVGHMHFLGANPHISLWKLADKYGPLMSLRLGNKPYVVATSPETAKEFLKT
LDANFGSRHYSSQSQYLLYGGQ (1)
DVAFQESSPSWRNLKKIFTMELASPARLEASRHIREEE
MIVLLRTIHSKGELELKSQLIDMISHVISRMVINKRFDDSVESDFPTLVQT
HFRLAGAFVPGDYIPAVKWLDLGGFEAQMKKQKERMDAFIDDILVQHRERRAKGPVPMKEY
DMVHVLLDRIETKDDQIQLTDTHVKALVL (0)
DAFLGASETIILTSEWAMAELLRHPSLMAKAQAELDAVVGRDRMVTE
SDLRHLTYLNTIIKETFRLHPAAALLLPRESAQPSQAFGYNFPAKTRV
LINCYAIHRDPAIWHDPLVFNPDRFLQADLKDVDV
KGRHFQLLPFGAGRRVCPGLSMGILTVQFILASLLHSFDWSLPGDMKPEDVDMTEIYGLT
LPRAAPLPCAAKLRLPSHLLTTAQKP*
>DR061629.1 iq01h03.g1 Cycas sporophyll (w/o ovule) (NYBG) Cycas rumphii
66% to 75A1, 53% to 75A11
N-term only
METREWIVWGITWAVLYVGVGYIINNSRKSRR
LPPGPKGWPLLGSLPLLGAMP
HVSLYNLSKKYGPILYLKLGTSGMVVASSPETAKAFLKTLDTNFSNRPGNAGATYLAYEA
NDMVFAPYGPRWKMLRKVCNLHLLGGKALDDWQPVREAEMSHMLRSILHHSNRSQPVNLP
EMLNYSMANMLGQIILSKRVFESHG
>692448354 CYP752A1 993458568
1028576144 816256171 839352528 see
872833037
26% to 755686556 complete very low
similarity to other P450s
MEKDDDVSFDVALPTIGLETTHSKFPQLVLSAIGLIVVCGGAYVLYNTHYLRVKIKL
PPGPPPWNVFCTSMQSKKPCQALAKICNCEYGGIMTLSLGKFPTILITSTVIATQLH
VLKRYKFKFGHPKNIPRPCEYLHPD
NYQNLKCVLPYNWTQWHKLWQIYIDHLLPVAHNMSFQSINQLDIQIMLKNLENEMTKEGGMKPFGIGLRPHLR
HASFKFIFNICFGRHVDAIAGGVGSHKDPLMMQLEALFIEVIRLGPAFIISDFVPTSLPFH
SPIDIQRAAISGTMKKYKTFYH (0?)
RANIKVPRSEPVDLLDHLVCLQKDEQLQDKEIVWLLSELILAS
TDHVSTILEWTFAHLMANPQVQAKLHQEIDIVCSKRTN (0)
ISTTEFDNMPYLVAIVKESMRVSSPIMLTIPHSTTKELNIGGFQLPMNTQIVCHLGALGQ
DANIYENPSCFDPNRFIGIGVNLNNAFEKQKNIVHLMTQQFCPGRGLEILHVYIF
LVKLLQCFEFSHLYVEIMPFKTSDTVEWGVINVLRKPLVACLNPHL*
>692452215 CYP758E5P 30% to 75B1 N-term frameshifted in five
different sequences
near KYG region
1006070983
862348410 probable pseudogene finished
686729933
80%
to 692452215
note, there is no eact
match to this seq in the trace archive
This is probably a
poor quality version of 692452215
755690091 CYP81F
like2 49% to 81F1, 4 aa diffs to 692452215
no
exact matches in trace archive best matches = 692452215
MDFKSLQSCSNRLQQQDGGLHSTSTILVTAIATIAVLYVAMVYQRRKKLPPGPWPWPIVG
NLPLLLGSQKP
(frameshift) SLRKLALKYGGLMY
(bad boundary)
VQAGQKPCLVVSTAAAAKEIFRKHDATFASRPPTL
AFNILTAGAYRNLGYAPYGPFWRRLRRIANTQLFSPAVHASHEPIRRKEIHYMLKVLVED
SHKGKPIHLKSWLTSVTANNMTMMLSNKRIFEMGVDNDEKKRDFDEMLRRTFVVNGDFMI
CDYVPYLSFVTKLQGWVSEMQGLRALGASLVGNIF
QVDEHRERAQKMYPSDTDYVPDYID
VLLXTSLDDGDRLPDRDIVSL
(0)
GLLNAGTDASANTVEWAMAELMANPDIRKKAQAELDAVVQDRLVQESDIPNLPF
395
LQAIVKENYRMHPSAPLSVRHESHESCVISGCEFPAHTELIVNIFAIHRDPSVYENPDKF
575
DPTRFVRSPEVDPVAGNDFYQLMPFGAGRRMCPEQQLGNTMVTSMLANLLQCFEWETATS
755
ERESGGGEGDAVVVDMVDYYSFMSFRQKPLCLLAKPRPLASLLLQASP*
>815871008 CYP758E6P 993522654 77%
to 692452215 pseudogene finished
755690091
extends downstream, deletion before DRDIVSL
MDLKSSQPCSHWLQHQAGGIPSTSAIL
VTAVATIAVLYVAMVYQS*KKLPPGPWPWPIVGNLPLLLGSQKPKSLRKLTLEYGGLMHFQLG
VQAGQKPCLVVLTASVAKEI
FRKHDATFASRPPCRVFNILSEGTYRNMGYAPCGPFWHRLKRIVTTQLFSPAVYASHEPI
RRKEIHYMLKVLVEDSHKGKPIYLKS*LISVTANIMTMMLSNRRIFEIGADDDE*KMQLEELMYRT
FDSFASIIISDYVPYLSFVTKLQGWVSRIQGFRKSGESLIGNIFLDD
DRDIVSL (0)
GLLNAGTDASANTVEWAMAELMANPDIRKKAQAELDAVVQDRLVQESDIPNLPFLQAIVK
ENYRMHPSAPLSVRHESHEPCVISGCEFPALTELIMNIFAIHRDPSVYENPDKFDPTRFV
RSPEVDPVAGNDFYQLMRFGAGRRMCPGQQLGNTMVTSMLANLL
QCFEWETATSERESGGGGGDEVGVDIA
NHYSFINYRQKPLYLLTKPPRLASLL
SQR*P*
>755686556
CYP98 like CYP758E1 39% to 98A9 complete
863152685
876249180 815631660 BJ189287 BJ196404 BJ598039 BJ599425
BJ608223
710522627
54% to 692452215 BJ594604
BJ194881 BJ201010
BJ198114
MASAHSHTRRWWSQEAHGIRVSGEGTIATLLISSLVIYVTVVYQRRKKLPPGPWPWPVVGNL
AVLAGLPHRNLQNLAAKYGGLMYLQL (1)
GQVPCLVVSTAAAAKELFRTHDVIFSYRPKRLDHEIISGKSYKSLTSAPYGPYWRQIRRIC
NTELFSPAIHASHVSVRSEEIHSMMKVLLAESRTEKAIDLKSWLTGVTANNMTRMLINKR
(2)
FFGTGVSDQQEKKDFEEIFDHIFAAAGTFFISDFIPKLRFVEMLQGKIAKLTAFRK
FLHSVIGKIFEVEKHRQRALERGNDPIYVPDFVDVLLNTPLDNGERLTDREIISILS
(0)
SMIGAGTDTTATTVVWAMSELMVNPKIRKQAQEELDAVVGDSRLVEESDIPNLPFLRTIV
KETFRLHAPVPLSLPRCSEQPCEVAGSQFPANTRLILNVFAIHRDPIVYENPDSFQPSRF
VDHPEVDHMSGKDFYGLIPFGAGRRMCPGYHLGNVMVSLMLAHLLHSFDWRLPAGVTEEN
LDMSETYKLVGLRKKPLFLIAKPRSPAYLY*
>689472293 CYP758D1 692450239 BJ166421
36% to 92A14 52% to 755686556
863148306
1020712813 997099744 complete
MGFVEMTQNWRLWLQEGSNVSVYGTVLFVMFTTSCILHVLSAIE
RRKKLPPGPWPWPFIGNLGVVLRKTGARHKFLQALGAKYGGLMYLGL (1)
GQIPCLVVSSVRVVESMFKSHDATFSDRLQTYFRKVQYGDSAMRSLSSAGYGSYWRQVRRMCNT
ELFSPGTHASQEGVRREEIQNMLDVLVHECKRRKPIDLGDWLFGVSTNNMTRMLINKR (2)
YYGTGAEIPEKKEEFQGMVKSRTRAAGTFVISDFIPSLTFIAKLQGLPKRFRESH
ESAKAQMESVLDVEEHRKNAIARASVDIKSEYSPDFVDVLLKAPLDDGQPLADSDIKFLLT
(0)
DLMIAGTETTGITVEWAMVELMLRPELRKQAQEEIDAVVGADPER
FVQESDIQKLPFLVAILKETFRVHPVAPLNVMRSSYEPCEFAGYYLPAQTRLIVNQYAIH
RDPSVYENPDKFEPRRFMENPEVNPLSGRDSYQLIPFGVGRRMCPASNLAFTMALLMLAN
LLHTFDWSFPDGVTADNFDVSEEFLGTVLRKKTPTILMAKPRSHVQ*
>762521418 CYP758E4P N-term 828262863 755809804
816388645 862850609
pseudogene
stop codon seen in 9 seqs., 27 aa deletion finished
MDLKNSQSCTEWVQRQIGVQLTWAILVTAVATIVVLYVAVVYQRRKKLPPGPWPWPIV
GNLPVVFGSQKHKLAHKYGGLMI
(2)
VQPGQKPCLVVWTAAAAKEIFRKHDATFASRPSMLIFYILTGGRYRNLGFA
PYGPFWRRFRRIANTQLFSPAVHASHEPIREREIHSMLRVLLEDSYKGKPINIKSWLTSV
TANNMTMMLTNKRIFEIGADNDEKKRDLEDLMRRTFALM*SFIISDYVPW
27 aa deletion
GKNFQVDEHRKRAKEMDRNDTEMPDDIDVLLNTSLDDGDWLADRDIASL
GLMNAGTDTSANTVEWGMAELMANPEIRKQAQAELDAVVQDRLVKESDIPNLPF
LQAIVKETYRMHPSVPLSQRHESHQPCVISGWEFPALTELILNLYAIHRDPSVYENPDKF
DPSRFTRNPKVDPLAGNDFYELIPFGAGRRMCAGHHLGNVTVTSMLANLL
>686716102 CYP761E3P 713851494 38% to 705A3 stop codons seen in
at least 5 seqs
pseudogene finished
VFNIHKVKEIEGPILDKHQAFIDIFLNKMY*QDSNET
YQILTSPSLPCAQNLLVGDIY
451
TSMVTIKWAMFEMLQNSTIKAKLKAKLDTHI*KDKQLSKTNLPNLTHLQAITKETCLHS 275
274
MEHLLIPKQNKLELKI 227
>BJ976877 CYP753A1 BJ975911 BJ964760 BJ968388
BJ971180 BJ967389 BJ973295
BJ962654
824724989
830915456 31% to 77A3 complete
Similar to 77 and
89 families
MANYIIASNALVLLAFVTFFFVYFLRAFI
LRDKKLRPKYPPSPWKWPILGNLPQLLRGGPACHTTFRLLAKELGPVYNVWLGGSFPMVI
VTGEETVHEALIKQSSVFSSRPKLLSWQHISAGFKTTMTSPFGPHWQKLRKT
ISVDLLGPSKLASYKPIRDSEIQKLLARLREQA
HANAGLVSPLDQLRTSAVDVIMRIGFGEEFALMEAVNSNRRHAKIVELDRCF
RQLMDAGSIFQLVIDSSVVARTLLFPLARSANRNIE
TVADNTVSLVMPIVQQRKRYLQDHPATETRTFVDALISCKGESALTDLEIVW
NVVELMVGGTDNTSHILEWALANMVKYPHIQEKVYTEVRCAMGPNLERRLVEESELDKLPYLQAVVK
ESMRRHMMTPLAIPKLAAQDCKLSGYDIPKGTMVVFHAGALAMDDDIWTDPLNFRPERFL
AGTGSSNAPVTQTHKHAFMPFGAGRRSCPGAAMGFLHLHHLFANLIYAFEWGPESPRKAV
DFTEKFRMVVTMKNPLRATIKERTHFRMM*
>713878499 CYP753A2 716050461 716050845 716050461 710547221 complete
859967265 33% to
71P1 32% to 98A3 857998092 No ESTs
1003442146 61% to BJ976877
MEADHNRASDSLLILCVVTFLSMTLLLGYLGNVLGKEIKPKLPPSPPQWPVLGNLPHIMRKSAAL
HTTLRLLGKKLGPIYTLWLGRSFPLVIVTGEETAREAFVDQGHIFSARPTLISWQYISSG
619
YRTTMTTPAGAHWQKLRKIISHHLISPTNLARYIPVRDSEIQKLLERFRDQSRENEGVV
442
HPLQQLRISAVDIIMRIAFGDEFATLDNEGRKGNAVVLDRLFQEIMDAGSIFQIALDSSP
262
LTRTLLFSASRKCYSNIRTIAGQITDLVMPIVMERKRLLKERPLTEVKTFVDSLIMLKG
85
DDALTDMEIVWVVELMVGATDNTSHILEWIFANMITYPHVQQKAYEEVKRAMGPNLDRGLVQENHI
PNLPYIQAIVKESMRRHMMAVLAIPRIASRDCKLRGYDIPKGT
MVVLHAGALALNEEIWSDPLEFRPERFL NLDNEATHMHKMAFLPFGVGRRSCPGASMGLLHLHMLLANLIYRFEWGPE
SPGQAVDFSEKFRMVVTMKSRLRATITERSPV*
>686737904 CYP757A1 change to 758G1 36% to 92A9 36% to 703A2 no ESTs complete
815805623
856865465 857893627 39% to 755686556
MGALDHSNDMWLQILLALTLVSVVLTWILQCSSSAQKVHPPGPTPWP
(1)
VIGNLFLFFRAPL
PHRMLHNLAEKYGDLMYLRLGFTPCIVVSSPALADYIHKNHDTEFSSRPDGLITGILNGD
SQSVSMAKHGDLWKTLRSICWQILRPANIARYETRRMEEINIMLQSIQIAAEAGETVD
LSSMLYKLSSNSMTQMLINRR
(2)
YFTAGGNEENLREAVIF
KKMISERLKIASQFAIGDYIPYLRFIDYLFRYNAKAQEIQSMTMRVCDEIMNLEERRRRLTREENGDAQAVREEDFVDDLLSIQAEYTADNSRKIKLTDHQIKLLVQ (0)
DMLVAGTETSATTVDWAMAELLCHPKVLQQLRSEIVTVVGSRSAVTE
QDTKQMPYLNAVVMETLRLHPAAPLNLPRESKGACLFLGRYQLPAKTRVIFNTHSIHRSL
EAYDSPNAFKPERFLGVPQANVSGSSFFQLSPFGFGKRVCPGQALGTISVCAALANLVHR
FAWSLPCGLPPSHLDMIESFGLTAPRRCPLILLPTPRLKV*
>710925967 CYP758B2P 50% to 71A25 heme stop codon and frameshift
seen in 5 seqs.
Pseudogene finished 61% to CYP758B1
GRRICPGMSLAAINAPPILAHRVHSFD*RLTETLVN
RNPRELDMTEKSDSVMASRLYPLLIIAQPRKPAFLYYPKEA*
>755729305 CYP701B1 717596741 40% to 701A3 43% to 701A7 complete no ESTs
44% to 701A10, 43%
to 701A5
755836365
970783504 828200861 yellow
region uncertain
830512365
85% to 755729305
only seq in trace archive
closest match
= 848389444
= 755729305
This is probably a
poor seq version of 755729305
MLNESTSGHSSDTCVQTSLGCRDGKRRLNEMLETKVIAHHVSHSPCAAIPGGLPVL
GNLLQLTEKKPHRTFTAWSKEHGPIFTIKVGSVPQAVVNNSEIAKEVLVTKFASISKRQM
PMALRVLTRDKTMVAMSDYGEEHRMLKKLVMTNLLGPTQVHDHR
VQQNPPCLKMCHVYASHSKGTPEEKIVCSPAFYRRISPSGMEFGCAEQKPI (0)
VEVLELGTCVSTWDMFDALVVAPLSAVINVDWRDFFPALRWIPNRSVEDL
VRTVDFKRNSIMKGLIRAQRMRLANLK (0)
EPPRCYADIALTEATHLTEKQLEMSLWEPIIESADTTLVTSEWAMYEIAKNPDCQDRLYR
EIVSVAGTERMVTEDDLPNMPYLGAIIKETLRKYTPVPLIPSRFVEEDITLGGYDIPK
GYQILVNLFAIANDPAVWSNPEKWDPERMLANKKVDMGFRDFSLMPFGAGKRMCAGITQ
(0)
AMFIIPMNVAALVQHCEWRLSPQEISNINNKIEDVVYLTTHKLSPLSCEATPRISHRLP*
>759473441 CYP761E4P 38% to 76C1 N-term pseudogene seen in 6
sequences finished
KQENGIGSSVRFLATRFCSQIMF*SQFVLHIIWHCRKLPPAPPEWPLI
GHLHLLGTHAHQSMAELAAK*QGILHLKFGLKGGVVVSIEAMSREIFKKHDLALSQR
CYP72 Clan
sequences (4 complete sequences, 1 finished pseudogene)
>DR578536 Picea glauca N-term half CYP72
clan seq.
MKIIWVGLAIASGYLVAPRVKQLWVDWIWTPNCLVQCCKRQGIPGFSHHEILKVLQL
RSMDTGQSWTILNSLRNHGMIFYTRVGKFVRLCVADPVFVRQILIDNAACYEKPSFIRSL
SAIGDGIFSAAGKDWFHQRQIFQPRFLSKEIKRKLELIKEAAIEALYDCEQNLQFNNEK
EIDIYQKFLELTLNSFGKFTFGSDLRSPMGEIFRSFDRYLYNNRKRMFGLASRLPGLLSLP
TSVNQEIKEDEYRLKMIAVDLLEKETTRQHFETDEEESLLALML
1 tttttttttt tttttgcaat ttgcgatgat
gctactccga attagatttt tgtcatgaag
61 ataatctggg taggattagc gattgcatcg
ggatatctgg tggcaccaag agttaagcaa
121
ttgtgggttg attggatttg
gacacccaat tgtctagtcc agtgctgtaa gagacaaggc
181 attcctggtt tttctcacca
tgaaattctc aaggtcttgc aattgagaag tatggatact
241
ggacaaagtt ggactatctt gaatagtcta agaaaccatg gaatgatctt ctacactagg
301 gttgggaagt ttgtcaggtt atgtgttgca
gatcctgttt ttgttcgcca aatcctaatt
361 gataatgcag cttgttatga gaaaccatca
tttattcgtt cattgtcagc tattggtgat
421 ggcatatttt cagctgctgg aaaagattgg
tttcatcaac gacaaatatt tcaacctcgt
481 ttcctttcta aggaaatcaa gaggaagctg gagctgataa
aggaagctgc aattgaggct
541
ttgtatgatt gcgaacagaa tttacagttc aacaacgaga aagaaataga tatttatcaa
601 aaatttctgg agttaacatt gaacagcttt
ggaaagttca cttttggtag tgatttaaga
661
agcccgatgg gtgagatctt tcgtagcttt gatcgctatc tttataacaa cagaaaacgc
721
atgtttggac ttgcttcccg attacctggg ttgctgtcac tgccaacaag tgtcaaccag
781
gagatcaagg aagatgaata ccgacttaaa atgattgcag tggacctttt ggaaaaggaa
841 acgacccgtc
aacattttga aacagatgag gaggaaagtt tgcttgcctt aatgctt
>686708190 CYP766A1 711871885 36% to 735A4 in CYP72 clan complete 44% to 715970850
BJ596980
mate = BJ193144
BJ601506
mate = BJ204578
BJ168764
825662336 936994386
1017450974 692499678
MMVEYSQSWTALAVVELSVVAITAVFVPLWNVCSTFLLEPLRLRRVMGKQDVRLAPFNLVFGNA
FEIGAHAQSFPETLPLKFDDLEPTATPQFDLYFSKY
(1)
GKRFLYHVGSETRLVVRDPEMAKEVLFNRMGWYERSPLDL
HIFSQVIGKGMFVVKGEEWEMQRRMLNPCFSNESLK (0)
PMVERMVKSAAQEMRNWEEMAAQAGGRVEHDVEHDIHIIA
YNIISYTAFNEGFDKGKQIYLMIYLMQDEIMGHLFAAGNPSFWIPGLR (2)
VLAGLLPTKHATAIAQLNGRTEKLIMELVKDRREAVQKGERDSYGDDLLGRMLTATERTDG
SSHKFILDAVINNCKNFFFAGSDSAANLTTFSLLMLANY
PEWQDRARKEVLEVFGDNDPCEMNDISRLKI
(0)
VGMISQEIARIFAVSPSIARLAVKDCELGDLLIPKGLVIEIATLAMHRDPELWGKDVAEFRP
ERFANGASAACTHHQAFLPFGAGPRSCIAEKISWLEVKVVLCMILRRFRILPSPKYK
HHPHFAMVNRPKYGLPLILEILPQSRSDSIMAEI*
>DR013841.1 Pinus taeda
EYTELIREALAQPMHNISHDIVPRITPQYHKWCQLYGETFFYWYGTHPRLYISEPELIKE
VLSNKFGYYGKETPRPFVLALLGRGLIFVDRLGWVKHRRIVGPVFNVDKLKPMVKKMAAC
TSSMLENWQEMMSQADSNGKEIDVHSEFRTLTADIIAHTAFSSSYNEGKEVFELQRELQA
MVAEAERSVYIPGSQSIPTRRNRYAWKIDRRIKEILNSIIQSRLESRTTSRTQVGYGSDL
LGIMMTANQKELGGSQRDLSMTIDEIMDECKTF
>715970850 CYP766C1 755717789 755842297 710486501 755717789 44%
to 714A2
890572813
832104723 complete 44% to 686708190 no ESTs
MVFTQWVRFAALAIPEDVRN
ALGVVLLAFVASAIVRVVFSLVKTYLYDPLSIGRIMAKQGIEGPPFHPIFGTTAELNAY
VKSVPESLPLDEDHDSMRTVSPHFHMYFPKF
(1)
GKRFLYWRGPHAKLVSKDPGLAKEVLLSQYEFFQRHPQDIKMLSNFVGMGLDNLTGEKWA
IERRTLNPFFYHDPLK
(0)
GMVEGMVKGAEPVLKSWEEEVARAGGTAEFNLEEDLHTISGNIIAHTAFGT
DHEKAKEIYQTQREYVNLLFQNLHSGWYWIPGFT
(2)
YLPTQTNVTMARLRSTIDSSLHELITERRKAAERGDTASYGNDLLGIMLAAASNST
DETATEFNLASVFNNAKLFFFAGQDTVATVLTFTLLQLARYPEWQDRARQEVLEEVGE
TEAYDSTTLNRLKI
(0)
VGMIVNETMRLFPAVISVSKVATKDMQINELFIPKGLTVEIPIVSYNQDPEIWGDDAHKFKP
DRFEHGVSKACKHPRAFLPFSMGPKMCIGKEFALMELKLVVAMVLRRFLSVSPHYK
HHPYSSLLTRPKYGMKLIFSSRQASKLEH*
>BJ196684
CYP766B1 complete
BQ827188
mate pair = BU052459 816039785 883669031 755835841
33% to
709B2
METVPVNVRNALAVVVASVIVYSVIKFLRVSVWQPLRLRRIMAKQGVSGPPF 168
169
RFVRGQFVEMWKFTESFPDALPIDDFANLTPTVTPQNALYYPKYGKIYLYWWGTITRLAV 348
349
RDPKIVKELMVSNHESLTRLQSESQFLAEVVGKGLLTQVGEKWASERRTLGPFFHQKSLE (0)
GMVGAIMEGAATELQKWEQEVEERGGTAELDVEPDLHKISGRIISRTAFGDEFEIGEQIF
KFQTLLSQELLKGFRSTAYWLVPGYR (2)
NLPTKRNRSMNLYGSQVDALVRGIINARREAVQKGVTSSYGDDLLGRMLTAATEGWSANTKEFNQL
AVFNICKFFYFAGQDTVANAIGFMILMLALYPEWQDRCRQEVTEILGDEQDWRASDISRL
KVVGMVFNETLRIFPPASTLTRVAAKDLQLEGLFIPKGMAIEFSLAAMHQDKDYWGDDVGKF
NPERFVNGAASACTHPQAFSPFGLGPKFCIGNNFAVMEAKIVLA
MMLRRFQLVLSPNYKHHPTSIMVQSPKFGLPIILKALKIT*
>857906422
CYP766B2P
857979019 80% to BJ196684, pseudogene finished
defective
at EXXR and PERF motifs
1029015881
goes upstream
982645139
goes upstream
1036166944
goes upstream
756808170
goes upstream
MEAVSVNVRNAVAVVVASVIVYSVIKFLRDSVWEPLRLSRIMAKQGVSGPPFRFLLGQYMEMVK
FTESFPDVMPINDFANMSPTVTPQNALYYLKY
(1?)
GKMYLYWWGTMTRLAVGDPKFVKELLITNHDSLTRSRIENQFVAEVVGKGLLSQEGEKWT
SERRTLGPFFHQKSLE
()
GMVGAMVEGAATELQKWEQEVEKRGGTAELDVEPDLQKISRRIISCTAFGDDFE
IGEQICKRQILHSNELWKTFRSAAYWLVPSYR
()
NLPTKGNRSMNLYGSQVDALVRGLINARREAVQKGVTSSYGDDLLGWMLTVATEGWSAN
TKEFNQLSVINNCKLFYFAGQDTVAKAIVFTVLMLALHPEWQDRCR
25 QEVTEILGDEQDWRACDISHLNV (0) 96
195
VGMVLN*SMRLFPTAFQLTREVVKDLQLEGLFIPKGMHIEFSVMAMHQDKDLWGDDVGNEIC
NGVASACTHPQAFNPFGLGPKYCIGNNFAVMEAKIVIAMILRRFQLVYSPNYRHHPT 552
553
VTMLQEPKFGMPIILKALKIN* 618
>BJ581794 mate = BJ176571 CYP765A1 (N-term), BJ611461 mate = BJ192135
untranslated 5 prime end
BJ585659 mate = BJ180400
(N-term)
759450777
850637393 876269284 830401001 complete
Extend upstream
with 860052391
Extend
downstream with 756812101
Extend
downstream with 832110254
Extend
downstream with 830750922
32% to 734A1 31% to 72A7, 31% to 709E1 (72 clan), intron boundaries not very certain
green
and cyan supported by ESTs from moss, grey supported by ESTs from Picea, Pinus
MPLAVAILYAANKLALAPALLHTMTIITILTWILGGALTLGLGFIVKEWLWNPLMLIEL
CKRQGIKGFPFVPFVGQMPAIDE (0?)
VLSGRNRRVQKQDNDEVEDEDRLTAVTNCYRNH
GSTFYFTVGRTVRLSIADPPLIKDILIANSESYSKPLHIRKLGVLGDGIFASSGSTWSP
QRSLFTGAFHTKEVK (0?)
SKIPTMIDCAHSAVEKWSRELNDGYSELDMYQKFAELTLDVIGKTAFGTEEIGGASEAAS
VIGSFNRYLLYCRELVFGPPATFPTSL (?)
KWLRTYMGRIISARRNSHHSGAAETVSDRHDLLDVIIG
AVDNIGHSEEGAKKALNEAPDQTISEKRKRAAEMTRLTEKRLLDNALTV
LLAGHETTASLLTWTIYLLAEHPLWQKRARAEVEEFCPGGVVEPQVLSHLKLLGMILL
ESLRLFPPVPLIGRMCIKDNKVGPDLLIPEGLEIVIPVAVLHRDRTIWGDNADEFAPARF
GNGISGACGNPLAFLPFGAGPRTCIGQTLALSEAKAVLAVMLPLFSWKLSTSYRHSPDV
TLTMMPEFGMPVVLEKIEK*
CYP74 Clan
sequences (3 complete sequences)
>CYP74x CYP74G1 AJ316567 divinyl
ether synthase 12-AUG-2002 ESTs BJ969377.1 complete
hydroperoxide
lyase (hpl gene) later designation 21-SEP-2005
39% to
74D1 39% to 74C1 41% to 74B2 42% to 74A5 39% to 74E2
38% to
74F1
BJ583519
BJ158039 BJ600124 BQ039802 BJ588073 BJ167917
BJ965095
BJ588288 BJ172167 BJ170523 BJ166295 BJ596848
BJ604932 BJ606484 BJ166967 BJ167535 BJ971827 BJ165862
BJ609895
BJ604409 BJ198021 BJ159658 BJ191836 BJ162716
BJ158982
BJ158406 BJ973612 BJ182787 BJ963293 BJ962527
BJ198621
BJ178387 BJ203304 BJ196986 BJ971054 BJ193013
MDRTLVLTCTTTCSHSAFRQSALPSNTSISVRLGTCSVRTQKR
RTVVASLGNIETTSTSTVGQESNLPLREIPGSYGIPYLSQLLDR
WTFFYREGEPQFWQSRMAKYGSTVIRSNMPPGWFWTDSRCIMLLDQKSYPTVFDYDKV
DKYKAFAGTIMPSTEYNGGYEVCAYLDASDKKHEQLKGYCFELLKFSSSKWAREFHTA
ISETFNQWEGKLAQKTPALINPTLPESLFSFVINALTTARFDDSSIPDAEKPVCGDLQ
KWAGFQLMPVIRTGAPIYIEEMLHVAPIPASLTKGGYDKMVVFLQKYAAETLSIAEKF
GLSQDEAVHNLIFFLILNAHGGFCRFLPVILREVAKNGQLQADLREEVRAAVKASGSD
QVTMKAVMNDMPLVASTVFEALRFDPPVPFQYARAKKDFIIESHDARYQIKTGDFLGG
VNYMVSRDPKVFTDRPNEFNARRFMGPEGDKLLAHLVWSNGRQTDETTVYTKQCAGKE
IVPLTGRLLLAELFMRFDSFNIEGLEMEATFTSLTPRSD*
>CYP74y CYP74A1 AJ316566.1 Allene oxide synthase complete BJ588378
BJ158304
BJ166090 BJ174856 BJ177756 BJ178183 BJ166090 BJ174856
BJ177756
BJ178183 BJ180158 BJ183079 BJ186758 BJ187084 BJ187922
BJ191679
BJ194574 BJ195562 BJ197445 BJ580030 BJ583315 BJ585498
BJ587579
BJ592267 BJ597789 BJ598633
46% to
74A1 40% to 74E1, 40% to 74F1 47% to 74C1 40% to 74B3
37% to
74D1
MAVPSSKLPLKAIPGDYGVPYFGAIKDRLDYFWLQGEEQFYRSR
MAKYNSTVFRVNMPPGPPISEHPQVICLLDQKSFPILFDVSKVEKKDVFTGTYMPSVS
FTSGYRVCSYLDPSEERHTKLKQWCFEVIAMNGRNFLPEFHKSIEESMVLWETSLAKG
EKTSVSDEVKQFAFNFLMRAVCHHDPAAPGEYSLGRNGGPYATAWANPQLAPIAGQTG
LPHVVEELVLHTVPLPSALVKKNYDALYNFIKNYATEALDRAEAMGIERNDATANLLF
FLCFNAYGGFSIFFPLITILISSCGPELMHDLHDEVTKAVAATDGKVTLQSIENMPLV
KSVVYEAFRFKPPVPYQYGKAKFDFTIENHENSFEVKKGEMLYGYQPIVMHDPKVFSD
PDQFLPRRFMGPDGEKLIKYIFWSNGYETDEPTTANKQCAGKDLVVTMARAFVAEMFL
RYKEYTLTMEGAGNATKVFFSDLKK
>CYP74z CYP74A8 856891456 70% to
74A1moss no ESTs complete
1028581451
1009321611 moves upstream
goes
downstream 1036017357
MAVPVSNLPLRAIPGGYGISYLGAIKDRLDYFWIQGEEEF
YRSRVEKYNSTVFRVSMPPGPPIAKDARVICVLDQKSFPILFDVNKCEKRDLFLG
TYMPDLSYTSGHRVLSYLDPSEVRHEKLKQWCFDLIARNGRKFLPEFHTAMEESFAVWEE
AMEKGENANLSEEVQQFAFNFLVRAVLHHDPVAPGEASLGKNGGPYASAWHGPQLAPIAGQT
GLPHAVEELLHTIRLPSSVVKEQYDALYNFFKTYGGEELDRAVALGIKRDDAIANLLFL
LGFNAYGGFNFFFPQLTVHIAQCVPELMHELHEEVVAAVQATEGKVTPKSLENMPLLSSV
VYEGFRMKPPVPYQYARAKTDFLIESHENSFEVKKGEMLYGFQPYVMHDPNVFENPDKFL
PRRFMGPEGEALLGNVFWSNGRETDDPTVHDKQCAGKDLAVTISRAYVAEM
FLRYKEFTLEVQGSGVQTTLLFSALQKA*
>CYP74B3v1 Lycopersicon esculentum (tomato)
AJ239065 58% to 74B2
IPIMNPAPLSTPAPVTLPVRSIPGSYGLPLVGPIADRLDYFWFQ
KPENFFTKRMEKHKSTVFRTNVPPCFPFFGSVNPNVVAVLDVKSFSHLFDMEIVEKAN
VLVGDFMPSVVYTGDMRVCAYLDTSEPKHAQIKNFSQDILKRGSKTWVPTLLKELDTM
FTTFEADLSKSNTASLLPALQKFLFNFFSLTILGADPSVSPEIANSGYIFLDSWLAIQ
LAPTVSIGVLQPLEEILVHSFAYPFFLVKGNYEKLVQFVKNEAKEVLSRAQTEFQLTE
QEAIHNLWFILGFNAFGGFSIFLPTLLGNLGDEKNADMQEKLRKEVRDKVGVNPENLS
FESVKEMELVQSFVYETLRLSPPVPSQYARARKDFKLSSHDSVYEIKKGELLRGYQPL
VMKDPKVFDEPEKFVLERFTKEKGKELLNYLFWSNGPQTGRPTESNKQCAAKDMVTLT
ASLIVAYIFQKYDSVSFSSGSLTSVKKAS
>CYP74C1 Cucumis sativus
fatty acid hydroperoxide lyase AF229811
MASSSPELPLKPIPGGYGFPFLGPIKDRYDYFYFQGRDEFFRSR
ITKYNSTVFHANMPPGPFISSDSRVVVLLDALSFPILFDTTKVEKRNILDGTYMPSLS
FTGGIRTCAYLDPSETEHTVLKRLFLSFLASHHDRFIPLFRSSLSEMFVKLEDKLADK
NKIADFNSISDAVSFDYVFRLFSDGTPDSTLAADGPGMFDLWLGLQLAPLASIGLPKI
FSVFEDLIIHTIPLPFFPVKSRYRKLYKAFYSSSGSFLDEAEKQGIDREKACHNLVFL
AGFNAYGGMKVLFPTILKWVGTGGEDLHRKLAEEVRTTVKEEGGLTFSALEKMSLLKS
VVYEALRIEPPVPFQYGKAKEDIVIQSHDSCFKIKKGETIFGYQPFATKDPKIFKDSE
KFVGDRFVGEEGEKLLKYVYWSNERETVEPTAENKQCPGKNLVVMMGRIIVVEFFLRY
DTFTVDVADLALGPAVKFKSLTRATASV
>CYP74D1
Lycopersicon esculentum divinyl ether synthase (LeDES)
AF317515
MSSYSELSNLPIREIPGDYGFPIISAIKDRYDYFYNQGEDAWFH
NKAEKYKSTVVKINMAPGPFTSNDYKLVAFLDANSFVCMFDNSLIDKTDTLGGTFKPG
KEYYGGYRPVAFIDTKDPNHAALKGYILSSFAKRHNLFIPLFRNTLSDHLFNNLEKQV
TEQGKADFNALLPTMTFDFIFRLLCDQKNPSDTVLGAQGPEHLRKWLFPQLIPSLSAK
KLPNIIEDMLFHNFLIPFGFIKSDYNKLVDAFSKSAVSMLDEAEKLGIKREEAVQNIL
FLVGINMFAGLNAFFPHLFRFVGEAGASLHTQLAKEIRSVIKEEGGAITLSAINKMSL
VKSVVYETLRLRPPVPLQYGKAKKEFMVQSHDASYKINKGQFVVGYQPMASRDPKIFA
NPDEFVPDRFMNDGEKMLKHVLWSNGRETESPAPDNKQCPGKDLVHLLGRLILVEFFI
RYDTFTLEITPLFRAPNVAFNTLTKASK
CYP85 clan
sequences (5 complete sequences, 2 finished pseudogenes,
1 unfinished
pseudogene)
>BJ975178
mate = BJ966664 (N-term) CYP763B1 34% to 90C1 complete 85 clan
890330488
982549587 35% to 85A1 rice, 41% to BJ580390
MDGRLFLQGLETVAFVCVSVLLISQLWPKNEERAKINTR
LPRGSYGLPLVGETLKYMASMMTSAPAFMAEHRQKYG
EMFKSKLMGAFCIITTKADTIKWVLNHEGKQFVTGYPKSFRKVLGEYAALSLHGDQWKSTRR
FLVNSLRVELLRERIPTIEQAVLENLNPWAAKESVSIREETKTLAFNVVAQYLLGSR
696
LKSGPVNDSLRNDFYTLTEGLFALPINLPGTQYRKGLEARARIIETLERDVVSHARPVGD
876
EDQYADYMDYMRKENLPGTTEELL
LEKTRCHVLGMLFAGHETAASA
MLFAVKYIMDNPRVLNELRAEHENIRISKFEGGSLTWDDYKNMRFTQSVITETLRLAN
PVALLWREATEDVQLNG
YVIPKGWKTVCAIREAHHDPALFDRPSEFNPWRHEQEVMNPAKKLPLLGFGGGPRYCPGA 587
ELARAEICIFLHHLVTKFDLKSCGEETVSFFPVPKFSNGLQVQVQERDLSTRISHKIRVH*
>755830456
CYP763B2 85% to BJ975178 CYP85 clan 34% to 90C1 complete
no introns
824727620
821875103 no ESTs
MDTTLVLHGLEIVAFICVSLTLIMQLWSRNNEQAKINKRLPGGSFGLPLLGETLKYMASMKTSM
PTFMAEHRQKYGEMLKSKMMGAFCIVTTKSDTIKWVLAHEGKQFVTGIPKSFRKVLGEYT
ALSLHGEQWKSTRRFLVNSLRVELLKERIPMIEQTVLENLNSWAIKGCVSIREETKTLAF
NVVAQYLLGSRLKSGPVNDSLRNDFYVLTEGLFALPINFPGTNFRNALEARARILKTLEE
DIVSKPRPAGDEDQYVDYMDYMRKENLPGTTDELLREKTRCHILGLLFAGHETAASAMLF
AVKYIMDNPRVWNELRAEHDNIQVSKFEGGNLTWDDYKNMRFSQSVITETLRLANPVALL
WREATEDVQLNGSLIPKGWKTVCAIREAHHDPEFFDHPHE
FNPWRHQHEVLNPAKKPPLLAFGGGPRYCPGAELARAELCIFLHHLVTKF
DLKACETEIVSFFPVPMFSNGLQVRVQERDPATKCSQKIMAY*
>815711319 CYP763B3P 76% to BJ975178 pseudogene 692447055 816362600 finished
KNIQISKFEGGSLTWDDYKTMRFTQSVITKTLRLGNPVALLWREAMENVKLNEKVIPK
CWKMVCAIQEAQLNSALFDPPYEFNPWRHKQEVMNPA*K
LPLLAFGGAPRYCPGAELAPAKICI
FLHYLVTKFNLKSCGEDTVSFFPVPKFSNGLQVQMHEHDSSTKI*HKIKVY*
>774548359 CYP763B4P 993478971 978014995 finished
34% to 707A2 seq
middle may be a pseudogene fragment
cannot extend up
or downstream
DQIRHELHTLLRGLRALPINLPGFTYYKSRK
(0)
AKLVLVSLMLKSIAKRREANIELNDFRNNLMKMTAIEVPDYDICCIMVVFMF
ASTDTTSALISWVVKYLHDFPEVRQRVQ
(0)
>BJ580390.1
mate pair = BJ175156 CYP763A1 36% to 90A1 complete
85 clan
BJ186430.1
mate = BJ593857
BJ186356.1
no mate
BJ175895.1
Mate pair = BJ581114
1020660301
1020679587 883835965
MAELGVSEMERMNTFGIGADAQRGLGAGMTLPLLFLATVVW
WIWQRHKANLESGLPGTFGLPFIGETLTYVAKMKSPLGNFVDEKTKR
(2)
YNGAQAFKSSLFFQPTVIATEVETVKMIVAKEGRSFVSNYPSSFALLLGRFNGLNMNGENWKRL
343
RKFVISHIMRVDLLKERMADIEDLVVRTLDSWADDEGRTIYVEDETKT
(0)
IAFNITALIVLNLKPGKVSQTMQRDYYPLIEGMFSLPINLPWTIYGKATQ
(0)
ARVRILKTLEEFLQSRTVK
DDVFDNYVQLLQEELPPGSPPALKHEMGLDLLTSLLFAGHDTTAATMVFSVKYIGENPKVLAELR
REHEELLKRKQPGERISWDDCKTLSFSNS
(0)
IITETLRMCNISTTVFRKSLEDVHVG (1)
DYVIPKGWLVLPYFRAVHFNPSIYPDPYTFNPFRYQDAAGSKLPFFGFG
GGARLCPGMDLARAELCLFLHHLVMKFESWELLGNDVVSYFPFPRLSA
RLPIRVKRRTPPQQPST*
>755808818
CYP762A1 33% to CYP707A3 complete in CYP85 clan
831702982
785869234 836314374 BJ969496.1 BJ160608 BJ162335 BJ977796
BJ168851
green supported by ESTs BJ170173 BJ965146
MATVSLQEPGLVVGLFLGAPLLLFLYILYYAISLHTTSVEGVR
VPRGNFWLLPLLGESISALTVPPKQFIDRQTRK
(2)
YGAMFTTHIGGDPMIMTTDVDLTRWVYQQTNRLFSVLSPKATYELLGHESIFYAKGDHH
LRLRKVFAGYLSTQKLVPFTPRIDKMAASIMESWKRKERVIVFDEAKM
(0)
YAIHLALAQLISIDTQEYPCMDHIFAHVPGENRLEKLVYLHYDIESGMMSVPLNIPGTAYHKANK (0)
AKILFRKALKVIINERRTGDVKCNDLLEGLLSPLEDGTLLDDEQVMDN
VITGVGAAEVTTTTALVWMVKWIQENPELHRELQ (0)
NEMDAIKKTKANGEELTYDDIKKMNLTLW (0)
TMYETLRLRKVTGFFIARTADQDVRYKDVVIPKNWVVAMTHGYHLDPNYYPEPEKFNPYRFQTMP
PAHTFTPFGASVRLCPGKEMAKIEILTFMYHMLTSFSWEPAEPEGETIWHLFPHPRNKLPIKVTPRT*
>CYP716 like CYP716F1 complete AW739242 AW497090 BJ610569.1
BJ600055.1
BJ599808.1 BJ597366.1 BJ592026.1 BJ167019 BJ586914
BJ172516
BJ590596
BJ590880 BJ606611 BJ599271 BJ611487 BJ164618
BJ608000
BJ193730 (N-term) opposite end of BJ597366
BJ181980
opposite end of BJ586914
BJ192165
BJ187702 BJ159038 BJ160786 BJ206410
BJ196921
BJ196248 BJ194582 BJ196596 BJ187985 BJ184639 BJ200159
BJ161170
BJ164034 BJ191214 BJ159266 BJ169016 BJ171514 BJ167564
830444348
44% to
Picea AY779542.1 CYP716B1, 38% to 716A2
MAHQQHGFLEGHAESTPAWAAVAAVVAMLVGWLFWRLFSVSPESQ
GKLPVPPGSFKWPLLGETLDYLDCARRNRVADFFNARVAKYGETFKTHILFNPTVSVTAP
DGNKFLFANENKLVQNHWPPSVSRLLGEHSMATKVGEEHRRARRVYTNFFK
PEGLQSFVPRIDELARSHNSKYWEGKEFILGGPTVRDFTFAVAADLFL
SLKHDDPMFRPFELAACDYLAGILQVPINLPGTAYRKGILGRESQLRVIDMSLKQRRQ
(0)
EMKEGRV
PPQQDLMSVLLNTLNEDGTPMSDDQIKDNMLLFVFAGHDTSSSALAG
LLKYLSLNPECLKKVLEEQMEIRKEKGGEDIPLSWDDTRKMKYTWRTIQETLRLQPSVQA
AFRTVIEEFEYDGYTIPKGWTIFWSVGRSHRNPKFFPDPEKFDPSRFEGTGPAPFTFVPF
GGGPHICPGNEFARTEILVYIHYLVLNYEWEMVDPTEDVCIDPMPLFTKQLQLRVRKRFPSL*
>830745357 CYP716F2P 830636975 37% to CYP716A1 44% to 716B1
53% to CYP716 like
Frameshift
and small deletion after DKRR seen in 3 seqs
(937552998, 830636975, 717629207)
pseudogene
finished
986842815
MEGLHYVWKQVGAHTGVWTVGATLLAVLTGWLLWSTTAVPTRNPPVPPGSFGWPLVGET
LDQLDAAKANQVVKFYATRVAKYGE
(0)
VFRTHFLFNPAVSMGAPEGNKFLFGNENKLVQNSWPGPVTRLLGKNSLTVLVGEEHKC
VLAPCQLFCWNWFQISLC
(0)
RQLYVGTPPSTGEGKGARILGVPTAKEFAFTVAADLFMSMDNHDPLYRLFAQA
HEEFVTGFFKIPIYLPGSAYRKALQGREEQRRIIGTIIDKRR
EGINPPHDLLNVMLTVPYENDSFMTDDAIKDNILLMMTASHDTSSTTIAFVLKYLYLNPECLKEVIR
(1)
EQLAIAKDKRADAAVTWEDTKNMKYTWRAIQETMRLQPPVQAGFRRAIKDFEFGGFSIPKGWT
(0)
LIWSVARSHMSPKFFPDPEKFDPSRFEGSGPPPYVFIPFGGGPHICLGNEFARLE
MLLFLHHIVLNYEWEMVDPNEQVSITPVTHFKKGLELILRKRRFE*
>picea sitchensis 716B1
MVWKEAVSVLQKAQELKEPPLMFTVFLASFIGLAFFFYLISNHR
TKAWRGIPPGTFGWPLIGETLEFLGCQRKGNPRDFFDSRTQKYGNVFTTSLVGHPTVV
FCSPEGNRFLFSNENKLVVNSWPSSVGNLFRSSLITTVGDDAKRLRRILMTFLRPEAL
REFVGRVDSMTKRHLAEHWIGKDEVMALPLLKRYTFSLACDLFASINTKDDLDRLWLH
FMVFVKGVMQIPIDLPGTRYNKTKHAANAIRQQLGSIINERK
IALEAGNAS PEQDLLS
FLLSNVDEQGESLTDNEIQDNILLLLYAGHDTSSSTLTVLLKFLAENPHCYEEVLREQ
LNIAGSKEEGQLLEWEDLQRMKYSWRVAQEALRLFPAVQGSFRKAIKEFIYDGFTIPK
GWKLHWTVNSTHQKSEYFSNPEKFDPSRFEGEGPPPYTFVPFGGGPRMCPGNEFARME
ILIFLHNIVKNFNWNLVNPLEKVIVDPMPAPVNGLPIKLVPHD
CYP86 Clan
sequences (10 complete sequences and
5 finished pseudogenes, 1
artefact seq.)
>CYP86 like1 CYP86F1 710546622
831723548 complete
BJ183632.1 BJ976817 BJ584415 BJ585375
839341101
830649952 774610095
711878238
CYP86 like 48% to 86A8 Arab., 48% to 86B3 rice
MQYLVMSRGDNCTHFHSQNQGQTGLGMCTGPNR
MDSWMLTQVMLAGVVTFLVWHVIKYSRIKGPIVWPVFGTTPQFLWNLPRMHDWTTDMLVKHDGTYTS
IAPKCTCLTAVATCR
(2)
PENLEYVLKTNFANYPKGRSFTYPSHDLLGQGIFNTDHDLWKMQRKTASLEFSTRTLRDLM
VKANRSSVGQRLLPVLADVARNRA
PIDFQDLFLRYTFDNICMVGFGVDPGCLAP
GLPTVPFAQAFDLATEGTLTRMVVPEIFWRITRALGWGMEGRLAKAIST
IDKFAADVITERRRELNMLKTLNATEYPCDLLSRFMQTTDHEGNPYTDRFLRDVTTNFIL
AGRDTTAIALSWFFYLITQNPAVEEKILNEIREILQSRRQSGGVGEPDDDDAGRTTQEAS
LSFEELKQLHYLHAALSESMRLYPSVPIDNKDVTADDFLPDGTFVRKGTRLMYSIYSMGR
MESIWGKDCLEYKPERWLRNGVFTPESPFKYAVFNAGPRLCLGKELAYLQMKSVASAILR
NYHVKLVPEHKVEYKLSLTLFMKYGLHVTLHPRVTVAY*
>755797498
CYP86B like2 CYP86F2 49% to 86A8 91% to 86
like1 complete
1000171656
774610095 BJ183632 BJ180038 759454272
MAMNEGDNYTQFHSHNQAYAGMEMYIGQYRVESWMLTQAIITCVVAFLVWHVLKYS
RIKGPIVWPVFGTTPQFLWNLPRMHDWTTDMLVKFDGTYTSIAPKCTCLTAVATCR (2)
PENLEYVLKTNFANYPKGRSFTYPSHDLLGQGIFNTDHDLWKMQRKTASLEF
STRTLRDLMVKANRSSVGQRLLPVLADVAK
(2)
KRIPIDFQDLFLRYTFDNICMVGFGVDPGCLAPGLPTVPFAQAFDLATEGTL
TRMVVPEIFWRITRALGWGMEGRLAKAINVLDKFAMDVITERRKELAMLKTLNATDYPCD
LLSRFMQTTDHEGNPYTDKFLRDVTTNFILAGRDTTAIALSWFFYLITQH
PAVEEKILLEIGEILRSRNHGQDKEAADDDATRIT
QEASLSFEELKQLNYLHAALSESMRLYPSVPIDNKDVTADDFLPDGTFVRKGTRLMYSIY
SMGRMDSIWGKDCMEYKPERWLRKGVFTPESPFKYAVFNAGPRLCLGKELAYLQMKSVAS
AILRNYHVKLVPGHKVEYKLSLTLFMKYGLRVTLHPRVTVAY*
>815804922 CYP94G1 852116747 830503673
1020611714 complete
cyan = possible
intron with GC boundary
46% to 94D2 after
subtracting insertions
MMDRELVTLLYTAGILLVVTLWCIWYHHPKYGKNRGPKVY
PLLGSYLSLLHNKSRILDWMVDLIRDSPTMTVRTVRPGGRQ
FHITAGPANVEHILKTNFENYPKGENSYANLHDLLGNGIFNIDGKSWKLQRKVAS
HEFTTQSLKNFMVGAVHDELRGRFIPVLQECCNTGRTVDLQDLLARFTFDTICKLGFGVD
PACLDLCFPSVRFANAFDTATSITANRFITFASVWKTMRALNVGS
EKKLRAAVADIDDFAMFVIQNRRKQVAGQSNRQTDNSSDADDAAHLDLLSRFMGLTA
ADQDRRDFDTQDPSCDQ
NEGPQLGYSDEFLRDIVISFILAGRDTSTSSLTWFFWNLEHHRQV
EDAICKEVSEILKNRLVEDKDHNKHVPTSFFSFEELKKMHYLHAAVSESLRLYPPVPIEM
KLAHSSDEWPDGTRIDPNSTIIYHPYAMGRMERIWGPDCMKFKPERWLKDGVFVQESPYKHAVFQ (0)
AGPRMCLGRELALMQIKMVVAVLLQRFRFSSQKGFTPEYDLNLTLPMKNGFPVSVQSKVPM*
>CYP94D like CYP94H1 AW561645 44% to 94D2 complete
816253695
1020656227 876286269 BJ606150 BJ606502
MSTPNYMSPEMGRFEKWALLLREESEEHTLAFVATILFVAVNALIFIWWHHPLYGKN
IGPRVYPFVGSLPSAIQHAHRLLDFSVETLRKSPTLTIRYVQSGYTAYSTANVENVEYVL
KTKFDNFVKGERMGDVLFDLLGRGIFNADGNLWKLQRKLASHEFSSRSLREFGVE
CVQKELQNRLVPVLSQFSENGNVVDLQDLLMRFSFDNICQLGFGVDPNCLEPSLPPVKFA
EAFDKANECTLLRFRTFPIMLRLYKFFNIGIERGLKESMAVVHNFAQEVIEARRKEFNEN
HGDIGHARQDLLSR
(2)
DAKEKQKASDIFLRDMVISFVLAGR
DTTSLGLSWFFYALGHNPHVEAKIYDEIKEQLQLQAQEDDSLPSSRPPGQLFTFEQLKKL
HYLHAALHESLRLFPPVPWDSKHAVRDDVLPDGTVILKGERVTFNIYAMARMEANWGPDC
NEFKPERWLKDGVFVPESPFKFATFQAGPRICLGKEMALIQMKLVASSLVYCFKFTLLED
PPRTCLSFVFKMLNGFPGDVHKRAVST*
>815609477 CYP94H2P 61% to 94D like pseudogene stop codon seen
in 13 seqs
finished cannot extend
DGFLRDIATSFVFTGRDSTSVTLSWFFPSLCLNP*AEAK 466
>710485080 CYP94H3P 713838121 40% to 94D4 probably a pseudogene finished
it has
frameshifts, deletions and stop codons
997462133
998782531 713858321
824686025
= poor seq version of 710485080 58% to 710485080 artifact
note: top part of this seq has only one match
in trace archive
as we move down it begins to match 710485080
bottom part matches 710485080
This is probably a poor quality version of 710485080
MDNSTVTLHKLECNPPKLFESSFGVWTATCIVALSFLYKWWKNHPLYGKN
PGAKAYPLIGNLVEFLWNITRIYDYETDLMRSKKILTIRNSLTYCRTTYMTANPAN
VE*ILKKNFANYPKGEYIRNTFQDYLGEGILNADGDIWKMQRKITSNEFSLRFMRNFMID
NAQQEIDIRLLPILLRASDDQKQ
LLDMQDILKRFNFDNICTLTVGINPGCLDLSLPNVEFEQAFDEAAACIAHRFTNVFWT
LYKALNIGYERRLRDSVPHIRNFIMPVI*ERRKEMTESSNKVHRVDLLSRFIDHGTDD
SEKLSDQFLCDMLVSFFQAGRDTIAMGMTWFFLELTSH
DFSNPQFQLRGCVL
SFDELKSMHYLQAALFESMRLHPSIPADQKVAASDDVWPDGTVIRKGETAGYSPYIMGRMEAL
WGPDVMEYKPERWLKDGVFVPENSYKFPVFQARPRICLGKDMGIMTMKLIAAS
LLQRFTLSVPDGFQSLYHVSVVLPMSGGLPVRVRRRS*
>BJ589415
CYP704B5 BJ592550 BJ584707 BJ588917 BJ593364 BJ581728
49% to
BJ957091 55% to 704B2 complete
976083649
869925771 1006228277 869807368 859499381
830613907
contains an isertion between J-helix and K-helix
compared
to 704B2 TTIDESWTLNQRIIQ
seen in EST
MAEEISWAAPRSGAATSVVLAVTTLLWIIVFRWRQRHRLGPKEWPILGSALEI
TSHFGDMHDWLLSYFEKGLKTFRVVIPGVVYTYTVDQANVEYILKTNFANFPK
(0)
GELYHRHMETLLGSGIFNVDGEEWRQQRKTASFEFASRILRDYSTVVFRNNAVKLAE
IIARMSQTQEPIEMQ
(0)
DLFMRLTLDGICTVGFGVEIGTLSPSLPAVPFATNFDNANEAVTY
RFFDPIWPLKQRLNIGNEAVLARSVKVVDDFTYKVIQTRRTELQYISSQGKEM
(0)
KADLLSRFILMGENSEESLTDKMLRDVILNFIIA (1)
GRDTTAVTLSWFVYLLSTHPEVADKVYEELRHLEEETTIDESWTLNQRIIQ
QASLLTYDALSKLNYLHAAITETVRLYPAVPQNPKGILEDDILPDGTLVKKGGL
VTYVPYAQGRMKELWGKDADEFRPERWLKDGIFIPASPSKFPAFQAGPRICLGKDSAYLQ
MKMAMAVLCRFFKFKLMPGHVVKYRMMATLSMQKGVRVYVSGRDV*
>820957973 CYP704B8P 66% to BJ589415
pseudogene seen on 8 seqs finished
cannot extend
upstream
755734545 865099818 1036028198
824622763 863035796 859709615
982621777 824721912
LKTYVP*GHGRMSQLWGKDPNEVQPERCLEGSIFIPVSPSVFPAFQ
(?)
AGPRICIGIKSAALHMKMATA
LCRLSSSELMRGHPVDCQPTANFSIQKRTALGREKAMFK*
>774439634 CYP704B6 774616817 872730703
1005941337 1006178134
79% to BJ604836 55% to CYP704B1 complete
MEEVMRASFSSGSVVAFVIIATLSYLWIFRW
RQRHRIEPKEWPIIGGALETIQHFDVMHDWILSYFNKGLKTFHVKYPGITYTYTIDP
NNIEYILKTNFANFPKVKFKNITS ()
LETLLGDGIFNSDGEAWRQQRKTASFEFTSRVLRDYSTVVFRENALKVGDILSSVCQKHQPIDMQ
(0)
DLFMRFTLEGICKVGFGVEIGTLSESLPAVPFATNFDNANEAVTYRFFDPFWPLKQMFNIGNEAVLSRSVKVVDDFTYKVIKIRRAEMDLATSEGHDK (0)
KADLLSRFILLGKDPEQNFTDKTLRDVILNFIIA
(1)
GRDTTAATLSWFVYLLSIYPHVADKIYDELHALEKDANINASQTLNQKMREYSSILS
YDVLTKVQYLHAAITETIRLYPAVPQ
(0)
DPKGILADDVLPDGTVLKKGGLVSYVPYAQGRAKVIWGDDAESFRPERWIKDGVFIPLSPFRFSAFQ
()
AGPRICLGKDSAYLQMKMVTALLCRFFKFDLMPGHQVKYRTMATLAMENGVKMFVTRR*
>BJ604836
CYP704 like CYP704B7 59% to 704B2 mate = BJ198529 complete
830772551
993524135 997128890 713796384
BJ598232.1
mate = BJ195079
BJ595821
mate = BJ190416
MEALISVPFSTESAVTFVIIATLSWLWIFRWQQRHRLAPKEWPVIGAAVETIR
NFDDLHDWVLSYFQKGIKTFRVKFPGTMYTYTVDPKDIEYILKTNFANFPK
(0)
GDLYHKNMETLLGDGIFNADGEVWRQQRKTASFEFASRVLR
DYSTVIFRENALKVGDIVVGASQTHNAVDMQ
(0)
DLFMRLTLEGICKVGFGVEIGTLSPSLPAIPFASNFDNANEAVTYRFFDPFWRLKQLFNIG
NEAVLSRSVKVVDDFTYNVIRTRRVELQSTEGENK (0)
KADLLSRFILLGEDPEQNFTDKTLRDIILNFIIA
GRDTTAATLSWFFYLLGNHPRVADKIYDELHALDDDANVNKSQSLNQEMSEYATQLT
YDVLLKLQYLHAAITETIRL
YPAVPQDPKGILADDVLPDGTVLKKGGLITYVPYSQGRMKDIWGEDAEDFRPERWIKDGV
FTPLSPFKFSAFQAGPRICLGKDSAYLQMKMASALLCRFFKFELAPGHPVKYRTMATLSM
QRGVKMYVTRR*
>BE643294.1 fern Ceratopteris richardii, CYP704B like, 47% to
704B2
50% to
704B7
SKLPQLSKGDRIHERFVDVLGDGIFNVDGEKWLHQRKVAILEFSSAKLRDFSTRAYRTHA
LRLLNVLLDAWKSGLVVDMQDLFMRLTLDSICNIGFGVNLGSLSSELPDVPFMRSFDDSN
KLIIRRYVDYAWKIKRALNVGGEAKLKKCVEIVDGFIYNVINIRRMEMTASRARGEELQK
EDLLSRFMSLSNEDGSPYSDKQLRDIVVNFIIAGRDTTALTLSWLFFMLCKNPTGIKNFC
RELTKSSTT*
>BJ967820 CYP704D1 993596810 46% to 704B1, 49% to 704B2 complete
835928964
903863118 862810596 833249970
MDFAQGWGSESIAGMMKTIVTAFSGILSLLLAYLLWAAADNWVLHRERKGPVQWPIL (1)
GVTLEALKNYQTLNDWVVYYFLRDGLTFSCKMMHLDLTFTADPVNVKHILKTNFANYDKRKFFH
ENFEIFLGDGIFNVDGEIWRTQRKTASFEFASRKLRDFSTVVFRDYSVKLASILARAATAQQSMDMQ
(0)
DLFMRFTLDSIFKVTFDYDVGTLQPGLPNIPFAQAFEITNEITSSRLINPIWKLNRAL
KIGSERVLLQSAKDVDEFIYGVIEAKKAEMANS
KSDLFSRFMRLEEDDSDIQFTDKNFRDTLLNFIIAGRDTTAVSLSWFVYRMAQNPEMTAR
LQQELRDFDTVRNWKQQPEGDEGLRRRVLGFAELLTFDNLVKLQYLHACILETLRLHPAVP
QDPKHAINDDILSDGTQIKKGSLIYYTPYAMGRMPALWGPDAMEFNP
QRWFVDGVVQTEQPFKFTAFQ (0)
AGPRICLGKDSAMLQLRMVLALLYRFFTFQIVEGTDIRYRQMATLLLANGLPAKIIKQKN*
>713859399
CYP704 like1 CYP704F1 complete 43% to 704B2 BJ588705.1 mate pair BJ183392
BJ181412
mate = BJ586316, BJ181232 mate = BJ594159
BJ944527
mate = BJ955305
1010161410
831733082 AW598874
MGDEGATLFGAFKSGNVLPAGVGQQEVWIM
AAVSLVVVTASMWLWLLSLRRRPPGPMIWPWLGSMLEIAPQFDTMNDWYLNYFSADVKTF
SFGMPGFPSCTKFVATVDPVIVEHILTNVYKYGKGDQLRDRLGDFLGRGIFLADGEDWRR
HRKIASTEFSTRKLRGHSASVFRGEGVKLANCLKVAMAADQPVEIQ
(0)
DLFLRMTLDSICKVAFGVEIGSLSPDLPDVQFAKDFDNAQAHISKRVVRPMFKILRA
LDIGEEHHFRIATNSVHSFAMDVIAKRRKEIAAAHDAGEEY (0)
HRDDLLSKFMANLTQDENSYDDKELRDVIISFMLAGRDTTAVTLSWFTYEMCCHPEIADKIY
EEGVAVIGKHTVVESAVEHLTHEALGQMHYLHAALSESLRLHPAVPRDGKCVLEEDVLPNGIKVKK
GDFVQYVPYSMGRMPFLWGPDALEFKPERWLKDGVYQSVSPYIHSAFQAGPRICLGKDSA
YLQLKVTAALITHFFKFHLVPGQEIAYTTTLVMPIKKGLKVTLSPRQ*
>816126256 CYP704F2P 870348294 816042352 85% to 713859399
CYP704 like1 finished
no sequence
upstream matches P450 sequence, possible pseudogene fragment
AGPRICLGRDSACLQLKLTIALIAHFFKFHLVPGQEITYTTTLVMPIKNGLKVTLPSR
>710547862
CYP704A like2, CYP704F3P 59% to 704 like1 finished
41% to 704A2 717627085 1012795691 762523153 839344697 1003398464
815674025
cannot find last 68 aa
pseudogene
stop codon seen in 12 sequences
784301641 884956166
MGDEGPKLLEVLNGTKSFIGNAGRVEAWITVVVALVVIVSSTWLWLCKRS
GRRKRIPGPAIWPVIGSMLDIVQNFDTLNDWYL
SYFSDDVKTFAFAIPEFPRSLKFFATVDPVNVEYILTNIHKYGKLSTLTKI
(0)
GPGLQERAGDFLGSGMLMTDGAAWKRHRR
IASTEFSTSKLRDHCDTVFREGAILANVLKSAMAVKDPVEIQ
(0)
DLAIRLTLDNICKMAFGVELGCLAITLPDVPFARAFDDAQSLVVRRAMNPLFKI*RA
LNIGDECCFREAIRNLDMFAVNVINKRRKEISAAHDAGREF
(0)
NKEDLLSRFMSHRAGGDDSYNDKELRDVRVDFLLARRDTSAVTIAWFTYAMCRHSDI
ADKIHKEGVDVVGYHTDFESMAQRLKHEVLGRMHYLHAALSESLRLHPPLAR
(0)
DSKYVFEDDALPDGTKVRKDDFVHHVPYSMGRMPFLRDSDVLEFKPERWLKDGAFQS
(0)
>BJ957091 CYP704E1 45% to 704 like1 mate = BJ946301 complete
48% to 704B1, 50%
to 704B2 BJ172920 BJ199044 BJ200146 BJ596315 BJ598545
BJ599748
BJ605379 BJ608983 BJ942257 BJ942853 BJ944859 BJ952929 BJ953251
BJ940397
BJ942576 BJ955633 BJ958706 982651912 830779862 862331944
BJ946301
BJ947855 AW509793
MLSGGPLETFSVWFAMNGESSFPVKTCLSLTWVTAESFLVAAIAWSFAAWIWWHWREQRKLPGPFAWPL
IGCLPELSANWDRLHDWVLEQFSDDRRTIYVQFGYPDVAVFTVDPANVEHLLKTNFSNYP
KGESNCNLMRELFGVGIFTTDGELWKEQRRMASYEFSSASLRDFSTDVFREYALKLVFIL
SRFASTGADFDLQEMCMRMTLGTTCKIGFGVVLDCLSPSLPKIQF
AQCFDDANFISYHRFVDPLWHVKRALNIGRERKLKHCVKVLNTFTYNVIEKRRQEMASFNTKVWSW
(0)
AAQSDLLSRLTDLCNRGGEISHYVDTALRDMILNFIVAGRDTTAGTLTWFFYMMSSHPEIADKIFDE
LSTVVAVAGKH (1)
IVEFSKLLTYEKLGKLHYLHAALSETLRLYPAVP
LDSKQAAEDDVLPDGTVVKKGSMVGYVPYSMGRMKCLWGDYAAEFKPERWIQEGE
FVPQSLFKFTAFQAGPRTCLGKDSAYLQMKMTAALVMRFFTIRVVPGHSMQYRTMLTLNM
KHGLRAVVSRR*
CYP97 clan
sequences (4 complete sequences)
>756728820 CYP97A8 CYP97A 62% to 755832269
61% to 97A4, 65% to 97A3
838595928
1017487654 1006241304 836332432 997378091
978023017
1012791403 863070727 993681861 957177019
993681765
1017433301 complete
MDLASDGRARTIRAALNGGDKMEMVLDDADEAERLRVWQKQDELASRIASGEFTVSQPS (2)
NDFLLTIRRILANSGPAGRALALQLAIYEARIKAEQSER
MPESRGDVGAIVGEPFFKPLQKLFLIYGGVFRLTFGPK
(0)
SFVIVSDPMVXKHLLKDNAKAYSK
(0)
GILAEILEFVMGTGLIPADGEVWRVRRRAIVPAVHRK (0)
YVAAMMEVFGQATQRLCDKLDEAAVSETSVEMESLFSRLTLDVIGKAVFNYEFDSLSNDAGIVE
(0)
AVYITLREAEDRSIAIFPYWNIPILRAIVPRQRRVAKALNLINEVLDNLIAICK
(0)
RMVEEEDVQFEDEYVNDRDPSILHFLLAAGDE (0)
VSSKQLRDDLMTLLIAGHETSAAVLTWTFYLLAQ
(0)
NPGAVAKLQEE (0)
VDRVLGDRIPTVEDMKKLRYTTRVINE
(0)
SLRLYPQPPVLIRRSLESDMLGKYRINK (2)
GEDIFISTWNIHRSPYLWEDPESFLPERFPLDGPDPTESNQNFR
(2)
YLPFGGGPRKCLGDMFATFE (0)
NITALAMLVRRLEFALAPDAPP (0)
VGMTTGATIHTSNGLHMSVVRRKIKSDSAKETPVTQTQSSNGLVSQGTS*
>755832269 CYP97B8 CYP97B like1 70% to 97B3 Arab. 70% to 97B4
rice. complete
824710406
832112555 859647155 1006315928 1006354442 863063893
755832108
879804344 1006116672 713862054
BJ590155 BJ606606
ESTs
BJ206405.1 BJ200097.1 BJ187250.1 14 exons
MAMASALPNAGPASLMNQNSGSRRLSSSKSTILGHSFLLRHLPSKTRSRGIQC
(2)
LKTDRKPENERTLLDNASNLLTNLLSGGNMGTMPIAEGAVSDLFGRPLFFALYDWFMQ
(0)
HGPVYKLAFGPKAYVVVSDPIVARHILRENTFSYDK
(0)
GVLADILEPIMGKGLIPADLETWKVRRR
(1)
AIVPGFHAAYLEAMVEVFDNCAERTVEKIEGLLDAVQKECKSQ
IEIEMESEYSNLALDIIGLSVFNYDFGSVTRESP (0)
VLQAVYGTLSEAEHRSTFYIPYWKFPLSRWLVPRQRKFNEDLKVINDCLDDLIKRAQSTRQ
(0)
EEDVESLQQRDLSAAQ
(0)
DSSLLRFLVDMRGEDATNKQ
(0)
LRDDLMTMLIAGHETTAAVLTWATFHLAQ
(0)
NPDMVAKAQAEIDRVLQGRRPTLKDIQNLT
(2)
YIKLIVAESLRLFPQPPLLIRRSLQPDTLP
(1)
GGHKGDPNGYSIPKGVDLFIS (0)
VYNLHRSPYFWDEPEKFNPERFLKAKLSDGIEGWAGFDPKRGQGALYPNE (0)
VMADFAFLPFGGGARKCVGDQFALMESTVALAMLLQKFEVELRGSPEDVELVTGATIHTK
DGLWCKLSRRKSITNLN*
>BJ955568
CYP97C5 71% to 97C2 opposite end = BJ944795 complete
890401623
839322423 not all introns shown, most of seq from ESTs
MASPSLHVGIGTRRPGCCNRTIGPIHYDSKFSGAYKMGLRGGPAGRVGNFCDQGRLRVR
CSGGAEDNDNGRSSVEKSGAGKSWVSPDWLTKIVSLGKGPDTSGIPVADAKLEDVKDLLG
GALFLPLFKWMMENGPVYRLAAGPRNFVIVGDPAVAKHVLKGYGTKYSKGLVAEVSEFLF
GSGFAIAEDQLWTARRRAVVPSLHKKYLSTMVDRVFCRCSDALVAKLEKVVASGAPVNME
AQMSQLTLDIIGLSVFNYEFDSLKTDSPVIDAVYTALKETESRSTDILPYWQ
(0)
IPLLCKIVPRQQKAAKAVEIIRETVEKLVAQMQEMVEAEKETIEG
EEYVNESDPSVLRFLLASREE
(0)
VSSVQLRDDLLSMLVAGHETTGSVL
TWTVYLLSKNPAALAKVHEELDRVLAGRKPQFADIKELKYLTRCINESMRIYPHPPVLLR
RARVADELPGGYKIEAGQDVMISVYNIHHSPQVWDNAEEFVPERFDVDGPVPNETNTDFK
YIPFSGGPRKCVGDQFAMLEATVALAVLLQRFKFDLVPNQTIGMTTGATIHTTTGLYMTV
TDRQQTKVPDLVVA*
>AW097933 ga03g06.y1 Moss EST
library Ceratodon purpureus 79% to BJ955568 97C
194 LLCKIVPRQQKVAKAVQIIRETVHRLVAQCKEIVEAEKETLQGEEYMNVSDPSVVRFLLAS*EE
385
>710502520 CYP764A1 name changed to
CYP746B1
42% to 705A EXXR
region BJ587261 BJ182326 complete
869797323
1006121234 710534477 755806706
27% to
Chlamydomonas SCAF 1399, 28% to 97A4, 31% to 734A1
most
like chlamydomonas CYP746A1, revised May 4, 2006
MGSISAANLELIATLASHCLQRTTESVKQGQAVLQQAVQPLPSCFPPGPNGDVA
LDFARDPLECLASLKSRYGSLVGFKLASRPIVLVSSPNFSREVFVTQSSTFIKAGTAFFP
GSSLAGNGLLVSDGDIWKRQRRLSNPAFRRAAIQTYAE (0)
QAMVNITEKMVDKVWRTGGVRDVYADFNELTMEIVASALFGASEASEEMAQVGPAITQAF
QFFTRRATSMFI
(1)
VPEWVPTFDNIQYNNAVTDLNKVVFRLINERRRQLANSSAPPRKDLLTRLLHV (1)
NEDGSGMDNQSLRDELMTFLVA (0)
GQETSAILLTWALLMFALHPHTQELVFQEISEVLNGQLPRQTDVSKLR
(2)
708
YLEAFIWETLRLMPPAYVVGRCACHPTELGGYKIPQGTTILVSPYLLHQDPAFWPRV 529
528 SEFDPSRWMPGGDATEHMENDSFWPFGGGPRNCIGMGFAMMEVTLVLAVISSRFRVSL 355
354 PVGEPIPSPRAMITLRPESEVKLRLTSR 271
RQQRQRKSEAADEMKVIVCLN*
>Chlamydomonas CYP746A1
C_28140001 = C_250032 C-helix exon
duplication
This is a bacterial related seq like
CYP252A1, CYP197A1, CYP208A1
N-term is probably in a seq gap. C-term runs off the end
scaf 2814 is a repeat of the C-helix exon
39% to CYP252A1 from Streptomyces
peucetius,
but
not bacterial because it has introns.
Intron
boundaries not well supported
LQASASQISAAAAQLTAAAAGAATRLQRLSSPPPPAFPAGPSGDQTLPLLTDPLRFLTDAT
(1) GNGLLVSDGPVWQRQRRLSNPAFRRAAVEAYGGAMVAATEDMMRRVWGPA (1)
GGTRDVYADFNELTLQVTLEALFGFGSAERRQQQQQQVEGQEGAAGA
GAAGAQAAGPSFAASSSAASSEDAAQIVAAVEKAFTFFTQR
(2)
AATGFVIPEWLPTWDNLEFAAAVQQLDRVVYGMINRRRQELAAAFG
(2)
GGGGGGGSAGVPSDLLTSLLLARDEDGSGMSDQALRDELMTLLVAGQ (0) 30502
ETSAILLGWASALLAAHPEVQAAAAAEVAAVCG
VRHMPYLESVVLETLRLYSPAYMVGRCARRDAALGPYVLPAG
TTVLVSPYVMHRDPEVWEEPEVFRPERWQELQRR 29548
EGYSGYMGLMSNLGPNGAYLPFGGGPRN 29261 (SEQ GAP HERE)
>746A1
bottom seq compared to 710502520
top seq (parts only)
Length = 237
Score = 680 (239.4 bits), Expect =
2.0e-70, P = 2.0e-70
Identities = 137/237 (57%), Positives = 165/237 (69%)
Query: 120
GNGLLVSDGDIWKRQRRLSNPAFRRAAIQTYAEQAMVNITEKMVDKVWR-TGGVRDVYAD 178
GNGLLVSDG +W+RQRRLSNPAFRRAA++ Y AMV TE M+
+VW GG RDVYAD
Sbjct: 1
GNGLLVSDGPVWQRQRRLSNPAFRRAAVEAYGG-AMVAATEDMMRRVWGPAGGTRDVYAD 59
Query: 179 FNELTMEIVASALFG ADLLTRLLHV-NEDGSGMDNQSLRDELMTFLVAGQETSAILLTW
236
FNELT+++ ALFG +DLLT LL +EDGSGM +Q+LRDELMT LVAGQETSAILL W
Sbjct: 60 FNELTLQVTLEALFG SDLLTSLLLARDEDGSGMSDQALRDELMTLLVAGQETSAILLGW
119
Query: 237
ALLMFALHPHTQELVFQEISEVLNVSKLRYLEAFIWETLRLMPPAYVVGRCACHPTELGG 296
A + A HP Q
E++ V V + YLE+ + ETLRL PAY+VGRCA LG
Sbjct: 120
ASALLAAHPEVQAAAAAEVAAVCGVRHMPYLESVVLETLRLYSPAYMVGRCARRDAALGP 179
Query: 297
YKIPQGTTILVSPYLLHQDPAFWPRVSEFDPSRWMPGGDATEHMENDSFWPFGGGPRN 354
Y +P GTT+LVSPY++H+DP W F P RW G + N
++ PFGGGPRN
Sbjct: 180
YVLPAGTTVLVSPYVMHRDPEVWEEPEVFRPERWQEMGLMSNLGPNGAYLPFGGGPRN 237
CYP710 clan
sequences (2 complete sequences)
>CYP710Axx CYP710A13 gnl|ti|755830097 54% to 710A1 complete
BJ604054.1 mate =
BJ195509
gnl|ti|785870283
ASYA328762.b1
gnl|ti|785844289
ASYA190398.b1
METLIGAVSRIEWRVGLVLVVSAYVVYEQLSFMTKRKHLPGPSFVLPFVGNVIAMVVDPAGFWDQQANYALK (0)
VGVSWNALLGKFVLFVRDSQLSQKVFANV
RPDAFHLVGHPLGRKLFGEHNLIFMFGEEHKDLRRRLAPLFTTKSLGVYISIQE
KTQKEHIAKWMAIAKDLGDD
PIRVRMLCRNMNLETSQNVFVGPYLTPDMRRQFDE
233
DYKNFNTGLMSLPINLPAFSFYKATRSVRNIQNMLTKCATASKARMAMGENPSCLMDFWM
413
IETVRELRDAEAANTPPPPHSSDYEIGCHLFDFLFAAQDASTSSLVWAITLIESHPYVL
590
EKLREELLCLRPDPLAPYTPESLREMKYTEMVVKEVLRYRPPATLVPHIANTDFALT
761
DTYTVPKGTIVFPSLLDSSFQGFKDPEVFDPERFSPERMEDLVYKKNWLLFGAG
923
PHQCIGQRYAINQLMLFISLFFTQVDVKRARKPGCDDLVYTPTIAPKDEGLVYLSPRIVKQKD
>CYP710Ayy CYP710A14 gnl|ti|692479609 54% to 710A1 complete
BJ954143.1
BJ599532.1 BJ603752.1 BJ601705.1
BJ606683.1
BJ603803.1 BJ598741.1 BG361639.1
BJ607040.1
AW496913.1 BI740999.1 AW699711
BJ206552
BJ196180 BJ206140 BJ943450
Mate pairs join
these two seqs
BJ197697 CYP710
N-term BJ204777.1 BJ193269.1
AW145717 88% to
CYP710ayy heme signature chimeric EST with errors
MEGLFEVMPGLGWREGLGLVIFGYIIFEQLSFFSKRKHLPGPSFVLPFVGSVIAMVVNPTGFWVNQAKDASK
VGVSWNALLGKFVLFVRDSQLSQKIFANV
RPDAFHLVGHPLGKKLFGDHNLIFMFGEEHKDLRRRLAPLFTLKALGVYVSIQE
KTQKEHITKWLDRADKLKNN
PIRIRLLCRDMNLETSQNVFVGPYLTPDMRERFNEDYKNF
540
NVGLMSLPIYFPGFSFYKATKSVKNIQNSLARCAAASKARMATGAEPSCLMDFWMVETVR
360
ELGEAEAAGMPPPSHSSDFEIGCHIFDFLFAAQDASTSSLVWAITLTESHPHVLEKLREE
180
QSLLRPNPLAPYTPESLRELKYTEMVVKEVLRYRPPATMVPHIASTDFALTDTYTIPKGTIVFPSL
LESSFQGFTDPEVFDPERFSPERMEDQVYRKNWLLFGAGPHQCIGQRYAINQLILFTSLF
TTFVDYKRARTPGCDDLLYTPTITPKDEGLVYISPRIVSEH*
CYP711 clan no
sequences found yet
CYP727 clan 1
complete sequence
>815606257 CYP751A1 710918567 776043366
33% to 727A1 complete
MAAWKRESVVSVFATNESAAIGVCDVTPVHETPLWR
YVKHEQNTLAWVSLIVVTFLLSRRLCSVLRLLILGYRLPGPRARAFDGRSQC
(?)
DDIVELLARLHQEHGPLVKVWTGPAQLLVSVKDVDILQHVFERAHDRVPVLRMALQLLYGRRSLFTSNYSKVSCRSL
(0)
INGLVLRQAHISSIEVAEKMTQLGGLSKNGCHD
LDCMTFSKLMAFAALGTSLYGDGYMIWPVAREFERVMMEVMEALPIWMRYSVPPLWNAKF
VVFWKQCLRLRDLARELAAHGNQTSIQESEDRVEGLNILGKLLEEF
VSVLFSRMGMGSSTVAEPGAAGMMSHGSLNTAGVLCNVLAQLARHPHIQTK
VHNEISTISGFDKSLTETDVQKMIYLNATVLEAARLLPTVPFLQRCSDEH
(1)
DIALLPGVVIPAGAILTAPIQLIQRDTVYWGDDAAIFNPDRFLKPRRIASGELASEQNQ
NIPHPCNFTKEPLQLNPAFLVFGAGSRSCIGSSLAVKQISILVTVILKRFEVMLYPTLFISKL*
Three probable
bacterial contaminants
>815725399 830616790 870054233 29% to 709C9
39% to
CYP109B1 bacterial contamination
next gene
303 bp downstream 29% to CP000116.1
Thiobacillus denitrificans
778231-779040 (best genbank hit)
mate pair of 870054233 = 870054137 same as Thiobacillus hit above
GCSHLKKGKRGCCFMKWNANGLLPLEWYKKMRRDSPVVQSNDGQVWDVYGYDDVKTVLRN
HDIFSSSAFPDSEDPREHSILRQEPPKHRQLRRLGPRRSRRASSNRWLPRSIRSPPRCWR
EAGEKKGGMSAIADLPSPLPIVVIAEMLGVSLDHREQFKAWSDALVGDSGEDFYQCQREMSEYF
SEIAEDRRRHPQDDLVTKLVEARIDNEHLTDLEIIGFCILLLVAGNETTTNLISSAMLGI
DTLPDVRAQLLADRSLIPGALEEVFRYFSPVQVMFRRVKQDTVLGGQEMKVGQFVHIWMA
SANHDEKVFERPEEFNIHRNPNPHLGLGSGIHYCLGSQLARLESKIAIETLLARFPEYRR
DRSAELARMDSMMMFALKELPIYLT*
>862312623
bacterial
seq. only one hit to trace archive
58% to CYP152A2 Clostridium
acetobutylicum bacterial P450
69% to
AP006627.1 unnamed P450 from Bacillus clausii
GYGLAGTLTMLDLYGTNHHPDLWDKPEKFMPDRFQHWRGSPFRFIPQGGGDHD
TGHRCAGEWITLEIMKESLDFLANRMSYDVPQQDLSYSFSDIPSLPHSNIVIQNVRSV*
>862809891 also
= 975706812, 884004771, 884935229, 717626545
862809508
looks
like a bacterial seq, many accessions, not just one
but
high seq similarity to bacterial P450s
next
gene 87bp upstream is 61% to
Oceanobacillus
iheyensis BA000028.3
complement(1763606..1764175)
biotin synthase
New 45%
to CYP205A1 Chloroflexus aurantiacus
40% to CYP197B1 Nostoc
punctiforme
43% to
CYP197A1 Bacillus halodurans
MLYDTRRTKMNETITGPKGLPITGNLLSFRRNPLQFIRSASKSHGDVVLFRFGPKRNVYLLTNPDHIKEVL
VTKQAHFRKGKGLQVAKAVVGDGILTSEGKKHLRQRRLMQPAFHRERIAAYGDVMVKQGV
DLMSEWRDGDVRDIHHDMMKVTLAIITETMFGKSVKEG
ASEIGHAIDVGLRYVANKASSFIDIPLSVPTRS
NREFLESNELLDKTIFSLIEARRNNDEPGNDLLGMLLAARDEEDGTGMTDEQVRDEVMTI
FVAGHETTANTMSWIFYLLARHPEAEKKLQEELATVLGERLPTVEDLPELKYTNLVVYES
LRLYPAAWTINREVVEEVEIG
GHTYQPGETLMMSQYVMHRDERFYENPDAFIPERFATDLLKQIPTFAFFPFGGGPRVCI
GNNFALMEATLLLATFAQRYQLRLAEPGQVVEPEPLVTLRPKNGLWMRLEKRG*
>AP006354.1 Lotus corniculatus var.
japonicus genomic
CYP711 73% to 711A1
MVFMDFEWLFQIPSVPWSS
33449
AMFTLLATIGGFLVYLYGPYWGVRKVPGPPSVPLIGHL
PLLAKYGPDVFSVLAKQYGPIYR (2) 33631
35424
FHMGRQPLIIIADAELCKEAGIKKFKDITNRSIPSPISASPLHQKGLFFTK (2) 35576
36301
RDSQWSTMRNTILSLYQPSHLSRLVPTMQSFIESATQNLDSQNEDFIFSNLSLSLATDVI 36480