98
sequence pieces from moss = 71 complete seqs, 24 finished pseudogenes,
plus 3 bacterial
contamination P450s.
15 sequences not
from Physcomitrella moss, used to assemble moss genes.
in progress D.
Nelson
July 31, 2005,
revised Jan. 31, 2006
CYP51 clan
sequences (1 complete sequence and 1 finished pseudogene)
>CYP51G1 complete
ESTs BJ585158.1
BJ591215.1 BJ592754.1 BJ963427.1 BJ165255.1 BJ157286.1
BJ188328.1
BJ185333.1
MGDAEMQGAPVGESAVFDRSKVMMLLGSLVVAAIVGHFVLAWNRKRRNLPPVVD
SAVPFVGGLLKFIKGPVPLLKEEYGRLGQVFTLQMLTRNVTFLIGPEVSAHFFKAQEADLSQRE
(0)
VYQFNVPTFGPGVVFDVDYSIRMEQFRFFTEALTVKRLRSYVEMMVEEAH
(0)
LFFSKWGEEGEVDLKVELEQLIVNTASRCLLGPEIRNSHLEKVTSLFHDLDNGMLPVSVLFPYLPIPAHKRRDR (2)
ARKELAEIFSKVIKARKASGKKEPDMLQAFMDSTYRSLKRGTTEEECTGLLIAALFAGQHTSSITSTWTGAYLMK
(2)
YKQFMPAVIEEQKEIMRRHGDHLDYDVLNEMSCLHRAMKEALRLHPPLILLLRQNHTDFSVTTREGKSYTIPKG
HIVGTSPAFANRLPYVYKDPDTFNPDRFAPGNEEDVKAGQFSYIAFGGGRHGCLGETFAYMQ
(0)
VKTVWSYLLQHFELELTGPKFPEVDWNAMVVGIKGEVMVRYKRRQLTCD*
>756723207
755804088 717597240 CYP51 pseudogene 54% to 51G1 rice with
multiple
frameshifts finished
TSPAFANRLP
HLHKDPDFFNPDRLDPENEETK
IGGIISCMCFGGGHHSCLRETYLH
VQGKTVWSFLLQYFVMELVSPKFPAVDWNVMVVGNKGEVLGLLFSL*
CYP71 clan
sequences (41 complete
sequences 15 finished
pseudogenes
0 partial sequences)
>710927435 CYP756A1 33% to 92A8 66% to 710912672
complete
870079473
1028549191 997124596 no ESTs
MAHLHTRLSEEAEAWIATGVDSFSRWQEYAAGFGRATYIVAALGFFAVVILELHNS
RKRRLSKLPPGPFQWPYLGSLPNLLLTVGVTSSFRLREKVSELGRNH
GPLMFLQIADTQILIVSSGTAAKE (0)
VLIARDEEFNFRPQCAVGKYLGFGSSDIAFAEGRHHWYLRKLCDTRLFSADSFVSYGHIPRAEAL
KMLHSVWEASKKGNGISVRETVTAFVRNSLCGMLLGSAHLDIENVSLQFTEKTLI
TLLDETICVVGEITLSDLAPGLKRVDFHGRTRKLKELHERWEKYLRVILEDRSHRLEKSA
KPEALVDVLLSLDDADMKLSNEAIMGVLL (0)
DTLVGGVYSTSATIEWALAELVRHPGVLEKVQLEMSEVVGPYHIVEDAEISQLPYFQ
(0)
ATVKETLRLHPVVPMSLPHMNKVATSISRYQIPANTSVVIDYKAIA
RDPAAWHKPLRFDPSRFLHTSAASQIDNIFKFLPFGYGRRGCPGANFAAVLLQLALAHLI
QAFDWAPTKGQLPHDIDVKESPGLVCFRFSPLVLSSTPRLANSLYQVSP*
>710912672
CYP71B like4 CYP756A2 complete BJ165273.1
BJ581063 BJ157306 879465659
828157377
830613141 815612895 755803843
MAQAVGLGRLPPPLSEENKAWMTNQLESYSTWQMNAAGLGRALYILAAVVFSAVVCFEQYSKRKERLSKV
PPGPFQWPYLGSLPYLIRTVGLRSSTRLREKITELAMKHGPLMFLQIADTQIVVVSSGKIAKE
(0)
VLVAHDAEFYFRPRCLVGKYLGFDS
TNMAFAEGRNHWRLRQLCDKHLFLPECFASYGHVQKQEAEKMLHSVWEATKRGENISV
RHTVITFARNSLCRMLLGTAHLDIENTSLEFNKETLITLLDEAFAVAGETTLIDLTPG
LNWLDFHGREQKMKYLRRRLEKYFQEILDDRSQRLDRSEPEAFVDVLLSLDEDKTLSDEAMMGVLL
(0)
DTLVAGTYSISASVEWALTELVRNPTMLENVQNEITQVVGPHHIVEVTEFSQLSYFQ
AIVKETLRLHPAMPMSIPHMNKVATPLSSYEIPPNTSVVVDYAAIGRDSEIWPN 179
PLDFDPRRFTDTDAASQIDSFKFLPFGYGRRVCPGANLGLRLLQLGLAHLVQGFDW
IPIEGQLPHDIDIKESSGAICFKSTPLILRAIPRLASTLYEI*
>692433988 CYP73A49 complete 846079417 836296434
BJ177355.1 BJ182198.1
BJ176060.1 BJ186266.1 BJ197128.1
BJ963588.1
BJ192275.1 BJ179214.1 BJ191683.1
BJ975047 = mate
pair of BJ966530.1 BJ600264 BJ611038
BJ975788 = mate
pair of BJ967270.1 BJ587131 BJ593693 BQ039467
BJ582985 = mate
pair of BJ177851.1 BJ581278 BJ582455 BJ584349
71% to 73A5
MGAGWKEAMALGLAGAGVARVLAISEDVVDNAAVSSFVKQF
MNLEGAVQALLVAVLLGLLIAKLRAPKLNLPPGPVALPIVGNWLQVGDDLNHRNLAEMSQ
KYGDVFLLKMGQRNLVVVSSPEVAKEVLHTQSVEFGSRTRNVVFDIFT
GNGQDMVFTVYGDHWRRMRRIMTVPFFTNKVVQQSRGAWEDEA
LRVIQDLKAKPEAS
TTGVVIRKRLQLMMYNIMYRLMFDSRFESEEDPLFLKLKALNGERS
RLAQSFEYNYGDFIPVLRPLLRGYLKVCQEIKDRRLALFKEHFLDERKKLLSTLGPRPDG
EKAAIDLILEAQKRGEINEENVLYIVENINVAA
IETTLWSIEWGVAELVNN
PEMQTRIREELDSTLGKGNLITEPDTYNNKLPYLSAFVKEVMRLH
MAIPLLVPHMNLHQAKLAGYDI
PAESKILVNAWWIANNPNHWDQPEKFIPERFLDGKIEAKGDDFRFLPFGSGRRSCPGIIIA
MPLLSIVLGRLVQSLELLPPPGTKKVDVSEKGGQFSLHIATHSTVVCKPIA*
>CYP73Ayy CYP73A48 705225761 complete 69% to 73A5 68% to upper seq
686707760 two
frameshifts near end near exact overlap with 705225761 (join)
ESTs
BJ156825 BJ601621
AW598785 AW156067 BJ960273 BJ962155
BJ173522 BJ170223
BJ955206 BJ608342 BJ603676 BJ955206
BJ605702 BJ593402
BJ607994 BJ596392 BJ609670 BJ944446
BJ606143 BJ598803
BJ583187 BQ041736 BJ597214 BJ599148
BJ607151 BJ595328
BJ163543 BJ203089 BJ199707 BJ201130
BJ197619 BJ178048
BJ195721 BJ970670 BJ193565
BJ199312
BJ206667 BJ204697
BJ200793 BJ173402 BJ185972 BJ190015
MEAASLASMFTFNNIVQGLCVAVFLGIVIMKLRAPKLKLPPGPFALPIVGNWLQVGDDLNQR
NLAEMSQKYGDVFLLKMGQRNLVVVSSPDIAKDVLHTQGVEFGSRTRNVVFDIFTGNGQDMVF
TVYGEHWRRMRRIMTVPFFTNKVVQHSRYAWEEETDYVIKDLKARPEAATSGVIIRRRLQ
LMMYNIMYRMMFNSRFETEDDPLFVKLKALNGERSRLAQSFDYNYGDFIPI
LRPFLKGYLKTCQDVKDRRLALFKEHFVDERK
(2)
KLNSVNPPKSEAEKCAIDHILEAQKKGEINEDNVLYIVENINVAAIETTLWSIEWGIAE
LVNNQEIQTRIRNELDSILGKGNLVSEPDTYNNKLPYLTAFVKEVMRLHMAIPLLVPHMN
INQAKLAGYDIPAESKILVNAWWIANNPKYWDQPEKFMPERFLDGKIEASGNDFRFLP
FGVGRRACPGIIIAMPLLAIVLGRLIQSFELLTPPGVKKIDLTETGGQFSLRIANHSTVV
ARPLAV*
>755680881 CYP73A50 76% to 710919188
1005948486 67% to 73Ayy 60% to 73A5
895358221
852123011 complete
CYP73Axx (no ESTs)
lower part from AIE to end = 755680881
exon starting with
KLGNA has no exact matches in trace archive
This is proably
the same seq as 755680881
710919274
759458758 61% to 73A5
MEKPTVVRGLLAIFVIGLVTGVEFKAP
STLDFVFSLDPCRQSLLLIVFTAIVINLRMKDRKMNLPPGPTALPIVGNWLK
(0)
VGNDLKHRTLAEMSERYGDVFMLKMGWRNYVVVSSPEAAKGVLHTQGEEFAC
RTRNAVFDIFAGKGQDIVFTNYGDHWRRMRRILTVPFVTAKVLQHAHFAWEEEVDEVMED
VQSRPESATAGVVLRHRFQLMMYNIVYRMMFNSRFASEYDPLYLKLKGLNGERCRLPAQN
YKYNYADFFPFLKLFIRGYLKICQEVRDKRLSLFKEHFVDERR
(2)
KLVNAVGVSGAEEKCAIDHMLEAQKKGEISEDNILYLIENINVA
(1)
AIESTLLSIEWGIAELVNHPNVQKRLQAELLEVIGEGNLVAEPDTHNNK
LPFLTAVVKETLRLHMPVPLLVPLMNMRQAKLAGYDIPPQSKVLVNAWWIGNNSKFWDQP
EKFMPERFLPAAAENQSLDFRFLPFGAGRRSCPGTAIAMPLLAIVLGRLLQKFDLLPPPG
MSKVDVAESGGQFSLHMATHSIVVLRPRN*
>710919188 CYP73A51 717596474 756814972 762526716 61% to 73A5
N-term
883857969
977890599 complete no ESTs
MGNPMGIGVLLVILVFGVVAKVDFKSRILLDFAFSSSLL
MQALLLGVFIVIVVSLRREGRNMNLPPCPAAFPIVGNWLQ
(0)
VGNDLKHRKLAEMAQKYGDVFMLRMGRRRFVVVSSPEAAKEVLHTHGVEFASRTRNAVY
DVFAGKGQDIVFTNYGDHWRRMRRIMTVPFVTHKVVKHAHFAWEDEVDHCIRDVEARSE
SATTGVVLRHRFQLMMYNIMYRMMFNSRFADEHDPLYLKLKGLNSERCRLPAQNSKYNY
ADFIPILRPFLRGYLKVCQDVRDRRLTLFKERFVDDQ
(2)
KLVNAVGAAGAGEKCAIDHILEAQKKGEISENNVLYLIENINIA (1)
AIESTLLSIEWGVAELVNHPEVQKRVQKEL
DEVLGDGHLVSEPDIHSGKLPFLTAVVKETLRLHMPIPLLVPHMNVKQAKIFGYDIPPES
KILVNAWWIGNNPKFWDQPERFMPERFLSATSEATTVDFRFLPFGAGRRSCPGSAIAVPL
LAIVLGRLVQKIDLLPPPGMTKVDVTESGGQFSLHMATHSTVVTRPRA*
>BJ163406 CYP73A52P aa 169-308 mate pair to BJ173377 finished
this seq seen in
at least 6 trace archive seqs 1023200166
883856438
869788433 891417707 866058619 884943991
890407139
838586233 816063061
This
seems to be an expressed pseudogene 76% to 692433988
LATS*EAGNSELGASFLKHF
MGLESAVQGLLVAVLVGLLIAKLRAPKLKLPPGTVALPIVGNWLQ
(0)
VGDDLNIGNLAEMSKKYGDVFMLKMGQRNLVMVSSPEAAKEVLHTQGTEFGSRERNVVY
DILIGNGQDMVFAVYEEDWRKMRRIMTVPFFTSKVVQQSRGTWEDDALRVVEELRTRPEAS
TTGVVIRNRLNSMMYNSIYRLMFDRRFENEEDPLFLRLKALNGGRSRLAQSFEYN*GDFI
PILHPVLRGYQRVCQEDQDRRLGVFNEHFVDVRKKLLETTGP
KPDGEKAAIYLILEAQRKGEISEDNLLYIVENINVAAI ETMLWSIEGGVAELVNNPDI
QTRVSEELDHTLGKGNLITELDTYNPK
(deletion)
HWDQPGRFMPERFSGNNIEVSGGDFRFLPFGSGRRSCRSIIIAMPLLTIVLSRLVQS
MGQLSPPGMKKVDVAEKGGQFSRHIATRSTMVCKPIA*
>755683069 CYP73A53P 710485186 72% to 73A5 with frameshift and
stop codon
pseudogene, stop
codon seen in 12 sequences, finished
815536458 41/47 (87%) to 73Ayy 824722097 pseudogene
816099016 87% to CYP73Ayy, suspect these join with 755683069
MATAGTIMMQAVGMASMFTFQNVVQGLIVAMVVWIFVIKLKSPKYKLPPGPVANWLE
(0)
FGDDLNHRYLAELTKKYGDIFLLKMGQRNVVVISSPEIAKDALQTQGIAFGSRPRNVVF
DIFSGKGQDMVFTPYGEHWRLMRRITTAPLFT
NKVVQQSRYAWEEEID*VSKDLKTLPE
XXTSGVIIRTKLQLMMYNIIYRMMFTSRFKDEKDPLYLKLKVLN
EDPRYLNLKVLNGEQNRMGQSFDYNSGDFIPIVRPFLRGYLKVC
QDVKDRRIALFKEYFVDERR
(2)
KLTSVNPPKSEEQKCAIDYIFEAEKLGEINEDNVLYIVENINVAAIDTTLWSIE*GIAE
LVNNPELDGIREELDAVLGKGNMVIEPDTYNNKLPLLTAFVKEVMRLHMSIPHVVHMN
LKHEKLAGYDIPAESKILVNVWWIGNNPKY*DQ
PEKFMPERFLDGNIEPSGNDFRYLPFGVGRRRCPGIFIAMPLLSIVLGRLIQSFELLPPP
GVKEVDVTEFCGQFSLRIANHSTVVVRPLL*
>710518592 CYP73A54P 755698511 44% to 73A5 C-term to PERF finished
pseudogene stop
codon seen in 11 seqs
GNFVSEPDTHNSKLQFLRAVSRKS
KVTLPSPLFSLHMNVKQVQIGGYYMFAESKISVIAW*IGNSPKLWDQLEKFMPEK
123
>CYP75B like1 CYP767A1 change to 757A1
716895728 755798807 39% to 92A11 38% to 75B1 complete
852155406
977922988 859713404 42% to CYP76C
like14 no ESTs
TIMKLRKKIGKLPPGPRALPLIGNIHQIGDFSRRNLMQMAE
KYGPIMYMRIGSKPLLVVSTAEAAHEFLKTQDKEWADRPTTTADKIFTNDHRNIVCAPYA
AHWRHLRKICTMDLFTPKRLMSFRTPRTEEINQMMTSIHEDVAAGKEVKLHVKLGHLTTNNITRMLLGKR
(2)
FFTVDEKGQMEAHRFKELVFELFRASSTPMIGDFIPWLKWVSIASGYVKYLKRVKADLDAFLQEFL
EIKKAASDQATAERAKDFVDLLLEQKTVSGDGPLEDATIRS (0)
DMLLAGTDTVSNAMEWTIAELMRHPECMRKLQQELDTVVGKSRIVSETDLPNLPYLQAVVK
EVMRFYPPAPLSLPHQSIVPTTVCGYDLPAGTQLCINLYAIQRDPKYWPNPVQFNP
DRFLNCDVDVGGTHFQLIPFGAGRRQCPGMPLGNLLLQISVARLVQAFEYSLPRGTK
RNYFMNYYSGANKLTSGIMYLI*
>CYP76C like14 CYP761E2 complete 74% to CYP76C like13 43% to 736A1
44% to 92A13 42%
to 92A14
692514018 no ESTs
816061186
859980678 755804916
759452715
755804916
759452715
MYFEKTTVARMLTAGFESENGLTGRRVEFYVFLAAIFIMPLVLL
KITRRPRLKLPPSPPAYPIIGHLHLLGKLPHQSMTNLAKKYGEIYSLRLGSVPAIVISTP
EMAKEFLLTNDKIWSSRSVHMTSGYYFSYDYA (1)
GIAFAPSTPVWRSLRKICMSELFTQRRLEASKGLREEEMQYMIR
(2)
SILDDAHQGRLIDLKLKINALTANIVARMVLNK
RFTGCIDSTVETEAEAHQFKEMMEEHFLLLGVFMIGDYIPWLSPLDLGGT
EKRMKSLRKRLDAFLDDILEVHEVKRAKGPIPEEDQDVIDVLLNEMYQQDSN
ESKQLDTNNVKSTIL (0)
NLFAGGTDTSTVTIEWAMSEMLRNPTIMGKLKAELDARIGKDRRVRETDLSDLPYLQAVT
530
KETFRLHPVGPLLIPHVSTHDCEVGGYHIPTGTRLYVNVYAIGRNPKVWDRPLE 692
FDPERFMTGLNAGVDVKGKHFHLLPFGTGRRGCPALPLGLLIVQWTLATLVHALDLSL
866
PQSMEPEDVDMTEAYGLTVPXGASLYLNAKLRAADHLY*
>BJ166750
CYP754A1 BJ171256 BJ172713 BJ602502 mate = BJ205564 complete
1014495025
879457809 883995654 1014487906 862362793 BJ165056
51% to
75B like3 BJ157072 BJ157394 BJ190531 BJ170915 BJ595929
MVEESWLWVLFVGALSFSILLQWGLNRKRKLKLPPGP
TAWPIVGCVFGLPRLNPPEKLFNKLSEKYGELMLLQLGSWSIVVTSSARMAMEILKTHDN
EFANRPDVISSRLNFNNTGLIQMHSTNPLFKRTRRMFS
AEIVSPRTVLETGVIRRKQ (0)
LRTLRSIVQDFDAGRSVNFTHEMKTLAMNLSMSICFGTDYATKVNDEAEALIHTYK (0)
IMAIWTRRSLGAIFPALRWLDLDGIESGFADVELQLRTNITALIEKKKQEMSMWSAE
DIQAGANEGDVMTKFLSMEGEDRCSEDQLISVVF
TILLAGTDTVFNVVTEAMYALLMHPNFYHRAVEELDAVVGKSRLV
EEADIPKLPMIQNIIKETFRIKPAGPSLVPRKNFEACEVAGYHIPANTTVFVNCIPLMRD
PSFWDSPDEFNPDRFIDSKVTVLGSDFNYLPFGYGKRTCPGLNLGMI
TVQYILAACLQCISWKLSRPRRLDIETDDDPRKVDDVMVDGKQRVDPALLEFAPQPVK*
>CYP75B like3 CYP754B1 complete BI437153 AW509529 BJ595585.1
BJ977649.1 BJ969326
713843806
713853618 CYP71 N-term 869925250 881364275
34% to
92A sequences, no high similarity to a known family
36% to
Panax ginseng seq. AB122079
yellow
exon questionable
MVGDVWIWVLITMVVAVIVGVGIDKKTKRGLKLPPGPPAWPVVGCLASLPAGHPPEVMFAKLAEKHGE
LMLLWLGSKPYVVASSARMAMEFLKRHDQEFANRPMSVVREYVSFKGNSIISMSASDPKY
QRLRRTFVMELLSPKKIAATRDLRKDQ
(0)
VLKMLRAIREDLDAKHEANFTEAVLTLGMSLSIGLLFGRDYGGKVFSEEIQTLVLTFKT
(0)
MVKYLSMINISDLIPSLRWLDLQGIERGLGLGEVQLRKSIMALIEQKRLDKIRLSSDE
IESGACQRDILSKLLSLEGEDRLDDDQLMGVVF
ALMLAGSDSISRGVGRAMQELLKQPLLHQRALDELDEVVGRRRLVEESDISSLPLINNIIKETLRLHP
PAQLLIPHGNVEQCEVAGYHIPARSTVLVNLYALSRDPSFWNSPLEFAPDRFVDSNLTVQGS
DFHYIPFGYGRRGCPGLNLGMITVQYALALCLQCILWRLPAGATISETYIDWKNSPDLIV
DGDLRVDLHLLEGL*
>AB122079 CYP768A1 change to 764A1 Panax ginseng (Apiales) 41% to
92A9 40% to 75B
MFPLAYPLLFVLLGALSWWILPIISPLKRHHKLPPGPRGLPIIG
SLHTLGALPHRTLQTLAKKYGPIMSMRLGSVPTIVVSSPQAAELFLKTHDNIFASRPK
LQAAEYMSYGTKGMSFTAYGPHWRNIRKFVVLELLTPAKINSFVGMRREELGMVVKSI
KEASAANEVVDLSAKVANIIENMTYRLLLGRTKDDRYDLKGIMNEALTLAGRFNIADF
VPFLGPLDIQGLTRQFKDTGKRLDKILEFIIDEHEQNSSNGNASGDFIDDMLSLKNKP
SNTHDELSKVIDRSVIKAIMIDIISAAIDTSDTSIEWILTELIKHPRAMKKCQEEIDA
VVGVDRMVEETDLPNLEYVYMVVKEGLRLHPVAPLLGPHESMEDITINGYFIPKQSRV
IVNSWALGRDPNVWSEDADEFLPERFEGSNIDVRGRDFQLLPFGSGRRGCPGMQLGLI
TVQLVVARLVHCFDWNLPNGITPDNLDMTEKFGLTTPRVKHLLAVPKYRL
>CYP76C like1 CYP758A1 complete 715966631 AXOS23673.g1 BQ040580 AW599561 48% to 76C2
BJ609203.1
opp end = BJ201966 BQ040759.1 711801903 BJ168318 BJ166758 BJ158754
BJ160069
CYP76C like4
713876163 46% to 76C2 no ESTs one frameshift and stop codon
no exact matches
in trace archive, best match = 715966631
probably
a poor version of the 715966631 sequence
MATPDSSGGAFDLAKWINGLVAHWGSVAVAVVAAAVIAKFIFNSTVGRRKL
PPGPAPWPILGNIASLAGLPHRSLEKLARKYGSLMYLRLGEVPCIVISSADVAKQLFKTH
DILFSNRPGGCFFEQLTEYRNITASRYGPHWRHLRKTCVHELFTQKRLEAYQATRLE
EISISIKELFEESDKKGPVDLHAWLHRLLFNNLTRVIMNNR
YFGTDEKGMKDAMDFNNVTALMFSQAGDVVISDFLPYLGFLTRLQGKPLLYR
KTREIVLEMMRRMTNFDERKKLHAEGRSTGEPEDFVDVLLSSTLSDGTTPLPDDICLMLL
MDVLVAGTDTSATTVEWTITELLRHPEAYKRVREELNSVVGSDQLVKEEHLEHLPYLNAVLQ
356
ESFRLHPATPLGLPRESSEAFEFLGYSLPAGTRLFVNQWAIHRDPAVYEQPEEFN
521
PERFLGREALKFIGDTQFQLVPFGSGRRNCAGLPMAVIVIPLVLAHLLHSVEFSLP
DGQQPKDLDMTETFGVAAPKASPLMIYATPRESAALY*
>CYP76C like2
This is a hybrid seq of two different genes
713877140
762522648 711800153
692509705
710494763 715965478 755800431 692507811 713794646 43% to 76C2
692476632
879809357 859709904 716891783
717625521 869788525 870057181
AW599509 BJ583667 BJ579892 BJ597932 BJ166653 BJ194761 BJ946070
BJ174288
BJ190667
BJ158648 BJ174717 BJ177099 BJ178535 BJ588129 BJ586603 BJ585830
BJ179043
BJ179518 BJ171222 BJ579424 BJ582321 BJ582678 BJ584175
715978988
756810370 CYP76C like11 51% to 76C4 95% to 76C like2
no other exact
matches in trace files
1025182054
= best match same as 76C like2 assume this is the same seq
MAEGGLLFGFELADVLVAAVLISVVVLYFHAEALQRRRCPPGPWPWPVVGN
FSALGDLPYRNLHKLAGKYGGLMYLRL
(1)
GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNN
NDENYRNIGLAEYGPYYRKLRRLLNTELFSPRRHASYEVTRAQEI
QCMMKVLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR
(2)
FYGVSGDDGEKQ
GQDLQK MTSSVFELLGSVV
ISDFVPYLS
FITKLQGHASKFSKIRDVSDKLTADFFDLD
SHRNNYKKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ
(0)
ELLNAGTETSSNTSEWAMAE
LIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV MKENFRLHPPAPLLLPHESREPTELLGYHFPAGTELLVNAFAIHRDPSVYDNPDSF
DPDRFLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG
QTEVDMTETIGISVRKKQPLFLVPKPRFELSLESVAEN*
>713877140 CYP758F2 (no AP insert, LHK insert) complete
also 824697165 893557678
993602306
869943560 997070612 993467812
1020671150
755800431 892704920 883863821
1006335547
857898605 BJ583667 mate = BJ178535
BJ579892
mate = BJ174717
BJ597932
mate = BJ194761
BJ179518
BJ158648 BJ190667 BJ177099 BJ179043
BJ946070 mate = BJ956863
957569174 883845570 713794646
MAEGGLLFGFELADVLVAAVLISVVVLYFH
AEALQRRRCPPGPWPWPVVGNFSALGDLPHRNLHKLAGKYGGLMYLRL (1)
GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNNDENYRNIGLAEYGPYYRKLRRLLNTELFSPRR
HASYEVTRAQEIQCMMKVLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR
(2)
FYGVSGDDGEKQGQDLQKMTSSVFELLGSVVISDFVPYLS
FITKLQGHASKFSKIRDVSDKLTADFFDLDSHRNNY
KKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ
(0)
EMLNAGTETSSNTSEWAMAELIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV
MKENFRLHPPLLLPHESREPTELLGYHFPAGTEVLVNSFAIHRDPSVYDNPDSFDPDR
FLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG
QTEVDMTETIGLSVSKKQPLFLVPKPRFELSLESVAEN*
>830443705 CYP758F1 (AP insertion) 96% identical to 713877140 (no AP insert)
879461440 1020623392 complete
may be alternative splicing in this gene. Two exon 2s exist
863042033 870057181 881406076 833254849
1003202172
BJ579424 mate = BJ174288
BJ588129 mate = BJ182840
BJ585830 mate = BJ180571
BJ586603 mate = BJ181679
828265864 833246011 977999457
(exon2a,
short)
GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNND
AFPSQSNNSVAVALSVDVSHLILPILSLEMHCATR
MAEGGLLFGFALADVLVAAVLISVVVLYFHAETLQRRRCPPGPWPWPVVGNFSALGDLPHRNLAGKYGGLMYLRL
(1)
GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNND
ENYRNIGLAEYGPYYRKLRRLLNTELFSPRRHASHEVTRAQEIQCMMK
VLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR
(2)
FYGVRVDDSEKERQDLQKMTSSVFELLGSVDLSDFVPYLS
FITKLQGHASKFSKIRDVSDKLTADFFDLDSHRNNY
KKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ
(0)
ELLNAGTETSSNTSEWAMAE
LIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV MKENFRLHPPAPLLLPHESREPTELLGYHFPAGTELLVNAFAIHRDPSVYDNPDSF
DPDRFLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG
QTEVDMTETIGISVRKKQPLFLVPKPRFELSLESVAEN*