98 sequence pieces from moss = 71 complete seqs, 24 finished pseudogenes,

plus 3 bacterial contamination P450s.

15 sequences not from Physcomitrella moss, used to assemble moss genes.

 

in progress D. Nelson

July 31, 2005, revised Jan. 31, 2006

 

517 moss ESTs

 

CYP51 clan sequences (1 complete sequence and 1 finished pseudogene)

 

>CYP51G1 complete

ESTs BJ585158.1 BJ591215.1 BJ592754.1 BJ963427.1 BJ165255.1 BJ157286.1

BJ188328.1 BJ185333.1

MGDAEMQGAPVGESAVFDRSKVMMLLGSLVVAAIVGHFVLAWNRKRRNLPPVVD

SAVPFVGGLLKFIKGPVPLLKEEYGRLGQVFTLQMLTRNVTFLIGPEVSAHFFKAQEADLSQRE (0)

VYQFNVPTFGPGVVFDVDYSIRMEQFRFFTEALTVKRLRSYVEMMVEEAH (0)

LFFSKWGEEGEVDLKVELEQLIVNTASRCLLGPEIRNSHLEKVTSLFHDLDNGMLPVSVLFPYLPIPAHKRRDR (2)

ARKELAEIFSKVIKARKASGKKEPDMLQAFMDSTYRSLKRGTTEEECTGLLIAALFAGQHTSSITSTWTGAYLMK (2)

YKQFMPAVIEEQKEIMRRHGDHLDYDVLNEMSCLHRAMKEALRLHPPLILLLRQNHTDFSVTTREGKSYTIPKG

HIVGTSPAFANRLPYVYKDPDTFNPDRFAPGNEEDVKAGQFSYIAFGGGRHGCLGETFAYMQ (0)

VKTVWSYLLQHFELELTGPKFPEVDWNAMVVGIKGEVMVRYKRRQLTCD*

 

>756723207 755804088 717597240 CYP51 pseudogene 54% to 51G1 rice with

multiple frameshifts finished

TSPAFANRLP

HLHKDPDFFNPDRLDPENEETK

IGGIISCMCFGGGHHSCLRETYLH

VQGKTVWSFLLQYFVMELVSPKFPAVDWNVMVVGNKGEVLGLLFSL*

 

CYP71 clan sequences (41 complete sequences 15 finished pseudogenes

0 partial sequences)

 

>710927435 CYP756A1 33% to 92A8 66% to 710912672 complete

870079473 1028549191 997124596 no ESTs

MAHLHTRLSEEAEAWIATGVDSFSRWQEYAAGFGRATYIVAALGFFAVVILELHNS

RKRRLSKLPPGPFQWPYLGSLPNLLLTVGVTSSFRLREKVSELGRNH

GPLMFLQIADTQILIVSSGTAAKE (0)

VLIARDEEFNFRPQCAVGKYLGFGSSDIAFAEGRHHWYLRKLCDTRLFSADSFVSYGHIPRAEAL

KMLHSVWEASKKGNGISVRETVTAFVRNSLCGMLLGSAHLDIENVSLQFTEKTLI

TLLDETICVVGEITLSDLAPGLKRVDFHGRTRKLKELHERWEKYLRVILEDRSHRLEKSA

KPEALVDVLLSLDDADMKLSNEAIMGVLL (0)

DTLVGGVYSTSATIEWALAELVRHPGVLEKVQLEMSEVVGPYHIVEDAEISQLPYFQ (0)

ATVKETLRLHPVVPMSLPHMNKVATSISRYQIPANTSVVIDYKAIA

RDPAAWHKPLRFDPSRFLHTSAASQIDNIFKFLPFGYGRRGCPGANFAAVLLQLALAHLI

QAFDWAPTKGQLPHDIDVKESPGLVCFRFSPLVLSSTPRLANSLYQVSP*

 

>710912672 CYP71B like4 CYP756A2 complete BJ165273.1 BJ581063 BJ157306 879465659

828157377 830613141 815612895 755803843

MAQAVGLGRLPPPLSEENKAWMTNQLESYSTWQMNAAGLGRALYILAAVVFSAVVCFEQYSKRKERLSKV

PPGPFQWPYLGSLPYLIRTVGLRSSTRLREKITELAMKHGPLMFLQIADTQIVVVSSGKIAKE (0)

VLVAHDAEFYFRPRCLVGKYLGFDS

TNMAFAEGRNHWRLRQLCDKHLFLPECFASYGHVQKQEAEKMLHSVWEATKRGENISV

RHTVITFARNSLCRMLLGTAHLDIENTSLEFNKETLITLLDEAFAVAGETTLIDLTPG

LNWLDFHGREQKMKYLRRRLEKYFQEILDDRSQRLDRSEPEAFVDVLLSLDEDKTLSDEAMMGVLL (0)

DTLVAGTYSISASVEWALTELVRNPTMLENVQNEITQVVGPHHIVEVTEFSQLSYFQ

AIVKETLRLHPAMPMSIPHMNKVATPLSSYEIPPNTSVVVDYAAIGRDSEIWPN 179

PLDFDPRRFTDTDAASQIDSFKFLPFGYGRRVCPGANLGLRLLQLGLAHLVQGFDW

IPIEGQLPHDIDIKESSGAICFKSTPLILRAIPRLASTLYEI*

 

>692433988 CYP73A49 complete 846079417 836296434

BJ177355.1 BJ182198.1 BJ176060.1 BJ186266.1 BJ197128.1

BJ963588.1 BJ192275.1 BJ179214.1 BJ191683.1

BJ975047 = mate pair of BJ966530.1 BJ600264 BJ611038

BJ975788 = mate pair of BJ967270.1 BJ587131 BJ593693 BQ039467

BJ582985 = mate pair of BJ177851.1 BJ581278 BJ582455 BJ584349

71% to 73A5

MGAGWKEAMALGLAGAGVARVLAISEDVVDNAAVSSFVKQF

MNLEGAVQALLVAVLLGLLIAKLRAPKLNLPPGPVALPIVGNWLQVGDDLNHRNLAEMSQ

KYGDVFLLKMGQRNLVVVSSPEVAKEVLHTQSVEFGSRTRNVVFDIFT

GNGQDMVFTVYGDHWRRMRRIMTVPFFTNKVVQQSRGAWEDEA

LRVIQDLKAKPEAS TTGVVIRKRLQLMMYNIMYRLMFDSRFESEEDPLFLKLKALNGERS

RLAQSFEYNYGDFIPVLRPLLRGYLKVCQEIKDRRLALFKEHFLDERKKLLSTLGPRPDG

EKAAIDLILEAQKRGEINEENVLYIVENINVAA IETTLWSIEWGVAELVNN

PEMQTRIREELDSTLGKGNLITEPDTYNNKLPYLSAFVKEVMRLH

MAIPLLVPHMNLHQAKLAGYDI

PAESKILVNAWWIANNPNHWDQPEKFIPERFLDGKIEAKGDDFRFLPFGSGRRSCPGIIIA

MPLLSIVLGRLVQSLELLPPPGTKKVDVSEKGGQFSLHIATHSTVVCKPIA*

 

>CYP73Ayy CYP73A48 705225761 complete 69% to 73A5 68% to upper seq

686707760 two frameshifts near end near exact overlap with 705225761 (join)

ESTs

BJ156825 BJ601621 AW598785 AW156067 BJ960273 BJ962155

BJ173522 BJ170223 BJ955206 BJ608342 BJ603676 BJ955206

BJ605702 BJ593402 BJ607994 BJ596392 BJ609670 BJ944446

BJ606143 BJ598803 BJ583187 BQ041736 BJ597214 BJ599148

BJ607151 BJ595328 BJ163543 BJ203089 BJ199707 BJ201130

BJ197619 BJ178048 BJ195721 BJ970670 BJ193565 BJ199312

BJ206667 BJ204697 BJ200793 BJ173402 BJ185972 BJ190015

MEAASLASMFTFNNIVQGLCVAVFLGIVIMKLRAPKLKLPPGPFALPIVGNWLQVGDDLNQR

NLAEMSQKYGDVFLLKMGQRNLVVVSSPDIAKDVLHTQGVEFGSRTRNVVFDIFTGNGQDMVF

TVYGEHWRRMRRIMTVPFFTNKVVQHSRYAWEEETDYVIKDLKARPEAATSGVIIRRRLQ

LMMYNIMYRMMFNSRFETEDDPLFVKLKALNGERSRLAQSFDYNYGDFIPI

LRPFLKGYLKTCQDVKDRRLALFKEHFVDERK (2)

KLNSVNPPKSEAEKCAIDHILEAQKKGEINEDNVLYIVENINVAAIETTLWSIEWGIAE

LVNNQEIQTRIRNELDSILGKGNLVSEPDTYNNKLPYLTAFVKEVMRLHMAIPLLVPHMN

INQAKLAGYDIPAESKILVNAWWIANNPKYWDQPEKFMPERFLDGKIEASGNDFRFLP

FGVGRRACPGIIIAMPLLAIVLGRLIQSFELLTPPGVKKIDLTETGGQFSLRIANHSTVV

ARPLAV*

 

>755680881 CYP73A50 76% to 710919188 1005948486 67% to 73Ayy 60% to 73A5

895358221 852123011 complete

CYP73Axx (no ESTs) lower part from AIE to end = 755680881

exon starting with KLGNA has no exact matches in trace archive

This is proably the same seq as 755680881

710919274 759458758 61% to 73A5

MEKPTVVRGLLAIFVIGLVTGVEFKAP

STLDFVFSLDPCRQSLLLIVFTAIVINLRMKDRKMNLPPGPTALPIVGNWLK (0)

VGNDLKHRTLAEMSERYGDVFMLKMGWRNYVVVSSPEAAKGVLHTQGEEFAC

RTRNAVFDIFAGKGQDIVFTNYGDHWRRMRRILTVPFVTAKVLQHAHFAWEEEVDEVMED

VQSRPESATAGVVLRHRFQLMMYNIVYRMMFNSRFASEYDPLYLKLKGLNGERCRLPAQN

YKYNYADFFPFLKLFIRGYLKICQEVRDKRLSLFKEHFVDERR (2)

KLVNAVGVSGAEEKCAIDHMLEAQKKGEISEDNILYLIENINVA (1)

AIESTLLSIEWGIAELVNHPNVQKRLQAELLEVIGEGNLVAEPDTHNNK

LPFLTAVVKETLRLHMPVPLLVPLMNMRQAKLAGYDIPPQSKVLVNAWWIGNNSKFWDQP

EKFMPERFLPAAAENQSLDFRFLPFGAGRRSCPGTAIAMPLLAIVLGRLLQKFDLLPPPG

MSKVDVAESGGQFSLHMATHSIVVLRPRN*

 

>710919188 CYP73A51 717596474 756814972 762526716 61% to 73A5 N-term

883857969 977890599 complete no ESTs

MGNPMGIGVLLVILVFGVVAKVDFKSRILLDFAFSSSLL

MQALLLGVFIVIVVSLRREGRNMNLPPCPAAFPIVGNWLQ (0)

VGNDLKHRKLAEMAQKYGDVFMLRMGRRRFVVVSSPEAAKEVLHTHGVEFASRTRNAVY

DVFAGKGQDIVFTNYGDHWRRMRRIMTVPFVTHKVVKHAHFAWEDEVDHCIRDVEARSE

SATTGVVLRHRFQLMMYNIMYRMMFNSRFADEHDPLYLKLKGLNSERCRLPAQNSKYNY

ADFIPILRPFLRGYLKVCQDVRDRRLTLFKERFVDDQ (2)

KLVNAVGAAGAGEKCAIDHILEAQKKGEISENNVLYLIENINIA (1)

AIESTLLSIEWGVAELVNHPEVQKRVQKEL

DEVLGDGHLVSEPDIHSGKLPFLTAVVKETLRLHMPIPLLVPHMNVKQAKIFGYDIPPES

KILVNAWWIGNNPKFWDQPERFMPERFLSATSEATTVDFRFLPFGAGRRSCPGSAIAVPL

LAIVLGRLVQKIDLLPPPGMTKVDVTESGGQFSLHMATHSTVVTRPRA*

 

>BJ163406 CYP73A52P aa 169-308 mate pair to BJ173377 finished

this seq seen in at least 6 trace archive seqs 1023200166

883856438 869788433 891417707 866058619 884943991

890407139 838586233 816063061

This seems to be an expressed pseudogene 76% to 692433988

LATS*EAGNSELGASFLKHF

MGLESAVQGLLVAVLVGLLIAKLRAPKLKLPPGTVALPIVGNWLQ (0)

VGDDLNIGNLAEMSKKYGDVFMLKMGQRNLVMVSSPEAAKEVLHTQGTEFGSRERNVVY

DILIGNGQDMVFAVYEEDWRKMRRIMTVPFFTSKVVQQSRGTWEDDALRVVEELRTRPEAS

TTGVVIRNRLNSMMYNSIYRLMFDRRFENEEDPLFLRLKALNGGRSRLAQSFEYN*GDFI

PILHPVLRGYQRVCQEDQDRRLGVFNEHFVDVRKKLLETTGP

KPDGEKAAIYLILEAQRKGEISEDNLLYIVENINVAAI ETMLWSIEGGVAELVNNPDI

QTRVSEELDHTLGKGNLITELDTYNPK (deletion)

HWDQPGRFMPERFSGNNIEVSGGDFRFLPFGSGRRSCRSIIIAMPLLTIVLSRLVQS

MGQLSPPGMKKVDVAEKGGQFSRHIATRSTMVCKPIA*

 

>755683069 CYP73A53P 710485186 72% to 73A5 with frameshift and stop codon

pseudogene, stop codon seen in 12 sequences, finished

815536458 41/47 (87%) to 73Ayy 824722097 pseudogene

816099016 87% to CYP73Ayy, suspect these join with 755683069

MATAGTIMMQAVGMASMFTFQNVVQGLIVAMVVWIFVIKLKSPKYKLPPGPVANWLE (0)

FGDDLNHRYLAELTKKYGDIFLLKMGQRNVVVISSPEIAKDALQTQGIAFGSRPRNVVF

DIFSGKGQDMVFTPYGEHWRLMRRITTAPLFT

NKVVQQSRYAWEEEID*VSKDLKTLPE

XXTSGVIIRTKLQLMMYNIIYRMMFTSRFKDEKDPLYLKLKVLN

EDPRYLNLKVLNGEQNRMGQSFDYNSGDFIPIVRPFLRGYLKVC

QDVKDRRIALFKEYFVDERR (2)

KLTSVNPPKSEEQKCAIDYIFEAEKLGEINEDNVLYIVENINVAAIDTTLWSIE*GIAE

LVNNPELDGIREELDAVLGKGNMVIEPDTYNNKLPLLTAFVKEVMRLHMSIPHVVHMN

LKHEKLAGYDIPAESKILVNVWWIGNNPKY*DQ

PEKFMPERFLDGNIEPSGNDFRYLPFGVGRRRCPGIFIAMPLLSIVLGRLIQSFELLPPP

GVKEVDVTEFCGQFSLRIANHSTVVVRPLL*

 

>710518592 CYP73A54P 755698511 44% to 73A5 C-term to PERF finished

pseudogene stop codon seen in 11 seqs

GNFVSEPDTHNSKLQFLRAVSRKS

KVTLPSPLFSLHMNVKQVQIGGYYMFAESKISVIAW*IGNSPKLWDQLEKFMPEK 123

 

>CYP75B like1 CYP767A1 change to 757A1 716895728 755798807 39% to 92A11 38% to 75B1 complete

852155406 977922988 859713404 42% to CYP76C like14 no ESTs

MMEIGGMRAEWHVVLSACVTIATMVL

TIMKLRKKIGKLPPGPRALPLIGNIHQIGDFSRRNLMQMAE

KYGPIMYMRIGSKPLLVVSTAEAAHEFLKTQDKEWADRPTTTADKIFTNDHRNIVCAPYA

AHWRHLRKICTMDLFTPKRLMSFRTPRTEEINQMMTSIHEDVAAGKEVKLHVKLGHLTTNNITRMLLGKR (2)

FFTVDEKGQMEAHRFKELVFELFRASSTPMIGDFIPWLKWVSIASGYVKYLKRVKADLDAFLQEFL

EIKKAASDQATAERAKDFVDLLLEQKTVSGDGPLEDATIRS (0)

DMLLAGTDTVSNAMEWTIAELMRHPECMRKLQQELDTVVGKSRIVSETDLPNLPYLQAVVK

EVMRFYPPAPLSLPHQSIVPTTVCGYDLPAGTQLCINLYAIQRDPKYWPNPVQFNP

DRFLNCDVDVGGTHFQLIPFGAGRRQCPGMPLGNLLLQISVARLVQAFEYSLPRGTK

RNYFMNYYSGANKLTSGIMYLI*

 

>CYP76C like14 CYP761E2 complete 74% to CYP76C like13 43% to 736A1

44% to 92A13 42% to 92A14

692514018 no ESTs

816061186 859980678 755804916 759452715

755804916 759452715

MYFEKTTVARMLTAGFESENGLTGRRVEFYVFLAAIFIMPLVLL

KITRRPRLKLPPSPPAYPIIGHLHLLGKLPHQSMTNLAKKYGEIYSLRLGSVPAIVISTP

EMAKEFLLTNDKIWSSRSVHMTSGYYFSYDYA (1)

GIAFAPSTPVWRSLRKICMSELFTQRRLEASKGLREEEMQYMIR (2)

SILDDAHQGRLIDLKLKINALTANIVARMVLNK

RFTGCIDSTVETEAEAHQFKEMMEEHFLLLGVFMIGDYIPWLSPLDLGGT

EKRMKSLRKRLDAFLDDILEVHEVKRAKGPIPEEDQDVIDVLLNEMYQQDSN

ESKQLDTNNVKSTIL (0)

NLFAGGTDTSTVTIEWAMSEMLRNPTIMGKLKAELDARIGKDRRVRETDLSDLPYLQAVT 530

KETFRLHPVGPLLIPHVSTHDCEVGGYHIPTGTRLYVNVYAIGRNPKVWDRPLE 692

FDPERFMTGLNAGVDVKGKHFHLLPFGTGRRGCPALPLGLLIVQWTLATLVHALDLSL 866

PQSMEPEDVDMTEAYGLTVPXGASLYLNAKLRAADHLY*

 

>BJ166750 CYP754A1 BJ171256 BJ172713 BJ602502 mate = BJ205564 complete

1014495025 879457809 883995654 1014487906 862362793 BJ165056

51% to 75B like3 BJ157072 BJ157394 BJ190531 BJ170915 BJ595929

MVEESWLWVLFVGALSFSILLQWGLNRKRKLKLPPGP

TAWPIVGCVFGLPRLNPPEKLFNKLSEKYGELMLLQLGSWSIVVTSSARMAMEILKTHDN

EFANRPDVISSRLNFNNTGLIQMHSTNPLFKRTRRMFS

AEIVSPRTVLETGVIRRKQ (0)

LRTLRSIVQDFDAGRSVNFTHEMKTLAMNLSMSICFGTDYATKVNDEAEALIHTYK (0)

IMAIWTRRSLGAIFPALRWLDLDGIESGFADVELQLRTNITALIEKKKQEMSMWSAE

DIQAGANEGDVMTKFLSMEGEDRCSEDQLISVVF

TILLAGTDTVFNVVTEAMYALLMHPNFYHRAVEELDAVVGKSRLV

EEADIPKLPMIQNIIKETFRIKPAGPSLVPRKNFEACEVAGYHIPANTTVFVNCIPLMRD

PSFWDSPDEFNPDRFIDSKVTVLGSDFNYLPFGYGKRTCPGLNLGMI

TVQYILAACLQCISWKLSRPRRLDIETDDDPRKVDDVMVDGKQRVDPALLEFAPQPVK*

 

>CYP75B like3 CYP754B1 complete BI437153 AW509529 BJ595585.1 BJ977649.1 BJ969326

713843806 713853618 CYP71 N-term 869925250 881364275

34% to 92A sequences, no high similarity to a known family

36% to Panax ginseng seq. AB122079

yellow exon questionable

MVGDVWIWVLITMVVAVIVGVGIDKKTKRGLKLPPGPPAWPVVGCLASLPAGHPPEVMFAKLAEKHGE

LMLLWLGSKPYVVASSARMAMEFLKRHDQEFANRPMSVVREYVSFKGNSIISMSASDPKY

QRLRRTFVMELLSPKKIAATRDLRKDQ (0)

VLKMLRAIREDLDAKHEANFTEAVLTLGMSLSIGLLFGRDYGGKVFSEEIQTLVLTFKT (0)

MVKYLSMINISDLIPSLRWLDLQGIERGLGLGEVQLRKSIMALIEQKRLDKIRLSSDE

IESGACQRDILSKLLSLEGEDRLDDDQLMGVVF

ALMLAGSDSISRGVGRAMQELLKQPLLHQRALDELDEVVGRRRLVEESDISSLPLINNIIKETLRLHP

PAQLLIPHGNVEQCEVAGYHIPARSTVLVNLYALSRDPSFWNSPLEFAPDRFVDSNLTVQGS

DFHYIPFGYGRRGCPGLNLGMITVQYALALCLQCILWRLPAGATISETYIDWKNSPDLIV

DGDLRVDLHLLEGL*

 

>AB122079 CYP768A1 change to 764A1 Panax ginseng (Apiales) 41% to 92A9 40% to 75B

MFPLAYPLLFVLLGALSWWILPIISPLKRHHKLPPGPRGLPIIG

SLHTLGALPHRTLQTLAKKYGPIMSMRLGSVPTIVVSSPQAAELFLKTHDNIFASRPK

LQAAEYMSYGTKGMSFTAYGPHWRNIRKFVVLELLTPAKINSFVGMRREELGMVVKSI

KEASAANEVVDLSAKVANIIENMTYRLLLGRTKDDRYDLKGIMNEALTLAGRFNIADF

VPFLGPLDIQGLTRQFKDTGKRLDKILEFIIDEHEQNSSNGNASGDFIDDMLSLKNKP

SNTHDELSKVIDRSVIKAIMIDIISAAIDTSDTSIEWILTELIKHPRAMKKCQEEIDA

VVGVDRMVEETDLPNLEYVYMVVKEGLRLHPVAPLLGPHESMEDITINGYFIPKQSRV

IVNSWALGRDPNVWSEDADEFLPERFEGSNIDVRGRDFQLLPFGSGRRGCPGMQLGLI

TVQLVVARLVHCFDWNLPNGITPDNLDMTEKFGLTTPRVKHLLAVPKYRL

 

>CYP76C like1 CYP758A1 complete 715966631 AXOS23673.g1 BQ040580 AW599561 48% to 76C2

BJ609203.1 opp end = BJ201966 BQ040759.1 711801903 BJ168318 BJ166758 BJ158754

BJ160069

CYP76C like4 713876163 46% to 76C2 no ESTs one frameshift and stop codon

no exact matches in trace archive, best match = 715966631

probably a poor version of the 715966631 sequence

MATPDSSGGAFDLAKWINGLVAHWGSVAVAVVAAAVIAKFIFNSTVGRRKL

PPGPAPWPILGNIASLAGLPHRSLEKLARKYGSLMYLRLGEVPCIVISSADVAKQLFKTH

DILFSNRPGGCFFEQLTEYRNITASRYGPHWRHLRKTCVHELFTQKRLEAYQATRLE

EISISIKELFEESDKKGPVDLHAWLHRLLFNNLTRVIMNNR

YFGTDEKGMKDAMDFNNVTALMFSQAGDVVISDFLPYLGFLTRLQGKPLLYR

KTREIVLEMMRRMTNFDERKKLHAEGRSTGEPEDFVDVLLSSTLSDGTTPLPDDICLMLL

MDVLVAGTDTSATTVEWTITELLRHPEAYKRVREELNSVVGSDQLVKEEHLEHLPYLNAVLQ 356

ESFRLHPATPLGLPRESSEAFEFLGYSLPAGTRLFVNQWAIHRDPAVYEQPEEFN 521

PERFLGREALKFIGDTQFQLVPFGSGRRNCAGLPMAVIVIPLVLAHLLHSVEFSLP

DGQQPKDLDMTETFGVAAPKASPLMIYATPRESAALY*

 

>CYP76C like2 This is a hybrid seq of two different genes

713877140 762522648 711800153

692509705 710494763 715965478 755800431 692507811 713794646 43% to 76C2

692476632 879809357 859709904 716891783 717625521 869788525 870057181

AW599509 BJ583667 BJ579892 BJ597932 BJ166653 BJ194761 BJ946070 BJ174288

BJ190667 BJ158648 BJ174717 BJ177099 BJ178535 BJ588129 BJ586603 BJ585830

BJ179043 BJ179518 BJ171222 BJ579424 BJ582321 BJ582678 BJ584175

715978988 756810370 CYP76C like11 51% to 76C4 95% to 76C like2

no other exact matches in trace files

1025182054 = best match same as 76C like2 assume this is the same seq

MAEGGLLFGFELADVLVAAVLISVVVLYFHAEALQRRRCPPGPWPWPVVGN

FSALGDLPYRNLHKLAGKYGGLMYLRL (1)

GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNN

NDENYRNIGLAEYGPYYRKLRRLLNTELFSPRRHASYEVTRAQEI

QCMMKVLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR (2)

FYGVSGDDGEKQ GQDLQK MTSSVFELLGSVV

ISDFVPYLS FITKLQGHASKFSKIRDVSDKLTADFFDLD

SHRNNYKKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ (0)

ELLNAGTETSSNTSEWAMAE

LIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV MKENFRLHPPAPLLLPHESREPTELLGYHFPAGTELLVNAFAIHRDPSVYDNPDSF

DPDRFLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG

QTEVDMTETIGISVRKKQPLFLVPKPRFELSLESVAEN*

 

>713877140 CYP758F2 (no AP insert, LHK insert) complete

also 824697165 893557678

993602306 869943560 997070612 993467812

1020671150 755800431 892704920 883863821

1006335547 857898605 BJ583667 mate = BJ178535

BJ579892 mate = BJ174717

BJ597932 mate = BJ194761

BJ179518 BJ158648 BJ190667 BJ177099 BJ179043

BJ946070 mate = BJ956863

957569174 883845570 713794646

MAEGGLLFGFELADVLVAAVLISVVVLYFH

AEALQRRRCPPGPWPWPVVGNFSALGDLPHRNLHKLAGKYGGLMYLRL (1)

GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNNDENYRNIGLAEYGPYYRKLRRLLNTELFSPRR

HASYEVTRAQEIQCMMKVLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR (2)

FYGVSGDDGEKQGQDLQKMTSSVFELLGSVVISDFVPYLS

FITKLQGHASKFSKIRDVSDKLTADFFDLDSHRNNY

KKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ (0)

EMLNAGTETSSNTSEWAMAELIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV

MKENFRLHPPLLLPHESREPTELLGYHFPAGTEVLVNSFAIHRDPSVYDNPDSFDPDR

FLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG

QTEVDMTETIGLSVSKKQPLFLVPKPRFELSLESVAEN*

 

>830443705 CYP758F1 (AP insertion) 96% identical to 713877140 (no AP insert)

879461440 1020623392 complete

may be alternative splicing in this gene. Two exon 2s exist

863042033 870057181 881406076 833254849

1003202172

BJ579424 mate = BJ174288

BJ588129 mate = BJ182840

BJ585830 mate = BJ180571

BJ586603 mate = BJ181679

828265864 833246011 977999457

(exon2a, short)

GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNND

AFPSQSNNSVAVALSVDVSHLILPILSLEMHCATR

 

MAEGGLLFGFALADVLVAAVLISVVVLYFHAETLQRRRCPPGPWPWPVVGNFSALGDLPHRNLAGKYGGLMYLRL (1)

GAKPCLVISTAAVAKEFYTTVDASFASRPKRFSWTVWNNND

ENYRNIGLAEYGPYYRKLRRLLNTELFSPRRHASHEVTRAQEIQCMMK

VLLEESEKGNPVNLQTWLHGTTSNNMTRMVVGKR (2)

FYGVRVDDSEKERQDLQKMTSSVFELLGSVDLSDFVPYLS

FITKLQGHASKFSKIRDVSDKLTADFFDLDSHRNNY

KKMKNDPSYVPDFEDVLMETPFENGTNLPDQDLLKLLQ (0)

ELLNAGTETSSNTSEWAMAE

LIRRPELIERAQTEMDSVIGSKRLVEESDIQQLPFLQAV MKENFRLHPPAPLLLPHESREPTELLGYHFPAGTELLVNAFAIHRDPSVYDNPDSF

DPDRFLARPHVDHMSTSDPYELMPFGKGLRMCPGYRLANTMVALMLANLLYVFDWSLPEG

QTEVDMTETIGISVRKKQPLFLVPKPRFELSLESVAEN*