CYP4C3 complete AL072439 AI457023 AI296945 AI106780 AL051651 MAESILLSKVGQVISGYSPITVFLLGSILIFLVVYNKRRSRLVKYIEKIPGPAAMPFLGNAIE* MNVDHDELFNRVIGMQKLWGTRIGINRVWQGTAPRVLLFEPETVE* PILNSQKFVNKSHDYDYLHPWLGEGLLTSTDRKWHSRRK* ILTPAFHFKILDDFIDVFNEQSAVLARKLAVEVGSEAFNLFPYVTLCTLDIVC* ETAMGRRIYAQSNSESEYVKAVY GIGSIVQSRQAKIWLQSD FIFSLTAEYKLHQSYINTLHGFSNMVIRERKAELAILQENNNNNNNN APDAYDDVGKKKRLAFLDLLIDASKEGTVLSN* EDIREEVDTFMFEGHDTTSAAISWTLFLLGCHPEYQERVVEELDSIFGDDKETPATMKNLMDMRYLE CCIKDSLRLFPSVPMMAKMVGEDVN* IGGKIVPAGTQAIIMTYALHRNPRVFPKPEQFNPDNFLPENCAGRHPFAYI PFSAGPRNCIGQKFAILEEKAVISTVLRKYKIEAVDRREDLTLLGELILRPKDGLRVKITPRD* AA952162 AI388993 AA538617 AA439290 AA940746 AA941522 AI106700 45% to 4c3 RLFPSSPCLAFAHADGVDLNFGGQLVACRLEAI 370 AC007928 Drosophila melanogaster chromosome 3 clone BACR04O02 This seq is improved over the old seq. At KYG region MLTINLLLAVGALFWIYFLWSRRSKLYFLMLKIPGPIGLPIL GSSLENIITYKRKLSFRTKYLNK*YGSTILTWMGPVPFIVTRDPKVVEDIF SSPDCHNKSQHIVNAITSCMGNGLLGKIDPHWLDRRKHFNPSFKQDLLLS FFHIFDAETKVLMNLLDTYVDKGEIDVVPEMLRWSFKIAA* ETTMGSEVKHDEHFKNGSLV NRKRNFKQTETNFIYLKCDNPIHFYRLISHSTLNILMPLVQNRMIS KICGYDKLRADNFSRIQKMLDNV VNPLPKTDSDPESNIVINRAMELYRKGDITYM DVKSECCIMIAAGYDTSALTVYHALFLLANHPEHQEAVFEELNGVFPDAGHFGITYPDMQ KLDYLERVIKETLRLIPAIPITARETKNDVRLSNGVLIPKGVVIGIDMFHTHRNPEVWGP DADNFNPDNFLAENMEQKHPYAYIPFARGKRNCIG SKYAMMSSKFALCRILRNYKISTSTLYKDLVYVDNMTMKLAEYPRLKLQRRG* AC007724 Drosophila melanogaster chromosome 3 clone BACR30N15 C-helix PMGFPFIGLNVFTQVRYACVPFYLKYME KR*YGKTVLTWIGLTPVLVTCEPKILEDIFTSPNCSNRSSVVDKAISSCLGLGLL *VSIDNHWNERRKLLLPSFKNNAVLSFVPVLN NEANFLVTLLAEFVDGGDINLLPELNKWSFKIAARK* AC017371 Query: 1 PMGFPFIGL----------------------NVFTQVRYACVPFYLKYMEKRY 31 PMGFPFIGL NVFTQVRYACVPFYLKYMEKRY Sbjct: 6454 PMGFPFIGLAFEYIRLKRK*LYFRI*LGKTKNVFTQVRYACVPFYLKYMEKRY 6612 AL076582 AL074717 AL072640 N-term to C-helix also AC007724 71656-71049 MDTFQLLLAVGVCFWIYFLWSRRRLYMMHFKIPGPMGLPILGIAFEYLITYKR* YGSTCLVWVGPTPFVITRDPKIAEEIFLSPECLNRSSIFSKPVNSCTGDGLLSLEGN KWVDRRKNLNPAFKQNVLLSFLPIFNSEAKTLVAFLDSLVGQGEKKVRDDIVRWSFRIAT* ETTVGTDVKKDASFKNDSVL Cyp4d1 AF016992 see also AF016993 to AF017004, MWLLLSLVLLLAIIALEMRRFLRNMRTIPGPLPLPLLGN AHIFLGLTPAEACLKIGELAERHGDTFGLFLGPSYSVMLFNPRDVERVLG SSQLLTKSQEYSFLGRWLNEGLLVSNGRKWHRRRKIITPAFHFRILEPYV EIFDRQSLRLVEELALRISRGQERINLGEAIHLCALDAICETA*MGVSIN AQSNADSEYVQAVKTISMVLHKRMFNILYRFDLTYMLTPLARAEKKALNV LHQFTEKIIVQRREELIREGSSQESSNDDADVGAKRKMAFLDILLQSTVD ERPLSNLDIREEVDTFMFEGHDTTSSALMFFFYNIATHPEAQKKCFEEIR SVVGNDKSTPVSYELLNQLHYVDLCVKETLRMYPSVPLLGRKVLEDCEIN GKLIPAGTNIGISPLYLGRREELFSEPNSFKPERFDVVTTAEKLNPYAYI PFSAGPRNCIGQKFAMLEIKAIVANVLRHYEVDFVGDSSEPPVLIAELIL RTKEPLMFKVRERVY CLOSEST FIRST 4D1 EXON (NOT USED, SKIPPED IN 6/7 ESTS) AI062205 MFLVIGAILASALFVGLLLYHLKFKRLIDLISYMPGPPVLPLVGHGHHFI GKPPHEMVKKIFEFMETYSKDQVLKVWLGPELNVLMGNPKDVEVVLGTLR FNDKAGEYKALEPWLKEGLLVSRGRKWHKRRKIITPAFHFKILDQFVEVF EKGSRDLLRNMEQDRLKHGESGFSLYDWINLCTMDTIC MORE DISTANT FIRST CYP4D1 EXON (THIS ONE IS USED) 6 ESTS MATCH THIS ONE MWLLLSLVLLLAIIALEMRRFLRNMRTIPGPLPLPLLGN AHIFLGLTPAEACLKIGELAERHGDTFGLFLGPSYSVMLFNPRDVERVLG SSQLLTKSQEYSFLGRWLNEGLLVSNGRKWHRRRKIITPAFHFRILEPYV EIFDRQSLRLVEELALRISRGQERINLGEAIHLCALDAICETA* Cyp4d2 Al009194 14092-15781 also X75955, AL023401 Cyp4d2 STS MLGVVGVLLLVAFATLLLWDFLWRRRGNGILPGPRPLPFLGNLLMYRGLD PEQIMDFVKKNQRKYGRLYRVWILHQLAVFSTDPRDIEFVLSSQQHITKN NLYKLLNCWLGDGLLMSTGRKWHGRRKIITPTFHFKILEQFVEIFDQQSA VMVEQLQSRADGMTPINIFPVICLTALDIIAETAMGTKINAQKNPNLPYV QAVNDVTNILIKRFIHAWQRVDWIFRLTQPTEAKRQDKAIKVMHDFTENI IRERRETLVNNSKETTPEEEVNFLGQKRRMALLDVLLQSTIDGAPLSDED IREEVDTFMFEGHDTTTSAISFCLYEISRHPEVQQRLQQEIRDVLGEDRK SPVTLRDLGELKFMENVIKESLRLHPPVPMIGRWFAEDVEIRGKHIPAGT NFTMGIFVLLRDPEYFESPDEFRPERFDADVPQIHPYAYIPFSAGPRNCI GQKFAMLEMKSTVSKLLRHFELLPLGPEPRHSMNIVLRSANGVHLGLKPRA AC015418 = AI261013 in ordered fragments 46% to 4D1 ELREKHGPVFRIWFGKDLMVMFTDPEDIKQLLGNNQLLTKSRNYELLEPWLGKGLLTNGGESWHRRRK LLTPGFHFRILSEFKEPMEENCRILVRRLRTKANGESFDIYPYITLFALDAICETAMGI KKHAQLQSDSEYVQAV HSICRVMHKQSFSFWQRLNVFFKHTKPGKEREAALKVLHDETNRVIRLRREQL IQERNEWKPEAEQDDVGAKRRLAFLDMLLLTQMEGGAELSDTDIREEVDTFMFEGH DTTSSAIAFALSLLSKNPDVQQRAFEEASELEGREKESMPYLEAVIKETLR IYPSVPFFSRKVLEDLEVGKLTVPKGASISCLIYMLHRDPKNFPDPERFDPDRFLVNE KQMHPFAFAAFSAGPRNCIG QKFAMLELKTSLAMLLRSYRFLPDKDHQPKPLAELVTKSGNGIRLRILPRDENGTTA* AI402187 GH09810 94% identical to Cyp4d2 7 diffs probably is 4d2 PPVPMIGRWFAEDVEKRGKHIPAGTNLTMGIFVLPRDPEYFESPDEFRPE RFDADVPQIHPYAYIPFSAGPRNCIGQKFAMLEMKSTVSKLLRQFELLPL APEPRQLMNIVLRSANGVHLGLKPRA Cyp4d8 MLLFLLVVLLFGAGWIIHLGQADRRRKVGNLPGPICPPLIGAMQLMLRLNPK TFIKVGREYVLKFGHLQRVWIFNRLLIMSGDAELNEQLLSSQEHLVKHPVYKVLGQWLGN GLLLKDGKVWHQRRKIITPTFHFSILEQFVEVFDQQSNICVQRLAQKANGNTFDV YRSICAAALDIIAETAMGTKIYAQANESTPYAEAVNE* CTALLSWRFMSVYLQVELLFTLTHPHLKW RQTQLIRTMQEFTIKVIEKRRQALEDQQSKLMDTADEDVGSKR RMALLDVLLMSTVDGRPLTNDEIREAVDTFM FEGHDTTTSALSFCLHELSRHPEVQAKMLEEIVQVLGTTRSRP VSIRDLGELKYMECVIKESLRMYPPVPIVGRKLQTDFKYS* DGVIPAGSEIIIGIFGVHRQPETFPNPDEFIPERHE NGSRVAPFKMIPFSAGPRNCIGQKFAQLEMKMMLAKIVREYELLPMGQRVECIVNIVLR SETGFQLGMRKRKHN* AC004717 DS01529 also AC005834 comp(100737-101282) P1s DS01589 DS02501 MWILLGIAVLIMTLVWDNSRKQWRVNTFEKSRILGPFTIPIVGNGLQALT LRPENFIQRFGDYFN*KYGKTFRLWILGECLIYTKDLKYFESILSSSTLL KKAHLYRFLRDFLGDGLLLSTGNKWTSRRKVLAPAFHFKCLENFVEIMDR NSGIMVEKLKNYADGKTCVDLFKFVSLEALDVTT*ETAMGVQVNAQNEPN FPYTKALK*SVVYIESKRLASVSMRYNWLFPLAAPLVYRRLQKDIAIMQD FTDKVIRERRAILERARADGTYKPLSTLT*DDIGGKAKMTLLDILLQATI DNKPLSDVDIREEVDVFIFAGDDTTTSGMSHALHAISRHPKVQECISEHV VSVLVPDPDASVTQTKLLEVKYLECVIKQTMRLHPPVPILGRYIPEDLII GEIAIPGNTSI*LMPYYVYRDPEYFPDPLVFKPERWMDMKTTSNTPPLAY IPFSSGPKNCIGQKFANLQMKALISKVIRHYELLPLGADLKATYTFILSS STGNNVGLKPRTRVK* AI106625 AI517120 N-term 65% to AC005834 MWLTLITGALILLLTWDFGRKRQRVLAFEKSAIPGPISIPILGCGLQALHLGAENIIGWV GEKFDKYGKTFRFWILGESLIYTKDLQYFETILSSTTLLEKGQLYEYLRPFLNDGLLVST GRKWHARRKIFTHAFHFKVLEHYVEIMDRHSNVMVDNL RKVADGKTAVDMLKYVSLAALDVIT* EAAMGVQVNAQNDPDFPYIKALKSVVYIQPDRMFRFSRRYNWLFPLAAPLLHRQLLSDIR VMHDFTDKVISERRETVRRAKADGTYRPLSGCT AEIGSKSQMALLDILLQSSINNQPLSDADIREEVDTFMFEGDDTTSSG VSHALYAIARHPEVQQRIFEELQRVLGPDASASGTQAQLQDLKYLDCVIKETMRLY PPVPAIGRHAQKELEIGDKTIPANTSIYLVLYYAHRDANYFPDPLSFRPERFLEDQEQGH NTFAYVPFSAGPKNCIGQKFAVLEMKVLISKVLRFYELLPLGEELKPMLTFILRSASGIN VGLRPRKALR AC017388 = AC010113 AC010557 35538 MILTATFICFCLASAFNYFRARRQRSLIKNLKGPFTWPLMGAMHKLLFLTPI 35744 NFFQRSTEYLTKYGTFSRCWVFHRLFIPLADLELSRQLLENDTHLETGYELMKDWLV 35913 35914 GGVLMCQSEQWQKRHSLISGLFDKGNLEQLIDLSRHQTEQLLQKLAKQADQKVFDIWYTV 36093 36094 SPIVLDLMVMTTCGAKPSEEYSKNLKD LSEIYRKRFLSLQ 36273 36274 SANRFNYWLSSPFMRKRQNRLIKRLNDEHNNLMAMHQSQNQLKIENGLDIYQLRPIP 36444 36445 LKDHKSLLEILLESKDPQLTGEEICGELNTCNYLGYQLCSPALCFCLVTIARNPSVQQK 36621 36622 CLDELNLAQIKDQGWDLEKLNYLDAVLHETMRLYPPQVIVGRQLKKDFPYN 36774 AELPCGSEIYINLYELQRNEVRYPKANHFDAQR 36948 36949 FLDSPPELLSYSLGPRCCPARKFSMQLLKTLLAPILANFEVL 37074 PYGDEVRLDLRLVLGSSNGFQLALKPR* Cyp4d10 U91634 Drosophila mettleri MSLSLPPLIAVACLVVALARISWLPLRSWLRRRRRTHQLAAQLPGPRNLP LLGNFHMFFGLEPWQVPHLINQLAKKYDGTFKLKMGSNFSLMMFQPRDIE VVLGSSQLLDKAVEYSFLRGWLNDGLLLSGGRKWHRRRKIITPAFHFRIL ESYDEIFDRQTRLLIHKWQQTLGHSFDLGHDVHLFTLDVICETAMGVSTN AQTNADSDYVRAVKTISTVLHKRMFNIFYRFDLTYMLTPLAWAERRALNV LHKFTEKIIVQRREELLRGGVTQTTDGADVGAKSKMVFLDILLQSNIDDK PLTNLDIREEVDTFMFEGHDTTSSGITFFFYNIALYPECQRKCVEEIVSV LGKDTETPVTYDLLNNLNYMDLCIKETLRMYPSVPLLGRKVLQECEINGK IIPAGTNIGISPLFLGRSEDISSEPNTFKPERFDVVTSAEKLNPHAYIPF SAGPRNCIGQKFAMLEIKAIAANVLRHYEIEFVGNAEESPVLIAELILRT KDPLMFKLKKRVI AC010003 chromosome 3L/75B13 clone RPCI98-6L11, = AC015424 AC009369 AC007573 MFWLGFGLLLLALSLYLLYVFERQSRIDRLTHKWPAPPALPFIGHLHILAKLVGPHPLRRATEMINEHLHDHRAKLWMGT KLYLVDCNPKDIQALCSAQQLLQKTNDYRVFENWLCEGLFTSGFEKWSHRRKIVMPAFNYTMIKQFVAVFEKQSRILLTN VAKFAESGDQIDFLQLISCFTLDTICETALGVSVGSQSSAKSEYLDAVKS*ILVIIDKRLKNIFYRNSFIFKRTSHYKR EQELIKTLSGFTEGIIQKRIDEINQDAENRNYQSSDAELDGVKRTLCFLDTLLLSKGPDG MFLTVKDIREEVDS FLFGGFDLTATTLNSLMYNMTLHPEHQQRCREEVWSVCGKDKSEPISIEQVRQLEFLEAC IKETLRMYPSGPLTARKATANCTISKYIIPKGSDVIISPI YMGRCKDFFPDPMVFKPDRWAIGAEPKLRHHF FIPFMAGARSCMGQRYAMVMLKMVLAHLLRNFLFEPLXERQVELKLNFVITLHTVDPIYVELK Cyp4d14 AL009194 152A3.2 comp(10682..12421) 55% identical to Cyp4d2 MYLELFAILLATALAWDYMRKRRHNKMYAEAGIRGPKSYPLVGNAPLLIN ESPKTIFDMQFRLIAEFGKNIKTQMLGESGFMTADSKMIEAIMSSQQTIQ KNNLYSLLVNWLGDGLLISQGKKWFRRRKIITPAFHFKILEDFVEVFDQQ SATMVQKLYDRADGKTVINMFPVACLCAMDIIAETAMGVKINAQLQPQFT YVQSVTTASAMLAERFMNPLQRLDFTMKLFYPKLLDKLNDAVKNMHDFTN SVITERRELLQKAIADGGDADAALLNDVGQKRRMALLDVLLKSTIDGAPL SNDDIREEVDTFMFEGHDTTTSSIAFTCYLLARHPEVQARVFQEVRDVIG DDKSAPVTMKLLGELKYLECVIKESLRLFPSVPIIGRYISQDTVLDGKLI PADSNVIILIYHAQRDPDYFPDPEKFIPDRFSMERKGEISPFAYTPFSAG PRNCIGQKFAMLEMKSTISKMVRHFELLPLGEEVQPVLNVILRSTTGINC GLKPRVY AC015418 2797- = AI261013 in ordered fragments 46% to 4D1 MSTLALVAFVLWAAFLRYLPKILNFLRLQRFAKTLPGPTIGELIANVKKGGEFVQRELIK ELREKHGPVFRIWFGKDLMVMFTDPEDIKQLLGNNQLLTKSRNYELLEPWLGKGLLTNGGESWHRRRK LLTPGFHFRILSEFKEPMEENCRILVRRLRTKANGESFDIYPYITLFALDAICETAMGI KKHAQLQSDSEYVQAV HSICRVMHKQSFSFWQRLNVFFKHTKPGKEREAALKVLHDETNRVIRLRREQL IQERNEWKPEAEQDDVGAKRRLAFLDMLLLTQMEGGAELSDTDIREEVDTFMFEGH DTTSSAIAFALSLLSKNPDVQQRAFEEASELEGREKESMPYLEAVIKETLR IYPSVPFFSRKVLEDLEVGKLTVPKGASISCLIYMLHRDPKNFPDPERFDPDRFLVNE KQMHPFAFAAFSAGPRNCIG QKFAMLELKTSLAMLLRSYRFLPDKDHQPKPLAELVTKSGNGIRLRILPRDENGTTA* Cyp4e1 K00045, AC005451 22741-24838 AI135768 AI293313 MWIVLCAFLALPLFLVTYFELGLLRRKRMLNKFQGPSMLPLVGNAHQMGN TP*TEILNRFFGWWHEYGKDNFRYWIGYYSNIMVTNPKYMEFILSSQTLI SKSDVYDLTHPWLGLGLLTSTGSKWHKHRKMITPAFHFNILQDFHEVMNE NSTKFIDQLKKVADGGNIFDFQEEAHYLTLDVICDTAMGVSINAMENRSS SVVQAFKD*ITYTFNMRAFSPWKRNKYLFHFAPEYPEYSKTLKTLQDFTN EIIAKRIEVRKSGLEVGIKADEFSRKKMAFLDTLLSSKVDGRPLTSQELY EEVSTFMFEGHDTTTSGVGFAVYLLSRHPDEQEKLFNEQCDVMGASGLGR DATFQEISTMKHLDLFIKEAQRLYPSVPFIGRFTEKDYVIDGDIVPKGTT LNLGLLMLGYNDRVFKDPHKFQPERFDREKPGPFEYVPFSAGPRNCIGQK FALLEIKTVVSKIIRNFEVLPALDELYDPILSASMTLKSENGLHLRMKQR LVCDST* Cyp4e2 U56957, X86076, U34332 AC005451 MWFVLYIFLALPLLLVAYLELSTFRRRRVLNKFNGPRGLPLMGN AHQMGKNPSEILDTVFSWWHQYGKDNFVFWIGTYSNVLVTSSKYLEF ILSSQTLITKSDIYQLTHPWLGLGLLTSTGSKWHKHRKMITPAFHFNILQ DFHEVMNENSTKFIKHLKTVAAGDNIFDFQEQAHYLTLDVICDTAMGVSI NAMENRSSSIVQAFKDMCYNINMRAFHPLKRNELLYRLAPDYPAYSRTLK TLQDFTNEIIAKRIEAHKSGAVSTNAGDEFTRKKMAFLDTLLSSTIDGRP LNSKELYEEVSTFMFEGHDTTTSGVSFAVYLLSRHQDEQRKLFKEQREVM GNSELGRDATFQEISQMKYLDLFIKEAQRVYPSVPFIGRFTEKDYVIDGD LVPKGTTLNLGLVMLGYNEKVFKDPHKFRPERFELEKPGPFEYVPFSAGP RNCIGQKFALLEIKTVVSKIIRNFEVLPALDELVSKDGYISTTIGLPDAE RKKRDPYRHKYDPILSAVLTLKSENGLYIRLKERH AC007291 Cyp4e3 BACR02I05 comp(115634-117642) also U34330 MWLAVLALLVLPLITLVYFERKASQRRQLLKEFNGPTPVPILGNANRIGK NP*TEILSTFFDWWYDYGKDNFLFWIGYSSHIVMTNPKQLE*YILNSQQL IQKSTIYDLLHPWLGHGLLTSFGSKWHKHRKMITPSFHFNILQDFHEVMN ENSAKFMTQLKKASAGDTIIDFQEHANYLTLDVICDTAMGVPINAMEQRD SSIVQAFREMCYNINMRAFHPFKRSNRVFSLTPEFSAYQKTLKTLQDFTY DIIEKRVYALQNGGSKEDHDPSLPRKKMAFLDTLLSSTIDGRPLTRQEIY EEVSTFMFEGHDTTTSGVSFSVYLLSRHPDVQRKLYREQCEVMGHDMNRS VSFQEIAKMKYLDLFIKEAQRVYPSVPFIGRYCDKDYDLDGSIVPKGTTL NLALILLGYNDRIFKDPHHFRPERFEEEKPAPFEYLPFSAGPRNCIGQKF ALLELKTVISKVVRSFEVLPAVDELVSTDGRLNTYLGLAPDEKLKREAGR HKYDPILSAVLTLKSDNGLHLRLRERRS* Cyp4e4 AL009194 152A3.6 16210-17867 also U34331 MLVVLLVALLVTRLVASLFRLALKELRHPLQGVVPSVSRVPLLGAAWQMR SFQPDNLHDKFAEYVKRFGRSFMGTVLGHVVMVTAEPRHIDALLQGQHQL KKGTMYFALRGWLGDGLLLSRGKEWHTMRKIITPTFHFSILEQFVEVFDR QSSILVERLRTLSYGNEVVNIYPLVGLAALDIITETAMGVNVDAQGADSE VVHAVKDLTNILATRFMRPHLLFPHLFRLCWPSGFRKQQAGVICLHEFTN GIIEQRRRLLAREANQDKPTKPHALLDTLLRATVDGQPLTDKQIRDEVNT FIFEGHDTTTSAVSFCLYLLSRHEAVQQKLFEELRMHYGQDLFRGVILSD FATLPYLSCVVKESLRLYPPIPAVARCLEKDLVIDEGYIPVGTNVVVLLW QLLRDEAIFTDPLVFQPERHLGEEAPRLSPYSYIPFSAGPRNCIGQKFAL LEMKTMVTKVIRHYQLLPMGADVEPSIKIVLRSKSGVNVGLRPRLY AC005451 comp(14729-16905) AC005415 ESTs AA202364 (LD02646) AI297753 MFLIAIAIILATILVFKGVRIFNYIDHMAGIMEMIPGPTPYPFVGNLFQF GLKPAEYPKKVLQYCRKYDFQGFRSLVFLQYHMMLSDPAEIQNILSSSSL LYKEHLYSFLRPWLGDGLLTSSGARWLKHQKLYAPAFERSAIEGYLRVVH RTGGQFVQKLDVLSDTQEVFDAQELVAKCTLDIVCENATGQDSSSLNGET SDLHGAIKDLCDIVCENAVVQERTFSIVKRFDALFKFSQYRARINKTPKL ITSQIISQRRHQLAAENTCQQGQPINKPFLDVLLTAKLDGKVLKEREIIE EVSTFIFTGHDPIAAAISFTLYTLSRHSEIQQKAAEEQRRIFGENFAGEA DLARLDQMHYLELIIRETLRLYPSVPLIARTNRNPIDLDGTKVAKCTTVI MCLIAMGYNEKYFDDPCTFRPERFENPTGNVGIEAFKSVPFSAGPRRCIG EKFAMYQMKALLSQLLRRFEILPAVDGLPPGINDHSREDCVPQSEYDPVL NIRVTLKSENGIQIRLRKR* Cyp4e5 U78486 Drosophila mettleri MWFIVYILLALPIMLFVFLSCEWPKRNDAEQIEWSSGVPFLGNAHQMGKT PAEILNTFFEFWHKYNKDNFRIWIGYYANILVSNPKHLEVIMNSTTLIEK LDIYDMLHPWLGEGLLTSKGSKWHKHRKMITPTFHFNILQDFHQVMNENS AKFIKRLKEVSAGDNIIDFQDETHYLTLDAICDTAMGVTINAIEKRDTVD VVKAFKDMCHIINMRAFRPLQRSDFLYRFSPEYATYAKTLKTLKDFTNDI IAKRIKVHRTAAAKTNQEGSEFSRKKMLPDTLLSATIDGRPLNQQEIYEE VSTFMFEGHDTTTSGVAFAGYILSRFPEEQRKLYEEQQAVMGNELNRDAT FQEISAMKYLDLFIKEAQRVYPSVPFIGRYTDKDYNIHGTIMPKGTTLNL GIIVLGYDDRVFEEPHRFYPERFEKQKPGPFEYVPFSAGPRNCIGQKFAL LELKTVISKLVRTFEVLPAVDELVSKDGNLNTYVGLPKEEKERKERMGYK YDPILSAVLTLKSENGLHLRLR Cyp4g1 AL009188 also U34328, MAVEVVQETLQQAASSSSTTVLGFSPMLTTLVGTLVAMALYEYWRRNSRE YRMVANIPSPPELPILGQAHVAAGLSNAEILAVGLGYLNKYGETMKAWLG NVLLVFLTNPSDIELILSGHQHLTKAEEYRYFKPWFGDGLLISNGHHWRH HRKMIAPTFHQSILKSFVPTFVDHSKAVVARMGLEAGKSFDVHDYMSQTT VDILLSTAMGVKKLPEGNKSFEYAQAVVDMCDIIHKRQVKLLYRLDSIYK FTKLREKGDRMMNIILGMTSKVVKDRKENFQEESRAIVEEISTPVASTPA SKKEGLRDDLDDIDENDVGAKRRLALLDAMVEMAKNPDIEWNEKDIMDEV NTIMFEGHDTTSAGSSFALCMMGIHKDIQAKVFAEQKAIFGDNMLRDCTF ADTMEMKYLERVILETLRLYPPVPLIARRLDYDLKLASGPYTVPKGTTVI VLQYCVHRRPDIYPNPTKFDPDNFLPERMANRHYYSFIPFSAGPRSCVGR KYAMLKLKVLLSTIVRNYIVHSTDTEADFKLQADIILKLENGFNVSLEKR QYATVA AL067059 (BACR015N20), AL057969 (BACR024M08) Drosophila melanogaster MSLMFLGAECSSMVLAAFETSAHSVFFALVLLAMFPEHQXLVFXXFKEHFLLAKG IEVTHTVLQX 425 LXFXVRXLNETLRLMPSVPFSSRETLEDLRXSXGVVIPKGMTFSIDIFNTQRNTXX WG 251 SEAAQFNPENFLPEKIHDRHPYAFIPFSKGKXNCIG 143 Cyp4g15 AC013897 AI403684 AI388987 N-term AI134510 Cyp4g15 C-term MEVLKKDAALGSPSSVFYFLLLPTLVLWYIYWRLSRAHLYRLAGRLPGPRGLPIVGHLFD VIGPASSVFRTVIRKSAPFEHIAKMWIGPKLVVFIYDPRDVELLLSSHVYIDKASEYKFF KPWLGDGLLISTGEKWRSHRKLIAPTFHLNVLKSFIELFNENSRNVVRKLRAEDGRTFDC HDYMSEATVEILLGE*TAMGVSKKTQDKSGFEYAMAVMRMCDILHARHRSIFLRNEFGFT LTRYYKEQGRLLNIIHGLTTKVIRSKKAAFEQGTRGSLAQCELKAAALEREREQNGGVDQ TPSTAGSDEKDREKDKEKASPVAGLSYGQSAGLKDDLDVEDNDIGEKKRLAFLDLMLESA QNGALITDTEIKEQVDTIMFEGHDTTAAGSSFFLSLMGIHQDIQDRVLAELDSIFGDSQR PATFQDTLEMKYLERCLMETLRMYPPVPLIARELQEDLKLNSGNYVIPRGATVTVATVLL HRNPKVYANPNVFDPDNFLPERQANRHYYAFVPFSAGPRSCVG*RKYAMLKLKILLSTIL RNYRVYSDLTESDFKLQADIILKREEGFRVRLQPRTS* Cyp4p1 U34327, AC008186 comp(2-88, 148-465) EST AI293255 TSIGLIFGLMNMSLNPDKQELCYQEIQEHIDDDLSNLDVGQLNKLKYLEY FMKETTRLFPSVPIMGREAVQETELANGLILPKGAQITIHVFDIHRNAKY WDSPEEFRPERFLPENVQDRHTYAYVPFSAGQRNCIGKKYAMQEMKTLMV VLLKQFKVLKAIDPQKIVFHTGITLRTQDKIRVKLVRRT* Cyp4p2 AL055774 BACR021H19 80% identical to Cyp4p1 ETAMGIKLDEMAEKGDRYRANFHIIDEGLTRRIVNPLYWDDCVYNMFTG HKYNAALKVVHEFSREIIAKRRVLLEEELENRRATQTADDDM*KETFAML DTLICAEKDGLIDDIGISEEVDTLMAEGYDTTSIGLVFGLMNMSLYAAEQ ELCYQEIQEHIXDDLSNLNLSQLSKLNYLGYFIKETMRLYPSIPIMGRQT LQETELENGLILPKRSQINIHVFDI Cyp4p3 AC008186 48361-48564, 48626-49174 AL063060 FAMLDTLILAEKDGLIDHIGICEEVDTLMFEGYDTTSIGVVLGKRNMSLYAAEQNLC SQEIQEHIXDDLSNLTLSQLSKLNYLGYFIKETRRRYLSIPIMGRQTLQEPELENGLFLPKRSQINI HVFDIHRNPKYWESPEEFRPERFLPQNCLKRHPYAYIPFSAGQRNCIGK KYAMQEMKTLMVVILKHFKILPVIDPKSIVFQVGITLRFKNKIKVKLVRRNCV* N-term A AI389883 56% to 4p Nterm about 15 aa short to overlap with AL055774 MMICLLWISVAILVVIHWIYKVNKDYNILAFFARRVQTKDGKP LDSLVPMIKGRTVFANCFDLLGKDTDQVFTHLRQLAKNSGDSYLQYSMGF SNFNVIDAHNAANILNHPNLITKGVIYNFLHPFLRTGVLTATEKKWHTRR SMLTRTFHLDILNQFQEIFIAESLKFVSQFQGQNEV N-term B follows Cyp4p3 sequence on AC008186 MLILWLVGAFIVLIQWIYRLNRDYCILGFFAKRIRTKNGQNPESIAPLVKGSTIFANSFD LYGKDHAGVFEHSRDCAKKLGKSYAEYAMGTAIYNVIDADSAERVLNDPNLINKGTIYDFLHPFLRTGLLTSTG N-term C AI405947 GH26104, AI109737 GH09012 AL063060 MIILWLILALSALLYWLHRANKDYHILSFFTKRIRLKDG TPVEIIAPIAKGKTIFGNTLDLYGRDHAGVFNYSRERAKEMGTSYIEYVF GKAIYNIIDADSAENVLNHPNLITKGLVYNFLHPFLRTGLLTSTGKKWHA RRKMLTPTFHFNILNQFQEIFKTESQKFLLQFEGQDEVTITLHDVIPRFT LNSICETAMGVKLDEMAEKGDRYR AC018129 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered Cyp4p1 and N-term C intact 69662 MIILWLILALSALLYWLHRANKDYHILSFFTKRIRLKDGTPVEIIAPIAKGKTIFGNTLD 69483 69482 LYGRDH 69465 69405 AGVFNYSRERAKEMGTSYIEYVFGKAIYNIIDADSAENVLNHPNLITKGLVYNFLHPFLR 69226 69225 TGLLTSTGKKWHARRKMLTPTFHFNILNQFQEIFK 69058 69010 TESQKFLLQFEGQDEVTITLHDVIPRFTLNSICET 68831 68830 AMGVKLDEMAEKGDRYRENFSQIEECFIRRLSNPLLWGDKLFEMFAAKDFASALDVVHRF 68651 68650 SSEIIAKRRDLLKDELDKSSSTADDDXX 68573 68501 KKRFAMLDTLIYAEKDGLIDHIGICEEVDTLMFEGYDTTSIGLIFGLMNMSLNPDKQELC 68322 68321 YQEIQEHISDDLSNLDVGQLNKLKYLEYFMKETTRLFPSVPI 68142 68141 MGREAVQETELANG 68103 68100 LILPKGAQITIHVFDIHRNAKYWDSPEEFRPERFLPENVQDRHTYAYVPFSAGQRNCIGK 67921 67864 KYAMQEMKTLMVVLLKQFKVLKAIDPQKIVFHTGITLRTQDKIRVKLVRRT* 67706 Cyp4p3 N-term B intact matches 1st part of 4p3 but this was a chimera 67385 MLILWLVGAFIVLIQWIYRLNRDYCILGFFAKRIRTKNGQNPESIAPLVKGSTIFANSFD 67206 67205 LYGKDH 67105 AGVFEHSRDCAKKLGKSYAEYAMGTAIYNVIDADSAERVLNDPNLINKGTIYDFLHPFLR 66926 66925 TGLLTSTG 66869 KKWHARRKMLSPTFHFNILNQFQEIF 66768 66708 RTESLKFLEQFKGNDEAIISLNEVIPRFTLNSIC 66607 66565 ETAMGVKLDEMAEKGDRYRENFRQIEECFIRRMSNPLLWSDTLFKMFAEKDYAS 66386 66385 ALDVVHGFSSEIIAKRRDQLNDEIDSRGNTQTAEDEL 66272 66213 KKRFAMLDTLILAEKDGLIDHIGICEEVDTLMFEGYDTTSIGLMFGLMNMSLYPEEQEKC 66034 66033 YQEIQANISDELNILNIGQLNKLKNLEYFIKETMRLFPSVP 65854 65853 AMGRETTRETELSNGLILPKGSQIFVHVFDIHRNPEYWDSPEEFRPERFLPENSQNRHTY 65674 65673 AYIPFSAGQRNCIGM 65578 KFAMQEMKTLMVALLKQFQILPEIDPKTIVFQTGLTLRTKNQIHVFDI 65435 Cyp4p2 (plus N-term A) 72180 MMICLLWISVAILVVIHWIYKVNKDYNILAFFARRVQTKDGKPLDSLVPMIKGRTVFANCFDLL 72001 72000 GKDTDQ 71992 71902 VFTHLRQLAKNSGDSYLQYSMGFSNFNVIDAHNAANILNHPNLITKGVIYNFLHPFLRTG 71723 71722 VLTATGKKWHTRRSMLTRTFHLDILNQFQEIF 71573 71448 RAESLKFVSQFQGQNEVVVSLKDRISRFTLNSIC 71308 71264 ETAMGIKLDEMAEKGDRYRANFHIIDEGLTRRIVNPLYWDDCVYNMFTGHKYNAALK 71085 71084 VVHEFSREIIAKRRVLLEEELENRRATQTADDDM 70980 70912 KKRFAMLDTLICAEKDGLIDDIGISEEVDTLMAEGYDTTSIGLVFGLMNMSLYAAEQELC 70733 70732 YQEIQEHIL 70647 DDLSNLNLSQLSKLNYLGYFIKETMRLYPSIPIMGRQTLQETELENGLILPKRSQINI 70468 70467 HVFDIHRNPKYWESPEEFRPERFLPQNCLKRHPYAYIPFSAGQRNCIG 70287 KKYAMQEMKTLMVVILKHFKILPVIDPKSIVFQVGITLRFKNKIKVKL 70108 70107 VRRNCV* 70099 AC008324 chromosome 2 clone BACR25K01 (D854)Gene 1 76245-78135 MFLEVLFAAPLVIFIFRKLWAHLNRTYFILSLCKRIRTEDGSLLESKIYVAPSKTRFGNNFDLVNFT TESIFNFMRDASAKAKGRNYLWYFFHAP MYNIVRAEEAEEILQSSKLITKNMIYELLKPFLGEGLLISTDQKWHSRRKALTPAFHFKVLQSFLIIFK EECNKLVKVLHQSVNMELELNQVIPQFTLNNVC ETALGVKLDDLSEGIRYRQSIHAIEEVMQQRLCNPFFYNIVYFFLFGDYRKQV NNLKIAHEFSSNIIEKRRSLFKSNQLGQEDEFGKKQRYAMLDTLLAAEADGQI DHQGICDEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEIKYLPDDSDDISVFQ FNELVYMECVIKESLRLFPSVPFIGRQCVEETVVNGMVMPKDTQISIHLYEIMRDARHFS NPDLFQPDRFFPENTVNRHPFAFVPFSAGQRNCIGQKFAILEIKVLLAAVIRNFKILPVT LLDDLTFENGIVLRTKQNIKVKLVHRENK* AC017771 = AC008324 Gene 2 78359-79584 and AC008003 AA567719 AA697618 AI517752, AL108342 21996 MWIALLGSSLLIGALWLLLRQLNKTYFILSLCKRVRTADGSPLESKVFVVPGKTRFGNNLDLNLTP 21811 21743 ANIFSYIRESTAKANGQNYIWNFLFAPEYNIVRAEDAEEIFQSTKITTKNMSYELIRPFL 21564 21563 GDGLLISI DQKWHTRRKTLTPAFHFNILQSFLSIF REESKKFIKILDKNVGFELELNQIIPQFTLNNIC 21203 ETALGVKLDDMSEGNEYRKAIHDFEIVFNQRMCNPLMFFNWYFFLFGDYKKY 21027 21026 SRILRTIHGFSSGIIQRKRQQFKQKQLGQVDEFGKKQRYAMLDTLLAAEAEGKID 20862 20861 HQGICDEVNTFMFGGYDTTSTSLIFTLLLLALHADVQERCYEELQDLPEDIDEVSMFQF 20685 20684 NELIHLECVIKESLRLFPSAPIIGRTCIEESVMNGLVLPKNAQISIHIYDIMRDARHFP 20508 20507 KPNQFLPERFLPENSVNRHPFAFVPFSAGPRNCIGQKFGVLEIKVLLAAVIRNFKLLPAT 20328 20327 QLEDLTFENGIVLRTQQNIKVKFEAR 20250 AC008325 Drosophila melanogaster chromosome 2 clone BACR05M06 (D855) RPCI-98 MWIALLGIPILLAVLTLLLKHINKTYFILSLTKRVRTEDGSPLESKVAIMPGKTRFGNNL DILNFTPASVFNFVRESTAKAKGQNYLWYFLYAPMYNVVRPEEAEEVFQSTKLITKNVVY ELIRPFLGDGLLISTDHKWHSRRKALTPAFHFNVLQSFLGIFK EECKKFLNVLEKNLDAELELNQVIPPFTLNNIC ETALGVKLDDMSEGNEYRKAIH Cyp6a2 M88009, S51248, U78088, AC007549 MFVLIYLLIAISSLLAYLYHRNFNYWNRRGLPHDAPHPLYGNMVGFRKNR VMHDFFYDYYNKYRKSGFPFVGFYFLHKPAAFIVDTQLAKNILIKDFSNF ADRGQFHNGRDDPLTQHLFNLDGKKWKDMRQRLTPTFTSGKMKFMFPTVI KVSEEFVKVITEQVPAAQNGAVLEIKELMARFTTDVIGTCRFGIECNTLR TPVSDFRTMGQKVFTDMRHGKLLTMFVFSFPKLASRLRMRMMPEDVHQFF MRLVNDTIALRERENFKRNDFMNLLIELKQKGRVTLDNGEVIEGMDIGE LAAQVFVFYVAGFETSSSTMSYCLYELAQNQDIQDRLRNEIQTVLEEQEG QLTYESIKAMTYLNQVISETLRLYTLVPHLERKALNDYVVPGHEKLVIEK GTQVIIPACAYHRDEDLYPNPETFDPERFSPEKVAARESVEWLPFGDGPR NCIGMRFGQMQARIGLAQIISRFRVSVCDTTEIPLKYSPMSIVLGTVGGI YLRVERI AL062684.1|CNS002GP Drosophila melanogaster genome survey sequence TET3 end of LAGXXXSTTMGFTLYELACNQDVQDKLRAEIXSVLERYNGKLEYDSMQDLFYME KVINESLRKHPVXAHLARIATKPYQHSNPKYFIEAGTGVLVSTLGIHHDPEXY PEPEKFIPERFDEEQVKRASHLRFPTFGAGPRNCIGLRFGRMQVIIGLALLIHNXRF EXHPKTPVPMKYTINNLLLGSEGGIHLNITKVVRD* AC010578 21500-20681 AA696094, AA803578, AA202305, AI546241 MAILLGLVVGVLTLVAWWVLQNYTYWKRRGIPHDPPNIPLGNTGELWRTMP LAGILKRTYLKFRKQTDGPFAGFYLYAMKYIVITDVDFVKTVLIRDFDKF HDRGVYHNDKDDPLTNNLATIEGQKWKNLRQKLTHTFTSAKMKSMFSTVL NVGDEMIRVVDEKISSSSQTLEVTDIVSRFTSDVIGICAFGLKCNSLRDP KAEFVQMGYSALRERRHGWLVDLLIFGMPKLAGELGFQFLLPSVQKFYMK IVQDTIDYRMKRKVTRNDFMDTLIDMKQQYDKGDKENGLAFNEVAAQAFV FFLAG 20681 AI063421 GH03219 AI532649 SD04231, 63% identical to AA696094 LIVLLIGVITFVAWYVHQHFNYWKRRGIPHDEPKIPYGNTSELMKTVHFA DIFKRTYNKLRNKTDGPFVGFYMYFKRMVVVTDIDFAKTVLIREFDKFHD RGVFHNERDDPLSANLVNIDGQKWKTLRQKLTPTFTSGKMKTMFPTILTV GDELIRVFGETASADSDSMEITNVVARFTADVIGSCAFGLDCHSLSDPKAKF AA951440 LD31895, AA816508 LD01943, AA201305 LD04267 MLDVVALLLIALAVGFWFVRTRYSYWTRRGIGSEPARFPVGNMEGFRKNKHFI DIVTPIYEKFKGNGAPFAGFFMMLRPVVLVTDLELAKQILIQDFANFEDR GMYHNERDDPLTGHLFRIDGPKWRPLRQKMSPTFTSAKMKYMFPTVCEVG EELTQVCGELADNAMCGILEIGDLMARYTSDVIGRCAFGVECNGLRNPEA EFAIMGRRAFSERRHCKLVDGFIESFPEVARFLRMRQIHQDITDFYVGIV RETVKQREEQGIVRSDFMNLLIEMKQRGELTIEEMAAQAFIFFAAGFDTS ASTLGFAXYELAKQP AC010578 Drosophila melanogaster chromosome 2 clone BACR03K23 (D1086) RPCI-98 EFLAQAIIFLGAGFETSSTTMGFGIYELGRNQDVQDKLREEIGNVFGKHNKEFTYEGIKEMKYLEQVVMETLRKYPVLAHLTRMTD TDFSPEDPKYFIAKGTIVVIPALGIHYDPDIYPEPEIFKPERFTDEEIAARPSCTWLPFGEGPRNCIGLRFGMMQTCVGLAYLIRG YKFSVSPETQIPMKIVVKNILISAENGIHLKVEKLAK* L46858, AL061295 (BACR004K24) 52% identical to cyp6a5 EFTYDSMQELRYMELVIAETLRKYPILPQLTRISRHLYAAKGDRHFYIEP GQMLLIPVYGIHHDPALYPEPHKFIPERFLADQLAQRPTAAWLPFGDGPR NCIGMRFGKMQTTIGLVSLLRNFHFSVCPRTDPKIEFLKSNILLCPAHGI YLKVQQLSQMSS* AL061650.1|CNS00613 Drosophila melanogaster genome survey sequence TET3 end of TLRGXPLLPRLTRFSGLLYAARGVRLFXFGPGLLLLXPVYGIXXVPALXPXPHRFI PERX 539 LAGRLAPRPAAAWLPFGVGPRXCVGMGFGRVPAAVGLVGLLRIFRFGVCPRPGP GVAFLR 359 SPFLLCPAXGFCLGVPRL 305 AA801503 HL02667 I-helix and before, 53% identical to Cyp12a2 TVQEYRSPNGFLLRLGRETSLYRYIPTPTYKKFSRAMDEIFDT CSMYVNQAIERIDRKSSQGDSNDHKSVLEQLLQIDRKLAVVMAMDMLMGG VDTTSTAISGILLNLAKNPEKQQRLREEVLSKLTSLHSEFTVEDMK AC007137.10|AC007137 Drosophila melanogaster chromosome 2 clone BACR25C02 MWLLLPILLYSAVFLSVRHIYSHWRRRGFPSEKAGITWSFLQKAYRREFR HVEAICEAYQSGKDRLLGIYCFFRPVLLVRNVELAQTILQQSNGHFSELK WDYISGYRRFNLLEKLAPMFGTKRLSEMFGQVQKVGDHLIHHLLDRQGQG CPQEVDIQQKLRV*YSVNIIANLIYGLDINNFEHEDHILTSYLSHSQASIQS* FTLGRLPQKSSYTYRLRDLIKQSVELREDHGLIRKDILQLLVRFRN*NRLM VSFEVCIVKSCTNSLFLDADKLLSIKRLAKVAEDLLKVSLDAVASTVTFT LLEILQEPLIVEKLRAEIKELSNENGQLKFEELNGLRYMDMCLK*ETLRK YPPLPIIERVCRKSYSLPNSKFTIDEGKTLMVPLLAMHRDEKYFSEPMKY KPLRFLQTANDVGQCEDKTKSNVFIGFGIGGSQCVGTYRIQIILAMFRQY FSHRNYRTELCKVGN* Cyp6a8 L46859, AL054065, AI258590 LP01819, AI107730 GH05558 MALAYILFQVAVALLAILTYYIHRKLTYFKRRGIPFVAPHLIRGNMEELQ KTKNIHEIFQDHYNKFRESKAPFVGFFFFQSPAAFVIDLELAKQILIKDF SNFSNKGIFYNEKDDPISAHLFNLDGAQWRLLRNKLSSTFTSGKMKLMYP TVVSVANEFMTVMHEKVPKNSVLEIRDLVARFTVDVIGTCAFAIQCNSLR DEKAEFLYFGKRSLVDKRHGTLLNGFMRSYPKLARKLGMVRTAPHIQEFY SRIVTETVAVREKEHIKRNDFMDMLIELKNQKEMTLENGDVVRGLTMEEV LAQAFVFFIAGFETSSSTMGFALYELAKNPDIEQHDQNFTYECTKDLKYL NQVLDETLRLYTIVPNLDRMAAKRYVVPGHPNFVIEAGQSVIIPSSAIHH DPSIYPEPFEFRPERFSPEESAGRPSVAWLPFGDGPRNCIGLRFGQMQAR IGPALLIRNFKFSTCSKTPNPLVYDPKSFVLGVKDGIYLKVETV AC008285 AC017567 Drosophila melanogaster chromosome 3 clone BACR31P16 (D1002) MQLTYFLFQVAVALLAIVTYILHRKLTYFKRRGIPYDKPHPLRGNMEGYKKTRTVHEIHQ EYYNKYRNSKAPFVGFYLFQKPAAFVIDLELAKQILIKNFSNFTDKGIYYNEKDDPMSAH LFNLDGPQWRLLRSKLSSTFTSGKMKFMYPTVVSVAEEFMAVMHEKVSENSILDVRDLVA RFTVDVIGTCAFGIKCNSLRDEKAEFLHFGRRALLDSRHGNLVSGLMRSYPNLARRLGLC RNTAQIQEFYQRIVKETVTLREKENIKRNDFMDMLIGLKNQKNMTLENGEVVKGLTMDEI VAQAFVFFIAGFDTSSSTMGFALYELAKNPSIEQHDQKFTYECIKDLKYLDQVLSETLRH YTIVPNVDRVAAKRFVVPGNPKFVIEAGQSVIIPSSAIHHDPSIYPEPNEFRPERFSPEE SAKRPSVAWLPFGEGPRNCIGLRFGQMQARIGLAMLIKNFTFSPCSATPDPLTFDPHSAI LLGIKGGIQLKVEAI* AC004721 AC017455 Cyp6a16 DS03308 complete 2-455 comp(2956-5070) 47% to 6a8 MDFTLLLLTSLLSFLLGYLRYRFTYWELRGIPQLRPHFLFGHFFRLQSVHYSELLQETYDAF RGSAKVAGTYVFLRPMAVVLDLDLVKAVLIRDFNNFVDRRSFHGDPLTANLFNLQGEEWRNL RTKLSPTFTSGKMKYMFGTVSTVAQQLGGTFDELVAVLELHDLMARYTTDVI GSCAFGTECSSLREPQAEFRQVGRRIFRNSNRSIRWRIFKMTYLSSLAKLGLPVRI LHPDITKFFNRIVRETVELRERENIRRNDFMDLLLDLRR KGLTMEQMAAQAFVFFVAGFETSSSNMSYALFELAKNQDVQQKLRMEINDSIGKHG KLTYEAMMEMPYLDQTI PETLRKYPALSSLTRLASEDYEIPSPDGGDPVVLEKGTSVHIPVLAIHYDPEVYPEP HEFRPERFAPDACRERHPTAFLGFGDGPRNCIGLRFGRMQVKVGLITLLR RFRFSLPPGSPTQLKVTKRNLILLPSDGVRLQVDPVESRLM* Cyp6a9 L46860, AL054861, AL053264, AL072094, AL055555 MGVYSVLLAIVVVLVGYLLLKWRRALHYWQNLDIPCE EPHILMGSLTGVQTSRSFSAIWMDYYNKFRGTGPFAGFYWFQRPGILVLD ISLAKLILIKEFNKFTDRGFYHNTEDDPLSGQLFLLDGQKWKSMRSKLSS TFTSGKMKYMFPTVVKVGHEFIEVFGQAMEKSPIVEVRDILARFTTDVIG TCAFGIECSSLKDPEAEFRVMGRRAIFEQRHGPIGIAFINSFQNLARRLH MKITLEEAEHFFLRIVRETVAFREKNNIRRNDFMDQLIDLKNSPLTKSES GESVNLTIEEMAAQAFVFFGAGFETSSTTMGFALYELAQHQDIQDRVRKE CQEVIGKYNGEITYESMKDMVYLDQVISETLRLYTVLPDLNRECLEDYEV PGHPKYVIKKGMPVLIPCGAMHRDEKLYANPNTFNPIFFARTSEGSDSVE WLPFGDGPRLCIGMRFGQMQARSGLALLINRFKFSVCEQTTIPIVYSKKT FLISSETGIFLKVERV AL072844 Drosophila melanogaster genome survey sequence TET3 end LLIPTAAIHMDPGIYENPQRFYPERFXEQAXRSRPAAAFLPXGDGLRGCIAARFAEQ QLLVGLVALLRQHRYAPSAETSIPVEYDNRRLLLMPKSDIKLSVERVDXL* AC008288 = AL072844 AC009342 AC017706 MDLMHRTLLTALGELSVVYALVKFSLGYWKRRGILHEKPKFL WGNIKGVVSGKRHAQDALQDIY TAYKGRAPFVGFYACLKPFILALDLKLVHQIIFTDAGHFTSRGLYSNPSGEPLSHNLLQLNGHKWRSLHAKSAEVFTPANMQKLLV RLSQISSRIQRDLGEKSLQTINISELVGAYNTDVMASMAFGLVGQDNVEFAKWTRNYWADFRMWQAYLALEFPLIARLLQYKSYAE PATAYFQKVALSQLQLHRRRDRQPLQTFLQLYSNAEKPLTDIEIAGQAFGFVLAGLGPLNATLAFCLYELARQPEVQDRTRLEINK ALEEHGGQVTPECLRELRYTKQVLNETLRLHTPHPFLLRRATKEFEVPGSVFVIAKGNNVLIPTAAI HMDPGIYENPQRFYPERFEEQARRSRPAAAFLPFGDGLRGCIAARFAEQQLLVGLVALLRQHRYAPSAETSIPVEYDNRRLLLMPK SDIKLSVERVDKL* AL069964 Drosophila melanogaster genome survey sequence T7 end of MSVGTVLLTALLALVGYLLMKWRSXMRHWQDLGIPCEEPHILMG SMKGVRTARSFNEIWTSYYNKFRGSGPFAGFYWFRRPAVFVLEK SLXKQILIKEFNKFTDRGXFHNPEDDPLSGQLFLLDGQKWRTMR NSTSSTFTSGKMKY Cyp6a9 HOMOLOG Drosophila grimshawi U87164 DPEAEFRIMGRKSLTDQRHGNLGNALLNGFPNFSRRIHMKLTPEHIEKFF MRIVKETVDYREKNNVRRNDFMDQLIDLKNKPLMKSETGESMNLTIEEIS AQALVFFAAGFETSSTTMGFALYELARAEDVQNRLRKECNEVLARHNGDL TYESIKDMKYLDQVISETLRLYTVLPILNRQCLEDYVVP Cyp6a13 Drosophila melanogaster AC005457, DS08616, AA941155 LD25139 MLTLLVLVFTVGLLLYVKLRWHYSYWSRRGV AGERPVYFRGNMSGLGRDLHWTDINLRIYRKFRGVERYCGYFTFMTKSLFIMDLELIRDI MIRDFSSFADRGLFHNVRDDPLT GNLLFLDGPEWRWLRQNLTQVFTSGKMKFMFPNMVEVGEKLTQACRLQVGEIEAKD LCARFTTDVIGSCAFGLECNSLQDPESQFRRMGRSVTQEPLHSVLVQAFMFAQPELARKL RFRLFRPEVSEFFLDTVRQTLDYRRRENIHRNDLIQLLMELGEEGVKDALSFEQIAAQALV FFLAGFDTSSTTMSFCLYELALNPDVQERLRVEVLAVLKRNNQKLTYDSVQEMPYLDQ VVAETLRKYPILPHLLRRSTKEYQIPNSNLILEPGSKIIIPVHSIHHDPELYPDPEKFDPSRFE PEEIKARHPFAYLPFGEGPRNCIGERFGKLQVKVGLVYLLRDFKFSRSEKTQIPLKFS SRNFLISTQEGVHLRME AC009844 65863-64467 45% to 6a2 same as AL097801, AL057750 MAVMIVLLIGVITFLAWYVHQHFNYWKRRGI FPR*APKLPTVIPAYLMKTRPFCGYFSRDPPTKLRTKPAGPFVGFLYVFQEDW*L*PNIDSAKPE LIREFDKFPVGGVFHNERED PLSATLVNIDGQKWKPLRQKLTPTFTSGKMKTMFPTILTVGDELIRVFGETASADSDSME ITNVVARFTADVIGSCAFGLDCHSLSDPKAKFVQMGTTAITERRHGKSMDLLLFGAPELA AKLRMKATVQEVEDFYMNIIRDTVDYRVKNNVKRHDFVDMLIEMKLKFDNGDKENGLTFN EIAAQAFIFFLAGFETSSTTMGFALYELACHQDIQDKLRTEINTVLKQHNGKLDYDSMRE MTYLEKVITETMRKRPVVGHLIRVATQHYQHTNPKYNIEKGTGVIVPTLAIHHDPEFYPE PEKFIPERFDEDQVQQRPXCTFLPFGDGPRNCIGLRFGRMQVIVGXALLIHNFKF AC009844 28950-28363 and 42575-43341 same sequence VLEIVDLVARYTPDVIGNCAFGLNCNSLQNPNAEFVTIGKRAIIERRYGGLLDFLIFGFP 28771 KLSRRLRLKLNVQDVEDFYTSIVRNTIDYRLRTNEKRHDFMDSLIEMYEKEQAGNT 28603 EDGLSFNEILAQAFIFFVAGFETSSTTMGFALYELALDQDIQKHNNEF 28423 TYEGIKEMKYLEQVVMETLRKYPVLAHLTRMTQTDFSPEDPKYFIPKGTTGVIPALGIHYDPEIYPEP GEVKPERLTDEAIAARPSCTWL AC009844 44542-44881 MLLLALIVVILSLLVFAARRRHGYWQRRGIPHDVPHPIYGNIKDWPKKRHIAMIFRDYYFK YKRSVYPFAGFYFFFTRSAVITDLELVKRVLIKDFNHFENRGIFYNEIDDPLS AC009844 35993-35161 IERFFMRIVRETVAFREQNNIRRNDFMDQLIDLKNKPLMVSQSGESVNLTIEEIAAQAF 35817 VFFAAGFETSSTTMGFALYELAQNQDIEKCNGELNYESMKDLVYLDQ 35637 ETLRLYTVLPVLNRECLEDYEVPGHPKYVIKKGMPVLIPCGAMHRD 35440 EKLYANPNTFNPDNFSPERVKERDSVEWLPFGDGPRNCIGMRFGQMQARIGLALLIKDFK 35260 FSVCEKTTIPMTYNKEMFLIASNSGIYLKAERV 35161 AC009844 82008-82526 KGLYCNQKSDPLSGDLYALRGESWKEMRQKLDPSLEGDRMSLLYDCLYEEAEQLLLTVNS 82187 TLMSQPHSTVHIQKIMRRYVLSSLAKCVFGLNAEQRKTYPLEDFEQMTELALNSHKHGYL 82367 MNLMMIRVPNFCRMLRMRRTPKQAEEYFIKLLTSIVEQRETSGKPQKDYLQLL 82526 AC009844 90483-90241 = AL069964 MSVGTVLLTALLALVGYLLMKWRSTMRHWQDLGIPCEEPHILMGSMKGVRTARSFNEIWTSYYNKFRGSGPFAGFYWFRRP AVFVLEKSLGKQILIKEFNKFTDRGFFHNPEDDPLSGQLFLLDGQKWRTMRNSTSSTFTSGK 6a9 MGVYSVLLAIVVVLVGYLLLKWRRALHYWQNLDIPCE EPHILMGSLTGVQTSRSFSAIWMDYYNKFRGTGPFAGFYWFQRPGILVLD ISLAKLILIKEFNKFTDRGFYHNTEDDPLSGQLFLLDGQKWKSMRSKLSS TFTSGKMKYMFPTVVKVGHEFIEVFGQAMEKSPIVEVRDILARFTTDVIG TCAFGIECSSLKDPEAEFRVMGRRAIFEQRHGPIGIAFINSFQNLARRLH MKITLEEAEHFFLRIVRETVAFREKNNIRRNDFMDQLIDLKNSPLTKSES GESVNLTIEEMAAQAFVFFGAGFETSSTTMGFALYELAQHQDIQDRVRKE CQEVIGKYNGEITYESMKDMVYLDQVISETLRLYTVLPDLNRECLEDYEV PGHPKYVIKKGMPVLIPCGAMHRDEKLYANPNTFNPIFFARTSEGSDSVE WLPFGDGPRLCIGMRFGQMQARSGLALLINRFKFSVCEQTTIPIVYSKKT FLISSETGIFLKVERV AC009844 694-846 WFGFGVGARSCIGIQFAQLQLRLALALLLSEYEFSLNTRKPLINLEDGIAL AL057750.1|CNS00162 Drosophila melanogaster genome survey sequence TET3 end of YVIGKFSXGLDCHSXSDPKAKFVQMGTTAITERRHGKSMXLLLFGAPELAAXX RMKATVQEVEDFYMNIIRDTVDYRVKNNVKRHDFVDMLIEMKLKFDN GDKENGLTFNEIAAQAFIFFLAGFETSSTTMGFALYELACHQDIQDKLRTEINTVLKQHN GKLDYDSMREMTYLEKVITETMRKRPVVGHLIRVATQHYQHTNPKYNIEKGTGVIVPTLA IHHDPEFYPEPEKFIPERFDEDQVQQRPXCTFLPFGDGPRNCIGLRFGRMQVIVGXALLI HNFKF Cyp6a14 AC005457 P1 clone DS08616 AC007085 (BACR21H10) 16539-17204 MLFTIALVGVVLGLAYSLHIKIFSYWKRKGVPHETPLPIVGNMX 13699 RSTTSAISTKEFIRSLRGRVPSPECTFFKRTALITDLDFIKQVMIKDFSYFQ 13868 DRGAFTNPRDDPLTGHLFALEGEEWRAMRHKLTPVFTSGKIKQMSKVIVDVGLRLGDAM 14045 DKAVKEAKVEEGNVEIKDLCARFTTDVIGSCAFGLECNSLQDPSAEFRQKGREIFTRR 14219 RHSTLVQSFIFTNARLARKLRIKVLPDDLTQFFMSTVKNTVDYRLKNGIKRNDFIEQMIE 14399 LRAEDQEAAKKGQGIDLSHGLTLEQMAAQAFVFFVAGFETSSSTMSLCLYELALQPDIQQ 14579 RLREEIESVLANVDGGELNYDVLAQMTYLDQVLSETLRKHP 14759 LLPHLIRETTKDYQIPNSDIVLDKGILALIPVHNIHHDPEIYPEPEKFDPSRFDPEEVKN 14939 RHPMAYLPFGDGPRNCIGLRFGKIQAKIGLVSLLRRFKFSVSNRTDVPLIFS 15095 KKSFLLTTNDGIYLKVE Cyp6a15p AC007085 Drosophila melanogaster GenEMBL AC005457 P1 clone DS08616 AFTVTNSKLAKKLKMKILRDDLTDFFLSVVKPALSGMTLWTSPPSGGRSSRQGGSKFDLS 9960 HNWTLEQMAAQAIVFFLAGFETSSSTMSSCKYELALQPEI*NQIRDEIERVLEGNAITYDALAK 9768 INYPEQVLSETLRKHPIQLIKFLLETQES 9681 FRVRNTELIVEKGTSLLIPVHSVHYDPHLYPHPKLFDSSRLKAYKSNSRHPFAYLPFGTF 9467 GPRSCIGLRFGKMQAKIGIVSLCQRFKFGDSDLTDIPLASDTRSAIVERI* 9322 Cyp6a17 AL052842 (BACR001L16), AL074108 (BACR035C16) 49% with 6a2 AA699131 MLLLALIVVILSLLVFAARRRHGYWQRRGIPHDEVHPLFGNIKDWPNKRHIAEIFR DYYFKYKNSDYPFAGFYFFFTRTAVVTDMELLKRVLIKDFNHFENRGVFYNEIDDPLSATL FSIEGQKWRHLRHKLTPTFTSGKMKNMFPIVVKVGEEMDKVFRSKTAADRGQVLEVV DLVARYTADVIGNCAFGLNCNSLYDPKAEFVSIGKRAITEHRYGNMLDIFLF GFPKLSRRLRLKLNIQEAEDFYTKIVRETIDYRLRTKEKRND 338 FMDSLIEMYKNEQSGNSEDGLTFNELLAQAFIFFVAGFETSSTTMGFALYELARNQD 509 VQDKLREEIGNVFGKHNKEFTYEGIKEMKYLEQVVMETLRKVPVLAHLTR MTDTDFSPEDPKYFIAKGTIVVIPALGIHYDPDIYPEPEIFKPE 263 RFTDEEIAARPSCTWLPFGEGPRNCIGLRFGMMQTCVGLAYLIRGYKFSVSPETQIPMK 86 IVVKNILISAENGIHLKVEKLA AC007594 sequence A AI257340 AI259899 one small segment is from seq. B MLWEFFALFAIADALLYRWASANNDFFKDRGIAYEKPELYFGNMAGMFLRKRAMFDIVCD 205 LYTKGGSKKFFGIFEQRQPLLMVRDPDLIKQITIKDFDHFINHRNEFATSSDDDPHDMSN 385 LFGSSLVSMRDARWKDMRSTLSPAFTGSKMRHMVQLMNHEA 508 KEAVDCLKQDDSRVQENELDMKDYCTRFTNDVIASTAFGLQVNSFKDRENTFY QMGKKLTTFTFLQSMKFMLFFALKGLNKILKVELFDRKSTQYFVRLVLDAMKYRQEHNIV 180 RPDMINMLMEARGIIQTEKTKASAVREWS DRDIVAQCFVFFFAGFETSAVLMCFTAHELMENQDVQQRLYEEVQQVDQDLEGKELTYEA 37807 IMGMKYLDQVVNEVLRKWPAAIAVDRECNKDITFDVDGQKVEVKKGDVIWLPTCGFHRDP 37987 KYFENPMKFDPERFSDENKESIQPFTYFPFGLGQRNCIGSR 38167 FALLEAKAVIYYLLKDYRFAPANKSCIPLKLITSGFQLSPKGGFWIKLVQR 38320 AC007594 sequence B (pseudogene) AA567377 AA698035 MLWEFFALFAIAAALFYRWASANNDFFKDRGIAYEKPVLYFGNMAGMFLRKRAMFDIVCD 39306 LYTKGGSK 39330 KFFGIFEQRQPLLMVRDPDLIKQITIKDFDHFINHRNVFATSSDDDPHDMSNLFGSSLFS 32428 MRDARWKDMRSTLSPAFTGSKMRQMFQLMNQVAKEAVDCLKQDDSRVQENELDMKDYCTR 32248 FTNDVIASTAFGLQVNSFKDRENTFYQMGKKLTTFTFLQSMKFMLFFALKGLNKI LKVELFDRKSTQYFVRLVLDAMKYRQEHNIVRPDMINMLMEARGIIQTEKTKASAVREWS 31842 DRDIVAQCFVFFFAGFETSAVLMCFTAHELMENQDVQQRLYEEVQQVDQDLEGKEL 31674 KYLDQVVSEVLRKWPPAIAFDRECNKE 31593 EAKAVIYYLLKDYRFAPAKKSCIPLELISSGFQLSPKGGFWIKLVQR 31455 AC009741 Drosophila melanogaster chromosome 3 clone BACR44K17 (D976) RPCI-98 QRQPLLMVRDPDLIKQITIKDFDHFINHRNEFDTSSDDDPHDMSNLFSSSLFSMRDARWK 146378 DMRSTLSPAFTGSKMRQMFQLMNQVAKEAVDCLKQDDSRVQENELDMKDYCTRFTNDVIA 146558 STAFGLQVNSFKDRENTFYQMGKKLTTFTFLQNMKFILLFALKSLNK 146699 ILKVEIFDRKSTQYFVRLVLDAMKYRQEHNIGRPDMINML 146887 Cyp6d2 AC004377 DS00837 MWTILLTILIAGLLYRYVKRHYTHWQRLGVDEEPAKIPFGVMDTVMKQER SLGMALADIYARHEGKIVGIYMLNKRSILIRDAQLARQIMTSDFASFHDR GVYVDEDKDPLSANLFNLRGASWGSVIFGLEIDSFRNPKNEFREISSSTS RDESLLLKIHNMSMFICPPIAKLMNRLGYESRILTSLRDMMKRTIEFREE HNVVRKDMLQLLIRLRNTGKIGEDDDQVWDMETAQEQLKSMSIEKIAAQA FLFYVAGSESTAAASAFTLYELSMYPELLKEAQEEVDAVLMKHNLKPKDR FTYEAVQDLKFLDICIMETIRKYPGLPFLNRECTEDYPVPGTNHIIAKGT PILISLFGMQRDPVYFPNPNGYDPHRFDSNNMNYDQAAYMPFGEGPRHCI GKALRMGKVNSKVAVAKILANFDLVQSPRKEVEFRFDAAPVLVTKEPLKL RLTKRK* AC007440 comp(89081-89665) region also on same fragment Cyp6g1 and one other MLLIWLLLLTIVTLNFWLRHKYDYFRSRGIPHLPPSSWSPMGNLGQLLFL RISFGDLFRQLYADPRNGQAKIVGFFIFQTPALMVRDPELIRQVLIKNFN NFLNRFESADAGDPMGALTLPLAKYHHWKESRQCMSQLFTSGRMRDVMYS QMLDVASDLEQYLNRKLGDRLERVLPLGRMCQLYTTDVTGNLFYSLNVGG LRRGRSELITKTK ELFNTNPRKVLDFMSVFFLPKWTGVLKPKVFTEDYARYMRHLVDDHHEPTKGDLINQLQHFQLSRSSNHYSQHP DFVASQAGI ILLAGFETSSALMGFTLYELAKAPDIQERLRSELREAFISTATLSYDTLMTLPYLKMVCLEALRLYPAAAFVNRECTSSASEG FSLQPHVDFI*VMIIPTYLCQSKF QFWPEPGVFDPKRFGPERSRHIHPMTYIPFGAGPHGCIGSRLGVLQLKLGIVHILKQYW VETCERTVSEIRFNPKSFMLESENEIYLRFCRSSL* AC007441 AC018176 AI403829 chromosome 3 clone BACR10E03 (D690) RPCI-98 50% to 6D1 MIGIYLLIAAVTLLYVYLKWTFSYWDRKGFPSTGVSIPFGALESVTKGK RSFGMAIYDMYKSTKEPVIGLYLTLRPALLVRDAQLAHDVLVKDFASFHD RGVYVDEKNDPMSASLFQMEGASWRALRNKLTPSFTSGKLKAMFETSDSV GDKLVDSIRKQLPANGAKELELKKLMATYAIDIIATTIFGLDVDSFADPN NEFQIISKKVNRNNIEDIIRGTSSFL LEKFFVKIGWKQEATERMRELSNRTVDL REQNNIVRKDLLQLLLQLRNQGKINTDDNIWSAESTKNGVKSMSKDLIAGQLFLFYVAGYETTASTTSFT LYELTQNPEVMEKAKEDVRSAIEKHGGKLTYDAISDMKYLEACILETARKYPALPLLNRICTKDYPVPDSKLVIQKGTPIIISLIG MHRDEEYFPDPLAYKPERYLENGKDYTQAAYLPFGEGPRMCIGARMGKVNVKIAIAKVLSNFDLEIRKEKCEIEFGVYGIPLMPKS GVPVRLSLKK* AC015208 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered MELVLLILVASLIGIAFLALQQHYSYWRRMGVREIRPKWIVGNLMGLLNMRMSPAEFISQLY NHPDAENEPFVGIHVFHKPALLLRDPEMVRNILVKDFAGFSNRYSSSDPKGDPLGSQNIF FLKNPAWKEVRLKLSPFFTGNRLKQMFPLIEEVGASLDAHLRQQPLHNERMRCFDLEAK ELCALYTTDVIATVAYGVSANSFTDPKCEFRRHGRSVFEFNLLRAAEFTLVFFLPHLVPF VRFKVVPAEATRFLRKTINYVMSEREKSGQKRNDLIDILIEFRRSTQLAKASGIKDQF VFEGDILVAQAVLFFTAGFESSSSTMAFAMYELAKDTDVQQRLREEIKDALVESGGQVTL KMIESLEFMQMILLEVLRMYPPLPFLDRECTSGRDYSLAPFHKKFVVPKGMPVYIPC YALHMDPQYFPQPRKFLPERFSPENRKLHTPYTYMPFGLGPHG CIGERFGYLQAKVGLVNLLRNHMITTSERTPHRMQLDPKAIITQAKGGIHLRLVRDALGV* AF083946 Cyp6g1 = AC007440, AL065705, AA698945, AI402390 GH21606, AI403823 MVLTEVLFVVVAALVALYTWFQRNHSYWQRkgipYIPPtpiigNTKVVFK MENSFGMHlSEIYNDPRLKDEAVVGIYSMNKPGLIIRDIELIKSILIKDF NRFHNRYarcdphgdplgynNLFFVRDAhwkgiRTKLTPVFTSGKVKQMY TLmQEIGKDLELALQRRGEKNSGSFITEIKEICAQFSTDSIATIAFGIRA NSLENPNAEFRNYGRKMFTFTVARAKDFFVAFFLPKLVSLMRIQFFTADF SHFMRSTIGHVMEERERSGLLRNSLIDVLVSLRKEAAAEPSKPHYAKNQD FLVVSAGVFFTAGFETSSSTMSFALYEMAKHPEMQKRLRDEINEALVEGG GSLSYEKIQSLEYLAMVVDEVLRMYPVLPFLDREYESVEGQPDLSLKPFY DYTLENGTPVFIPIYALHHDPKYWTNPSQFDPERFSPANRKNIVAMAYQP FGSGPHNCIGSRIGLLQSKLGLVSLLKNHSVRNCEATMKDMKFDPKGFVH QADGGIHLEIVNDRLYDQSAPSLQ AC008257 AC014226 AL062352 AL054245 AI108091 AI113367 AI064259 AI064268 36% with 6a2 43% identical to 6g1 possible frameshifts after EKDRRKA and GEDV MLLLLLLGSLTIVFYIWQRRTLSFWERHGVKYIRPFPVVGCTREFLTAKVPFFEQIQKFH EAPGFENEPFVGVYMTHRPALVIRDLELIKTVMIKKFQYFNNRVLQTDPHNDALGYKNLF FARSPGWRELRTKISPVFTSGKIKQMYPLMVK*IGKNLQDSAERLGSGTEVQVKDLCSRF TTDLIATIAFGVEANALQDAKSEFFYHNRAIFSLTLSRGIDFAIIFMIPALASLARVKLF SRETTKFIRSSVNYVLKEREKDRRKA*RNDLIDILLALKREAAANPGEDV*KEVDLDYLV AQAAVFQTAGFETSASTMTMTLYELAKNEALQDRLRQEIVDFFGDEDHISYERIQEMPYL SQVVNETLRKYPIVGYIERECSQPAEGERFTLEPFHNMELPHGMSIYMSTVAVHRDPQYW PDPEKYDPERFNSSNRDNLNMDAYMPFGVGPRNCIGMRLGLLQSKLGLVHILRNHRFHTC DKTIKKIEWAPTSPETFSTRRIISRFEAITGPAN* AC015002 = Dm0590 STS from DS07966-Sp6 in vector ad10sacBII, AL058810 MVYSTNILLAIVTILTGVFIWSRRTYVYWQRRRVKFVQPTHLLGNLSRVLRLEESFALQLRRFY 11344 FDERFRNEPVVGIYLFHQPALLIRDLQLVRTVLVEDFVSFSNRFAKCDGRSDKMGALSLF 11164 LAKQPEWREIRTRLAPAFAGAKLKQMFSLMEE IGCDLEWYLKRLTRDLRRGDAERGAIVSIKDVCDLYNTDMIASIAFGLRSYSLRNTQSE 9947 IGSHCQDLFRPNVRRIIDLFVIFYLPKLVPLLRPKLFTEPHAEFLRRVIQLVIEERERGG 9767 DLRNDLIEMLLTLKKEADLQQDKSHFTHHRDFLAAQAASFEVAGIETCSASMSFALYELA 9587 KQPLMQSRLRREIREAFASNPNGRLTYEAVARMEFLDMVVEETLRKYPIVPLLERECTPI 9407 NKKRFYSLRPHAECYTRRGMPVFISNLAIHHDPK 9305 YWPDPDRFDPERFSAANKALQAPMSYMPFGAGPRNCIGMQIGLLQIK 8830 LGLVYFLHQHRVEICDRTVERIQFDAKFALLASEQRIYLKVDCL* 8695 AC006469 Cyp9b1 chromosome 2R DS02730 and DS07472 also U34324 MSFVEICLVLATIGLLLFKWSTGTFKAFEGRNLYFEKPYPFLGNMAASAL QKASFQKQISEFYNRTRHH*KLVGLFNLRTPMIQINDPQLIKKICVKDFD HFPNHQTLNIPNERLVNDMLNVMRDQHWRNMRSVLTPVFTSAKMRNMFTL MNESFAQCLEHLKSSQPIAAGENAFELDMKVLCNKLSNDVIATTAFGLKV NSFDDPENEFHTIGKTLAFSRGLPFLKFMMCLLAPKVFNFFKLTIFDSTN VEYFVRLVVDAMQYREKHNITRPDMIQLLMEAKKESKDNWTDDEIVAQCF IFFFAAFENNSNLICTTAYELLRNLDIQERLYEEVKETQEALKGAPLTYD AAQEMTYMDMVISESLRKWTLSAAADRLCAKDYTLTDDEGTKLFEFKAGD NINIPICGLHWDERFFPQPQRFDPERFSERRKKDLIPYTYLPFGVGPRSC IGNRYAVMQAKGMLYNLMLNYKIEASPRTTRDMWESARGFNIIPTTGFWM QLVSRK* AC006469 Cyp9b2 DS02730 and DS07472 also U34325, AI402118 MALIEICLALVVIGYLIYKWSTATFKTFEERKLYFEKPYPFVGNMAAAAL QKSSFQRQLTEFYERTRQH*KLVGFFNMRTPMITLNDPELIKKVCVKDFD HFPNHQPFITSNDRLFNDMLSVMRDQRWKHMRNTLTPVFTAAKMRNMFTL MNESFAECLQHLDSSSKTLPGRKGFEVDMKVMCNKLSNDIIATTAFGLKV NSYDNPKNEFYEIGQSLVFSRGLQFFKFMLSTLVPKLFSLLKLTIFDSAK VDYFARLVVEAMQYREKHNITRPDMIQLLMEAKNESEDKWTDDEIVAQCF IFFFAAFENNSNLICTTTYELLYNPDVQERLYEEIVETKKALNGAPLTYD AVQKMTYMDMVISESLRKWTLAAATDRLCSKDYTLTDDDGTKLFDFKVGD RINIPISGLHLDDRYFPEPRKFDPDRFSEERKGDMVPYTYLPFGVGPRNC IGNRYALMQVKGMLFNLLLHYKIEASPRTIKDLWGSASGFNFTPRSGFWM HLVPRK* Cyp9b3 Drosophila mettleri AF083945 MDLILLLSIVGLIYFVYKWATARHNEFELRGLPFEKPLPIFGNN AAVVTGRASFQKSSPSSIARTRQHKMVGFFNFRTPMIQLNDPEIIKKITVKDFEYFP NHQLFFTTEERLINDMLSVMKDQRWKHMRNTLTPVFTSAKMRSMFSLMNESFAEC MDHLDQMSKTAVKPGGSFELELKEVCNRLSNDLIATTAFGLKVSSYKKPDNDFYEIGKSI VFFRGKALYKLFACPTTLPAVFKLLGFKIFDAQKTDFFIRLVVDAMKYREENNIVRPD MIQLLMEAKKESTEHWSDDELVAQCFIFFFAAFENNASLICTTAYELLNNPDVQQRLY EEVQETYDALKGEMLTYDAVTKMKYMDLVASESLRKWTLAASTDRECAKDYTLYDD DASKLFEFKAGDRINIPIVGLHLDDKFFPEPHKFIPERFSDGNKDQIVPYTYLPFGAGPRN CIGNRYALMQAKAMLYNLVLKYKIERSPKTVKDLLSDSRGFQLTPQSGYWVHLVPRK Cyp9c1 U34326, AL063862 (BACR007C22), AL058497 (BACR024A05) MVFVELSIFVAFIGLLLYKWSVYTFGYFSKRGVAHEKPIPLLGNIPWSV LMGKESYIKHSIDLHLRLKQHKVYGVFN*LRDPLYYLSDPELIRQVGIK NFDTFTNHRKGITEGFNDTSVISKSLLSLRDRRWKQMRSTLTPTFTSLK IRQMFELIHFCNVEAVDFVQRQLDAGTSELELKDFFTRYTNDV IATAAFGIQVNSFKDPNNEFFSIGQRISEFTFWGGLKVMLYILMPKLMK VKTSPLHSFF NVDYFKKLVFGAMKYRKEQSIVRPDMIHLLMEAQRQFKAEQEGSAESAAQ QDKAEFNDDDLLAQCLLFFSAGFETVATCLSFTSYELMMNPEVQEKLLAE ILAVKEQLGEKPLDYDTLMGMKYLNCVVSESLRKWPPAFIVDRMCGSDFQ LKDEEGEVVVNLREDDLVHINVGALHHDPDNFPEPEQFRPERFDEEHKHE IRQFTYLPFGVGQRSCIGNRLALMEVKSLIFQLVLRYHLKPTDRTPA DMMSSISGFRLLPRELFWCKLESRGPA* Cyp9f1 Drosophila mettleri AF083947 MLVEFLALSVVVLLLAYRWATANYNFFKERGIPYHKPYPFVGNMGKMLLR QKSMFDLIVELYNRGDSKVFGIFEQRKPLLMIRDPELVKQITIKDFDHFI NHRNIFGVDNNDPHDMDNLFGSSLFSMRDARWKDMRRPLSPAFTGSKMRQ MFQLMDIVANEAVECLKRDDIPENGIELDMKDYCTRFTNDVIASTAFGLQ VNSFKDRENQFYMMGKKLTPLQPLTNLKFLLFTSAQKIFKALKISLFDRQ STNYFVRLVLDAMKYRQENNIIRPDMINMLLEARGLINSDKLKSSVVRDW SDRDIVAQCFVFFFAGFETSAVLMCFTAQELLENEDVQEKLYEEVAQVDS DLQGGQLTYEAIMGMKYLDQVVSEVLRKWPAAIAVDRECNKDITYEVDGK SVQIKKGEAVWLPTCGFHRDPKYFENPNKFDPDRFSEENKDKIQPFTYYP FGVGPRNCIGSRFALLEAKAVIYYLLREFRLVPAKKTCIPLVLSSSGFQL APKTGFWVKLIPRK Cyp9f2 AA735946 78% IDENTICAL TO Cyp9f1 This appears to be derived from a full length version of the pseudogene sequence DRENTFYQMGKKLTTFTFLQNMKFILLFALKSLN KILKVEIFDRKSTQYFVRLVLDAMKYRQEHNIVRPDMINMLMEARGIIQT EKTKASAVREWSDRSIVAQCFAFFFAGFETSAVLMCFTAHELMENQDVQQ RLYEEVQQVDQDLEGKELTYEAIMGMKYLDQVVSEVLRKWPPAIAFDREC NKDITFDVDGQKVEVKKGDVIWLPTCGFHRDPK AC017170 Drosophila melanogaster, AI259899 and parts of AC007594 N-terminal (39127-39333) and comp(32169-32604) up to DRENTFY MLWEFFALFAIAAALFYRWASANNDFFKDRGIAYEKPVLYFGNMAGMFLRKRAMFDIVCD LYTKGGSKKFFGIFEQRQPLLMVRDPDLIKQITIKDFDHFINHRNVFATSSDDDPHDMSN LFGSSLFSMRDARWKDMRSTLSPAFTGSKMRQMFQLMNQVAKEAVDCLKQDDSRVQENEL DMKDYCTRFTNDVIASTAFGLQVNSFKDRENTFYQMGKKLTTFTFLQSMKFMLFFALKGLNK ILKVELFDRKSTQYFVRLVLDAMKYRQEHNIVRPDMINMLMEARGIIQTEKTKASAVRE WSDRDIVAQCFVFFFAGFETSAVLMCFTAHELMENQDVQQRLYEEVQQVDQDLEGKELTY EAIMGMKYLDQVVNEVLRKWPLAIAVDRECNKDITFDVDGQKVEVKKGDVIWLPTCGFHR DPKYFENPMKFDPERFSDENKESIQPFTYFPFGLGQRNCIGSRFALLEAKAVIYYLLKDYRFAPAK KSCIPLELITSGFQLSPKGGFWIKLVQR 9f pseudogene AI113499 AL105104 AC017240 AC007594 comp(31455-32006) NSFKDRENTFYQMGKKLTTFTFLQNMKFILLFALKSLNK ILKVELFDRKSTQYFVRLVLDAMKYRQEHNIVRPDMINMLMEARGIIQTEKTKASAVRE WSDRDIVAQCFVFFFAGFETSAVLMCFTAHELMENQDVQQRLYEEVQQVDQDLEGKELTY EAIMGMKYLDQVVSEVLRKWPAAIAFDRECNK EAKAVIYYLLKDYRFAPAKKSCIPLELISSGFQLSPKGGFWIKLVQR There are at least three 9f genes. One is a pseudogene. The pseudogene has an EST AI113499 and two genomic sequences AL105014 and AC017240 One of the other two sequences is represented by the EST AA735946 that is nearly identical to the pseudogene except it does not have the two deletions seen in the pseudogene. The other sequence is represented by EST AI259899 and genomic sequence AC017170. The genomic sequence AC007594 has three gene fragments. The first comp(31455-32607) and the last (39127-39333) seem to belong together. The middle sequence (37628-38320) matches AC017170 and probably is the same gene. The first sequence may to be chimeric since the first part of this sequence up to ILKVEL matches AC017170 and AI259899, but the second part matches the pseudogene and does not match AC017170 and AI259899. For an alignment of 9f related Drosophila sequences see 9f alignment AC005450 Cyp9h1 comp(17940-19601) = AC007453 comp(140861-142543) = AC005472 MDQSMIALALFIILLVLLYKWSVAKYDVFSERGVSHEKPWPLIGNIPLKAMIGGMP VLKKMIELHTK HTGSPVYGIYALRDAVFFVRDPELIKLIGIKEFDHFVNHNSMHNNIQESILSKS LISLRDGRWKEMRNILTPAFTGSKMRIMYDLIQSCSEEGVIHIQEQLELSQDASIELE MK DYFTRFANDVIATVAFGISINSFRRKDNEFFRIGQAMSRISAWSVVKAMLYALFPRL MK VLRIQVLDTKNIDYFSSL VTAAMRYRQEHKVVRPDMIHLLMEAKQQRLADLSDKSKDELYYSEFTADDLLAQC LLFFF AGFEIISSSLCFLTHELCLNPTVQDRLYEEIISVHEELKGQPLTYDKLTKMKYLDMV VLE ALRKWPPSISTDRECRQDIDLFDENGQKLFSARKGDVLQIPIFSLHHDPENFEDPEF FNP ERFADGHALESRVYMPFGVGPRNCIGNRMALMELKSIVYQLLLNFKLLPAKRTSR DLL NDIRGHGLKPKNGFWLKFEARQ* cyp12a4 AC006091 Drosophila melanogaster chromosome 3 clone BACR48G05 (D475) MLKVRSALSLIQSQKATLSLATQK* RWQTNVATAEAREDSEWLQAKPFEQIPRLNMWALSMKMSMPGGKYKNMELME MFEAMRQDY GDIFFMPGIMGNPPFLSTHNPQDFEVVFRNEGVWPNRPGNYTLLYHREEYRKDFY QGVMG VIPTQGKPWGDFRTVVNPVLMQPKNVRLYYKKMSQVNQEF ILELRDPDTLEAPDDFIDTINRWTLESVSVVALDKQLGLLKNSNKESEALKLFHYL DEFFIVSIDLEMKPSPWRYIKTPKLKRLMRALDGIQEVTLAYVDEAIERLDKEAKEG VVR PENEQSVLEKLLKVDRKVATVMAMDMLMAGVDT TSSTFTALLLCLAKNPEKQARLREEVMKVLPNKNSEFTEASMKNVPYLRACIKESQ RLHP LIVGNARVLARDAVLSGYRVPAGTYVNIVPLNALTRDEYFPQASEFLPERWLRSPK DSE SKCPANELKSTNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRNFNVEFNYPTE NAFR SALINLPNIPLKFKFIDLPN* Cyp12a5 AC006091 Drosophila melanogaster chromosome 3 clone BACR48G05 (D475) MLKGRIALNILQSQKPIVFSASQQ*RWQTNVPTAEIRNDPEWLQAKPFEE IPKANILSLFAKSALPGGKYKNLEMMEMIDALRQDYGNIIFLPGMMGRDG LVMTHNPKDFEVVFRNEGVWPFRPGSDILRYHRTVYRKDFFDGVQGIIPS QGKSWGDFRSIVNPVLMQPKNVRLYFKKMSQVNQEFIKEIRDASTQEVPG NFLETINRWTLESVSVVALDKQLGLLRESGKNSEATKLFKYLDEFFLHSA DLEMKPSLWRYFKTPLLKKMLRTMDSVQEVTLKYVDEAIERLEKEAKEGV VRPEHEQSVLEKLLKVDKKVATVMAMDMLMAGVDTTSSTFTALLLCLAKN PEKQARLREEVMKVLPNKDSEFTEASMKNVPYLRACIKESQRVYPLVIGN ARGLTRDSVISGYRVPAGTIVSMIPINSLYSEEYFPKPTEFLPERWLRNA SDSAGKCPANDLKTKNPFVFLPFGFGPRMCVGKRIVEMELELGTARLIRN FNVEFNHSTKNAFRSALINLPNIPLKFKFKFTDVPN* AC012807 Drosophila melanogaster, AC009385 AA801503 MLRLTVKHGLRANSQLAATRNPDASSYVQQL ESEWEGAKPFTELPGPTRWQLFRGFQKGGEYHQLGMDDVMRLYKKQFGDICLIPGLFGM 53300 PSTVFTFNVETFEKVYRTEGQWPV 53228 RGGAEPVIHYRNKRKDEFFKNCMGLFGN GAEWGKNRSAVNPVLMQHRNVAIYLKPMQRVNRQFVNRIREIRDKESQEVPGDFMNTINH 52788 LTFESVATVALDRELGLLREANPPPEASKLFKNIEVLMDSFFDLGVRPSLYRYIPTPTYK 52608 KFSRAMDEIFDTCSMYVNQAIERIDRKSSQGDSNDHKSVLEQLLQIDRKLAVVMAMDM 52434 LMGGVDTTSTAISGILLNLAKNPEKQQRLREEVLSKLTSLHSEFTVEDMKSLPYLRAVIK 52254 ESLRLYPVTFGNARSAGADVVLDGYRIPKGTKLLMTNSFLLKDDRLYPRAKEFIPERWLR 52074 RKDDDKSDVLMNKDLNAFIYLPFGFGPRMCVGKRIVDLEMELTVANLVRNFHIEYNYS 51900 TEKPYKCRFLYKPNIPLKFKFTDLKY* 51819 AL063519 T7 end of BAC BACR07K20 STS G01307 is exact match AC006496 AC012699 MAVILLLALALVLGCYCALHRHKLADIYLRPLLKNTLLEDFYHAELIQPEAPKRRRRGI WDIPGPKRIPFLGTKWIFLLFFRRYKMTKLHE YGDIVLEVMPSNVPIVHLYNRDDLEKVLKYPSKYPFRPPTEIIVMYRQSRPDR YASVGIVNEQGPMWQRLRSSLTSSITSPRVLQNFLPALNAVCDDFIELLRARRDPDTLV VPNFEELANLMGLEAVCTLMLGRRMGFLAIDTKQPQKISQLAAAVKQLFISQRDSYYGLG LWKYFPTKTYRDFARAEDLIYE Orf with I helix SQSSVISEIIDHELEELKKSAACEDDEAAGLRSIFLNI LELKDLDIRDKKSAIIDFIAAGIET Orf with EXXR motif LANTLLFVLSSVTGDPGAMPRILSEFCEYRDTNILQDALTNA TYTKACIQESYRLRPTAFCLARILEEDMELSGYSLNAG Orf with PERW and heme motifs TVVLCQNMIACHKDSNFQGAKQFTPERWIDPATENFTVNVDNASIVV PFGVGRRSCPGKRFVEMEVVLLLAK AC008187 Drosophila melanogaster chromosome 2 clone BACR13D20 (D604) RPCI-98 A cyp12 family member 45% to 12a5 LQLITKRNRMNTLSSARSVAIYVGPVRSSRSASVLAHEQAKSS*ITEEH KTYDEIPRPNKFKFMRAFMPGGEFQNASITEYTSAMRKRYGDIYVMPGMFGRKDWVTTF 85761 NTKDIEMVFRNEGIWPRRDGLDSIVYFREHVRPDVYGEVQGLVAS QNEAWGKLRSAINPIFMQPRGLRMYYEPLSNINNEF 85452 IKEIRDPKTLEVPEDFTDEISRLVFESLGLVAFDRQMGLIRKNRDNSDALTLFQTSRDIF 85211 RLTFKLDIQPSMWKIISTPTYRKMKRTLNDSLNVAQKMLKENQDALEKRRQAGEKINS 85037 NSMLERLMEIDPKVAVIMSLDILFAGVDATATLLSAVLLCLSKHPDKQAKLREELLSIMP 84857 TKDSLLNEENMKDMPYLRAVIKETLRYYPNGLGTMRTCQNDVILSGYRVPKGTTVLLGS 84680 NVLMKEATYYPRPDEFLPERWLRDPETGKKMQVSPFTFLPFGFGPRMCIGKR 84524 VVDLEMETTVAKLIRNFHVEFNRDASRPFKTMFVMEPAITFPFKFTDIEQ* 84371 AL063519 BACR07K20 STS G01307 is exact match MXILXLNRQYGDIVLEVMPSNVPIVHLYNRDDLEKVLKYPSKYPFRPPTEIIVMYRQSRPDR YASVGIVNEQGPMWQRLRSSLTSSITSPRVLQNFLPALNAVCDDFIELLRARRDPDTLV VPNFEELANLMGLEAVCTLMLGRRMGFLAIDTKQPQKISQLAAAVKQLFISQRDSYYGLG LWKYFPTKTYRDFARAEDLIYE*VHR AL074984 note this sequence has the end of an early exon at its beginning PWLXXXDRPMXLISRNRDDPDALT LNIQPSMWRXISTPNFRMMMRLLDDILMFSQKMIKDTEDSVEKRRQ* LQLIPKRNRMNTLSSARSVAIYVGPVRSXXXASVLAHEQAKSS AC007398 AC007418 mito clan best matches = 37-38% to 12a sequences MQRLRTGESSNPKKLNV* SQQPVTSVATTRTTASSLPA ETTSSPAAAVRPYSEVPGPYPLPLIGNSWRFAPLIG* TYKISDLDKVMNELHVNYGKMAKVGGLIGHPDLLFVFDG DEIRNI* THFLFAMELRPSMPSLRHYKGDLRRDFFGDVAGLIGV* HGPKWEAFRQEVQHILLQPQTAKKYIPPLNDIASEFMGR* IELMRDEKDELPANFLHELYKWALE* SVGRVSLDTRLGCLSPEGSEEAQQIIEAINTFFWAVPELELRMPLWRIY PTKAYRSFVKALDQFT*EITLHPRICMKNIGKTM DKADADEARGLSKSEADISIVERIVRKTGNRKLAAILALDLFLVGVDT TSVAASSTIYQLAKNPDKQKKLFDELQKVFPHREADINQNVLEQM PYLRACVKETLRMRPVVIANGRSLQSDAVINGYHVPKGVSFREMVTIW* DPAYFPEPKRFLPERWLKQSTDXXX SAGCPHANQKIHPFVSLPFGFGRRMCVGRRFAEIELHTLLAKV GFNHVLHLALPELPPFNLPYLIPGTDLPQIQGLVQFRRVC VPCELHVHSAVASEFQTNAEGRVSCLHQREPEAHHGAQSCSYTRRMCDIY SQSSAAVAMLHFN* AC007356 Drosophila melanogaster chromosome 2 clone BACR24H09 (D595) RPCI-98 MNNLSLKAWRSTVSCGPNLRQCVPRISGA GSRRAQCRESSTGVATCPHLADSEEASAPRIHSTSEWQNALPYNQIPGPK PIPILGNTWR LMPIIGQYTISDVAKISSLLHDRYGRIVRFGGLIGRPDLLFIYDADEIEK CYRSEGPTPFRPSMPSLVKYKSVVRKDFFGDLGGVVG RHGEPWREFRSRVQKPVLQLSTIRRYLQPLE VITEDFLVRCENLLDENQELPEDFDNEIHKWSLE GIGRV ALDTRLGCLESNLKPDSEPQQIIDAAKYALRNVATLELKAPYWRYFPTPLWTRYVK NMNFFV SVCMKYIQSATERLKTQDPSLRAGEPSLVEKVILSQK DEKIATIMALDLILVGIDTVSMAVCSM LYQLATRPVDQQKVHEELKRLLPDPNTPLTIPLLDQMHHLKGFIKEVFRMYSTVIG NG RTLMEDSVICGYQVPKGVQAVFPTIVTGNMEEYVTDAATFRPERWLKPQHGGTPG KLHPF ASLPYGYGARMCLGRRFADLEMQILLAKLLRNYKLEYNHKPLDYAVTFMYAPDG PLRFKMTRV* AC017275 = AC007356 in ordered 36160 SMAVCSMLYQLATRPVDQQKVHEELKRLLPDPNTPLTIPLLDQMHHLKGFIKEVFRMYST 35981 35980 VIGNGRTLMEDSVICGYQVPKGVQAVFPTIVTGNMEEYVTDAATFRPERWLKPQHGGTPG 35801 35800 KLHPFASLPYGYGARMCLGRRFADLEMQILLAKLLRNYKLEYNHKPLD 35657 Cyp12b1 U78485 Drosophila acanthoptera MWKFAIHSQQPFCWQQLCNRRHLYVGNVQQQTHLELLDAAPTRSDDEWLQAKPY EKVPGPGTWQVLSYFLPG GKQYNTNLIQMNRRMREWYGDIYRFPGLMGKQDVIFTYNPNDFELTYRNEGVWP IRIGLESFTYYRKVHRPE VFGSIGGLVSEQGKDWAHIRNKVNPVQMRVQNVRQNLPQIDQISREFVDKLDTLR DPVTHILNDNFHEQLKM WAFESISFVALNTRMGLLSDRPDPNAARLAEHMTDFFNYSFKYDVQPSIWPYYKT PGFKKFLQTYDKITEIT TAYIDEAIKRFEIEKDSGNECVLQQLLSLNKKVAVVMAMYMLMAGIDTTSSAFVTIL YHLARNPHKQRQLHR ERRRILPDSDEPLTPENTKNMPYLRACIKECMRITSITPGNFRIATKDLVLSGYRVPR GEGVLMGVLELSNS EKYFGQSGQFMPERWLKADTDPDVKACPAARSRNPFVYLAFGFGPRTCIGKRIAE LEMETLLTRLLRRYQVS WLAEMPLQYESNIILSPHGIYVQVRAAC cyp12b2 AC018326 <7400-9141 AC004657 AC004345 MWKYSNKIIYRNVSGNQLWFNRNSSVGGTLSQQVRSWQKEQELLKSRNLFTNNGYICSQT QLELADSRIDEKWQQARSFGEIPGPSLLRMLSFFMPGGKLF*NTNLIQMNRLMREMYGDI YCIPGMMGKPNAVFTYNPDDFEMTYRNEGVWPIRIGLESLNYYRKIHRPDVFKGVGGLAS E*QGQEWADIRNKVNPVLMKVQNVRQNLPQLDQISKEFIDK*LETQRNPETHTLTTDFHN QLKMWAFESISFVALNTRMGLLSDNPDPNADRLAKHMRDFFNYSFQFDVQPSIWTFYKTA GFKKFLKTYDNITDITSNYIETAMRGFGKNDDGKTKCVLEQLLEHNKKVAVTMVMDMLMA GIDT*TSSACLTILYHLARNPSKQEKLRRELLRILPTTKDSLTDQNTKNMPYLRACIKEG LRITSITPGNFRITPKDLVLSGYQVPRGTGVLMGVLELSNDDKYFAQSSEFIPERWLKSD LAPDIQACPAARTRNPFVYLPFGFGPRTCIGKRIAELEIETLLVRLLRSYKVSWLPETPI EYESTIILSPCGDIRFKLEPVGDLM* Cyp18 U44753 AL062343 AI294030 (3 diffs with cyp18) MLADSYLIKFVLRQLQVQEDGDAQHLLMVFLGLLALVTLLQWLVRNYREL RKLPPGPWGLPVIGYLLFMGSEKHTRFMELAKQYGSLFSTRLGSQLTVVM SDYKMIRECFRREEFTGRPDTPFMQTLNGYGIINSTGKLWKDQRRFLHDK LRQFGMTYMGNGKQQMQKRIMTEVHEFIGHLHASDGQPVDMSPVISVAVS NVICSLMMSTRFSIDDPKFRRFNFLIEEGMRLFGEIHTVDYIPTMQCFPS ISTAKNKIAQNRAEMQRFYQDVIDDHKRSFDPNNIRDLVDFYLCEIEKAK AEGTDAELFDGKNHEEQLVQVIIDLFSAGMETIKTTLLWINVFMLRNPKE MRRVQDELDQVVGRHRLPTIEDLQYLPITESTILESMRRSSIVPLATTHS PTRDVELNGYTIPAGSHVIPLINSVHMDPNLWEKPEEFRPSRFIDTEGKV RKPEYFIPFGVGRRMCLGDVLARMELFLFFASFMHCFDIALPEGQPLPSL KGNVGATITPESFKVCLKRSPLGPTAADPHHMRNVGAN AI294030 3 DIFFS WITH CYP18 MLADSYLIKFVLRQLQVQQDGDAQHLLMGFLGLLALDT AL067521 similar to CYP18 I-helix to K-helix region LLWXSVFHCXIRGCCARVXXALARVVGRLRLPXFXXXXXLPLPASPFLASXRRSXXVPLAPTPSPXR 515 AL077732.1|CNS00KLP similar to CYP18 C-terminal IDTEGKXATXSTSYPSXXAXXXSXAXFWRGVELFLFFASFIHCFXIXLPXGQPLPSLKGN 183 VGXTITPESFNVCLXXRPLWPTSADPHHMRNVGAN 288 AC012164 293-399 27006-27380 RGLGKEELAGHATTLLLEGYETSAMLLAFALYELALNEDAQRHAGNLI 27185 DPGALGELRYSEAALLEALRLHPAMQALQKRCTKTFTLPDQKSGASSELKVHLGTELVLP 27365 VHAIH 27380 DSALYPAPNQFRPERF 27492 AC012164 477-537 109248-109066 not on AC015216 or AC012373 FLFFAPLKAWFGIGPPEGLPLARFKGKVGAPLPPGLVKVCPKGPPLGAPSPHSPPMGKVGA 109066 AC015216 Drosophila melanogaster, in ordered fragments MSADIVDIGHTGWMPSVQSLSILLVPGALVLVILYLCERQCNDLMGAP PPGPWGLPFLGYLPFLDARAPHKSLQKLAKRYGGIFELKMGRVPTVVLSDAALVRDFFRR 49011 DVMTGRAPLYLTH GIICAQEDIWRHARRETIDWLKALGMTRRPGELRARLERRIARGVDECV 49301 VNPLPALHHSLGNIINDLVFGITYKRDDPDWLYLQRLQEEGVKLIGVSGVVNFLPWLRHL 49902 PANVRNIRFLLEGKAKTHAIYDRIVEACGQRLKEKQKVFKELQEQKRLQRQLEKEQLRQS 50082 KEADPSQEQSEADEDDEESDEEDTYEPECILEHFLAVRDTDSQLYCDDQLRHLLAD 50250 LFGAGVDTSLATLRWFLLYLAREQRCQRRLHELLLPLGPSPTLEELEPLAYLRACIS 50421 ETMRIRSVVPLGIPHGCKENFVVGDYFIKGGSMIVCSEWAIHMDPVAFPEPEEFRPERFL 50601 TADGAYQAPPQFIPFSSGYRMCPGEEMARMILTLFTGRILRRFHLELPSGTEVDMAGES 50778 GITLTPTPHMLRFTKLPAVEMRHAPDGAVVQD 50877 AC015216 Assembled sequence 72624-74273 also = AC012164 and GSS AL098201 MLPLVLFILLAATLLFWKWQGNHWRRLGLEAPFGWPLVGNMLDFALGRRSYGEIYQEIYT* RNPGLKYVGFYRLFNEPAILVRDQELLRQILVGRNFADCADNAVYVDHQRDVLASHNPFIANGDRWRVLRADLVP LFTPSRVRQTLPHVARACQLLRDQVPLGRFEAKDLATRYTLQVVASAIFGLDAHCLGIHMRVAHEPSRWLEWLAPLFQPS VWSLLETMSLLHTPRLGRLIGHR*YVPLPLQHWFRELVEARSGGDNLLQWLAESKRGLGKEELAGHATTLLLEGYETSA MLLAFALYELALNEDAQRRLHIELDEVAQRHAGNLI DPVALGELRYSEAALLEALRLHPAMQALQKRCTKTFTLPDQKSGASSELKVHLGTVLVLPVQAIH LDPALYPAPNQFXPERFLNQPPMGCRFLGFGAGPRMCPGMRLGLLQTKAALTTLLQDHCVQLADEDQCRVEVSPLTFLTASRNGIWLSFKRRTRRY* AC014292 AC009393 AC008359 AC014072 AI257867 MITETLLTICAAVFLCLSYRYAVGRPSGFPPGPPKIPLFGSYLFMLIINF KYLHKAALTLSRWYKSDIIGLHVGPFPVAVVHSADGVREILNNQVFDGRP QLFVAAMRDPGQDVRGIFFQDGPLWKEQRRFILRYLRDFGFGRRFDQLEL VIQEQLNDMLDLIRNGPKYPHEHEMVKSGGYRVLLPLLFNPFSANAHFYI VYNECLSREEMGKLVKLCQMGIQFQRNADD YGKMLSIIPWIRHIWPEWSGYNKLNESNLFVRQFFADFVDKYLDSYEEGVE RNFMDVYIAEMRRGPGYGFNRDQLIMGLVDFSFPAFTAIGVQLSLLVQYLMLYPAVLRRVQNEIDEVVGCGRLPNLEDRKNLPFTE ATIREGLRIETLVPSDVPHKALEDTELLGYRIPKDTIVVPSLYAFHSDARIWSDPEQFRPERFLDADGKLCLKLDVSLPFGAGKRL CAGETFARNMLFLVTATMCQHFDFVLGPNDRLPDLSQNLNGLIISPPDFWLQLQDRH* Cyp28a1 U89746 Drosophila mettleri MLEITLILVLLLLGLFYVFMTWNFGYWRKRKVPGPKPHCFTGNY PHMYNMKRHSVYDLNDIYSEYEHKFDAVGIYGARSPQLLVISPQVARRVFVSDFR HFH DNEISLMVDEKSDFIMANNPFSQIGDEWKQRRADITPGLTMGRIKTVYPVTQEVCQ KM TDWLRKQIRLPPSGGIDAKDMSLRFTSEMVTDCVLGLKAESFSDKPTPIMGYIKDL FA QSWTFIIYFVLVSTLPALRHVFKLRFVPLRIENFFVNLMQTAIDARRQQLAAGKQFE R VDFLDYILHLGKKKNLDTRHLTAHTMTFLLDGFETTALYDSCALVLSRDQEAQQKL RE ELEAHLDDKGIIDFEKLNELPFLDACVQESLRIFPPAFMSNKLCTEPIELPNKTGENF TVERGTTVVVPHYCFMMDEEFFPDPQAFKPERFMQPDAAKMYREQGVFMAFGDG PRVC IGMRFALTQIKGALVELLTKFIIRVNPKTRSDNEYDPTTFIGTCKGGIWLDFELRQ Cyp28a2 U89747 Drosophila mettleri MLVTLILLGLVVFLGYKFLIWNYDYWRKRKVPGPKPALFTGNYP HLFTGKQHPVYAVNEIYRKYKNDYDAVGIYISRMPQLLIVNPDLAHRVFVSNFKN FHD NEISALVVEKSDYIFANNIFSMTGDAWKERRSDITPGLTISRIKSVYPVTNQVCKKM T EYIKKQIRIAPKDGLNGKDLSLCFTTEMVTDCVLGLGAQSFTDNPTPVMAKMRNL FRQ DLPFLINTIAMALFPPLRRIIRLRFLSKTIVEFFVRFMETPLEERQKHISAGANINRV DMLGYIIQLSPKRNMDSLKITACTMSFLLDGDDTPPSLLSNTLLLLGRNPQGHQRL RE ELSEHLCDQGFIDFDKLVDLPYLNACVHESIRIFLTAVSSKLCTESIELSNRNGPNFT VEKGTVVLVPITCFMYDDDHFPNANEYNPERFLKPDSIKKYRDQGLFLGFGDGPRI CIGMRFGLAQAKAALVEILVNFDVSVNARTRKDNLYDPKNLLSTLEGGIWLDFAARS AI403094 AI403665 AA698711 AA697564 50% to 28A2 AC008324 AI062825 MCPISTALFVIAAILALIYVFLTWNFSYWKKRGIPTAKSWPFVGSFPSVF TQKRNVVYDIDEIYEQYKNTDSIVGVFQTRIPQLMVTTPEYAHKIYVSDF RSFHDNEMAKFTDSKTDPILANNPFVLTGEAWKERRAEVTPGLSANRVKA AYPVSLRVCKKFVEYIRRQSLMAPAQGLNAKDLCLCYTTEVISDCVLGIS AQSFTDNPTPMVGMTKRVFEQSFGFIFYTVVANLWPPITKFYSVSLFAKDVAAFFYDL MQKCIQVRRESPAAQQRDDFLNYMLQLQEKKGLNAAELTSHTMTFLTDGFETTAQVL THTLLFLARNPKEQMKLREEIGTAELTFEQISELPFTEACIH ETLRIFSPVLAARKVVTEPCELTNKNGVSVKLRPGDVVIIPVNALHHDPQYYEEPQSFKP ERFLNINGGAKKYRDQGLFFGFGDGPRICPGMRFSLTQIKAALVEIVRNFDIKVNPKTRK DNEIDDTYFMPALKGGVWLDFVERN* AC008324 = AL070820 AC017780 114532 MCPVTTFLVLVLTLLVLVYVFLTWNFNYWRKRGIKTAPTWPFVGSFPSIFTRKRNIAYDI 114353 114352 DDIYEKYKDTDNMVGVFTTRVPQLLVMCPEYIHKIYA 29560 TDFRSFHNNEWRNFVST 29613 29666 KTDMILGNNPFVLTGDEWKERRSEIMPALSPNRVKAVYP 29845 29846 VSQSVCKKFVEYIRRQQQMATSEGLDAMDLSLCYTTEVVSDCGLGVSAQSFTDTPTPLLK 30025 30026 MIKRVFNTSFEFIFYSVVTNLWQKVRKFYSVPFFNKETEVFFLDIIRRCITLRLEKPEQ 30202 30203 QRDDFLNYMLQLQEKKGLHTDNILINTMTFILDGFETTALVLAHIMLMLGRNPEEQDKVR 30382 30383 KEIGSADLTFDQMSELPHLDAC 30454 30519 ILETLRLFSPQVAARKLVTEPFEFANKNGRTVHLKPGDVVTIPVKALHHDPQYYEDPLTF 30698 30699 KPERFLESNGGGMKSYRDRGVYLAFGDGPRHCPG 30800 30860 MRFALTQLKAALVEILRNFEIKVNPKTRSDNQIDDTFFMATLKGGIYLDFKDL* 31021 Cyp28a3 U91565 Drosophila nigrospiracula MRFALIQIKAAVVEVITKFNVRVNPKTRKDNEYEPTAFITSLKGGIWLDFESRP Cyp28a4 U91565 Drosophila hydei MRFAMTQIKGALVEVLTKFNVRVNPKTRTDNEYEPTRFITTLKGGIWLDFEPRQ Cyp28a5 DS00180_1 : contig 1 (85065 bases) of 1 for P1 d29 (AC001660) MVLITLTLVSLVVGLLYAVLVWNYDYWRKRGVPGPKPKLLCGNYPNMFTMKRH AIYDLDDI YRQYKNKYDAVGIFGSRSPQLLVINPALARRVFVSNFKNFHDNEIAKNIDEKTDFI FANNP FSLTGEKWKTRRADVTPGLTMGRQIKTVYPVTNKVCQKLTEWVEKQLRLGSKDG IDAKQMS LCFTTEMVTDCVLGLGAESFSDKPTPIMSKINDLFNQPWTFVLFFILTSSFPSLSHLI KLR FVPVDVERFFVDLMGSAVETRRAQLAAGKQFERSDFLDYILQLGEKRNLDNRQLLA YSMTF LLDGFETTATVLAHILLNLGRNKEAQNLLREEIRSHLQDGTIAFEKLSDLPYLDACV QETI RLFPPGFMSNKLCTESIEIPNKEGPNFVVEKGTTVVVPHYCFMLDEEFFPNPQSFQ PERFL EPDAAKTFRERGVFMGFGDGPRVCIGMRFATVQIKAAIVELISKFNVKINDKTRKD NDYEP GQIITGLRGGIWLDLEKL* AC014191 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered MFGSLLLGIATLLGAIYAFLVSNFGHWRRRGVTEPRALPLFGSFPNMIWPRQHFTMDMRDIY 9811 RHYRNTHSYVGCYLLRAPKLLVLEPRLVYEIYVSAFSHFENN 9631 DASKMVDIAKDRLVALNPFVLEGEEWRHQRAVFSTLLTNGRIRTTHAIMQRVCLDLCQFI 9451 AIKSAGGKDLDCID LGLRFTGESLFDCVLGIQARTFTDNPLPVVRQNHEMSAENRGLAIAGAVHGLFPNLPRWL 9171 RPKVFPRSHDRFYGQMISEALRLRRSKHQERNDFINHLLEMQRELDLSEEDMASHAMT 8997 FMFDGLDTTSNSIAHCLLLLGRNPDCQRRLYEELQLVNPGGY 8817 LPDLDALIDLPYLSACFN 8763 *ESLRIYPAGGWASKTCTKEYELRGS 8629 HHSEPLKLRPGDHVMVPIYALHNDPDLYPEPDVFRPERFLDGGLKNCKQQG 8476 IFLGFGNGPRQCVGMRLGLAMAKAALAAIVQRFEVVVSPRTLNGTELDPLIFVGVHKGGI 8296 WLQFVPRKNVTTK* 8254 CYP4AA1 AC004516 68725-68267 AC004426 AC005556 comp(849-1502) MHLRLLSPPQLERTTNLELCSILILLVISLSIYTFYATLNTYLRSVLLSLRLTGPPSLPF L GNCMLVTDKDCK*YGSLVRIWVLLFPFFAVLEPEDLQVILSSKKHTNKVFFYRLM HNFLGD GLITSSGSKWSNHRRLIQPAFHHNLLEKFIDTFVDASQSLYENLDAEAVGTEINIAK YVNN CVLDILN*EAVLGVPIKKRGQDVAMMEDSPFRQGKIMMPARFTQPWLLLDGIYHW TKMAND ELNQKKRLNDFTRKMIQRRRQIQNNNNGNSERKCLLDHMIEISESNRDFTEEDIVN EACTF MLAGQDSVGAAVAFTLFLLTQNPECQDRCVLELATIFEDSNRAPTMTDLHEMRYM EMCIKE ALRLYPSVPLIARKLGEEVRLAKHTLPAGSNVFICPYATHRLAHIYPDPEKFQPERF SPEN SENRHPYAFLPFSAGPRYCIGNRFAIMEIKTIVSRLLRSYQLLPVTGKTTIAATFRITL RA SGGLWVRLKERDHPLIAH* L49408 DS02740, AI405120 MFYTVIWIFCATLLAILFGGVRKPKRFPPGPAWYPIVGSALQVSQLRCRLGMFCKV IDVFA RQYVNPYGFYGLKIGKDKVVIAYTNDAISEMMTNEDIDGRPDGIFYRLRTFNSRLG VLLTD GEMWVEQRRFILRHLKNFGFARSGMMDIVHNEATCLLQDLKDKVLKSGGKQTRI EMHDLTS VYVLNTLWCMLSGRRYEPGSPEITQLLETFFELFKNIDMVGALFSHFPLLRFIAPNF SGYN GFVESHRSLYTFMSKEIELHRLTYKNYDEPRDLMDSYLRAQDEGNDEKGMFSDQS LLAICL DMFLAGSETTNKSLGFCFMHLVLQPEIQERAFQEIKEVVGLERIPEWSRDRTKLPY CEAIT LEAVRMFMLHTFGIPHRAVCDTRLSGYEIPKDTMVIACFRGMLINPVDFPDPESFN PDRYL FDGHLKLPEAFNPFGFGRHRCMGDLLGRQNLFMFTTTVLQNFKMVAIPGQVPEEV PLEGAT AAVKPYDIMLVAREQ* AC003055 (P1 DS06332) second gene complete 5801-7437 MFTLVGLCLTIVHVAFAVVYFYLTWYHKYWDKRGVVTAEPLTILGSYPGILINKSR SLILDVQDVYSKYKDKYRTVGTFITRQPQLLVLDPALAHEILVDKFSHFRDTITSSFVGHNP DDKYVAGSPFFSAGDKWKRLRSENVGGLTPSRLKMAYSIWEQSGRKLVEYIERARREQG DIIETRDLAYRFTANAMADFIWGIDAGSLSGKVGEIGDFQKTSTDWSAHAFSSMIRFNKTL VAIFVRKLFSMRFFTKATDEFFLRLTQDAVNLRQGGSGEGRTDYLSHLIQLQQRGNSIHDSV GHALTVHLDGFETSGAVLYNAVSYQLSEHHEEQEKLRSEILEALASEGQISYDQINNLPYLD QCFNESLRLTTPIGFFMRICTKPTQINLGDDKTLDLEPGVTVMVPAYQYHHDNDIYPEASE FRPDRFENGAASVLTKRGCFLPFGDGPRICLGMRVGQLSVKTAIVHILSNYQVEQMKKV PLGADSGMGIFLNGDVELKYTKLQK AC009909, AC002444, AC003055 MYILASLALILLHLLVLPIYLYLTWHHKYWRKRGLVTARPLTLLGTYPGLLTRKSNLVFD VQKIYSKYKGKHRAVGVFVTRQPQILVLDPELAHEVLVSNFRCYKDSLQSSYLRHAKWDK YARLNPFWASGQSWRRLRTDAQAGISGSRLRQAYNIWEQGGQMLTEYMTQQVAEKNNILE TRDVCFRYTAHVMADFIWGIDAGTLTRPMEQPNKVQEMASKWTSYAFYMLTLFMATIVAP CSRLLLRFRFYPKETDEFFSNLTKESIELRLKAGDSTRTDYLSHLLQLRDQKQATHDDLV GHALTVMLDGYDTSGTALLHALYYVLAENPAVQQKLRVEILSCMASEKSLDFEKLSSLQY LEQCFNESLRLSSLIPQYTKVCTLPTVIRLSESKSLDVEVGMTIMIPNYQFHHDKQYFPE PEAFKPERFDNGAYQELMRKGIFLPFSDGPRICMGVPLAMLTLKSALVHILSNFQVVRGR DRLIPKGDSGFGVVLQGDVNLEYRRFFR* AC007571 MLASIILSGWLLLAWLYFLWS RRRYYKVAWQLRGPIGWPLIGMGLQMMNPESKSWTGF* YMDGLSRQFKAPFISWMGTSCFLYINDPHSVEQILNSTHCTNKGDFYR FMSSAIGDGLFTSSSPRWHKHRRLINPAFGRQILSNFLPIFNAEAEVLLQ KLELEGVQHGKRLEIYQILKKIVLEAAC* QTTMGKKMNFQHDGSLCIFKAYNGLVQMLRHLIMVNNIC WLITFAFSLTEVCVKRMLSPWLYPDLIYRRSGLFRLQQKVVGILFGFIEQVSYWSDQSCTH *LISQTFETTSTALYFTILCLAMHPCYQ EKLHKELVTELPPSGDINLEQLQRLEYTEMVINEAMRLFAPVPMVLRSAD QDIQLKRGDGEFLIPRGTQIGIDIYNMQRDERVWGPLSRTYNPDAHFGLD SPQRHAFAFVPFTKGLRMCIGYRYAQMLMKLLLARIFRSYRISTEARLEE LLVKGNISLKLKDYPLCRVERR* AC017336 153877-155807 = AC007752 62176-61546 and 120713-121149 MLTWTLWCGLLFLLWIYFLWSRRRFYLLTLKIPGPLGYPILGMAHWLMRRE* DILNAFGCFLDKHGPTIFSWLGPIPFMIVSDPQVVQDIFTSPHCVNKGIIYKAVDDGAGVGLFSLKD* PRWSIHRKLLNPAFGHKVLLSFLPIFNRETALLLDQLEPLQDDGEKDLIPLLQSFTLGIAT* QTTMGSDVKDEESFRSNSLLGRYQC* ILETMTDMCFSPWLNSRFCRQLAGKESHYYQAKTEIRQFIRK* DEMGALPSIQSNDKNLFLNLVTDLMRRGVFTLKNVEDESNI IVFGAFETTANAVYYTLMLLAMFPEYQERAFEEIKTIFPNTGDFDVSYADTQQMVYLDLI LNESMRVIPPVPVVSRQTSQDLKLSNGIVVPKGVQIAIDIYHMHRSKKIWGPDAETF NPDHFLPHNIQDKHPYAYIPFTKGIRNCIG* WRYALISAKVTLAKLLRNYRFKTSFPFENLYFVEDITMKLKSVPLLELQKRT* AL068269 AA141181 might be same as AC005130 AAQAXFLXLAGFDTSSSTYHFLRCTSWPRTPXFRTPXNGXSGCLAV *PRSATELRHFTGLXYLRQVVDEVLRXYPPTXFLDRCCNSRTGYDL SPWNGGSPFKLRAGTXVYISVLGIHRDAQYWPNPESFDPERFSAEQ RQQHHPMTYLPFGAGPRXCIGXXLGQLEIKVGLLHILNXFRVEXCE RTLPEMRFDPKAFVLTAHNGTYLRFVKNXL* AC012831 comp(42975-44564) = AL068269 AA141181 similar to AC014742 MIAVFSLIAAALAVGSLVLLPVVLRGGCLLVVTIVWLWQILHFWHWRRLGVPFVPAAPFVGNVWNLLRGACCFGDQFRELYESKEA AGRAFVGIDVLHNHALLLRDPALIKRIMVEDFAQFSSRFETTDPTCDTMGSQNLFFSKYETWRETHKIFAPFFAAGKVRNMYGLLE NIGQKLEEHMEQKLSGRDSMELEVKQLCALFTTDIIASLAFGIEAHSLQNPEAEFRRMCIEVNDPRPKRLLHLFTMFFFPRLSHRV GTHLYSEEYERFMRKSMDYVLSQRAESGENRHDLIDIFLQLKRTEPAESIIHRPDFFAAQAAFLLLAGFDTSSSTITFALYELAKN TTIQDRLRTELRAALQSSQDRQLSCDTVTGLVYLRQVVDEVLRLYPPTAFLDRCCNSRTGYDLSPWNGGSPFKLRAGTPVYISVLG IHRDAQYWPNPEVFDPERFSAEQRQQHHPMTYLPFGAGPRGCIGTLLGQLEIKVGLLHILNHFRVEVCERTLPEMRFDPKAFVLTA HNGTYLRFVKNSL* AC014742 Drosophila melanogaster, in ordered pieces = AC005130 MVIAFFIFL*CAALAVGSVVLLPLIALLAVWLWQRRHFRIWRRLGVPYLPAAPVLG 1204 NVLNVETAACCFGDQFRELYERKEAAGRAIVGINVLHSHALLLRDPALIRRILVEDFPEFSSSFKST 1398 DAIRDTMGSGNLLFTKYKTWWETHKIFAIRLGGRRIRSLLYGLLERI*QNLEAHMAQK 1557 LNGAESVELEVKQLCALFTTDIFAKFALQSLQNPEAEFRPMCIEVNDPKPKRLSHHLFTGF 1710 TPPIYRVRTHLYSEEYERFMRKSMNYVLAQRAEN*EKRYDLIDMFLQMHRT 1841 ETAEGIIHRPDFYVAQAAFLLLAGFDTSSSFALYELAKNPTIEHRLQAELRVDLQSSHNHQLSYDTLTGLVYLRQVLED 2077 LPFGAGPRGCIGTLLGQLGIKVGLLHTLKHFRVELCERTLPEMRFDP*ASVLTAHNGTFLRFVRNSL* 2504 AC005130 DS01560, complete sequence AL062712 (BACR006M08), AL075733 (BACR037C16), = AC014742 CAALAVGSVVLLPLIALLAVWLWQRRHFRIWRRLGVPYLPAAPVL GNVLNVETAA *FGDQFRELYERKEAAGRAIVGINVLHSHALLLRDPALIRR ILVEDFPEFSSSFKSTDAIRDTMGSGNLLFTKYKTWWETHKIFAVRTSGK* *YVFADAPYRNCGRHNPSARLLRSPGC FLLLAGFDTSSSFALYELA KNPTIEHRLQAELRVDLQSSHNHQLSYDTLTGLVYLRQVLEDDPQFHDEI SYPPRELQRIASV* *SDTCLAVYFIIFKIRNPKPRIL HQIDLTRATLTASLHALPFGAGPRGCIGTLLGQLGIKVGLLHTLKHFRVE LCERTLPEMRFDP* AL070820.1|CNS00FNQ Drosophila melanogaster genome survey sequence TET3 end LSLCYTTEVVSDCGLGXSAXSFTDTPTPLLKXIKRVFNTSFEFIFYSVVTNLWQKV RKFY SVPFFNKETEVFFLDIIRRCITLRLEKPKQQRDDFLNYILQLQEKKGLHTDNILIN TMTFILDGFETTALVLAHIMLMLGRNPESK LTFDQMSELPHLDACI*LETLRLFSPQVAARKLVTEPFEFANKNGRTVHLKPGDVV TIPVKAL 794 HHDPQYYEDPLTFKPERF 848 AL062352 AL054245 AI108091 AI113367 AI064259 AI064268 36% with 6a2 FRPPGXITXXPXMTXTXYXLAKNEXLQDRLRQEIVDFFGDEDHISY ERIQEMPYLSQXVNETLRKYPIVGYIEREAIHPRTIPQHGVPHGMS IYMSTVAVHRDPQYWPDPEKYDPERFNSSNRDNLNMDAYMPFGVGP RNCIGMRLGLLQSKLGLVHILRNHRFHTCDKTIKKIEWAPTSPETF CTRRIISRFEAITGPAN* AC014186 AA803931 AA821188 AA803220 AI404794 MCCLQSTTLDRKKNASWNRSAIVGSDTSDAPEHCPL RCGALLAVLLAWQQRKCWRLIWQLNGWRGVIQQPVLWLLLCINLHPNS ILEKVSQYRVHFQRPLRVLVGTRVLLYIDDPAGMECVLNAPECLDKTFLQDGFFVRRGLLHAR GQKWKLRRKQLNPAFSHNIVASFFDVFNSVGNQMVEQFQTQTNLHGQAVKFTAAEDLLSRAVLE* DTSMGAQLDTQSVDHSPIIQAFHLSSKLLFKRMINPLLSSDWIFQRTQLWRDLDEQLQ VIHSQMESVIEKRAKELLDMGEPAGRAHNLLDTLLLAKFEGQSLSRREI RDEINTFVFXGVDTTTAAMSFVLYALAKFPETQTRLRKELQDVALDETTDLDALNGLPYLEALIKE VLRLYTIVPTTGRQTTQSTEIGGRTYCAGVTLWINMYGLAHDKEYYPDPYAFKPERWLPE DGAVAPPAFSYIPFSGGPHVCIGRRYSLLLMKLLTARLVREFQMELSPEQAPLRLEAQM VLKAQQGINVSFLKQ* AI404831 GH24669 52% to 6d2 N-term AC008197, AC009846, AA141002 C-term MFSLILLAVTLLTLAWFYLKRHYEYWERRGFPFEKHSGIPFGCLDSVWRQEKSMGLAIYDVYV KSKERVLGIYLLFRPAVLIRDADLARRVLAQDFASFHDRGVYVDEERDPLSANIF SLRGQSWRSMRHMLSPCFTSGKLKSMFSTSEDIGDKMVAHLQKELPEEGFKEVDIKKVMQ NYAIDIIASTIFGLDVNSFENPDNKFRKLVSLARANNRFNAMFG VSNGCSCRIAQFLFRIGFKNPVGLAMLQIVKETVEYREKHGIVRKDLLQLLIQLRNTGKIDENDEKSFSIQKTPDD IKTISLEAITAQAFIFYIAGQETTGSTAAFTIYELAQYPELLKRLQDEVDETLAKNDGKITYDSLNKMEFLDLCVQETIRKYPGLP ILNRECTQDYTVPDTNHVIPKGTPVVISLYGIHHDAEYFPDPETYDPERFSEESRNYNPTAFMPFGEGPRICIAQRMGRINSKLAI IKILQNFNVEVMSRSEIEFENSGIALIPKHGVRVRLSKRVPKLS* AC008197 chromosome 3 clone BACR02L12 (D753) RPCI-98 AC009846 AA141002 MFSLILLAVTLLTLAWFYLKRHYEYWERRGFPFEKHSGIPFGCLDSVWRQEKSMGLAIYD 68158 VYVKSKERVLGIYLLFRPAVLIRDADLARRVLAQDFASFHDRGVYVDEERDPLSANIFSL 68338 RGQSWRSMRHMLSPCFTSGKLKSMFSTSEDIGDKMVAHLQKELPEEGFKEVDIKKVMQNY 68518 AIDIIASTIFGLDVNSFENPDNKFRKL 68599 VISESEFCYGVSNGCSCRIAQFLFRIGFKNPVGLAMLQIVKETVEYREKHGIVRKDLLQLLIQLRNTGKIDENDEKSFSIQKTPDD IKTISLEAITAQAFIFYIAGQETTGSTAAFTIYELAQYPELLKRLQDEVDETLAKNDGKITYDSLNKMEFLDLCVQETIRKYPGLP ILNRECTQDYTVPDTNHVIPKGTPVVISLYGIHHDAEYFPDPETYDPERFSEESRNYNPTAFMPFGEGPRI AI064680 N-TERM TO C-HELIX AL061315 AI404867 AI406053 AI1516839 AC009382 MSALIFLCAILIGFVIYSLISSARRPKNFPPGPRFVPWLGNTLQFRKEASAVGGQHILFERWA KDFRSDLVGLKLGREYVVVALGHEMVKEVQLQEVFEGRPDNFFLRLRTMGTRKGITCTDG QLWYEHRHFAMKQMRNVGYGRSQMEHHIELEAEELLGQLERTEEQPIEPVTWLAQ SVLNVLWCLIAGKR AL078186 MLTSVFYVLFAIAITIILISYVFLLLKCKQKAFVVIGLLYQEKKY QCFDQAPGPHPWPIIGNINLLGRFQYNPFYGFGTLTKKYGDIYSLSLGHT RCIVVNNVDLIKEVLNKNGKYFGGRPDFFRYHKLFGGDRNNCKFIXXLRF AC014810 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered MLAALIYTILAILLSVLATSYICIIYGVKRRVLQPVKTKNSTEINHNAYQKYTQAPGPRP WPIIGNLHLLDRYRDSPFAGFTALAQQYGDIYSLTFGHTRCLVVNNLELIREVLNQNGKV MSGRPDFIRYHKLFGGERSNSLALCDWSQLQQKRRNLARRHCSPREFSCFYM KMSQIVARKWSTGIRELGNQLVP GEPINIKPLILKACANMFSQYMCSLRFDYDDVDFQQIVQYFDEIFWEINQGHPLDFLPWL YPFYQRHLNKIINWSSTIRGFIMERIIRHRELSVDLDEPDRDFTD ALLKSLLEDKDVSRNTIIFMLEDFIGGHSAVGNLVMLVLAYIAKNVDIGRRIQEEIDAII EEENRSINLLDMNAMPYTMATIFEVLRYSSSPIVPHVATEDTVISGYGVTKGTIVF INNYVLNTSEKFWVNPKEFNPLRFLEPSKEQSPKHF LPFSIGKRTCIGQNLVRGFGFLVVVNVMQRYNISSHNPSTIKISPESLALPADCFPLVLTPREKIGPL* AC008307 AC015141 AC007725 chromosome 3 clone BACR03D22 (D709) RPCI-98 TAFSSQWALFALSKEPRLQQRLAKERATNDSRLMHGLIKESLRLY 45293 PVAPFIGRYLPQDAQLGGHFIXXX 45230 TMVLLSLNTAGRDPSHFEQPERVLPERWCIGETEQVHKSHGSLPFAIGQRSCIGRRVALK 44986 QLHSLLGRCAAQFEMSCLNEMPVDSVLRMVTVPDRTLRLALRPR 44854 SRCRWLPPAATVTYTDDREEGAAGPAALAETPARPAPGANP*PEPLPFALRSAAFAAFSRNGTTACRRRQIRAHSKGEGT SGSWDTCGSYSRRRSHAVNP*SLTSKSRRESNRVENA*YILVSENSIKLAYAGLVQC*TIQP*LATF*KK*LSIGFSFKK VLNPTKLPKSAKRNRVFYI*NAS*DTKLY*NLTNRNI*IEISKLQRRKIC*NIQGRN*KTNHNLFDLREFNYNTYVLIYF RRKDP*FNPYILYLFSLHKYIDARHKQYGPIFRERLGGTQDAVFVSSANLMRGVFQHEGQYPQHPLPDAWTLYNQQHACQ RGLFFM*VEWTK*VR*IANVEPTKTISCLFFDGNREGAEWLHNRRILNRLLLNGNLNWMDVHIESCTRRMVDQWKRRTAE AAAIPLAESGEIRSYELPLLEQQLYRWSIEGTRRHALRADPHTK*FHPNSIHL*FCAASCLAPACSPAPRSSPRWTTSRR LCTRCLSIARD**HSRLAWPRFCACPSGGISRPMWMRCCVRELP*SITASECRRTKGDRTMRRFTIASRRRMCQAI*SSG YL*TWSLQQVTR*AIQLNCKAHY*L*LHLI*TAFSSQWALFALSKEPRLQQRLAKERAT EQVQVVTPRCHRDVHR*PRRGSGRARCAG*DTCSTSSWCESLA*ASSVRAAIRRLCSVFPQRNYRLPSPPNTCPFQG*RD FR*LGHLWIL*PPAEPRSKSVKFNLEKSERKQSC*KCIIYSCVRKLYKIGLCRTSSMLNNSTLIGNLLKKVIINRFFF*E SA*PY*ITKKCKT**SFLYLKCFMRY*IILKSHKSKYLNRNIKAAKKKNMLKYTRTKLKN*SQSLRLKRVQLQYICINIF SPKGSIV*PVYPLSV*PS*VHRCEAQAVWSHFPGAIGRYPGCSVRIVRKSHARSLPARGSVSAASAAGCLDAV*PATCLP TGTVLHVSGVDQMSPLNCKCGADENDLLSLLRWE*GGRRVAAQPTHT*STAAQRKFELDGRAY*ELYQTNGGSVEKTHCG GGGDSASGEW*NTKLRTAPVGTTALPLVHRRYKTSCIKG*SPYKMISSQLYPLVVLCCIMFGTSVLTCPKIQSSLDYFTQ IVHKVFEHSSRLMTFPPRLAQILRLPIWRDFEANVDEVLREGAAIIDHCIRVQEDQRRPHDEALYHRLQAADVPGDMIKR IFVDLVIAAGDTVSNPIELQSPLLIVVTPHLDRIQQSVGFVCPFKGAEAPATTGQGASY RAGAGGYPPLPP*RTPMTEKRERPGPLRWLRHLLDQLLVRILSLSLFRSRCDPPPLQRFPATELPPAVAAKYVPIPRVKG LPVVGTLVDLIAAGGATQ*IRKV*PRKVGEKAIVLKMHNIFLCPKTL*NWLMPD*FNVKQFNLNWQPFEKSDYQ*VFLLR KCLTLLNYQKVQNVIEFFISKMLHEILNYIKISQIEIFKSKYQSCKEEKYVEIYKDETKKLITISST*ESSTTIHMY*YI FAERIHSLTRISFICLAFISTSMRGTSSMVPFSGSDWAVPRMQCSYRPQISCAESSSTRVSIRSIRCRMPGRCITSNMPA NGDCSSCKWSGPNESVELQMWSRRKRSLVSSSMGIGRAPSGCTTDAYLIDCCSTEI*IGWTCILRAVPDEWWISGKDALR RRRRFR*RRVVKYEATNCPCWNNSSTVGP*KVQDVMH*GLIPIQNDFIPTLSTCSSVLHHVWHQRAHLPQDPVLAGLLHA DCAQGV*A*LATDDIPASLGPDFAPAHLAGFRGQCG*GAA*GSCHNRSLHQSAGGPKETAR*GALPSPPGGGCARRYDQA DICRLGHCSR*HGEQSN*IAKPITNCSYTSFRPHSAVSGLCLPFQRSRGSSNDWPRSELP AC015141 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered pieces Length = 21229 Score = 124 bits (308), Expect = 5e-29 Identities = 63/63 (100%), Positives = 63/63 (100%), Gaps = 13/63 (20%) Frame = +1 Query: 1 MTEKRERPGPLRWL-------------LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP 47 MTEKRERPGPLRWL LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP Sbjct: 229 MTEKRERPGPLRWLRHLLDQLLVRILSLSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP 408 Query: 48 RVKGLPVVGTLVDLIA 63 RVKGLPVVGTLVDLIA Sbjct: 409 RVKGLPVVGTLVDLIA 456 AC007648 Drosophila melanogaster chromosome 3 clone BACR13A02 (D705) RPCI-98 13.A.2 map 88E-88F strain y; cn bw sp, WORKING DRAFT SEQUENCE, 84 unordered pieces Length = 128775 Score = 123 bits (306), Expect = 9e-29 Identities = 62/63 (98%), Positives = 63/63 (99%), Gaps = 13/63 (20%) Frame = -1 Query: 1 MTEKRERPGPLRWL-------------LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP 47 MTEKRERPGPLRW+ LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP Sbjct: 115974 MTEKRERPGPLRWMRHLLDQLLVRILSLSLFRSRCDPPPLQRFPATELPPAVAAKYVPIP 115795 Query: 48 RVKGLPVVGTLVDLIA 63 RVKGLPVVGTLVDLIA Sbjct: 115794 RVKGLPVVGTLVDLIA 115747 AC008307 Drosophila melanogaster chromosome 3 clone BACR03D22 (D709) RPCI-98 03.D.22 map 86F-87A strain y; cn bw sp, *** SEQUENCING IN PROGRESS ***, 94 unordered pieces Length = 91546 Score = 102 bits (253), Expect = 1e-22 Identities = 49/49 (100%), Positives = 49/49 (100%) Frame = +2 Query: 15 LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIPRVKGLPVVGTLVDLIA 63 LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIPRVKGLPVVGTLVDLIA Sbjct: 28868 LSLFRSRCDPPPLQRFPATELPPAVAAKYVPIPRVKGLPVVGTLVDLIA 29014 AC015396 AL099182 Cyp302a1 dib1 gene MLTKLLKISCTSRQCTF AKPYQAIPGPRGPFGMGNLYNYLPGIGSYSWLRLHQAGQDKYEKYGAIVRETIVPGQDI 32181 VWLYDPKDIALLLNERDCPQRRSHLALAQYRKSRPDVYKTTGLLPTNGPEWWRIRAQ 32352 VQKELSAPKSVRNFVRQVDGVTKEFIRFLQESRNGGAIDMLPKLTRLNLE 32502 32643 VTCLLTFGARLQSFTA QEQDPRSRSTRLMDAAETTNSCILPTDQGLQLWRFLETPSFRKLSQAQSYMESVALELVE 32870 ENVRNGSVGSSLISAYVKNPELDRSDVVGTAADLLLAGIDTTSYASAFLLYHIA 33032 RNPEVQQKLHEEARRVLPSAKDELSMDALRTDITYTRAVLKESLRLNPIAVGVGR 33197 GRGQDLNQDAIFSGYFVPKG TTVVTQNMVACRLEQHFQDPLRFQPDRWLQHRSALNPYLVLPFGHG 33441 MRACIARRLAEQNMHI 33489 LLLRLLREYELIWSGSDDEMGVKTLLINKPDAPVLIDLRLRRE* 33705 AC017770 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered pieces Length = 67513 Cyp4e4 75-203 31598 EYVKRFGRSFMGTVLGHVVMVTAEPRHIDALLQGQHQLKKGTMYFALRGWLGDGLLLSRG 31419 31418 KEWHTMRKIITPTFHFSILEQFVEVFDRQSSILVERLRTLSYGNEVVNIYPLVGLAALDI 31239 31238 ITETAMGVNVD 31206 Cyp4d2 67-206 33788 AEIMDFVKKNQRKYGRLYRVWILHQLAVFSTDPRDIEFVLSSQQHITKNNLYKLLNCWLG 33609 33608 DGLLMSTGRKWHGRRKIITPTFHFKILEQFVEIFDQQSAVMVEQLQSRADGMTPINIFP 33432 33431 VICLTALDIIAGEFA 33387 Cyp4d2 245-483 33126 RQDKAIKVMHDFTENIIRERRETLVNNSKETTPEEEVNFLGQKRRMALLDVLLQST 32959 32958 IDGAPLSDEDIREEVDTFMFEGHDTTTSAISFCLYEISRHPEVQQRLQQEIRDVLGEDRK 32779 32778 SPVTLRDLGELKFMENVIKESLRLHPPVPMIGRWFAEDVE 32656 32605 LSGGKHIPAGTNFTMGIFVLLRDPEYFESPDEFRPERFDADVPQIHPYAYIPFSAGPRN 32429 32428 CIGQKFAMLEMKSTVSKLLRHFELL 32354 Cyp4d14 67-483 36011 AAIFDMQFRLIAEFGKNIKTQMLGESGFMTADSKMIEAIMSSQQTIQKNNLYSLLVNWLG 36190 36191 DGLLISQGKKWFRRRKIITPAFHFKILEDFVEVFDQQSATMVQKLYDRADGKTVINMFPV 36370 36371 ACLCAMDIIAETAMGVKINAQLQPQFTYVQSAHLFSASAML 36550 36551 AERFMNPLQRLDFTMKLFYPKLLDKLNDAVKNMHDFTNSVITERRELLQKAIADGGDADA 36730 36731 ALLNDVGQKRRMALLDVLLKSTIDGAPLSNDDIREEVDTFMFEGHDTTTSSIAFTCYLLA 36910 36911 RHPEVQARVFQEVRDVIGDDKSAPVTMKLLGELKYLECVIKESLRLFPSVPIIGRYISQD 37090 37091 TVLDGKLIPADSNVIILIYHAQRDPDYFPDPEKFIPDRFSMERKGEISPFAYTPFSAGP 37267 37268 RNCIGQKFAMLEMKSTISKMVRHFELL 37348 more distant 4d1 exon 1 103-194 usedin most ESTs of 4d1 49069 ERVLGSSQLLTKSQEYSFLGRWLNEGLLVSNGRKWHRRRKIITPAFHFRILEPYVEIFDR 49248 49249 QSLRLVEELALRISRGQERINLGEAIHLCALDAIC 49353 closest 4d1 exon 1 118-200 50309 YKALEPWLKEGLLVSRGRKWHKRRKIITPAFHFKILDQFVEVFEKGSRDLLRNME-QDRL 50485 50486 KHGDSGFSLYDWINLCTMDTICGRDLGL 50569 Cyp4d1 192-512 50677 SIPETAMGVSINAQSNADSEYVQAVKTISMVLHKRMFNILYRFDLTYMLTPLARAEKKAL 50856 50857 NVLHQFTEKIIVQRREELIREGSSQESSNDDADVGAKRKMAFLDILLQSTVDERPLSNLD 51036 51037 IREEVDTFMFEGHDTTSSALMFFFYNIATHPEAQKKCFEEIRSVVGNDKSTPVSYELLNQ 51216 51217 LHYVDLCVKETLRMYPSVPLLGRKVLEDCEISDGKLIPAGT 51396 51397 NIGISPLYLGRREELFSEPNIFKPERFDVVTTAEKLNPYAYIPFSAGPRNCIGQKFAMLE 51576 51577 IKAIVANVLRHYEVDFVGDSSEPPVLIAELILRTKEPLMFKVRER 51711 AC017388 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered Cyp4d8 74-212 38766 REYVLKFGHLQRVWIFNRLLIMSGDAELNEQLLSSQEHLVKHPVYKVLGQWLGNGLLLSD 38587 38586 GKVWHQRRKIITPTFHFSILEQFVEVFDQQSNICVQRLAQKANGNTFDVYRSICAAA 38416 38415 LDIIAETAMGTKIYAQANESTPY 38347 Cyp4d8 250-512 38155 IRTMQEFTIKVIEKRRQALEDQQSKLMDTADEDVGSKRRMALLDVLLMSTVDGRPLTN 37982 37981 DEIREEVDTFMFEGHDTTTSALSFCLHELSRHPEVQAKMLEEIVQVLGTDRSRPVSIRDL 37802 37801 GELKYMECVIKESLRMYPPVPIVGRKLQTDFKVHGD 37622 37621 GVIPAGSEIIIGIFGVHRQPETFPNPDEFIPERHENGSRVAPFKMIPFSAGPRNCIGQ 37448 37447 KFAQLEMKMMLAKIVREYELLPMGQRVECIVNIVLRSETGFQLGMRKR 37304 AC017371 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered 1st gene with annotation MDTFQLLLAVGVCFWIYFLWSRRRLYMMHFKIPGPMGLPILGIAFEYLITYK* R at intron = CGT RKMSIRTKYMDIYGSTCLVWVGPTPFVITRDPKIAEEIFLSPECLNRSSIFSKPVNSCTGDGLLSLE* G at intron = GGT ASKWVDRRKNLNPAFKQNVLLSFLPIFNSEAKTLVAFLDSLVGQGEKKVRDDIVRWSFRIATRK* RK at intron = CGTAAGT Intron = ACAGAA QTTVGTDVKKDASFKNDSVLKSYET T = ACGT FMKIIVMNVLLPFTHNKIFSTLGGFETQKALAKSNVNKMIGTV 1 R = AGG RFMKIIVMNVLLPFTHNKIFSTLGGFETQKALAKSNVNKMIGTV 2 R = AGG RMLRLTIINIFVPFVQNKIVSKLFGLEWLRRRDASAINKMINNV 3 S = AGC SLNNLIPIGVVMPWLRNKYLGKLFSYEKRRLEAATQSNAFIKDV IVDKKLMTKPESGSQPEITSVINKAIELHRNGEMSREEVQSECCSFVVAAFETTGDTVYHALILLAMFPEHQDTVYQELK ELFPVAGDFEVTYDDLQRMVFLERVVNETLRLIPSVPFTPRETIRDFRLSSGVVIPKGVGIGIDIFATHRNRDHWGTDPS SFNPDHFLPDNVRDRHPYAYIPFSKGRRNCI GWKYGLMSSKLALSKILRNCKVSTSFRYEDLEFVDNIGMELAQSPGLEFHRRT* AC017371 1st gene 1890-3720 AC007724 AL072640 AL074717 AL076582 MDTFQLLLAVGVCFWIYFLWSRRRLYMMHFKIPGPMGLPILGIAFEYLITYK RKMSIRTKYMDIYGSTCLVWVGPTPFVITRDPKIAEEIFLSPECLNRSSIFSKPVNSCTGDGLLSLE ASKWVDRRKNLNPAFKQNVLLSFLPIFNSEAKTLVAFLDSLVGQGEKKVRDDIVRWSFRIATR QTTVGTDVKKDASFKNDSVLKSYET FMKIIVMNVLLPFTHNKIFSTLGGFETQKALAKSNVNKMIGTV IVDKKLMTKPESGSQPEITSVINKAIELHRNGEMSREEVQSECCSFVVAAFETTGDTVYHALILLAMFPEHQDTVYQELK ELFPVAGDFEVTYDDLQRMVFLERVVNETLRLIPSVPFTPRETIRDFRLSSGVVIPKGVGIGIDIFATHRNRDHWGTDPS SFNPDHFLPDNVRDRHPYAYIPFSKGRRNCI GWKYGLMSSKLALSKILRNCKVSTSFRYEDLEFVDNIGMELAQSPGLEFHRRT* 2nd gene with annotation MIVIQLLIAASLILWIRFLWSRRKLYMLMMQLPGRMGLPLLGNSVRYLIISR G at intron = GGT next joint = AGGC so joint =GGC = G GRMSSRTTYMDKHGSTYMAWIGTTPIVITRDPKIAEKVLTSPFCINRSSQTTNALALSMGYGLLTLQ G at intron = GGT AGSKWMARRKHMNPAFKHSVLLSFLPIFNAETDLLVSVFDSFVGQGEKDVLSDLIRWSFAIATRK* QTTLGTDVTKDDNFENDAILKTYQS S = TCGT 2 MLRLTIINIFVPFVQNKIVSKLFGLEWLRRRDASAINKMINNV ILDKKLNSNPENYCESELKTVIHRAIELFRNDEMSLMELGAECSSMVLAAFETSAHTVYYALVLLAMFPEHQEMVFNEIK EHFPLAKGIEVTHTDLQQLVYLDRVLNETLRLMPSVPFSSRETLEDLRLSNGVVIPKGMTISIDIFNTQRNTDYWGSEAA QFNPENFLPEKIHDRHPYAFIPFSKGKRNCI GWRYGLMSSKLALVKILRNYKLKTSFPYENLEFVDHMVIKLAQSPQLAFERRTL* AC017371 2nd gene 4100-5929 AC007724 AL057969 AL067059 MIVIQLLIAASLILWIRFLWSRRKLYMLMMQLPGRMGLPLLGNSVRYLIISR GRMSSRTTYMDKHGSTYMAWIGTTPIVITRDPKIAEKVLTSPFCINRSSQTTNALALSMGYGLLTLQ AGSKWMARRKHMNPAFKHSVLLSFLPIFNAETDLLVSVFDSFVGQGEKDVLSDLIRWSFAIATR QTTLGTDVTKDDNFENDAILKTYQS MLRLTIINIFVPFVQNKIVSKLFGLEWLRRRDASAINKMINNV ILDKKLNSNPENYCESELKTVIHRAIELFRNDEMSLMELGAECSSMVLAAFETSAHTVYYALVLLAMFPEHQEMVFNEIK EHFPLAKGIEVTHTDLQQLVYLDRVLNETLRLMPSVPFSSRETLEDLRLSNGVVIPKGMTISIDIFNTQRNTDYWGSEAA QFNPENFLPEKIHDRHPYAFIPFSKGKRNCI GWRYGLMSSKLALVKILRNYKLKTSFPYENLEFVDHMVIKLAQSPQLAFERRTL* 3rd gene with annotations MLTLQIFEAFAIILCVYFLWSRRRFYIMMLKLPGPMGFPFIGLAFEYIRLKRK R at intron = CGT G at other end of intron is AGGT so joint = CGT = R RKIRLRTILFKIYGKTVLTWIGLTPVLVTCEPKILEDIFTSPNCSNRSSVVDKAISSCLGLGLLTLK S at intron = AGT D at intron = AGAT so joint = AAT = N NNHWNERRKLLLPSFKNNAVLSFVPVLNNEANFLVTLLAEFVDGGDINLLPELNKWSFKIAARK* QITMGDEVRNQANYQNGNLLESYKA A = GCGT LNNLIPIGVVMPWLRNKYLGKLFSYEKRRLEAATQSNAFIKDV IIDKKLSSTDNSSEPALIDRILNLVRIGELSYDDVMGEFSNIIFAASDTLSITVNNVLILMAMFPKYQDNVFEELAEVFP SGGEFEASHADLEKLVKLDRVLHETMRLIPAVPLLIRQTSHSIQLSNGFYIPEGVTLMIDIFHTHRNKDIWGPQANAFNP DNFLPENKRARPPYSYLPFSKGKKTCL GWKLSLISAKLALAKILRNYMLSTTFLYKDLRFIDNTTMKLAEQPLLAVKRRI* AC017371 6352-8145 3rd gene AC007724 AL066602 MLTLQIFEAFAIILCVYFLWSRRRFYIMMLKLPGPMGFPFIGLAFEYIRLKRK RKIRLRTILFKIYGKTVLTWIGLTPVLVTCEPKILEDIFTSPNCSNRSSVVDKAISSCLGLGLLTLK NNHWNERRKLLLPSFKNNAVLSFVPVLNNEANFLVTLLAEFVDGGDINLLPELNKWSFKIAAR QITMGDEVRNQANYQNGNLLESYKA LNNLIPIGVVMPWLRNKYLGKLFSYEKRRLEAATQSNAFIKDV IIDKKLSSTDNSSEPALIDRILNLVRIGELSYDDVMGEFSNIIFAASDTLSITVNNVLILMAMFPKYQDNVFEELAEVFP SGGEFEASHADLEKLVKLDRVLHETMRLIPAVPLLIRQTSHSIQLSNGFYIPEGVTLMIDIFHTHRNKDIWGPQANAFNP DNFLPENKRARPPYSYLPFSKGKKTCL GWKLSLISAKLALAKILRNYMLSTTFLYKDLRFIDNTTMKLAEQPLLAVKRRI* AC017740 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered 271-461 = AC007928 168661 QVVNKKVNPLPKTDSDPESNIVINRAMELYRKGDITYMDVKSECCIMIAAGYDTSALTVY 168482 168481 HALFLLANHPEHQEAVFEELNGVFPDAGHFGITYPDMQKLDYLERVIKETLRLIPAIPI 168305 168304 TARETKNDVRLSNGVLIPKGVVIGIDMFHTHRNPEVWGPDADNFNPDNFLAENMEQKHPY 168125 168124 AYIPFARGKRNCIG 168083 AC017250 87-512 = AI404831 38209 YLLFRPAVLIRDADLARRVLAQDFASFHDRGVYVDEERDPLSANIFSLRGQSWRSMRHML 38388 38389 SPCFTSGKLKSMFSTSEDIGDKMVAHLQKELPEEGFKEVDIKKVMQNYAIDIIASTIF 38562 38563 GLDVNSFENPDNKFRKLVSLVSNGCSCRI 38742 38743 AQFLFRIGFKNPVGLAMLQIVKETVEYREKHGIVRKDLLQLLIQLRNTGKIDENDE 38910 38911 KSFSIQKTLTFSGHIKTISLEAITAQAFISYIAGQETTGST 39087 39088 AAFTIYELAQYPELLKRLQDEVDETLAKNDGKITYDSLNKMEFLDLCVQETIRKYPGLPI 39267 39268 LNRECTQDYTVPDTNHVIPKGTPVVISLYGIHHDAEYFPDPETYDPERFSEESRNYNPT 39444 39445 AFMPFGEGPRICIAQRMGRINSKLAIIKILQNFNVEVMSRSEIEFENSGIALIPKHGVR 39621 39622 VRLSKR 39639 AC017648 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered Cyp9h1 306-485 9674 DELYYSEFTADDLLAQCLLFFFAGFEIISSSLCFLTHELCLNPTVQDRLYEEIISVHEE 9850 9851 LKGQPLTYDKLTKMKYLDMVVLEALRKWPPSISTDRECRQDIDLFDENGQKLFSARKGDV 10030 10031LQIPIFSLHHDPENFEDPEFFNPERFADGHALESRVYMPFGVGPRNCIGNRMALMELK 10204 10205SIVYQLLLNFKLLPA 10249 AC018013 = AC009382 EST AI064680 37224 MSALIFLCAILIGFVIYSLISSARRPKNFPP 37132 GPRFVPWLGNTLQ 35956 FRKEASAVGGQHILFERWAKDFRSDLVGLKLGREYVVVALGHEMVKEVQLQEVFEGR 35786 35785 PDNFFLRLRTMGTRKGITCTDGQLWYEHRHFAMKQMRNVGYGRSQMEHHIELEAEELLGQ 35606 35605 LERTEEQPIEPVTWLAQSVLNVLWCLIAGKRIARQEDGTLRRLLDLMNRRSKLFDIC 35435 35434 GGLLAQFPWLRHVAPDRTGYNLIQQLNTELYGFFMDTIEEHRRQLAKDPSPAESDLIYA 35258 35257 YLQEMKDRSAGGESSTFNETQLVMTILDFFIAGSQTTSNTINLALMVLAMRPDVQEKLFS 35078 35077 QVTASVAAASTDAFPHLSRREAFDYMDAFIMEVQRFFHITPITGPRRALWATKLGGYDIP 34898 34897 KNATILISLRSVHLDKEHWKDPLEFRPERFIDSAGKCFKDEYFMPFGMGRRRCLGDALAR 34718 34717 ACIFSFLVRIVQHFSVVLPAGESPSMVLLPGITLTPKPYKVQFVKRT* 34574 AC017306 = L49408 CYP2 like 12933 MFYTVIWIFCATLLAILFGG 12993 VRKPKRFPPGPAWYPIVGSALQVSQ LRCRLGMFCKVIDVFARQYVNPYGFYGLKIGKDKV 13172 13173 VIAYTNDAISEMMTNEDIDGRPDGIFYRLRTFNSRLGVLLTDGEMWVEQRRFILRHLKNF 13352 13353 GFARSGMMDIVHNEATCLLQDLKDKVLKSGGKQTRIEMHDLTSVYVLNTLWCMLSGRRYE 13532 13533 PGSPEITQLLETFFELFKNIDMVGALFSHFPLLRFIAPNFSGYNGFVESHRSLYTFMSKE 13712 13713 IELHRLTYKNYDEPRDLMDSYLRAQDEGNDEKGMFSDQSLLAICLDMFLAGSETTNKSL 13889 13890 GFCFMHLVLQPEIQERAFQEIKEVVGLERIPEWSRDRTKLPYCEAITLEAVRMFMLHTFG 14069 14070 IPHRAVCDTRLSGYEIPKDTMVIACFRGMLINPVDFPDPESFNPDRYLFDGHLKLPEAF 14246 14247 NPFGFGRHRCMGDLLGRQNLFMFTTTVLQNFKMVAIPGQVPEEVPLEGATAAVKPYDIM 14423 14424 LVAREQ* 14444 AC017771 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered = AC008325a 26507 MWIALLGIPILLAVLTLLLKHINKTYFILSLTKRVRTEDGSPLESKVAIMPGKTRFGNNLDIL 26319 26318 NFTTASVFNFVRESTAKAKGQNYLWYFLYAPMYNVVRPEE 26139 26138 AEEVFQSTKLITKNVVYELIRPFLGDGLLISTGK 25959 25958 GFNARLSLQCPFREECKKFLNVLEKNLDAELELNQ 25779 25778 VIPPFTLNNIC 25746 25686 ETALGVKLDDMSEGNEYRKAIHAIEEVLIQRVCNPLMYYNWYFFVYGDYRKHLQNLR 25516 25515 IVHDFSSRIIERKRQQFQQKQLGEVDEFGRKQRYAMLDTLLAAEADGQIDHQGIC 25351 25350 DEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEVENLPEDSDDISMFQFNKLVY 25174 25173 LECVIKESLRMFPSVPFIGRQCVEETVVNGMVMPKDTQISIHIYDIMRDPRHFPKPDLF 24997 24996 QPDRFLPENTVNRHPFAYVPFSAGQRNCIGQKFAILEMKVLLAAVIRNFKLLPATQLEDL 24817 24816 TFENGIVLRTQENIKVKLSKRVK* 24745 16-194 24052 WAHLNRTYFILSLCKRIRTEDGSLLESKIYVAPSKTRFGNNFDLVNFTSG 23698 YDLRAFETVFGRRTAD*HRYFYIFILNSYYFKEF*TNLDQKWHSREKP*LLLFTLRCCNL 23519 23518 FLSIFREECNKLVKVLHQSVNMELELNQVIPQFTLNNVC 23348 69-134 = AC008324 gene 1 23841 IFNFMRDASAKAKGRNYLWYFFHAPMYNIVRAEEAEEILQSSKLITKNMIYELLKPFLGE 23662 23661 GLLISTG 23641 190-512 = AC008324 gene 1 23179 INVIAETALGVKLDDLSEGIRYRQSIHAIEEVMQQRLCNPFFYNIVYFFLFGDYRKQVN 23003 23002 NLKIAHEFSSNIIEKRRSLFKSNQLGQEDEFGKKQRYAMLDTLLAAEADGQIDHQ 22838 22837 GICDEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEIKYLPDDSDDISVFQFNE 22661 22660 LVYMECVIKESLRLFPSVPFIGRRCVEEGVVNGLIMPKNTQINIHLYEIMRDARHFSNP 22484 22483 KMFQPDRFFPENTVNRHPFAFVPFSAGQRNCIGQKFAILEIKVLLAAVIRNFKILPVTLL 22304 22303 DDLTFENGIVLRTKQNIKVKLVHR 22232 AC017771 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered pieces Length = 49626 Score = 250 bits (632), Expect = 1e-65 Identities = 133/321 (41%), Positives = 204/321 (63%) Frame = -1 Query: 186 ETAMGIQMNAQEESESEYVKAVYEISELTMQRSVRPWLHPKVIFDLTTMGKRYAECLRIL 245 ETA+G++++ E +EY KA++ I E+ +QR P ++ F + +++ + LRI+ Sbjct: 25686 ETALGVKLDDMSEG-NEYRKAIHAIEEVLIQRVCNPLMYYNWYFFVYGDYRKHLQNLRIV 25510 Query: 246 HGFTNKVIQERKSLRQMTGMKPTISNEEDELLGKKKRLAFLDLLLEASENGTKMSDTDIR 305 H F++++I+ ++ Q + E DE G+K+R A LD LL A +G ++ I Sbjct: 25509 HDFSSRIIERKRQQFQQKQL-----GEVDEF-GRKQRYAMLDTLLAAEADG-QIDHQGIC 25351 Query: 306 EEVDTFMFEGHDTTSAGICWALFLLGSHPEIQDKVYEELDHIFQGSDRSTTMRDLADMKY 365 +EV+TFMFEG+DTTS + + L +L H ++Q K YEE++++ + SD +M + Y Sbjct: 25350 DEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEVENLPEDSD-DISMFQFNKLVY 25174 Query: 366 LERVIKESLRLFPSVPFIGRVLKEDTKIGDYLVPAGCMMNLQIYHVHRNQDQYPNPEAFN 425 LE VIKESLR+FPSVPFIGR E+T + ++P +++ IY + R+ +P P+ F Sbjct: 25173 LECVIKESLRMFPSVPFIGRQCVEETVVNGMVMPKDTQISIHIYDIMRDPRHFPKPDLFQ 24994 Query: 426 PDNFLPERVAKRHPYAYVPFSAGPRNCIGQKFATLEEKTVLSSILRNFKVRSIEKREDLT 485 PD FLPE RHP+AYVPFSAG RNCIGQKFA LE K +L++++RNFK+ + EDLT Sbjct: 24993 PDRFLPENTVNRHPFAYVPFSAGQRNCIGQKFAILEMKVLLAAVIRNFKLLPATQLEDLT 24814 Query: 486 LMNELILRPESGIKVELIPRL 506 N ++LR + IKV+L R+ Sbjct: 24813 FENGIVLRTQENIKVKLSKRV 24751 Score = 249 bits (628), Expect = 3e-65 Identities = 132/326 (40%), Positives = 203/326 (61%) Frame = -3 Query: 180 ALDIICETAMGIQMNAQEESESEYVKAVYEISELTMQRSVRPWLHPKVIFDLTTMGKRYA 239 A+++I ETA+G++++ E Y ++++ I E+ QR P+ + V F L ++ Sbjct: 23182 AINVIAETALGVKLDDLSEG-IRYRQSIHAIEEVMQQRLCNPFFYNIVYFFLFGDYRKQV 23006 Query: 240 ECLRILHGFTNKVIQERKSLRQMTGMKPTISNEEDELLGKKKRLAFLDLLLEASENGTKM 299 L+I H F++ +I++R+SL K +EDE GKK+R A LD LL A +G ++ Sbjct: 23005 NNLKIAHEFSSNIIEKRRSL-----FKSNQLGQEDEF-GKKQRYAMLDTLLAAEADG-QI 22847 Query: 300 SDTDIREEVDTFMFEGHDTTSAGICWALFLLGSHPEIQDKVYEELDHIFQGSDRSTTMRD 359 I +EV+TFMFEG+DTTS + + L +L H ++Q K YEE+ ++ SD + + Sbjct: 22846 DHQGICDEVNTFMFEGYDTTSTCLIFTLLMLALHEDVQKKCYEEIKYLPDDSDDISVFQ- 22670 Query: 360 LADMKYLERVIKESLRLFPSVPFIGRVLKEDTKIGDYLVPAGCMMNLQIYHVHRNQDQYP 419 ++ Y+E VIKESLRLFPSVPFIGR E+ + ++P +N+ +Y + R+ + Sbjct: 22669 FNELVYMECVIKESLRLFPSVPFIGRRCVEEGVVNGLIMPKNTQINIHLYEIMRDARHFS 22490 Query: 420 NPEAFNPDNFLPERVAKRHPYAYVPFSAGPRNCIGQKFATLEEKTVLSSILRNFKVRSIE 479 NP+ F PD F PE RHP+A+VPFSAG RNCIGQKFA LE K +L++++RNFK+ + Sbjct: 22489 NPKMFQPDRFFPENTVNRHPFAFVPFSAGQRNCIGQKFAILEIKVLLAAVIRNFKILPVT 22310 Query: 480 KREDLTLMNELILRPESGIKVELIPR 505 +DLT N ++LR + IKV+L+ R Sbjct: 22309 LLDDLTFENGIVLRTKQNIKVKLVHR 22232 Score = 247 bits (624), Expect = 9e-65 Identities = 167/453 (36%), Positives = 253/453 (54%), Gaps = 59/453 (13%) Frame = -2 Query: 54 VPRNKLFQVFDRRAKLYGPLYRIWAGPIA-QVGLTRPEHVELILRDTKHIDKSLVYSFIR 112 + N + + AK G Y IW A + + R E E I + TK K++ Y IR Sbjct: 21749 ITANIFSYIRESTAKANGQNY-IWNFLFAPEYNIVRAEDAEEIFQSTKITTKNMSYELIR 21573 Query: 113 PWLGEGLL--------------------TGTGAKWHSHRKMITPTFHFKILDIFVD---- 148 P+LG+GLL T G + + + +F+ Sbjct: 21572 PFLGDGLLISIGE*SYIIVNI**SHSF*TRNGTQEEKL*PQLFILIFCSLFSLFLSK*E* 21393 Query: 149 --------------VFVEKSEILVKKLQSKVGGKDFDIYPFITHCALDIIC--------- 185 +F E+S+ +K L VG + ++ I L+ IC Sbjct: 21392 RFLTYYLFK*SIILIFREESKKFIKILDKNVGF-ELELNQIIPQFTLNNICGKHNHLFIY 21216 Query: 186 -----------ETAMGIQMNAQEESESEYVKAVYEISELTMQRSVRPWLHPKVIFDLTTM 234 ETA+G++++ E +EY KA+++ + QR P + F L Sbjct: 21215 EIVLLTIFSSIETALGVKLDDMSEG-NEYRKAIHDFEIVFNQRMCNPLMFFNWYFFLFGD 21039 Query: 235 GKRYAECLRILHGFTNKVIQERKSLRQMTGMKPTISNEEDELLGKKKRLAFLDLLLEASE 294 K+Y+ LR +HGF++ +IQ ++ K + DE GKK+R A LD LL A Sbjct: 21038 YKKYSRILRTIHGFSSGIIQRKRQQ-----FKQKQLGQVDEF-GKKQRYAMLDTLLAAEA 20877 Query: 295 NGTKMSDTDIREEVDTFMFEGHDTTSAGICWALFLLGSHPEIQDKVYEELDHIFQGSDRS 354 G K+ I +EV+TFMF G+DTTS + + L LL H ++Q++ YEEL + + D Sbjct: 20876 EG-KIDHQGICDEVNTFMFGGYDTTSTSLIFTLLLLALHADVQERCYEELQDLPEDIDE- 20703 Query: 355 TTMRDLADMKYLERVIKESLRLFPSVPFIGRVLKEDTKIGDYLVPAGCMMNLQIYHVHRN 414 +M ++ +LE VIKESLRLFPS P IGR E++ + ++P +++ IY + R+ Sbjct: 20702 VSMFQFNELIHLECVIKESLRLFPSAPIIGRTCIEESVMNGLVLPKNAQISIHIYDIMRD 20523 Query: 415 QDQYPNPEAFNPDNFLPERVAKRHPYAYVPFSAGPRNCIGQKFATLEEKTVLSSILRNFK 474 +P P F P+ FLPE RHP+A+VPFSAGPRNCIGQKF LE K +L++++RNFK Sbjct: 20522 ARHFPKPNQFLPERFLPENSVNRHPFAFVPFSAGPRNCIGQKFGVLEIKVLLAAVIRNFK 20343 Query: 475 VRSIEKREDLTLMNELILRPESGIKVELIPRL 506 + + EDLT N ++LR + IKV+ R+ Sbjct: 20342 LLPATQLEDLTFENGIVLRTQQNIKVKFEARV 20247 21749 ITANIFSYIRESTAKANGQNYIWNFLFAPEYNIVRAEDAEEIFQSTKITTKNMSYELIR 21573 21572 PFLGDGLLISIGTQEEKL*PQLFILIFCSLFSLFLS IFREESKKFIKILDKNVGFELELNQIIPQFTLNNIC ETALGVKLDDMSEGNEYRKAIHDFEIVFNQRMCNPLMFFNWYFFLFGD 21039 21038 YKKYSRILRTIHGFSSGIIQRKRQQFKQKQLGQVDEFGKKQRYAMLDTLLAAEA 20877 20876 EGKIDHQGICDEVNTFMFGGYDTTSTSLIFTLLLLALHADVQERCYEELQDLPEDIDE 20703 20702 VSMFQFNELIHLECVIKESLRLFPSAPIIGRTCIEESVMNGLVLPKNAQISIHIYDIMRD 20523 20522 ARHFPKPNQFLPERFLPENSVNRHPFAFVPFSAGPRNCIGQKFGVLEIKVLLAAVIRNFK 20343 20342 LLPATQLEDLTFENGIVLRTQQNIKVKFEARV 20247 Score = 48.8 bits (114), Expect(2) = 7e-11 Identities = 31/92 (33%), Positives = 46/92 (49%), Gaps = 3/92 (3%) Frame = -2 Query: 33 LVNKLPGPTAYPV---VGNAIEAIVPRNKLFQVFDRRAKLYGPLYRIWAGPIAQVGLTRP 89 ++N PG Y + V N + + + V + AK G Y + + RP Sbjct: 26324 ILNFTPGSFCYSMRL*VYNYFSSFITASVFNFVRESTAKAKGQNYLWYFLYAPMYNVVRP 26145 Query: 90 EHVELILRDTKHIDKSLVYSFIRPWLGEGLLTGTG 124 E E + + TK I K++VY IRP+LG+GLL TG Sbjct: 26144 EEAEEVFQSTKLITKNVVYELIRPFLGDGLLISTG 26040 Score = 44.1 bits (102), Expect(2) = 1e-06 Identities = 23/61 (37%), Positives = 33/61 (53%) Frame = -1 Query: 64 DRRAKLYGPLYRIWAGPIAQVGLTRPEHVELILRDTKHIDKSLVYSFIRPWLGEGLLTGT 123 D AK G Y + + R E E IL+ +K I K+++Y ++P+LGEGLL T Sbjct: 23823 DASAKAKGRNYLWYFFHAPMYNIVRAEEAEEILQSSKLITKNMIYELLKPFLGEGLLIST 23644 Query: 124 G 124 G Sbjct: 23643 G 23641 Score = 39.9 bits (91), Expect(2) = 7e-11 Identities = 14/25 (56%), Positives = 18/25 (72%) Frame = -3 Query: 126 KWHSHRKMITPTFHFKILDIFVDVF 150 KWHS RK +TP FHF +L F+ +F Sbjct: 25978 KWHSRRKALTPAFHFNVLQSFLGIF 25904 Score = 30.1 bits (66), Expect(2) = 1e-06 Identities = 11/20 (55%), Positives = 15/20 (75%) Frame = -2 Query: 131 RKMITPTFHFKILDIFVDVF 150 RK +TP FHFK+L F+ +F Sbjct: 23564 RKALTPAFHFKVLQSFLIIF 23505 AC018294 Drosophila melanogaster, *** SEQUENCING IN PROGRESS ***, in ordered pieces Length = 96075 36-529 44% to 12A2 probable mito seq. 1755 MYHTLMLNIGFICRMSTQK* 1814 CANNOT IDENTIFY THE N-TERMINAL 2205 ATAVNLEEAKPYADIPGPSKLQLIRAFLPGGK 2369 GLYKNLPVHEMFLDMNRQYGSIFRMPSVAGTDLVLTMNPQDYEVIFRNEGQYPYRRSFE 2545 2546 VMDYFKRVHRREVFDGYDGLTSGNGPAWGKMRTAVNPILLQPRN 2725 2726 AKLYMTNLVQVSDEFLER* 2776 2838 IRIIRDPVTQEMPDDFAVDIRHLVIESICSVALNTHLGLLGEQRNNKDIQKLVLALQDVV 3017 3018 ELGFQLDIMPAFWKYLPMPNFKKLMRSLDTITDFCYFHIGNALKRIEEDAKAGTLNEIGL 3197 3198 ETSLLEKLARFDRQTAVIIAMDLLFAGADPVSLTLGGILFS 3377 3378 LSKSPDKQARLLEEIRGILPNKDSSLTIENMRNLPYLRACIKEGIRMYPIGPGTLRRMPH 3557 3558 DVVLSGYRVVAGTDVGIAANYQMANMEQFVPKVREFIPERWLRDESNSHLVGE 3716 3717 TATPFMYLPFGFGPRSCAGKRIVDMMLEIAISRLVRNFKIGFDYPIENAFKAQFFVQPNI 3896 3897 PFKFKFIERNE* 3932 MLKVRSALSLIQSQKATLSLATQK* RWQTNVATAEAREDSEWLQAKPFEQIPRLNMWALSMKMSMPGGK ||| || | |||| 2070 MLSTQWNANKQISRQIYQLCRGLAQKVRKIYFKYIFIL*NLEEAKPYADIPGPSKLQL IRAFLPGGK MNTLSSARSVAIYVGPVRSSRSASVLAHEQAKSS an EST like this seq NTLSSARSVAIYVGPVRSSRSASVLAHEQAKSSITEEHKTYDEIPRPNKFKFMRAFMPGGEFQNASITEYTSAMRKRYGD IYVMPGMFGRKDWVTTFNTKDIEMVFRNEGIWPRRDGLDSIVYFREHVRPDVYGEVQGLVASQNEAWGKLRSAINPIFMQ PRGLRMYYEPLSNINNEFIERIKEIRDPKTLEVPEDFTDEISRLVF 54F 12a4 MLKVRSALSLIQSQKATLSLATQKRWQTNVATAEAREDSEWLQAKPFEQIPRLNMWALSMKMSMPGGK 55F 12a5 MLKGRIALNILQSQKPIVFSASQQRWQTNVPTAEIRNDPEWLQAKPFEEIPKANILSLFAKSALPGGK 56F 12b2 MWKYSNKI(36aa)KSRNLFTNNGYICSQTQLELADSRIDEKWQQARSFGEIPGPSLLRMLSFF-MPGGK 57F AC008187 ??TKRNRMNTLSSARSVAIYVGPVRSSRSASVLAHEQAKSSITEEHKTYDEIPRPNKFKFMRAF-MPGGE AC018294 NLEEAKPYADIPGPSKLQLIRAF-LPGGK