Jan. 25, 2000 N-terminal sequences of all 90 Drosophila melanogaster P450 proteins/genes. 90 N-terminal sequences from Drosphila melanogaster have been listed here. This includes two different N-terminals for the 4d1 gene, that has alternative splicing. Cyp6a15p has no N-terminal and is not included here. There are three other pseudogenes Cyp307a2p, Cyp6t2p and Cyp9f3p. All 86 other P450s are full length. Cyp12e1 AC018294 has an N-terminal in the genomic sequence, but it is not obvious. I chose The first 38 amino acids based on a best guess for exon 1 with acceptable GT-AG boubdaries. An effort has been made to align the P(hyd)(hyd)GN motif at the right end. This was not possible for all sequences, since this motif is not seen in every P450. 317a1 and 318a1 may not be aligned correctly. See FASTA file for complete sequence info and accession numbers. F after the number indicates full length sequence is known (or alternate splice). 1F 4c3 MAESILLSKVGQVISGYSPITVFLLGSILIFLVVYNKRRSRLVKYIEKIPGPAAMPFLGNA-IEMNV 2F 4aa1 MHLRLLSPPQLERTTNLELCSILILLVISLSIYTFYATLNTYLRSVLLSLRLTGPPSLPFLGNC-MLVTD 3F 4g1 MLTTLVGTLVAMALYEYWRR-NSREYRMVANIPSPPELPILGQA-HVAAG 4F 4g15 MEVLKKDAALGSPSSVFYFLLLPTLVLWYIYWRLSRAHLYRLAGRLPGPRGLPIVGHL-FDVIG 5F 4s3 AC015418 MSTLALVAFVLWAAFLRYLPKILNFLRLQRFAKTLPGPTIGEL-IANVK 6F 4p1 AI405947 MIILWLILALSALLYWLHRANKDYHILSFFTKRIRLKDGTPVEIIAPIAKGKTIFGNT-LDLYG 7F 4p2 AI389883 MICLLWISVAILVVIHWIYKVNKDYNILAFFARRVQTKDGKPLDSLVPMIKGRTVFANC-FDLLG 8F 4p3 AC008186 MLILWLVGAFIVLIQWIYRLNRDYCILGFFAKRIRTKNGQNPESIAPLVKGSTIFANS-FDLYG 9F 4ac1 AC008325aMWIALLGIPILLAVLTLLLKHINKTYFILSLTKRVRTEDGSPLESKVAIMPGKTRFGNN-LDILN 10F 4ac3 AC017771 MWIALLGSSLLIGALWLLLRQLNKTYFILSLCKRVRTADGSPLESKVFVVPGKTRFGNN-LDLLN 11F 4ac2AC008324aMFLEVLFAAPLVIFIFRKLWAHLNRTYFILSLCKRIRTEDGSLLESKIYVAPSKTRFGNN-FDLVN 12F 4d1 MWLLLSLVLLLAIIALEMRRFLRNMRTIPGPLPLPLLGNA-HIFLG 13F 4d1 alt splice MFLVIGAILASALFVGLLLYHLKFKRLIDLISYMPGPPVLPLVGHG-HHFIG 14F 4d2 MLGVVGVLLLVAFATLLLWDFLWRRRGNGILPGPRPLPFLGNL-LMYRG 15F 4d14 MYLELFAILLATALAWDYMRKRRHNKMYAEAGIRGPKSYPLVGNA-PLLIN 16F 4d8 MLLFLLVVLLFGAGWIIHLGQADRRRKVANLPGPICPPLIGAM-QLMLR 17F 4d20 AI106625 MWLTLITGALILLLTWDFGRKRQRVLAFEKSAIPGPISIPILGCG-LQALH 18F 4d21 AC004717 MWILLGIAVLIMTLVWDNSRKQWRVNTFEKSRILGPFTIPIVGNG-LQALT 19F 4e1 MWIVLCAFLALPLFLVTYFELGLLRRKRMLNKFQGPSMLPLVGNA-HQMGN 20F 4e2 MWFVLYIFLALPLLLVAYLELSTFRRRRVLNKFNGPRGLPLMGNA-HQMGK 21F 4e3 MWLAVLALLVLPLITLVYFERKASQRRQLLKEFNGPTPVPILGNA-NRIGK 22F 4ae1 (old 4e4) MLVVLLVALLVTRLVASLFRLALKELRHPLQGVVPSVSRVPLLGAA-WQMRS 23F 4ad1 AC005451 MFLIAIAIILATILVFKGVRIFNYIDHMAGIMEMIPGPTPYPFVGNL-FQFGL 24F 303a1 L49408 AC017306 MFYTVIWIFCATLLAILFGGVRKPKRFPPGPAWYPIVGSA-LQVSQ 25F 18a1 MLADSYLIKFVLRQLQVQEDGDAQHLLMVFLGLLALVTLLQWLVRNYRELRKLPPGPWGLPVIGYL-LFMGS 26F 304a1 AC014292 AI257867 MITETLLTICAAVFLCLSYRYAVGRPSGFPPGPPKIPLFGSY-LFMLI 27F 305a1 AC009382 AI064680AC018013MSALIFLCAILIGFVIYSLISSARRPKNFPPGPRFVPWLGNT-LQFRK 28F 306a1 MSADIVDIGHTGWMPSVQSLSILLVPGALVLVILYLCERQCNDLMGAPPPGPWGLPFLGYL-PFLDA 29F 307a1 AC014810 MLAALIYTILA(15aa)GVKRRVLQPVKTKNSTEINHNAYQKYTQAPGPRPWPIIGNL-HLLDR 30p 307a2p MLTSVFYVLFAIAITIILISYVFLLLKCKQKAFVVIGLLYQEKKYQCFDQAPGPHPWPIIGNI-NLLGR 31F 6a2 MFVLIYLLIAISSLLAYLYHRNFNYWNRRGLPHDAPHPLYGNM-VGFRK 32F 6a9 MGVYSVLLAIVVVLVGYLLLKWRRALHYWQNLDIPCEEPHILMGSL-TGVQT 33F 6a21 AL069964 on AC009844 MSVGTVLLTALLALVGYLLMKWRSTMRHWQDLGIPCEEPHILMGSM-KGVRT 34F 6a8 MALAYILFQVAVALLAILTYYIHRKLTYFKRRGIPFVAPHLIRGNM-EELQK 35F 6a18 AC008285 AI403838 MQLTYFLFQVAVALLAIVTYILHRKLTYFKRRGIPYDKPHPLRGNM-EGYKK 36F 6a22 AC010578 80744 AA951440 MLDVVALLLIALAVGFWFVRTRYSYWTRRGIGSEPARFPVGNM-EGFRK 37F 6a19 AC010578 21500 AA696094 MAILLGLVVGVLTLVAWWVLQNYTYWKRRGIPHDPPNIPLGNT-GELWR 38F 6a20 AI063421 AC009844 MAVMIVLLIGVITFVAWYVHQHFNYWKRRGIPHDEPKIPYGNT-SELMK 39F 6a23 AC009844 44542-44721 MSLLLTLIALLVSLLLFMARRRHGYWQRRGIPHDVPHPIYGNM-KDWPK 40F 6a17 MLLLALIVVILSLLVFAARRRHGYWQRRGIPHDEVHPLFGNI-KDWPN 41F 6a13 MLTLLVLVFTVGLLLYVKLRWHYSYWSRRGVAGERPVYFRGNM-SGLGR 42F 6a14 MLFTIALVGVVLGLAYSLHIKIFSYWKRKGVPHETPLPIVGNM-XRSTT 43F 6a16 AC004721 MDFTLLLLTSLLSFLLGYLRYRFTYWELRGIPQLRPHFLFGHF-FRLQS 44F 6u1 AC008288 AC017706 MDLMHRTLLTALGELSVVYALVKFSLGYWKRRGILHEKPKFLWGNI-KGVVS 45F 6d2 MWTILLTILIAGLLYRYVKRHYTHWQRLGVDEEPAKIPFGVM-DTVMK 46F 6d4 AC008197 AI404831 MFSLILLAVTLLTLAWFYLKRHYEYWERRGFPFEKHSGIPFGC-LDSVW 47F 6d5 AC007441 AI108995 AI402085 MIGIYLLIAAVTLLYVYLKWTFSYWDRKGFPSTGVSIPFGAL-ESVTK 48F 6t1 MIAVFSLIAAALAVGSLVLLPVVLRGGCLLVVTIVWLWQILHFWHWRRLGVPFVPAAPFVGNV-WNLLR 49p 6t2p MVIAFFIFLCAALAVGSVVLLP-------LIALLAVWLWQRRHFRIWRRLGVPYLPAAPVLGNV-LNVET 50F 6t3 AC007440 MLLIWLLLLTIVTLNFWLRHKYDYFRSRGIPHLPPSSWSPMGNL-GQLLF 51F 6g1 MVLTEVLFVVVAALVALYTWFQRNHSYWQRKGIPYIPPTPIIGNT-KVVFK 52F 6g2 AC015208 19014-20624 MELVLLILVASLIGIAFLALQQHYSYWRRMGVREIRPKWIVGNL-MGLLN 53F 6v1 AC015002 MVYSTNILLAIVTILTGVFIWSRRTYVYWQRRRVKFVQPTHLLGNL-SRVLR 54F 6w1 AC008257 AI403674 MLLLLLLGSLTIVFYIWQRRTLSFWERHGVKYIRPFPVVGCT-REFLT 55F 9b1 MSFVEICLVLATIGLLLFKWSTGTFKAFEGRNLYFEKPYPFLGNM-AASAL 56F 9b2 MALIEICLALVVIGYLIYKWSTATFKTFEERKLYFEKPYPFVGNM-AAAAL 57F 9f2 AC007594 39127-39333 MLWEFFALFAIAAALFYRWASANNDFFKDRGIAYEKPVLYFGNM-AGMFL 58p 9f3p MLWEFLALFAIATALFYRWASANNDFFKDRGIAYEKPVLYLGNM-AGMFL 59F 9h1 MDQSMIALALFIILLVLLYKWSVAKYDVFSERGVSHEKPWPLIGNI-PLKAM 60F 9c1 MVFVELSIFVAFIGLLLYKWSVYTFGYFSKRGVAHEKPIPLLGNI-PWSVL 61F 317a1 MWIIFLIIGLLVLGLLVLLIIAARYQRDYWRYLDIPHERPKKLWPII-RQIMT 62F 28a5 MVLITLTLVSLVVGLLYAVLVWNYDYWRKRGVPGPKPKLLCGNY-PNMFT 63F 28d1 AI403094 AC008324 MCPISTALFVIAAILALIYVFLTWNFSYWKKRGIPTAKSWPFVGSF-PSVFT 64F 28d2 AC008324 MCPVTTFLVLVLTLLVLVYVFLTWNFNYWRKRGIKTAPTWPFVGSF-PSIFT 65F 28c1 AC014191 MFGSLLLGIATLLGAIYAFLVSNFGHWRRRGVTEPRALPLFGSF-PNMIW 66F 308a1 AC015216a MLPLVLFILLAATLLFWKWQGNHWRRLGLEAPFGWPLVGNM-LDFAL 67F 309a1 AC003055 AC009909 MFTLVGLCLTIVHVAFAVVYFYLTWYHKYWDKRGVVTAEPLTILGSY-PGILI 68F 309a2 AC002444 AC009909 MYILASLALILLHLLVLPIYLYLTWHHKYWRKRGLVTARPLTLLGTY-PGLLT 69F 310a1 AC007137 MWLLLPILLYSAVFLSVRHIYSHWRRRGFPSEKAGITWSFL-QKAYR 70F 311a1 MALWPLLLITLTIWILVRKWTLLRLGSSLPGPWAFPLLGNA-QMVGK 71F 312a1 AC0015424 AC010003 MFWLGFGLLLLALSLYLLYVFERQSRIDRLTHKWPAPPALPFIGHL-HILAK 72F 313a1 AC007809 AC007928 MLTINLLLAVGALFWIYFLWSRRSKLYFLMLKIPGPIGLPILGSS-LENII 73F 313a2 AC017371 MIVIQLLIAASLILWIRFLWSRR-KLYMLMMQLPGRMGLPLLGNS-VRYLI 74F 313a3 AC017371 AL076582 MDTFQLLLAVGVCFWIYFLWSRR-RLYMMHFKIPGPMGLPILGIA-FEYLI 75F 313a4 AC007752 MLTWTLWCGLLFLLWIYFLWSRR-RFYLLTLKIPGPLGYPILGMA-HWLMR 76F 313a5 AC017371 MLTLQIFEAFAIILCVYFLWSRR-RFYIMMLKLPGPMGFPFIGLA-FEYIR 77F 313b1 AC007571 MLASIILSGWLLLAWLYFLWSRR-RYYKVAWQLRGPIGWPLIGMG-LQMMN 78F 318a1 AC014186 MCCLQSTTLDRKKNASWNRSAIVGSDTSDAPEHCPLRCGALLAVLLAWQQRKCWRLIW 79F 301A1 MNNLSLKA(29aa)ESSTGVATCPHLADSEEASAPRIHSTSEWQNALPYNQIPGPKPIPILGNL-MPIIG 80F 49a1 MQRLRTGESSNPKKLNVSQQPVTSVATTRTTASSLPAETTSSPAAAVRPYSEVPGPYPLPLIGNS-WRFAP 81F 12a4 MLKVRSALSLIQSQKATLSLATQKRWQTNVATAEAREDSEWLQAKPFEQIPRLNMWALSMKMSMPGGK 82F 12a5 MLKGRIALNILQSQKPIVFSASQQRWQTNVPTAEIRNDPEWLQAKPFEEIPKANILSLFAKSALPGGK 83F 12c1 AC012807 MLRLTVKHGLRANSQLAATRNPDASSYVQQLESEWEGAKPFTELPGPTRWQLFRGF-QKGGE 84F 12d1 AC008187 MNTLSSARSVAIYVGPVRSSRSASVLAHEQAKSSITEEHKTYDEIPRPNKFKFMRAF-MPGGE 85F 12e1 MLSTQWNANKQISRQIYQLCRGLAQKVRKIYFKYIFILNLEEAKPYADIPGPSKLQLIRAF-LPGGK 86F 12b2 MWKYSNKI(36aa)KSRNLFTNNGYICSQTQLELADSRIDEKWQQARSFGEIPGPSLLRMLSFF-MPGGK 87F 302a1 AC015396 MLTKLLKISCTSRQCTFAKPYQAIPGPRGPFGMGNLYNYLPGIGSY-SWLRL 88F 314a1 MAVILLLALALVLG(16aa)PLLKNTLLEDFYHAELIQPEAPKRRRRGIWDIPGPKRIPFLGTK-WIFLL 89F 315a1 AC008307 MTEKRERPGPLRWL(19aa)RCDPPPLQRFPATELPPAVAAKYVPIPRVKGLPVVGTL-VDLIA 90F 316a1 AC017388 AC010113 MILTATFICFCLASAFNYFRARRQRSLIKNLKGPFTWPLMGAM-HKLLF