P450s that have appeared since the 1993 P450 nomenclature update. This is part E of the bibiographic P450 files. This section contains bacterial sequences CYP101 to CYP174. This includes references that were incomplete and duplications of sequences that were already in the update. If a sequence is assigned an accession number that was not in the old update it is included in this list. 48 new P450s were added July 27, 2000 Four new sequences were added Jan. 9, 2001 CYP102C1, CYP172-174. Added CYP175A1 9/17/2001 Compiled by David R. Nelson Last modified June 2, 2003 added 25 new sequences. Last modified Nov. 5, 2003 There are now 501 bacterial P450s Last modified Dec. 2, 2003 added 4 seqs in CYP153 family Last modified Feb. 23, 2005 added 26 seqs from Rhodococcus sp. RHA1 Last modified Feb. 23, 2005 added 155C1 and 155C2 Last modified Sept. 20, 2005 added 18 Sorangium cellulosum seqs. Last modified Nov. 17, 2006 Last modified Dec. 31, 2007 51 Family 101 Family 102 Family 103 Family 104 Family 105A Subfamily 105B Subfamily 105C Subfamily 105D Subfamily 105E Subfamily 106 Family 107A Subfamily 107B Subfamily 107C Subfamily 107D Subfamily 107E Subfamily 107F Subfamily 107G Subfamily 107H Subfamily 107J Subfamily 108 Family 109 Family 110 Family 111 Family 112 Family 113A Subfamily 113B Subfamily 114 Family 115 Family 116 Family 117 Family 118 Family 119 Family 120 Family 121 Family 122 Family 123 Family 124 Family 125 Family 126 Family 127 Family 128 Family 129 Family 130 Family 131 Family 132 Family 133 Family 51 Family A note on nomenclature. CYP51s were originally all called CYP51, because only one gene was found per species and they all seemed to be in this one conserved family. However, rice had many CYP51s in at least two sequence groups, so subfamilies have been designated for CYP51s. These are not the typical subfamilies, but only one subfamily is created for each major taxonomic group. CYP51A for animals, CYP51B for bacteria. CYP51C for Chromista, CYP51D for Dictyostelium, CYP51E for Euglenozoa, CYP51F for fungi. Those groups with only one CYP51 per species are all called by one name: CYP51A1 is for all animal CYP51s since they are orthologous. The same is true for CYP51B, C, D, E and F. CYP51G (green plants) and CYP51Hs (monocots only so far) have individual sequence numbers. CYP51B1 Mycobacterium tuberculosis (Actinobacteria) GenEMBL Z80226 (34809bp) gi 1550642 Rv0764c complement (6140-7495) 33.7% identical to CYP51 over 439AA overlap this is a bacterial CYP51 CYP51B1 Mycobacterium marinum No accession number Tim Stinear MM4932 82% to 51B1 M. tuberculosis CYP51B1 Mycobacterium ulcerans No accession number Tim Stinear 99% to CYP51B1 Mycobacterium marinum = ortholog CYP51B1 Mycobacterium bovis subsp. bovis AF2122/97 (Actinobacteria) NC_002945 complete genome complement(858662..858868) CYP51 100% match locus_tag = Mb0786c CYP51B1 Mycobacterium avium (Actinobacteria) TIGR contig:3273:m_avium Length = 5,475,738 79% to CYP51 M. tuberculosis 3021360 TSTVVPRVSGGEEEHGHLEEFRTDPIGLMQRVRDECGDVGWFQLVDKHVILLSGAQANEF 3021539 3021540 FFRSADEDLDQAEAYPFMTPIFGKGVVFDASPERRKEMLHNSALRGEQMKGHASTIEGEV 3021719 3021720 KKMIADWGDEGEIELLDFFAELTIYTSTACLIGLKFREQLDHRFAEYYHDLERGTDPLCY 3021899 3021900 VDPYLPIESFKRRDEARVKLVALVQEIMDQRLANPPKDKADRDMLDVLVSIKDEDGKPRF 3022079 3022080 SADEITGMFISLMFAGHHTSSGTSAWTLIELIRHPDVYAEVLAELEELYADGQEVSFHAL 3022259 3022260 RSIPKLDNVVKETLRLHPPLIILMRVAKGEFEVEGFPIHEGDYVAASPAISNRIPEDFPD 3022439 3022440 PDAFKPDRYNKPEQADIVNRWTWIPFGAGRHRCVGAAFAQMQIKAIFSVLLREYDFEMAQ 3022619 3022620 PADSYRNDHSKMVVQLARPAKVRYRKR 3022700 CYP51B1 Mycobacterium smegmatis (Actinobacteria) TIGR contig:3439:m_smegmatis Length = 6,989,783 80% to CYP51 M. tuberculosis 4858809 VPRVSGGEEEHGHLEEFRTDPIGLMKRVRSECGDVGWFQLADKQVVLLSGAEANEFFFRS 4858988 4858989 SDSELNQAEAYPFMTPIFGEGVVFDADPERRAEMLHNTALRGEQMKGHAATIENEVRRMV 4859168 4859169 ESWGDEGEIDLLEFFAELTIYTSTACLIGVKFRNQLDKRFADYYHLLERGTDPLCYVDPY 4859348 4859349 LPIESFRIRDEARANLVELVQEVMNGRIANPPKDKSDRDLLDVLVSIKDEDGTPRFSANE 4859528 4859529 VTGMFISLMFAGHHTSSGTASWTLIELLRHPEFYAKVQAELDDLYADGQEISFHALRQIP 4859708 4859709 NLDNALKETLRLHPPLIILMRVAQDEFEVAGRPIHKGQMVAASPAISNRIPEDFPDPDTF 4859888 4859889 DPDRYDKPRQEDLINRWTWIPFGAGKHRCVGAAFAQMQIKAIFSVLLRDFEFEMAQPSES 4860068 4860069 YRNDHSKMVVQLARPAKVRYRRR 4860137 CYP51B1 Methylococcus capsulatus (Proteobacteria) TIGR contig:221:m_capsulatus 49% to CYP51 M. tuberculosis NOTE FUSION PROTEIN EXTENDS C-TERMINAL. SEE J. Biol. Chem., Vol. 277, Issue 49, 46959-46965, December 6, 2002 A Novel Sterol 14-Demethylase/Ferredoxin Fusion Protein (MCCYP51FX) From Methylococcus capsulatus Represents a New Class of the Cytochrome P450 Superfamily Colin J. Jackson, David C. Lamb, Timothy H. Marczylo, Andrew G. S. Warrilow, Nigel J. Manning, David J. Lowe, Diane E. Kelly, and Steven L. Kelly 908332 MSHPPSNTP 908305 PVKPGGLPLLGHILEFGKNPHAFLMALRHEFGDVAEFRMFHQRMVLLTGSQASEAFYRAP 908126 908125 DEVLDQGPAYRIMTPIFGRGVVFDARIERKNQQLQMLMPALRDKPMRTYSEIIVAEVEAM 907946 907945 LRDWKDAGTIDLLELTKELTIYTSSHCLLGAEFRHELNTEFAGIYRDLEMGIQPIAYVFP 907766 907765 NLPLPVFKRRDQARVRLQELVTQIMERRARSQERSTNVFQMLIDASYDDGSKLTPH 907598 907597 EITGMLIATIFAGHHTSSGTTAWVLIELLRRPEYLRRVRAEIDALFETHGRVTFESLRQM 907418 907417 PQLENVIKEVLRLHPPLILLMRKVMKDFEVQGMRIEAGKFVCAAPSVTHRIPELFPNPEL 907238 907237 FDPDRYTPERAEDKDLYGWQAFGGGRHKCSGNAFAMFQIKAIVCVLLRNYEFELAAAPE 907061 907060 SYRDDYRKMVVEPASPCLIRYRRRDAP 906980 CYP51B1 Rhodococcus sp. RHA1 (Actinobacteria) No accession number Marianna A. Patrauchan Rha05830 Submitted to nomenclature committee 12/13/04 77% to CYP51B1 M. avium or M. tuberculosis CYP51B1 Nocardia farcinica IFM 10152 (Actinobacteria) GenEMBL AP006618.1 CDS complement(2757924..2759282) MTLVKPRRVSGGEHEHGHLEEFRTDPIALMRRVRQECGDVGAFE LAGKQVILLSGAEANEFFFRSGDEDLDQGAAYPFMKPIFGEGVVFDASPERRKEMLHN SALRAEQMRGHATTIAAEVDRMIAGWDDEGEIDLLDFFAELTIYTSSACLIGVKFRNE LDDRFARLYHELERGTDALAYVDPYAPIESFRRRDEARAALVALVQAIMDERAANPPA DKSDRDLLDVLVSVPNEDGGPRFSASEITGIFISMMFAGHHTTSGTAAWTVIELLRHP ELRDRVVAELDELFADGKDVSFHALRQIPLLEATLKETLRMHPPLIILMRVAQGDFEV CGHHIAAGDHVAATPAISNRLPEDFPDPDTFDPGRYIDPNQEDLVNRWTWIPFGAGRH RCVGAAFALMQLKAIFSILLRDWEFEMAQPSESYRNDHSKMVVQLQQPCRVRYRRRVR TS CYP51B1 Mycobacterium vanbaalenii ZP_01207535.1, AAT40578.1, EAS23094.1 Q5IZM4CP51_MYCVN Cytochrome P450 51 (CYPLI) (P450-LIA1) (Sterol 14-alpha 80% TO CYP51 Mycobacterium tuberculosis MTAVKEVPRVSGGEEEHGHLEEFRTDPIGLMKRVREECGDVGWFQLADKQVILLSGAEANEFFFRSSDSE LNQAEAYPFMTPIFGEGVVFDADPERRAEMLHNTALRGEHMKGHATTIEAEVRKMIEGWGESGEIDLLEF FAELTIYTSTACLIGLKFRNQLDSRFANYYHLLERGTDPLCYVDPYLPIESFRIRDEARAGLVELVQDVM HGRIANPPKDKSDRDMLDVLVSIKDEDGNPRFTANEITGMFISLMFAGHHTSSGTSSWTLIELLRHPEFY AKVQQELDDLYADGQEVSFHALRQIPSLDNALKETLRLHPPLIILMRVAQDEFEVAGYPIHKGQMVAASP AISNRIPEDFPNPDDFDPDRYEKPRQEDLINRWTWIPFGAGKHRCVGAAFAQMQIKAIFSVLLREYEFEM AQPPESYQNDHSKMVVQLARPAKVRYRRRVRD 101 Family CYP101A1 Pseudomonas putida GenEMBL D00528 (1950bp) Koga,H., Yamaguchi,E., Matsunaga,K., Aramaki,H. and Horiuchi,T. Cloning and nucleotide sequences of NADH-putidaredoxin reductase gene(camA) and putidaredoxin gene(camB) involved in cytochrome P-450cam hydroxylase of Pseudomonas putida J. Biochem. 106, 831-836 (1989) Note: only the last 93 nucleotides of the cam gene was cloned along with two downstream genes. CYP101A1 Pseudomonas putida PIR C60886 (last 8 amino acids) Romeo, C., Moriwaki, N., Yasunobu, K.T., Gunsalus, I.C., Koga, H. Identification of the coding region for the putidaredoxin reductase gene from the plasmid of Pseudomonas putida. J. Protein Chem. 6, 253-261 (1987) CYP101B1 Novosphingobium aromaticivorans NZ_AAAV01000165.1 complement(29626..30870) gene = Saro2804 43% to CYP101 MLPHDRGQNSTRRITAMEAPAHVPADRVVDIDIYMPPGLAEHGF HKAWSDLSAGNPAVVWTPRNEGHWIALGGEALQEVQSDPERFSSRIIVLPKSVGEMHG LIPTTIDPPEHRPYRQLLNAHLNPGAIRGLSESIRQTAVDLIEGFAAQGHCNFTAQYA EQFPIRVFMALVGIEASEAPRIRHWAECMTRPGMDMTFDEAKAVFFDYVGPLVDARRE TPGEDMISAMINADLGDGRRLTRDEALSVVTQVLIAGLDTVVNVLGFIMRELAGNPAL RADLRQRGADILPVVHELFRRFGLVSIAREVRRDIEFHGVHLKAGDMIAIPTQVHGLD PRVNPDPLAIDPSRKRARHSTFGSGPHMCPGQELARKEVAITLEEWLRRIPDFALGPN SDLSPVPGIVGALRRVELVWNT CYP101C1 Novosphingobium aromaticivorans NZ_AAAV01000133.1 complement(4199..5389) gene = Saro1574 44% to CYP101A1 MIPAHVPADRVVDFDIFNPPGVEQDYFAAWKTLLDGPGLVWSTA NGGHWIAARGDVVRELWGDAERLSSQCLAVTPGLGKVMQFIPLQQDGAEHKAFRTPVM KGLASRFVVALEPKVQAVARKLMESLRPRGSCDFVSDFAEILPLNIFLTLIDVPLEDR PRLRQLGVQLTRPDGSMTVEQLKQAADDYLWPFIEKRMAQPGDDLFSRILSEPVGGRP WTVDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQRLLRERPDLIPAAADELMR RYPTVAVSRNAVADVDADGVTIRKGDLVYLPSVLHNLDPASFEAPEEVRFDRGLAPIR HTTMGVGAHRCVGAGLARMEVIVFLREWLGGMPEFALAPDKAVTMKGGNVGACTALPL VWRA CYP101D1 Novosphingobium aromaticivorans NZ_AAAV01000085.1 complement(6803..8068) gene = Saro0669 44% to CYP101 MNAQTSTATQKHRVAPPPHVPGHLIREIDAYDLDGLEQGFHEAW KRVQQPDTPPLVWTPFTGGHWIATRGTLIDEIYRSPERFSSRVIWVPREAGEAYDMVP TKLDPPEHTPYRKAIDKGLNLAEIRKLEDQIRTIAVEIIEGFADRGHCEFGSEFSTVF PVRVFLALAGLPVEDATKLGLLANEMTRPSGNTPEEQGRSLEAANKGFFEYVAPIIAA RRGGSGTDLITRILNVEIDGKPMPDDRALGLVSLLLLGGLDTVVNFLGFMMIYLSRHP ETVAEMRREPLKLQRGVEELFRRFAVVSDARYVVSDMEFHGTMLKEGDLILLPTALHG LDDRHHDDPMTVDLSRRDVTHSTFAQGPHRCAGMHLARLEVTVMLQEWLARIPEFRLK DRAVPIYHSGIVAAVENIPLEWEPQRVSA CYP101D2 Novosphingobium aromaticivorans NZ_AAAV01000042 complement(5601..6899) gene = Saro0208 63% to 101D1 MGTTRMDTFNPQESRLATNFDEAVRAKVERPANVPEDRVYEIDM YALNGIEDGYHEAWKKVQHPGIPDLIWTPFTGGHWIATNGDTVKEVYSDPTRFSSEVI FLPKEAGEKYQMVPTKMDPPEHTPYRKALDKGLNLAKIRKVEDKVREVASSLIDSFAA RGECDFAAEYAELFPVHVFMALADLPLEDIPVLSEYARQMTRPEGNTPEEMATDLEAG NNGFYAYVDPIIRARVGGDGDDLITLMVNSEINGERIAHDKAQGLISLLLLGGLDTVV NFLSFFMIHLARHPELVAELRSDPLKLMRGAEEMFRRFPVVSEARMVAKDQEYKGVFL KRGDMILLPTALHGLDDAANPEPWKLDFSRRSISHSTFGGGPHRCAGMHLARMEVIVT LEEWLKRIPEFSFKEGETPIYHSGIVAAVENVPLVWPIAR CYP101D3 Sphingomonas sp. SKA58 ZP_01304513 78% to CYP101D2 1 MPEHVPETMS RARAPRPEHI PEQYVHEIDM YALEGIEQGY HEAWKNIPKP DMPDLIWTPF 61 TGGHWIATNG DTVREVYSDP TRFSSEVIFL PKEAGEKYEM VPTRMDPPEH TPYRKALDKG 121 LSLAQIRKVE SKVRKVAVDL IDSFVSRGEC DFSAEYANVF PVRVFMALAD LPESDVPTLS 181 RFAKMMTRPE GNTPEEMAKH LEEGNKGFFA YVEPIIQARR GKEGEDLITV MVNAEINGER 241 ITHDKALGLI SLLLLGGLDT VVNFLSFMMI HLAKNPQVVE ELRADPLKLM RSAEEMFRRF 301 PVVSEARMVA KDQDFRGIEL KRGDMILLPT ALHGLDDQLN DDPWRINLER RGISHSTFGG 361 GPHRCAGLHL ARMEVIVTIE EWLKRIPTFA MKPGAQPIYH SGIVAAVDNV PLIWSER 102 Family CYP102A1 Bacillus megaterium Ruettinger,R.T.,Wen, L.-P. and Fulco, A.J. Coding Nucleotide, 5'-Regulatory, and Deduced Amino Acid Sequences of P450BM-3, a Single Peptide Cytochrome P450:NADPH-P450 Reductase from Bacillus megaterium. J. Biol. Chem. 264, 10987-10995 (1989) CYP102A1 Bacillus megaterium GenEMBL J04832 (4957bp) Ravichandran,K.G., Boddupalli, S.S., Hasemann,C.A., Peterson,J.A. and Deisenhofer,J. Crystal structure of hemoprotein domain of P450BM-3, a prototype for microsomal P450s. Science 261, 731-736 (1993) P450 is N-terminal CYP102A2 Bacillus subtilis GenEMBL D87979 Yamamoto, H., S. Uchiyama, F. A. Nugroho, and J. Sekiguchi. A 23.4 kb segment at the 69 degrees-70 degrees region of the Bacillus subtilis genome. Microbiology. 143, 1317-20 (1997) Gene name yfnJ 66.4% identical to CYP102A1 P450 part only also called YetO (fusion of P450 and reductase like CYP102A1, P450 part is N-terminal) CYP102A3 Bacillus subtilis GenEMBL U93874, Z99117 Sorokin, A., A. Bolotin, B. Purnelle, H. Hilbert, J. Lauber, A. Dusterhoft, and S. D. Ehrlich. Sequence of the Bacillus subtilis genome region in the vicinity of the lev operon reveals two new extracytoplasmic function RNA polymerase sigma factors SigV and SigZ. Microbiology. 143, 2939-43 (1997) Gene name yrhJ most similar to CYP102A2 (fusion of P450 and reductase like CYP102A1 P450 part is N- terminal) CYP102A4 Bacillus anthracis str. Ames GenPept AAP27014 bifunctional P-450:NADPH-P450 reductase 1 79% to 102A2 1 MDKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKLAEEYG PIFRMQTLSD TIIVVSGHEL 61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETQEPNWQ KAHNILMPTF SQRAMKDYHA 121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM 181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSE NQEENDLLSR 241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR 301 VLTDSTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV 361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG 421 MLLQHFEFID YEEYQLDVKQ TLTLKPGDFK IRIVPRNQTI SHTTVLAPTE EKLKNHEIKQ 481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVAAL NDRIGSLPKE 541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKG DELKGVQYAV FGCGDHNWAS TYQRIPRYID 601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQRMWSDAMK VFGLELNKNM EKERSTLSLQ 661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYK EGDHLGVLPI 721 NSEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVR LYDLLSYSVE VQEAATRAQI 781 REMVTFTACP PHKKELESLL EDGVYQEQIL KKRISMLDLL EKYEACEIRF EPFLELLPAL 841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP 901 QSNFQLPENP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNVGEAHLYF GCRHPEKDYL 961 YRTELENDER DGLISLHTAF SRLEGQAKTY VQHVIKEDRI HLISLLDNGA HLYICGDGSK 1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR LQEEGRYGKD VWAGI CYP102A5 Bacillus cereus ATCC 14579 GenPept AAP10153 NADPH-cytochrome P450 reductase/P450 fusion 79% to 102A2 Bacillus subtilis 1 MEKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKIAEEYG PIFQIQTLSD TIIVVSGHEL 61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETHEPNWK KAHNILMPTF SQRAMKDYHA 121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM 181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSG DQEENDLLSR 241 MLNVPDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR 301 VLTDPTPTYQ QVMKLKYMRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV 361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG 421 MLLQHFELID YQNYQLDVKQ TLTLKPGDFK IRILPRKQTI SHPTVLAPTE DKLKNDEIKQ 481 HVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE 541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKP DELKGVQYAV FGCGDHNWAS TYQRIPRYID 601 EQMAQKGATR FSKRGEADAS GDFEEQLEQW KQNMWSDAMK AFGLELNKNM EKERSTLSLQ 661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSDRSTRHIE VSLPEGATYK EGDHLGVLPV 721 NSEKNINRIL KRFGLNGKDQ VILSASGRSI NHIPLDSPVS LLALLSYSVE VQEAATRAQI 781 REMVTFTACP PHKKELEALL EEGVYHEQIL KKRISMLDLL EKYEACEIRF ERFLELLPAL 841 KPRYYSISSS PLVAHNRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP 901 QSNFELPKDP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNLGQAHLYF GCRHPEKDYL 961 YRTELENDER DGLISLHTAF SRLEGHPKTY VQHLIKQDRI NLISLLDNGA HLYICGDGSK 1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR VQDEGRYGKD VWAGI CYP102A6 Bradyrhizobium japonicum USDA 110 GenPept BAC48147 NC_004463 complete genome 3173438..3176674 NADPH-cytochrome P450 reductase/P450 fusion 54% to 102A2 1 MSSKNRLDPI PQPPTKPVVG NMLSLDSAAP VQHLTRLAKE LGPIFWLDMM GSPIVVVSGH 61 DLVDELSDEK RFDKTVRGAL RRVRAVGGDG LFTADTREPN WSKAHNILLQ PFGNRAMQSY 121 HPSMVDIAEQ LVQKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE 181 SLVRSLETIM MTRGLPFEQI WMQKRRKTLA EDVAFMNKMV DEIIAERRKS AEGIDDKKDM 241 LAAMMTGVDR STGEQLDDVN IRYQINTFLI AGHETTSGLL SYTLYALLKH PDILKKAYDE 301 VDRVFGPDVN AKPTYQQVTQ LTYITQILKE ALRLWPPAPA YGISPLADET IGGGKYKLRK 361 GTFITILVTA LHRDPSVWGP NPDAFDPENF SREAEAKRPI NAWKPFGNGQ RACIGRGFAM 421 HEAALALGMI LQRFKLIDHQ RYQMHLKETL TMKPEGFKIK VRPRADRERG AYGGPVAAVS 481 SAPRAPRQPT ARPGHNTPML VLYGSNLGTA EELATRMADL AEINGFAVHL GALDEYVGKL 541 PQEGGVLIIC ASYNGAPPDN ATQFVKWLGS DLPKDAFANV RYAVFGCGNS DWAATYQSVP 601 RFIDEQLSGH GARAVYPRGE GDARSDLDGQ FQKWFPAAAQ VATKEFGIDW NFTRTAEDDP 661 LYAIEPVAVT AVNTIVAQGG AVAMKVLVND ELQNKSGSNP SERSTRHIEV QLPSNITYRV 721 GDHLSVVPRN DPTLVDSVAR RFGFLPADQI RLQVAEGRRA QLPVGEAVSV GRLLSEFVEL 781 QQVATRKQIQ IMAEHTRCPV TKPKLLAFVG EEAEPAERYR TEILAMRKSV YDLLLEYPAC 841 ELPFHVYLEM LSLLAPRYYS ISSSPSVDPA RCSITVGVVE GPAASGRGVY KGICSNYLAN 901 RRASDAIYAT VRETKAGFRL PDDSSVPIIM IGPGTGLAPF RGFLQERAAR KAKGASLGPA 961 MLFFGCRHPD QDFLYADELK ALAASGVTEL FTAFSRADGP KTYVQHVLAA QKDKVWPLIE 1021 QGAIIYVCGD GGQMEPDVKA ALVAIRHEKS GSDTATAARW IEEMGATNRY VLDVWAGG CYP102A7 Bacillus licheniformis ATCC 14580 GenEMBL AAU24352 Rey,M.W., Ramaiya,P., Nelson,B.A., Brody-Karpin,S.D., Zaretsky,E.J., Tang,M., de Leon,A.L., Xiang,H., Gusti,V., Clausen,I.G., Olsen,P.B., Rasmussen,M.D., Andersen,J.T., Jorgensen,P.L., Larsen,T.S., Sorokin,A., Bolotin,A., Lapidus,A., Galleron,N., Ehrlich,S.D. and Berka,R.M. Complete genome sequence of the industrial bacterium Bacillus licheniformis and comparisons with closely related Bacillus species Genome Biol. 5 (10), R77 (2004) 74% to CYP102A3 for the P450 domain 1 MNKLDGIPIP KTYGPLGNLP LLDKNRVSQS LWKIADEMGP IFQFKFADAI GVFVSSHELV 61 KEVSEESRFD KNMGKGLLKV REFSGDGLFT SWTEEPNWRK AHNILLPSFS QKAMKGYHPM 121 MQDIAVQLIQ KWSRLNQDES IDVPDDMTRL TLDTIGLCGF NYRFNSFYRE GQHPFIESMV 181 RGLSEAMRQT KRFPLQDKLM IQTKRRFNSD VESMFSLVDR IIADRKQAES ESGNDLLSLM 241 LHAKDPETGE KLDDENIRYQ IITFLIAGHE TTSGLLSFAI YLLLKHPDKL KKAYEEADRV 301 LTDPVPSYKQ VQQLKYIRMI LNESIRLWPT APAFSLYAKE ETVIGGKYLI PKGQSVTVLI 361 PKLHRDQSVW GEDAEAFRPE RFEQMDSIPA HAYKPFGNGQ RACIGMQFAL HEATLVLGMI 421 LQYFDLEDHA NYQLKIKESL TLKPDGFTIR VRPRKKEAMT AMPGAQPEEN GRQEERPSAP 481 AAENTHGTPL LVLYGSNLGT AEEIAKELAE EAREQGFHSR TAELDQYAGA IPAEGAVIIV 541 TASYNGNPPD CAKEFVNWLE HDQTDDLRGV KYAVFGCGNR SWASTYQRIP RLIDSVLEKK 601 GAQRLHKLGE GDAGDDFEGQ FESWKYDLWP LLRTEFSLAE PEPNQTETDR QALSVEFVNA 661 PAASPLAKAY QVFTAKISAN RELQCEKSGR STRHIEISLP EGAAYQEGDH LGVLPQNSEV 721 LIGRVFQRFG LNGNEQILIS GRNQASHLPL ERPVHVKDLF QHCVELQEPA TRAQIRELAA 781 HTVCPPHQRE LEDLLKDDVY KDQVLNKRLT MLDLLEQYPA CELPFARFLA LLPPLKPRYY 841 SISSSPQLNP RQTSITVSVV SGPALSGRGH YKGVASNYLA GLEPGDAISC FIREPQSGFR 901 LPEDPETPVI MVGPGTGIAP YRGFLQARRI QRDAGVKLGE AHLYFGCRRP NEDFLYRDEL 961 EQAEKDGIVH LHTAFSRLEG RPKTYVQDLL REDAALLIHL LNEGGRLYVC GDGSRMAPAV 1021 EQALCEAYRI VQGASREESQ SWLSALLEEG RYAKDVWDGG VSQHNVKADC IART CYP102A8 Bacillus thuringiensis serovar konkukian str. 97-27 AAT62301 98% to CYP102A4, 96% to CYP102A5 1 MDKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKLAEEYG PIFQIQTLSD TIIVVSGHEL 61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETDEPNWK KAHNILMPTF SQRAMKDYHA 121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM 181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSE NQEENDLLSR 241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR 301 VLTDSTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV 361 LIPQLHRDKD AWGDDVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG 421 MLLQHFEFID YEDYQLDVKQ TLTLKPGDFK IRIVPRNQTI SHTTVLAPTE EKLKKHEIKK 481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE 541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKP DELKGVQYAV FGCGDHNWAS TYQRIPRYID 601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQSMWSDAMK AFGLELNKNM EKERSTLSLQ 661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYK EGDHLGVLPI 721 NNEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVR LYDLLSYSVE VQEAATRAQI 781 REMVTFTACP PHKKELESLL EDGVYQEQIL KKRISMLDLL EKYEACEIRF ERFLELLPAL 841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP 901 QSNFQLPENP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MKVGEAHLYF GCRHPEKDYL 961 YRTELENDER DGLISLHTAF SRLEGHPKTY VQHVIKEDRI HLISLLDNGA HLYICGDGSK 1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR LQEEGRYGKD VWAGI CYP102A9 Bacillus weihenstephanensis KBAB4 ZP_01184381 96% to CYP102A5 1 MDKKVSAIPQ PKTYGLLGNL PLIDKDKPTL SFIKIAEEYG PIFRIQTLSD TIIVVSGHEL 61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETHEPNWK KAHNILMPTF SQRAMKDYHA 121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSYYR ETPHPFITSM 181 SRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSG NQEENDLLSR 241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR 301 VLTDPTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV 361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKI PHHAYKPFGN GQRACIGMQF ALHEATLVMG 421 MLLQHFEFID YQDYQLDVKQ TLTLKPGDFK IRILPRNQTI SHTTVLAPIE EKLKNDEIEQ 481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE 541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKS DELKGVQYAV FGCGDHNWAS TYQRIPRYID 601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQSMWSDAMK AFGLELNKNI EKERSTLSLQ 661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYQ EGDHLGVLPI 721 NSEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVS LFDLLSYSVE VQEAATRAQI 781 REMVTFTACP PHKKELELLL EEGVYHEQIL KKRMSMLDLL EKYEACEIRF ERFLELLPAL 841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP 901 QSNFQLPENP ETPIIMVGPG TGVAPFRGFL QARRVQKQKG INLGQAHLYF GCRHPEKDYL 961 YRTELENDER DGLISLHIAF SRLEGYPKTY VQHLIKQDRI NLISLLDNGA HLYICGDGSK 1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR VQEEGRYGKD VWAGI CYP102A10 Erythrobacter litoralis HTCC2594 YP_456909 60% to 102A6 1 MDAPTALAPI PQPPGKPIVG NAFTVDSSRL IQSLMELAEE YGPIFQLEVM GTPLVFVSGA 61 DMVAEICDES RFDKTVRGPL KRLRLIAGDG LFTGDTDDPN WAKAHHILLP SFSQKAMGSY 121 LPMMTDIASQ LMLKWERLNS DDVIDVPMDM VRLTLDTIGV CGFGYRFNSF YREDFHPFIE 181 ALNRTLDTTQ KMRGLPGEKL LKRQQIEQLN EDAAYMNNLV DEIIRERRQT GESGQGDLLD 241 FMLSGRDPVT GERLSDENIR YQINTFLIAG HETTSGLLSF TLYYLLKNRD VLQRAYAEVD 301 EVLGRNIDQT PTLSQIGRLP YIRAILSEAL RLWPTAPAMG LAPFEDEVLG GKYAIAKGTF 361 TTVLIPSLHR DKLVWGENPE AFNPDNFSPK AEAARPPHAY KPFGNGQRAC IGRQFAIQES 421 ILVLGMLLQR FELFDHADYQ LRIKETLSIK PDGFTIKARL RHDVERGGVA TVEPESKTPD 481 QAAAVPSHGT PLLVLYGSNL GSSEGFAREL AQRGEFSGFD VTMAPLDAHV AKLPTDGAVA 541 IACASYNGMP PDNAAKFVDW LEQADAADAP LSNVSYLVLG CGNSDWAATF QVVPRKIDAL 601 MEQHGAERLV PAEELDARGD LDTQFHDWLD GLIPQLGDAF DIDLESGFDA VFEPLYTVEI 661 TDSITGNTVA DRVGAREVEV VANRELKDTS KDEGRSTRHL EVRLPEGMEY EPGDHLCVVP 721 VNDPAVVDRL LKRFGLDRDT FVRIESRSDM RGPFPSGSTF SVLNLAETAG ELQAVATRKD 781 IATLARYSEC PNSRAALEAL AAPPSADGTD RYTSEVLEKR RSVLDMLEEF PACDVPLAVF 841 LELIPFLSPR YYSISSAPEA NQGLCSITVG VVKGPALAGT GEFKGTCSAY LADLPPGDRF 901 RAVVRKPTAQ FRLPDNPETP VIMIGPGTGV APFRAFLQRR DHLQEDGAVL GEAMLFFGCR 961 HPDIDYLYRE ELDDYDQRGV ATVHAAFSRH DGSRTYVQDL IAREADRVWE LIEQDARIYV 1021 CGDGARMEPD VRKALMAIYA EKKSSDEASA KAWIDDLVAQ DRYLLDVWVG CYP102A11 Erythrobacter sp. NAP1 ZP_01041731 75% to CYP102A10 1 MATNATLTPI PQPPGKPLIG NALTVDASQQ IQSLMELAEE YGPIFQLDMM GTPIVIISGA 61 DLVAEVCDEK RFDKSVRGPL KRLRLIGGDG LFTGDTDAPN WSKAHNILLP SFSQKAMGSY 121 LPMMTDIATQ LVMKWERMNS DDVIDVPKDM IRLTLDTIGV CGFGYRFNSF YREDFHPFIR 181 ALTRTLETTQ KIRGIPGEKL LKGDAVKQLH RDAKYMNNLV DEIIRERQRS GGDGPEDLLD 241 FMLSGRDPLT GERLSDENIR YQINTFLIAG HETTSGLLSF TLYYLLKNRD VLTRAYAEVD 301 TVLGRNIDQP PSLKQIGQLP YIRAILFEAL RLWPTAPAFG LAPFEDEVLG GKYLIPKGTF 361 TTVLIPSLHR DKSVWGENPE VFDPENFTAE AEAARPPHAY KPFGNGQRAC IGRQFAIQES 421 ILVLAMILQR FELFDHSDYK LDIKETLSIK PDEFTIKARM RKDVERGGKA TEDAEEASTE 481 PAEPKVPKHD TPLLVLYGSN LGSSESFARE VAQKGEFSGF EVEMAPLDDY VGKLPEDGAV 541 AIACASYNGM PPDNAAKFID WIEGKDGAAP DLSGVSYMML GCGNSDWAAT FQAVPRLIDA 601 RMEELGAKRI VPTTELDARS DVDTQFHTWL DALMPQLGEH FDLDLSGEAA GIAEPLYKVE 661 VTQSVTANTV ASRVGAHAVK MVANRELKNT DIGEGRSTRH IEVELAQGET YQPGDHLCVV 721 PENDDAVVER LLRRFNLDAD TYVRIESRSE MRGPFPSGST FSVYNLAKTA GELQAVATRK 781 DIATLALYTE CPNSKPALEK LAQPPQEDGT DQYATDVLAK RKSVLDLLED YPACDLPLAV 841 FLEMIPFLSP RYYSISSAPG DTPQTCSITV GVVKGPALSG KGTFKGTCSN YLAELEPGAS 901 FNAVVREPTA NFRLPDDPKV PLIMVGPGTG LAPFRGFLQE RDALASSGEE LGPARLYFGC 961 RTPDEDFLYR DELEDYDKRG IVTLRTAFSR VDEGKCYVQD HIADDADAIW EMLEAGGRIY 1021 VCGDGARMEP DVRAALAKIH SDKTGSTPAE AQSWVGDLIT NERYSLDVWV G CYP102A12 Rhodopseudomonas palustris HaA2 YP_487251 84% to CYP102A6 1 MSSSNKLAPI PHPPKQPVVG NMLSIDTKAP VQHLVRLAEE LGPIFWLDMM GAPIVIVSGY 61 DLVDEISDEK RFDKAVRGAL RRVRTVGGDG LFTADTSEPN WSKAHNILLT PFGGRAMQSY 121 HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE 181 SLVRSLETIM MTRGLPLENL WMKKRRDTLA EDVAFMNAMV DEIIAERRKA AAVADKMDML 241 GAMMTGVDKV TGEPLDDVNI RYQINTFLIA GHETTSGLLS CAIYALLKHP EVLQKAYDEV 301 DRVLGADTSV EPSYQQVNQL GYITQILKET LRLWPPAPAY GVAPIQDETI GGQYHLKRGT 361 FTTVLVLALH RDPSIWGPNP DAFDPENFSR EAESKRPANA WKPFGNGQRA CIGRGFAMHE 421 AALALGMILQ RFKLIDHTRY RMVLKETLTI KPEGFKIKVR PRSDKDRATR IASGVSHSVA 481 PAPAAPRARP GHNTPLLVLY GSNLGTAEEL AHRVADLADL NGFATRLGAL DQYVGQLPEE 541 GGVLIFAASY NGAPPDNATQ FVRWLSGDLP PDAFAKLRYA VFGCGNRDWT ATYQAIPRLI 601 DERLAAHGGR NIFVRGEGDA RDDLEGQFEA WFATLGPLAV KEFGIDAAFD RGADDTPLYG 661 IEPLAPAASQ PLAATGVAVA MRVLENRELQ DRAASGRSTR HIEIALPQGM SYRVGDHLSV 721 IPRNDPALVA AVAQRFGFAP DDQIRLSAAP GRRAQLPVGE AVSIGGLLGD HVELQQVATR 781 KQIVALAAHT RCPQTRPKLQ ALAGGDGAAD DAYRAEVLGK RRSVFDLLQE HPACELPFAA 841 YLEMLTPLQP RYYSISSSPA RDPARASVTV AVVEGPALSG RGIYRGACSS WLAGRGSGDT 901 VQATVRATKA CFRLPDDDRV PLIMIGPGTG VAPFRGFLQE RSARKVGGAT LGPALLFFGC 961 RHPAQDYLYA DELQGFAADG IVELHAAFSR GDGPKTYVQH LIAAQKDRVF ALIEQGAIVY 1021 VCGDGGRMEP DVKAALCAIH RERSGADATA AAAWIADLGA RDRYVLDVWA SV CYP102A13 Rhodopseudomonas palustris HaA2 YP_568957 87% to CYP102A12 1 MPSTNKLDPI PHPPKKPVVG NMLSLDTTAP VQHLVRLAKE LGPIFWLDMM GAPLVIVSGY 61 DLVDEISDEK RFDKAVRGAL RRARAVGGDG LFTADTKEPN WSKAHNILLT PFGGRAMQSY 121 HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE 181 SLVRSLETIM MTRGLPLENL WMKKRRETLA DDVVFMNAMV DEIIAERRKA SESAADKKDM 241 LGAMLAGVDR ATGEPLDDVN IRYQINTFLI AGHETTSGLL SCAIYALLKH PDVLQKAYDE 301 VDRVLGSDTA VRPSYQQVNQ LSYITQILKE TLRMWPPAPA YGVAPIKDEV IGGKYHLKRG 361 TFVTVLVLAL HRDPAIWGPN PDAFDPENFS REAESKRPAN AWKPFGNGQR ACIGRGFAMH 421 EAALALGMIL QRFQLIDHQR YRMVLKETLT IKPEGFKIKV RPRSDKDRGD FVAAGASQVS 481 TPALAQAAPR ARPDHNTPLL VLYGSNLGTA EELATRVADL AELNGFSTRL GALDQYVGHL 541 PEEGGVLIFT ASYNGAPPDN ATQFVQWLSG DLPKDAFAKL RYAVFGCGNR DWTATYQAIP 601 RLVDERLAAH GGRNIFLRGE GDARDDLEGQ FESWFAKLGP LAVKEFGIDA KFARAVDDAP 661 LYRIEPVAPA AGNAVAAAGG AVPMKVLANR ELQDCAASGR STRHIEIALP EGISYRVGDH 721 LSVMPRNDPA LVAAVAQRLG FAPDDQIKLQ VAPGRRAQLP IGEAISVGRL LGDFVELQQV 781 ATRKQIAVMA EHTRCPQTRP KLQALAGGDG AADEAYRAGV LAKRKSVYDL MQEHPACELP 841 LHAYLEMLSP LAPRYYSISS SPLRDPSRAA ITVAVVDGPA LSGRGHYRGV CSTWLAGRSV 901 GDTIHATVRA TKAGFRLPDD DRVPLIMIGP GTGLAPFRGF LQERAARQQN GATLGPALLF 961 FGCRHPAQDY LYADELQGFA AEGVVELHTA FSRGEGPKTY VQHLIAAQKD RVFTLIEQGA 1021 IIYVCGDGGK MEPDVRAALM AIHRERSGAD AAAASTWIDD LGACNRYVLD VWASA CYP102A14 uncultured soil bacterium ABD83817 74% to 102A12 1 MASNNKMSPI PQPPTRPVVG NMLSLDSAAP VQDLTRLAKE LGPIFWLDMM GAPIVIVSGY 61 TLVDELSVET RLDKVVRGAL RRVRAIGGDG LFTADTAEPN WSKARNILLQ PFGNRAMQSY 121 HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE 181 SLVRSLETIM MIRGLPLENF WMRRRRSDLA TDVAFMNKMV DEIVAERRKS AEASDGKKDM 241 LNAMMSGVDR STGEQLDDVN IRYQINTFLI AGHETTSGLL SYAIYALLKH PDVLKKAYAE 301 VDRVLGADIE ARPSYQQVTQ LTYITQILKE ALRLWPPAPA YGIAPLKDET IAGGKYSLKK 361 NTFISILVTA LHHDPAVWGP NPDLFDPENF SPEAEAKRPV NAWRPFGNGQ RACIGRGFAM 421 HEAALALGMI LQRFKLIDHQ RYQIRLKETL TIKPDGFKIK VRPRSGHDRT VHAEAATAAV 481 ATGAALPRAR PRPGHNTPLL VLYGSNLGTA EDLATRVADL AEVNGFATRL APLDDCAGQL 541 PDSGGVLIFC ASYNGAPPDN ATKFVGWLRG ELPNDAFAKL RYAVFGCGNR DWAATYQSVP 601 RLIDETLSAH GGKRVFPRGE GDARSDLDGQ FESWFAALGA AAVKEFGLES RFSRSADDAP 661 LYSVEPVAPS AVNAVAALGG TVPMTILVSR ELQNKSGPDA SERSTRHIEV QLPGGMTYRV 721 GDHLSVVPCN APALVDRVAR RFGFLPADQI RLAVAEGRRA QLPVGEAVSI GQLLTDFVEL 781 QQVATRKQIQ IMSEHTRCPV TKPKLVAYVG DDADSSERYR ADILSRRKSV YDLLEEFPAI 841 ELPFPAYLEM LSLLAPRYYS ISSSPTGDAS RCSITVGVVS CPASSTGRGL YRGVCPYYLA 901 SRREGESVFA TVRETKAGFR LPDDPSVPII MIGPGTGLAP FRGFLQERAA RKAGGATLGP 961 AMLFFGCRHP EQDYLYADEL KAFADEGITE LFVAFSRSEG PKTYVQHLLA TQKARVWDLI 1021 EQGAVIFVCG DGSKMEPDVK ATLVQIYRDC TGADANGGAK WIADLGAQNR YVLDVWAGG CYP102B1 Streptomyces coelicolor cosmid F43. GenEMBL AL136502 CDS 10570..12153 gene="SCF43.12" Highly similar to the N-terminal P450 domain of Bacillus megaterium 41.9% identity in 497 aa overlap. 45% to 102A1 over 433 amino acids cloned and expressed by David Lamb and Steve Kelly CYP102B2 Streptomyces avermitilis GenEMBL AP005050 Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV7426 78% to 102B1 from Streptomyces coelicolor CYP102B3 Rhodococcus sp. RHA1 No accession number Marianna A. Patrauchan Rha05872 Submitted to nomenclature committee 12/13/04 62% to CYP102B2 CYP102B4 Streptomyces scabies SCAB9321 David Lamb Submitted to Nomenclature committee Nov. 10, 2006 80% to CYP102B2 Streptomyces avermitilis CYP102C1 Rhodococcus sp. X309 GenEMBL AF059700.1 complement(3619-4584) runs off end of sequence partial gene 48% to 102B1 CYP102C2 Rhodococcus erythropolis PR4 YP_345116 88% to CYP102C1 1 MNGPRGSRTR PRRASLVNLA VLGVVLVHTV LMSADKCPYP KSATRAGEIT AVQPQVFESI 61 PSPAWRLPLL GDLLTVDSEK PIQKEMALAS KLGPIFEWKI VNNRVTVVSG VDLVAEVNNE 121 ALWAKSVGLP ILKLRKVAED GLFTAFNSEP NWRKAHNILS EGFSRSALRN YHPSMLRALG 181 GLTDSWDRVA DAGETIDASS DANKLALDVI GLAGFGYDFA SFIGEEHPFV GAMSRVLAHV 241 NSTSNDIPFL RKLRGNGADL QNEKDIALLR TVVDNVIAER QSKPGEHQDD LLDLMLHSAD 301 AETGEKLDPV NIRNQVFTFL VAGNETTAGT LAFALYFLSR HPDVADTARA EVADVTAGET 361 PAFEDVARMR YLRRVVDETL RLWPSAPGYF RKVRTDTTLG GRYDMPKGSW VFVLLPQLHR 421 DPVWGEDPES FDPDRFKPEN VKKRPAHAYR PFGTGPRACI GRQFALHEAV LALAIILQRY 481 NFQSDPEYKL DIRETLSLKP VGFELSLQRR CYP102D1 Streptomyces avermitilis GenEMBL AP005023 Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV575 47% to 102A3 40% to 102B1, 44% to 102C1 partial seq CYP102E1 Ralstonia metallidurans GenEMBL NZ_AAAI01000371 104500-107000 region 51% to 102D1 MSTATPAAALEPIPRDPGWPIFGNLFQITPGEVGQHLLARSRHHDGIFELDFAGKRVPFVS SVALASELCDATRFRKIIGPPLSYLRDMAGDGLFTAHSDEPNWGCAHRILMPAFSQRAM KAYFDVMLRVANRLVDKWDRQGPDADIAVADDMTRLTLDTIALAGFGYDFASFASDELDP FVMAMVGALGEAMQKLTRLPIQDRFMGRAHRQAAEDIAYMRNLVDDVIRQRRVSPTSGMD LLNLMLEARDPETDRRLDDANIRNQVITFLIAGHETTSGLLTFALYELLRNPGVLAQAY AEVDTVLPGDALPVYADLARMPVLDRVLKETLRLWPTAPAFAVAPFDDVVLGGRYRLRKD RRISVVLTALHRDPKVWANPERFDIDRFLPENEAKLPAHAYMPFGQGERACIGRQFALTE AKLALALMLRNFAFQDPHDYQFRLKETLTIKPDQFVLRVRRRRPHERFV TRQASQAVADAAQTDVRGHGQAMTVLCASSLGTARELAEQIHAGAIAAGFDAKLADLDDA VGVLPTSGLVVVVAATYNGRAPDSARKFEAMLDADDASGYRANGMRLALLGCGNSQWATY QAFPRRVFDFFITAGAVPLLPRGEADGNGDFDQAAERWLAQLWQALQADGAGTGGLGVDV QVRSMAAIRAETLPAGTQAFTVLSNDELVGDPSGLWDFSIEAPRTSTRDIRLQLPPGITY RTGDHIAVWPQNDAQLVSELCERLDLDPDAQATISAPHGMGRGLPIDQALPVRQLLTHFI ELQDVVSRQTLRALAQATRCPFTKQSIEQLASDDAEHGYA CYP102F1 Actinosynnema pretiosum subsp. auranticum AF453501 complement(6501..9518) maytansinoid antitumor agent ansamitocin biosynthetic gene cluster I 49% to 102A3 gene = asm30 MVATGTRIPGPKPLPLVGNLLDVLTSDLDTDVDFLDRCHREHGG IVALTFAGQRQVFASSHELVARMCSDPSWGKAVHPALEQVRDFAGDGLFTARGDEPNW GKAHRLLMPAFGPTAMRDHFPAMLDIAEQMLVRWRRFGPDHRIDVADDMTRLTLDTIA LCAFGARFNSFYRDRAHPFVDAMVRSLVEAGERAERLPGVQPFLVGRNQRYRDDIATM NRIADGIVAARAALPAGERPDDLLERMLTCADPVTGERLSARNVRYQLATFLIAGHET TSGLLSFAVHRLLAHPEVLRKAKDAVDGVLGDRVPAFEDLARLDYLGQVLRETLRLHP TAPAFALAPDEPAELGGHAIGAGEPVLVMLPTLHRDPAVWRDPDVFDPERFAPERMDE IPACAWMPFGHGARACIGRPFALQEATLVLALVLQRFDLALADPDHRLTIKQTLTLKP DSLVVRARPRADRPGATATVETVVPHQVPATHRHGTPLHVFYGSNGGSGEGLARTIAG DGAARGWATSVAPLDDAVRALPASGPVVIVSSSYNGAPPDNAAHFVRWLTQDGPDLSG VDYLVLGCGNLDWSATYQRVPTLIDEAMAAAGARRLRERGATDARADFFGDWERWYEP LWPLLSAECGVEVGEIGPRFRVVESDAADGLGDLASAVVLENRELVRGPDAGSKRHLE LRLPDGTSYRTGDYLSVLPQNHPDLVRRAVARLGTRAERVVTVESSAPTGLVPVGRAL RVDELLTRCVDLSAPAGAGVVARLAERCPCPPERAELAATTGATLLELLERFPSCAVD LALALELLPAPRTRLYSISSAAEEQRAEVALTVSVTGVTSGYLSRVRPGDRVAVGIAS PPESFRPPADNTVPVVLIAAGTGIAPFRGFLRARAALGGEPGPALLLFGCRGPELDDL YAEEFAALGDWLEVDRAYSRHPDGEVRHVQHRLWQRRDRVRELVDAGARVYLCGDATR VGPAVEEVLGRIGPGAGWLDALRAGGRYATDVF CYP102G1 Streptomyces scabies SCAB5931 David Lamb Submitted to Nomenclature committee Nov. 10, 2006 52% to 102A5 Bacillus cereus P450 part 52% to 102A3 Bacillus subtilis, 45% to 102D1 CYP102G2 Saccharopolyspora erythraea NRRL23338 SACE_4205, 4676800,4679985 (-) STRAND 56% to CYP102G1 Streptomyces scabies, 51% to CYP102B2 MTQTPLHHDDVPVADVSGTGLTATPTQQAMELARRHGPVFRRRTREFQSLLVSDVDLVAE LSDEQRFAKAVGPALENVREFAADGLFTAYNDEFNWAKAHDILMPAFALGSMRTYHPVML RVARRLLDSWDRAAAASAPVDVPDDMTRMTLDTIGLAGFGFDFGSFGRAEPHPFVGAMVR CLDWSMTRLSRVPGTDHSERDEAFRADARYLASVVDEVINTRAAEGDTSGEDLLGLMLGA RHPADGTTLDAANIRNQIITFLIAGHETTSGTLSFALHYLAKNPTVLRLVQREADELWGD SPDPEPSFEDIGRLTYTRQVLNETLRLWPSAPAFGRQARHDTVLGGRIPMRAGEAAAVLI PMLHRSPVRGDNPELFDPARFAPEAEAARGPHAFKPFGTGERACIGRQFALDEATMVLAM LAHRYRLVDHAGYRLKVKETLTLKPEGLTLAVRARTAADRVTNRLALPVGLPSAAPGEPA DAARRPGRVLPGTGLLLLHGSNYGTCRDFAAQLALAAGELGCDTAVAPLDEYAGNLPSDR PVIVVAASYNGRPTDDAVSFSRWLDEAEPGAADGVDFAVLGVGDRNWAATYQHVPTRIDA RLAELGGTRILERGEADASGDLAGAVRRFSAALETALLERSGDPDAVAAAPEGDGPAYTV SEVTGGALDSLAARHGMVEMTVTEVADLTAPDYPRTKRFVRLALPEGTAYRTADHLAVLP VHDAALVERAAGVLGVDLDTVLDIRAKRPGRLTFDRSLTVRELLSHHVELQDPPTPDGLD ALAALNPCPPERAALRGLAEEARSGTADHRTLWDLIEDHPALRDALSWSALLELLPATRP RQYSVSSSPAVDPRHVDLMVSVLRAPARSGRGEFRGAGSRHLSEVRPGDTVLARVQPCRE DFRVAPDEPLIMVAAGTGLAPFRGVIADRRERVANGARQAPALCYFGCDAPDADYLHSAE LRAAESAGAVAMRPAFNEAPVGGQCFVQHRIAAEAGEVWALLESGARVLVCGDGRHMAPG VREAFRGIYRERTPGADDASAHEWLQAMIAGGCYVEDVYAG CYP102H1 Nocardia farcinica IFM 10152 plasmid pNF1 GenEMBL AP006619 177679..179100 51% TO 102C1 MAVTTSTTSGGHSNPPLPHPKWRLPIINDLLTINPIKPTLTSLR DAEQLGGIFERRLVDWPMIVVSDSELITEICDERNWAKHLGVPLRKMRHIARDGLFTA RNDEPNWAKAHAVLAPAFTKEAMRSYHQTMLTTIGELLDYWAKRDGQWVDVGEDMNKL TLEIIARTGFDYTFDSFTRSEVHPFVAAMLRGLTYISRNSNMPPFLQKTIGARAAARH SRDITYVRTVVDDVIKARQASGTVGDHHDLLDRMLTVPDPASDELLDTTNVRSQILTM LVAGHETSAGVLAFALYELSRRPELVAAARAEIETRFADGDLSTIAYDDVAKLRTLRR IVDETLRLYPVAPGFFREARHETTIGGGRYRFGPGDWVLVLTLHAHRDPATWGPDAGE FQPDRWLPERMRSLDGRQVFKPFGTGLRACIGRQFAYHEIVLALAHILHTFEFTPDPG YELDIAEQITLKPHRFRLRLNHR CYP102J1 Burkholderia sp. 383 ABB05850 50% to CYP102A4 1 MKSSSLVPQP PLKPVIGHLM EVLGPSPLAK MMDLARTYGP VYWFEVFGQG YYVVSGQTLV 61 DEVCDETRFQ KCVHQSLLEL RPAIGDGLFT AFGDEPNWAK AHRVLMQAFG PLSIWSMFDK 121 MVDIADQMFL HWERFGPETP VDVSDHMTRL TLDTIALCAF DCRFNSFYRE DQHPFVDAMV 181 NTLSEAGKRE LRPKLVSKLM VKRSRQFDAD IEVMRSLATK MIEDRRKNPH VNEESMDLLD 241 RMLNGIDPVT GEKLDDENIV FQMITFLIAG HETTSGLLSF ATYFLLKNPD ILQKARDMVD 301 EVVGSETPRI EHLARLRYVE QILMETLRIW PTAPGFAVKP LADTTFGGKY AVSPDDIIMI 361 LTPMLHRDVS VWGEDVEAFR PERFAPENAE QLPPNSWKPF GNGARACIGR PFAMQEAHLV 421 LIMLLQRFDF SFADPDYELD VAETLTLKPA GFRVNVKPRA RGALKVPDTA LHARSNAQSP 481 SVAPVQSIAP GEDLSNLLVL FGGNTGSAES FARRIAGDAS RHGFHATCAP LDDFAGKLGG 541 YPAIVIVTAS YEGQPPDNAK SFVPYVEALD EGALDGVHFS VLGCGNKQWA RTYQAIPKRV 601 DEALEKAGAT RVHFRGELDS GGDFFGEFDR WYTEMWNRFA ISAGKEIPVI QHDDVALKVS 661 FAGSSREKVL NLGDMAHASI VDNRELVDIS VAGSRSKKHI ELKLPEAMTY RSGDYLAVLP 721 RNAKNNVDRV LRRFRVSWDT QVVIEGTSSN PRLPLGQAIG CGELFSSYVE LAVPATRSQV 781 SSLAAATRCP PEKVELERLS ADGFECEILG KRTTVMDLLE RFGSVDLSLE KFLDMLPALK 841 ARQYSISSSP LWKADHVTLT VAVVDAPALS GNGRHEGVAS SYLARLNTGD SLSVAVRPSN 901 ARFRPPAEPD LPMILICAGS GIAPFRGFLQ ERALQKQRGE NVGTSLLFFG IDDPDVDFLY 961 RDELDEWARC GVVEVMPAYS NRPEEGARFV QDKVWLERER ISALFSQGAT VFVCGDGKNM 1021 APAVRATLGR IYQETTGEND ESASAWIDTM EREHGRYVAD VFA 103 Family CYP103A1 Agrobacterium tumefaciens GenEMBL M19352, AF242881 CDS 141158.142426 gene="virH1" CYP103A2 Agrobacterium tumefaciens GenEMBL AF034769 GenEMBL AB016260 CDS 124584..125759 CYP103A3 Agrobacterium tumefaciens plasmid pTiAB2/73 vir region GenEMBL AF329849 892..2148 gene = virH 61% TO 103A1 MNARGPEKVSQTSGPIISASLDPDNVSVSDLDRSGHAIFAEWRP KRPFLRRQDGVYVLLRADDVLGLSSDPRTRQIETELMLNRGINEGAVFDFVRYSMLFS NNEVHSRRRSPFTRTFAFRMIENLRPQVSQLTETLFQDLKELDSFNFVEEFASKLPAV AIAGLLGLPPSDIPYFTQLVYRVARCLSPSWRDADLPDIEASAAEFKNYVQAVIDDRR SNPRDDFLSSFIRATREAEDLSPDEGLAQLMLIVLAGTDTTKTGLTALTGQLLRHRHV WEALLKDESLVPAAVEEGLRFEPPVGSYPRLALADIDLEGFILPKGSLLALCTMSALR DEKHFAHPELFDIHRKQMHWHMVFGAGAHRCLGEALARLELQEGLATVLRYAPTLSIE GEWPTVQGHGGVRRIAEMRVGFRRQI 104 Family CYP104A1 Agrobacterium tumefaciens GenEMBL M19352, AF242881 CDS 142447..143670 gene="virH2" CYP104A2 Agrobacterium tumefaciens GenEMBL AB016260 103A2 CDS 124584..125759 and 104A2 CDS 125919..127094 83% to 104A1 105A Subfamily CYP105A1 Streptomyces griseolus GenEMBL M36480 (1629bp) Y18556 CDS 2447..3703 Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M., Leto,K.J., Romesser,J.A. and O'Keefe,D.P. Genes for two herbicide-inducible cytochromes P-450 from Streptomyces griseolus J. Bacteriol. 172, 3335-3345 (1990) Gene suaC CYP105A2 Amycolata autotrophica GenEMBL D26543 (1197bp) Kawauchi,H., Sasaki,J., Adachi,T., Hanada,K., Beppu,T. and Horinouchi,S. Cloning and nucleotide sequence of a bacterial cytochrome P-450 VD25 gene encoding vitamin D-3 25-hydroxylase Biochim. Biophys. Acta 1219, 179-183 (1994) CYP105A3 Streptomyces carbophilus GenEMBL D30815 PIR JC4287 Watanabe,I., Nara,F. and Serizawa,N. Cloning, characterization and expression of the gene encoding cytochrome P-450sca-2 from Streptomyces carbophilus involved in production of pravastatin, a specific HMG-CoA reductase inhibitor Gene 163 (1), 81-85 (1995) 105B Subfamily CYP105B1 Streptomyces griseolus GenEMBL M36481 (1688bp) M32239 Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M., Leto,K.J., Romesser,J.A. and O'Keefe,D.P. Genes for two herbicide-inducible cytochromes P-450 from Streptomyces griseolus J. Bacteriol. 172, 3335-3345 (1990) Gene subC, SU-2 CYP105B2 Streptomyces tubercidicus strain R-922 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Cyp229 78% to 105B1 CYP105B3 Saccharopolyspora erythraea NRRL23338 SACE_2842, se: 3099326,3100528 (+) STRAND 55% to CYP105B2, 54% to 105Q1, 53% to 105B1 MTSTLDESTPDYPMPRARGCPFDPPPALRDLQRETPMARVRLWDGSTPWLVTRYADQRAV LRDSHVSADMNHPTYPRQAPGGGTLSFIGMDDPEHARLRRMVGSAFAVKNVERMRPWVQR IVDEAVDELLAGPRPADLVEEFALPVPSLVICGLLDVPYADHAFFQSNSKTMINRDSTPE QRSQASGRLAEYLSDLLSSKMDTRGDDLRSRLCGRIEAGDLTLRQATEMAVLLLIAGHET TANMIALSTLLLLRHPDQLALLRESDDPDVARRAVEEMLRYLNITHGGRRRVALEDVEVA GQRVRAGEGLVLPNEIANRDPDAFPDPDRLDITREARHHVAFGFGVHQCLGQPLARLELE IVYRTLYRRAPRLALAAGIEDIPFKHDGFVYGVYELPVTW 105C Subfamily CYP105C1 Streptomyces sp. GenEMBL M31939 PIR S19629 (381 amino acids) Horii, M., Ishizaki, T., Paik, S.Y., Manome, T. and Murooka, Y. An operon containing the genes for cholesterol oxidase and a cytochrome P-450-like protein from a Streptomyces sp. J. Bacteriol. 172, 3644-3653 (1990) Gene choP 105D Subfamily CYP105D1 Streptomyces griseus GenEMBL S45823 X63601 (1700bp) PIR S24750 (412 amino acids) Trower,M.K., Lenstra,R., Omer.C., Buchholz,S.E., and Sariaslani,F.S. Cloning, nucleotide sequence determination and expression of the genes encoding cytochrome P-450soy (soyC) and ferredoxinsoy (soyB) from streptomyces griseus. Mol. Microbiol. 6, 2125-2134 (1992) PIR S35901 (412 amino acids) Erratum. Cloning, nucleotide sequence determination and expression of the genes encoding cytochrome P-450(soy) (soyC) and ferredoxin(soy) (soyB) from Streptomyces griseus. Mol. Microbiol. 7, 1024-1025 (1993) CYP105D2 Streptomyces griseus GenEMBL AF071145 84% identical to 105D1 CYP105D3 Streptomyces sclerotialus GenEMBL AF071149 68% identical to 105D1 CYP105D4 Streptomyces lividans GenEMBL AF072709 CDS complement(1593..2813) 69% to 105D1 67% to 105D2 82% to 105D3 57% to 105A1 CYP105D5 Streptomyces coelicolor 3StF60 [Full Sequence] Sanger cosmid CDS comp(2106-3344) 98% identical to CYP105D4 cloned and expressed by David Lamb and Steve Kelly CYP105D6 Streptomyces avermitilis GenEMBL AB070949.1 69121-70371 Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV412_pteD 55% to 105D1 from Streptomyces griseus, 53% to 105D4, 54% to 105D5 (if first 17aa left off 105D5) Gene = pteD CYP105D7 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV7469 73% to 105D4 from Streptomyces lividans CYP105D8 Streptomyces tubercidicus strain I-1529 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Cyp233 68% to 105D7 CYP105D9 Streptomyces sp. JP95 GenEMBL AF509565 11774..13024 griseorhodin biosynthesis gene cluster 55% to 105D6 gene = grhO3 MTDTLDEPQTLADGAEDAPAYPVKRTCPYRMPPGYEELREKGPI SRVTLWNGRTAWLVTGNDLGRRLFPDARLSSDVLDPRFPLLAPRIEAQRQQAAAPPLV GVDDPVHARQRRMVLPSFGIRQINALRPEIQKYADDLLDTMLAKGPGVTVDLLTEYAL PMPSAVICMLLGVPYEDHHYFDERSRHVLSSSGEEQAAQAQQAFTEILAYLDDLIVRK QAEPGDTLLDELIARQLEEGKVDRQELAMIATVLLVSGHETTSNMIALSTMALLADPD QLAALRADESLMPRAVDELMRFSSIGDMLMRVAKEDIEIEGHLIRAGDGVILSTMLMN RDPGAFERPDELDIRRPAGRHVAFGYGIHQCIGQNLARAEMEIALATLFRRVPTLKLA VPAEQVPVNAPFVLQGVSELPVTW 105E Subfamily CYP105E1 Rhodococcus fascians GenEMBL Z29635 (7139bp) PIR S42052 (399 amino acids) Crespi,M., Vereecke,D.M., Temmerman,W.G., Van Montagu,M. and Desomer,J. The fas operon of Rhodococcus fascians encodes new genes required for efficient fasciation of host plants. J. Bact. 176, 2492-2501 (1994) MAGTADLPLEMRRNGLNPTEELAQVRDRDGVIPVGELYGAPAFL VCRYEDVRRIFADSNRFSNAHTPMFAIPSGGDVIEDELAAMRAGNLIGLDPPDHTRLR HILAAEFSVHRLSRLQPRIAEIVDSALDGLEQAGQPADLMDRYALPVSLLVLCELLGV PYADRDELRDRTARLLDLSASAEQRAVAQREDRRYMATLVTRAQEQPGDDLLGILARK IGDNLSTDELISIISLIMLGGHETTASMIGLSVLALLHHPEQAAMMIEDPNCVNSGIE ELLRWLSVAHSQPPRMAVTEVQIAGVTIPAGSFVIPSLLAANRDSNLTDRPDDLDITR GVAGHLAFGHGVHFCLGHSLARMTLRTAVPAVLRRFPDLALSPSHDVRLRSASIVLGL EELQLTW CYP105F1 Streptomyces lavendulae GenEMBL AF127374 CDS 2006..3229 48% to 105C1 42% to 105B1 40% to 105D1 new subfamily in 105 CYP105F2 Streptomyces peucetis No accession number Niranjan Parajuli Submitted to nomenclature committee Nov. 2, 2003 85% to 105F1 clone name SP8812 CYP105G1 Amycolatopsis mediterranei GenEMBL AF040571 CDS complement(5011..6066) 49% to 105C1, 105B1 new subfamily in 105 looks like an insertion in the seq from 80-120 CYP105H1 Streptomyces noursei ATCC 11455 nyst GenEMBL AF263912 CDS comp (58637..59833) gene="nysN" 47% to 105B1 46% to 105A1 46% to 105D1 function="presumably involved in modification of the nystatin macrolactone ring" CYP105H2 Streptomyces albus GenEMBL AF071143 77% to 105H1 LLIAGHETTANNIGLGVVTLLSHPQWAGDERAVEELLRLHSVAD MVALRVAVDDVEIAGQVIRKGEGIVPLLAAANHDTEVFGCPHAFDPERSERRHVAFGY GVHQCLGQNL CYP105H3 Streptomyces natalensis GenEMBL AJ278573 52789..53985 pimaricin biosynthetic gene cluster. 68% to 105H1 gene = pimG MTYTDPAAPETDPPAVDFPQRKPGVPFPPPDYADYRDRKGLVLS QLSDGKRVWLVTRHEDVRAVLTSPSISSNPEHKGFPNVGNLGVPKQDQIPGWFVGMDS PEHDRFRKALIPEFTVRRVRAMKPAIERTVDAQLDAMLAAGNTADLVADFALPIPSLV ISALLGVPPADREFFESRTRVLVSLRSSTDDDRMAAAKDLLRYINRLVEIKQKWGGDD LITRLLATGAIAPHEMSGVLMLLLIAGHETTANNIALGVVTLLANPQWIGDDRAVEET LRFHSVADLVSLRVAVQDVEIAGQLIKAGEGIVPLVAAANHDENAFECPHAFDPSRSA RHHVAFGYGVHQCLGQNLVRIEMEVAYRKLFERIPNLELAVPTDGLDIKYDGVLYGLN ELPVRW CYP105H4 Streptomyces nodosus GenEMBL AF357202 complement(62051..63250) amphotericin biosynthetic gene cluster 84% to 105H1 MTAETEMTTFAPGCPVAFPLRRPGRPFPPPEYADYRAGEGLVRS ELPASGPVWLVTRHEDVRTVLTDPRISADPSRPGFPRARRTGGAPSQSEIPGWFVALD PPEHDRFRKTLIPEFTVRKVRELRPAIQQIVDERIDALLAAGNSADLIADFALSVPSL VISDLLGVPKADRDFFEAKTKVLVTLSSTDEQRDEASKALLRYLNRLIQIKGRRPGED LISRLLQAGTMNRQELSGVSMLLLIAGHETTANNIGLGVVQLLTNPQWIGDDRIVEEM LRYYSVADLVSFRVAVEDVEIGGQLIKAGEGIVPLIAAANHDGSVFDKPEEFNPERSA RSHVAFGYGVHQCLGQNLVRVEMEIAYRTLFERIPTLELAVPVEELPLKYDGVLFGLH ELPVTWS CYP105H5 Streptomyces griseus GenEMBL AJ300302 10678..11859 Gene = canC 72% to 105H3 MTTSPGPTVVDFPRRTPREPLPLSQYAEHRKQNGLVQTHLPNGR PIWLVTRHEDVRAVLTHPRISANPDNEGFPNVGETMGVPKQEQIPGWFVGLDSPEHDR FRKVLIPEFTVRRVRELRPAIERTVDERIDAMLAGGNTADLVNDFALPVPSLVISALL GVPSADRDFFESRTRTLVAIRTSTDEERAEATRQLLRYINRLIVIKKKWRGEDLISRL LSTGKLSDEELSGVLLLLLIAGHETTANNIGLGVVTLLSHREWIGDDRLVEELLRLHS VADMVALRVAVDDVEIAGQTIRKGEGIVPLLASANHDTEAFGCPHAFNPERTERRHVA FGYGVHQCLGQNLVRVEMEIAYRKLFERIPELRLAVPEDQLAYKYDGILFGLHELPVR W CYP105J1 Amycolatopsis mediterranei rifamycin GenEMBL AF040570 CDS comp (67462..68673) 52% to AF072709 105D4 50% to 105D1 new subfamily in 105 CYP105K1 Streptomyces tendae strain Tue901 GenEMBL Y18574 CDS 6325..7557 45% to 105A3 46% to 105D1 43% to 105B1 new subfamily in 105 gene="nikF" CYP105K2 Streptomyces ansochromogenes GenEMBL AF469953 14..1246 95% to 105K1 note="involved in nikkomycin biosynthesis MTEAFDHDIPSFPMARECPMHPPAEYRELRGQEPVSRVRMPDGQ VAWLVLKHALARKLLADPRVSADRLHPAFPGRLTAEQRAATERVRRLTTRRSMIHLDG DEHGAHRRILTGEFSLRRIAAQRPRVQEIVDRSIDEMLAAPQPADLVEHVSQAVPSLV ICELLGVPHEQRRDFHEWAGMLVSRSVSIQERAAASDALNDFLEALVTEKERGEPADD LIGRLIARNRQTPVMTHDEIVGTAVMLLVAGHQTTANMISLGVVALLENPEHKARIAA DSSLLPPAIEEMLRYFSVVENAPARVATEDIAIGGVTIRKNEGIVVSGLAADWDDEVF GHPDRLDFERGARHHVAFGYGVHQCLGQNLARVELEIVFETLLRRVPGLSLAVPAEEL PYKDDAGIYGIYRVPVNC CYP105L1 Streptomyces fradiae GenEMBL AF055922 CDS comp (6507..7769) GenEMBL AF147703 complement(2565..3875) Fouces,R., Mellado,E., Diez,B. and Barredo,J.L. The left edge of the tylosin gene cluster from Streptomyces fradiae Microbiology (1999) In press tylH1 46% to 105A1 42% to 105D1 43% to 105B1 new subfamily in 105 MSSSGDARPSQKGILLPAARANDTDEAAGRRSIAWPVARTCPFS PPEQYAALRAEEPIARAELWDGAPVWLISRQDHVRALLADPRVSIHPAKLPRLSPSDG EAEASRSLLTLDPPDHGALRGHFIPEFGLRRVRDVRPSVEQIVTGLLDDLTARGDEAD LLADFALPMATQVICRLLDIPYEDRDYFQERTEQATRPAAGEEALEALLELRDYLDRL ISGKTGRESGDGMLGSMVAQARGGGLSHADVLDNAVLLLAAGHETTASMVTMSVLVLL QHPTAWRELTVNPGLLPGAVDELLRYLSIADGLRRSATADIEIDGHTIRAGDGLVFLL AAANRDEAVFSEPEAFDIHRSARRHVAFGYGPHQCLGQNLARMELEVALGAVLERLPA LRPTTDVAGLRLKSDSAVFGVYELPVAW CYP105L2 Micromonospora griseorubida GenEMBL AB089954 1490..2641 gene cluster for the polyketide macrolide mycinamicin 54% to 105L1 gene = mycCI MDRTCAWALPEQYAEFRQRATGWPAKVWDGSPTWLVSRYEHVRA LLVDPRVTVDPTRQPRLSEADGDGDGFRSMLMLDPPEHTRLRRMFISAFSVRQVETMR PEIEKIVDGILDRLLALEPPVDILTHLALPMSTQVICHLLGVPYEDREFFQERSELAS RPNDDRSMPALIELVEYLDGLVRTKTAHPDTGLLGTAVTERLLKGEITHQELVNNAVL LLAAGHETSANQVTLSVLTLLRHPETAAELREQPELMPNAVDELLRYHSIADGLRRAA TADIVLGDHTIRAGDGLIILLSSANHDGNTFGAEATFDIHRPARHHVAFGYGPHQCLG QNLARLEMEVTLGKLFRRVPALRLAQEPDALRVRQGSPIFGIDELLVEW CYP105M1 Streptomyces clavuligerus clavulanic GenEMBL AF200819 CDS 136..1359 GenEMBL AY034175 CDS 200..1423 GenEMBL U87786 CDS 13810..15036 function="involved in clavulanic acid biosynthesis" 48% to 105B1 42% to 105A1 41% to 105D1 new subfamily in 105 MNEAAPQSDQVAPAYPMHRVCPVDPPPQLAGLRSQKAASRVTLW DGSQVWLVTSHAGARAVLGDRRFTAVTSAPGFPMLTRTSQLVRANPESASFIRMDDPQ HSRLRSMLTRDFLARRAEALRPAVRELLDEILGGLVKGERPVDLVAGLTIPVPSRVIT LLFGAGDDRREFIEDRSAVLIDRGYTPEQVAKARDELDGYLRELVEERIENPGTDLIS RLVIDQVRPGHLRVEEMVPMCRLLLVAGHGTTTSQASLSLLSLLTDPELAGRLTEDPA LLPKAVEELLRFHSIVQNGLARAAVEDVQLDDVLIRAGEGVVLSLSAGNRDETVFPDP DRVDVDRDARRHLAFGHGMHQCLGQWLARVELEEILAAVLRWMPGARLAVPFEELDFR HEVSSYGLGALPVTW CYP105N1 Streptomyces coelicolor St4C2 [Full Sequence] Sanger cosmid CDS 29986-31221 45% to 105A1 new subfamily in 105 cloned and expressed by David Lamb and Steve Kelly CYP105N2 Streptomyces glaucescens cytochrome P450 GenEMBL AF071144 95% to 105N1 only 5 aa diffs 57% to AF071148 56% to AF071146 59% to 105D3 54% to 105A3 LLIAGHETTTSMIALSTLLLLDRPELPAELRNDPDLMPAAVDEL LRVLSVADSIPLRVAAEDIELSGRTVPADDGVIALLAGANHDPEQFDDPERVDFHRTD NHHVAFGYGMHQCLGQNL CYP105N3 Streptomyces peucetis No accession number Niranjan Parajuli Submitted to nomenclature committee Nov. 2, 2003 91% to 107N1 clone name SP0881 CYP105P1 Streptomyces avermitilis GenEMBL AB070949.1 67376-68575 Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV413_pteC low 40% range to 105 subfamilies Gene = pteC CYP105P2 Streptomyces peucetis No accession number Niranjan Parajuli Submitted to nomenclature committee Nov. 2, 2003 92% to 105P1 clone name SP7863 CYP105Q1 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV1611 49% to 105B1 from Streptomyces griseolus 46% to 105D4 and D5 CYP105Q2 Streptomyces sp. GenEMBL BD133549 78% to CYP105Q1 3 LIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHSGLRRVA 182 183 KGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGFGTHQC 350 CYP105Q3 Streptomyces sp. GenEMBL BD133546 77% to 105Q1 139 MADTLTDAAPDTDGRVPEYPMPRATGCPLAPSPAAAELRGDRPITRVRIWNGSTPWLITR 318 319 HADQRTLLTDPRVSNDDHEPDFPHVNAHRAAIAPHTPKLITNTDAPEHTRLRRSVNAPFL 498 499 VKRIEAMRPAVQKIVDDLIDDMLAGPSPADLLTALALPVPSLVIAELLGVPYEDHHFFQE 678 679 NSNRVLDNSLTAEEAQESSRALGGYLDTLFRTKLEQPGEDVLSEMGSKVKAGEMTHQEAV 858 859 SMGVAMLIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHS 1038 1039 GLRRVAKGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGYGPH 1218 1219 QCLGQNLARLELQVVYGTLYRRVLTLRPAVPVDQLAFNHTGTTYGVKCLPVTW 1377 CYP105Q4 Mycobacterium marinum No accession number Tim Stinear MM4762 52% to 105Q1 CYP105Q4 Mycobacterium ulcerans No accession number Tim Stinear 98% to 105Q4 M. marinum = ortholog CYP105Q5 Streptomyces scabies SCAB11341 David Lamb Submitted to Nomenclature committee Nov. 10, 2006 92% to 105Q3 Streptomyces sp. CYP105Q6 Mycobacterium vanbaalenii PYR-1 ZP_01205830.1, EAS26822.1 50% to 105Q3, 86% to CYP105Q7 MTETLAQEAVSVPEYPMERTAGCPFAPPQQMLEMNQVKPLSRVRIWNGTTPWLVTGHEVARTLFADSRVS VDDRREGFPHWNEHMLSTVDKRPRSVFTSDAEEHTRFRRMLSKPFTFRRVEALRPVIQQVTDECIDEILA GPQPADMVAKLALPVPTRVISDMLGVPYEDHEFFQEHANAGLARYAAADAMQKGAMSLHQYLINLVEEKQ AHPAEDAVSDLAERVTAGEISVKEAAQLGTGLLIAGHETTANMIGIGICALLENPEQAALLRDSDDPKFI ANAVEELMRYLSIIQNGQRRVATEDIEIGGETIRAGEGIILDLAPANWDARAFPEPDKLDLTRDATQQLG FGYGRHQCVGQQLARAELQIVFHTLLRRIPTMKPAIPLEEVPFKHDRLAYGVYELPVTW CYP105Q7 Mycobacterium smegmatis MSMEG4843 TIGR 53% to CYP105Q1, 51% to 105Q3 86% to 105Q6 M. vanbaalenii, 76% to 105Q4 M. marinum Formerly CYP105T1 but more similar to CYP105Q sequences MSETLTQPSATDIPGYPMERAAACPFAPPPQMLDMNKAKGLSRVRIWDGS TPWLITGHEEARALFADSRVSVDDRRPGFPHWNEHMLATVHKRPRSVFTS DAEEHTRFRRMLSKPFTFRRVEGLRPAIQKITDECIDAILAGPQPADIVD KLALPVPTVVISEMLGVPYEDHEFFQEHANAGLARYAAADAMQKGAMSLH QYLIDLIEKKQAEPAEDAVSDLAERVTAGELSVKEAAQLGTGLLIAGHET TANMIGIGILALLENPEQADFLRNAEDPKVIANAVEELMRYLSIIQTGQR RVAVEDIEIGGETIKAGEGIIIDLVPANWDAKAFPEPDKLDLTRDAGQQL GFGYGRHQCVGQQLARAELQIVFHTLLRRIPTLRLAIPLEEVPFKHDRLA YGVYELPVAW CYP105R1 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV7186 CYP105S1 Mycobacterium smegmatis MSMEG0758 TIGR fas1 52% to 105E1 76% to 105S2 Mycobacterium vanbaalenii MTQAQALPPLHIRRDAFDPTPELGEIRAGEGVHVTVNPFGMQVYLVTRHE DVKTVLSDHERFSNSRPPGFVLPGAPQISAEEQASNRAGNLLGLDPPEHQ RLRRMLTPEFTIRRIKRLEPRIVEIVDAHLDAMESAGPPADIVADFALPI PSLVICELLGVPYEDRTDFQQRSARQLDLSAPMPERLELQRQGRAYMRGL VERSRTRPGDDILGMLVREHGTELTDDELIGIAGLLLLAGHETTSNMLGL GVLALLRHPDQLACVRDDPDAVGPAIEELLRWLSIVSTALPRITTTDVEL AGVTIPAGHLVFASLPAGNRDPEFIDDPDTLDIRRGAPGHLAFGHGVHHC LGAPLARMEMRIALPALLRRFPTLALAEPFEDVRWRPFHFIYGLQSLAVAW CYP105S2 Mycobacterium vanbaalenii PYR-1 ZP_01208508.1, EAS22074.1 52% to 105E1, 76% to CYP105S1 MSQAVRPELPPVHMRRDGFDPTPQLREIRETEGVRVITSAFGMSAYLVTRHEDVKTVLSDHTRFSNTRPP GFVVPGAPPIDEDEQARSRAGNLLGLDPPEHQRLRRMLTPEFTLRRMRRLQPRIAEIVDAQLDALAAARD GEASADLVQHFALPIPSLVICELLGVPYADRDDFQRRSARQLDLSIPIPERIELAREGRAYMGSLVAGAR TNPGDDILGMLVREHGAELTDDELVGIAGLLLLAGHETTSNMLALGTLALLRHPEQLAAVREDPDAVAPA VEELLRWLSIVHTAIPRITTTDVEIAGVSIPAGQLVFASLPSGNRDDEFIERPEVFDITRGAMGHLAFGH GVHHCLGAPLARMEMQIAFPALLRRFPTLAPAGEFDDVPFRSFHFIYGLKSLEVTW CYP105T1 Burkholderia fungorum GenEMBL NZ_AAAJ02000095 8366..9610 gene = Bcep2217 44% to 105H1 MRKTMTSAINDVRPQTTSTFPFARTGSPLHPPAEYARYRDGQPV TRVQMWDGRYAWIFTRMEDVKAVLSSPHFSVVPSKPGYPFLTPARAATVKSYQTFITM DPPDHTRFRRMLTRDFTQKRMEELRPQIAAYVNRLIDEMLARGSPGDLVSALALKLPV TVVSMLVGVPYEDHEDLVKWSGQRLDLEQNPTVSESAADNMLAYFDGLLQRKERDPGD GADMLSRLVIEQIKPGHLSRLEAIHMVNLLYFAGHETTANQIALGTLSFLLDPRQRAL LENNPGLLKNAIEEMLRFHTISHYNSCRVATADVEVGGTLIREGEGAYALIMAANRDP AAFPAPDRFDIERPNSQEHVAFSYGLHMCLGQPLARLELQVCFEALFRRLPRLRLAVP LEELPFKREMYVYGLHALPVTW CYP105U1 Streptomyces hygroscopicus strain NRRL 3602 AY179507 complement(63940..65133) Geldanamycin biosynthesis gene cluster 50% to 105B1 52% to 105B2 not 105S gene = gdmP MDEIRDYPESRAAACPFSPPLGYEELRERSAVTRVRMWDGSTPF LVTGYHEARAALGDSRFSADGTHKAMPRFVKFEVPAEVFNLGRMDDPEHARIRRMLTA NFTIRRTEAMRPMIQGIVDGLLDRLIAQGPPADLVADFAFPLPSQVIGVMLGVSDADF AEFQQASQGVMDFTASAEEMGAALGVMVDYVARMCAAKRADPGDDLLSRLIVDQELTG GLTQQQVVATALVLLLAGHETTANMIALSTVLLLSHPEQLARLRADAGLMGNAVDELL RYITIVQEGTGRVATEDVEVGGVLIPGGEGVIINLPSANRDPHFADAHELDLSRPNAR EHVAFGFGVHQCLGQTLARVELQIALETLLRRLPTLRLEVPFDDLAFLYESMNFGVAR VPVAW CYP105V1 Streptomyces sp. HK803 GenEMBL AY354515 36297..37508 Gene = plmT4 43% to CYP105Q1 MSQLSSELPAFPMSKAKGCPLDPPPEYAQLRSDRPVAKARLWDG KEVWLITGYDEIRSIFTDPRISVDNTQPGYPWLSEQARTVVLTGGVKPVGRMDPPEHT AMRRMLGQGFLVKKIQNMRGDVEALVNELIDDILAGPRPTDLVPSLAMPVPSTALGWV LGVPPADKRLISLVPRLFDEDSGLEGAMEARAELFAYIDELITHRENQPGDDIISHLV GYYQKGELSRVSVLTQSVTLIAAALDTTRSMITNGILALLQHPEQAAALIEDPDLVPA AVEELLRYTVVTEFSSKRVAAADIEIAGETIKAGDGIICLISAGNRDEKVFTDPDTLD VRRDAKQHLGFGAGIHTCIGKQLARMELEVVYGTLFRRIPELRLAVPFDQLVFRNTFD VQGVRALPVTW CYP105W1 Micromonospora echinospora GenEMBL AF497482 84045..85229 Gene = calE10 calicheamicin biosynthetic locus 45% to CYP105K1 47% to 105D4 MPRRCPFGPPAEYARLRTERPVARLPMLGGNTAWVVSRYADVKR VLSDPRMSADRRRAGFPRFAPTTESQRQASFANFRPPLNWMDPPEHTAARRQIVDEFA ARRVRQLRPLVERVVDEHLDAMTAGRSSADLVPSFSYPVPSRVICEMLGVPYGEHAFF ERRSTRMLSRGVPADERARCAREIREFLDGVVTDKERHPGDDVLSRLLAAQRAAGEPD HEAVVSMAFVLLVAGHVTTSNMISLSVLALLTHPERLARLRAEPDRFPAAVEELLRYF TIVEAATARTATADVTVGGVTIRAGEGVVALGQAANRDPAAFDRPDEFDPDRDARHHL AFGYGRHICPGQHLARLELDVALSRLVRRLPGLRLTVDVDDLPLKEDGNIFGLHALPVAW CYP105X1 Pseudonocardia autotrophica same as Amycolata autotrophica GenEMBL AF525299 2766..3974 Gene = pauC P-450 gene cluster 49% to 105A3 MAEDTLGQDFPMQRQCPFEPPKEYERLRAEQPISRVRMPDGTPA WLVTLHEDVRTVLASPAFSSDLAHPGMPAVNPEIRTIARQQRPPFSRMDPPEHSFFRR MLIPEFTVKRTKTLRAGIQSVVDGLIDDLLRKSPPVDLVDEFALPVPSLVICQLLGVP YSRHEFFQQQARVILSRQSTREQVGAAFTALRAYLDTLVEEKLHTPGDDLTSRLATEH LEPTGDVRRQDLVASCMLLLTAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEA VEELVRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDI HRGNRRHACFGYGVHQCIGQHLARTELEVAFSTLFTRIPTLQIAAPSDELDYDHDGML FGLHELPVTW CYP105X2 Amycolata autotrophica same as Pseudonocardia autotrophica GenEMBL AF071148 99% to 105X3 94% to 105X1 61% to 165B2 LLIAGHETTSHMISLGVTALLERPDQLAALQNDLTLLPEAVEEL LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN RRHVAFGYGVHQCLGQNL CYP105X3 Micromonospora inyoensis GenEMBL AF071146 99% to 105X2 61% to 165B2 60% to 105A3 LLIAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEAVEEL LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN RRHVAFGYGVHQCLGQNL CYP105Y1 Rhodococcus sp. RHA1 No accession number Marianna A. Patrauchan Rha04313 Submitted to nomenclature committee 12/13/04 48% to CYP105X1, 46% to 105D7 CYP105Z1 Streptomyces scabies SCAB17851 David Lamb Submitted to Nomenclature committee Nov. 10, 2006 49% to 105B1 Streptomyces griseolus 51% to 105B2 Streptomyces tubercidicus possible ortholog CYP105AA1 Streptomyces tubercidicus strain R-922 GenEMBL AY549204 Istvan Molnar, Syngenta Biotechnology, Inc. Jungmann,V., Molnar,I., Hammer,P.E., Hill,D.S., Zirkle,R., Buckel,T.G., Buckel,D., Ligon,J.M. and Pachlatko,J.P. Biocatalytic conversion of avermectin to 4'-oxo-avermectin: characterization of biocatalytically active bacterial strains and of cytochrome p450 monooxygenase enzymes and their genes Appl. Environ. Microbiol. 71 (11), 6968-6976 (2005) Submitted to nomenclature committee June 2, 2003 Clone name Cyp230 56% to CYP105AA2 formerly 105S1, but that name was already assigned to a Mycobacterium smegmatis sequence (my error). CYP105AA2 Streptomyces tubercidicus strain I-1529 GenEMBL AY549201 Istvan Molnar, Syngenta Biotechnology, Inc. Jungmann,V., Molnar,I., Hammer,P.E., Hill,D.S., Zirkle,R., Buckel,T.G., Buckel,D., Ligon,J.M. and Pachlatko,J.P. Biocatalytic conversion of avermectin to 4'-oxo-avermectin: characterization of biocatalytically active bacterial strains and of cytochrome p450 monooxygenase enzymes and their genes Appl. Environ. Microbiol. 71 (11), 6968-6976 (2005) Submitted to nomenclature committee June 2, 2003 Clone name Cyp234 56% to CYP105AA1 formerly 105S2, but that subfamily was already assigned to a Mycobacterium smegmatis sequence (my error). CYP105AB1 Saccharopolyspora erythraea NRRL23338 SACE_3429, se: 3784062,3785189 (+) STRAND, 52% to CYP105W1, 51% to 105K1 LRSQEPVKRVRTIGGGTAWLVTRHEDVRRVLSDPRMSSDRTMPGFPSLVPGRRAIVAENK QAMIGMDGQEHAEARRAVIGEFTVRRINRMRPRIQEIVDECVDRMLAAGGPVDLVRELSL PVPSLVICELLGVPYSDHDFFQSRSALMISRSTPPERRRDVVLELRRYLDELVAEKVREP ADDLLGRQVAQQSEKGEVDREGLVSLAFLLLIAGHETTANMISLGTLALLDNPDQLARIT EDPARTPAAVEELLRYFSIVDGATSRTALADIEIGGVLIREGEGVVAVGLSANRDPEAFD SPDELDLDRQARNHVAFGFGAHQCLGQNLARVELQIVFDTLFRRIPGLRLADGLDGIRFK DDALVYGAHEMSVTW CYP105AB2 Salinispora tropica (marine actinomycete) Strop_1339 complement(1505346..1506596) 57% TO CYP105AB1 Saccharopolyspora erythraea 48% to CYP105K1 Y18574.1 Streptomyces tendae MTETASIATTRTASGQLTDAEFPVQRGCPFTTPTEYEQIREESSIAKVRLKNGGEAWWIA GHELGRSVLADRRFSSDRRRDNFPFVSTDPETRAQLQSQPTSMLGMDGAEHAQTRRALMG EFTVRRMAGLRPRIQQIVDQHIDEMLATPQRSVDLVEALSLPVPSLVICELLGVPYADHD FFQGLTGPLLRHTTPPEVRLRIQEELNTYLGTLIDRKLTDPTDDLLSRQIAKHRDNGTFD RASMVSLAFLLLVAGHETTANMISLGVVGLLQHPDQLVIIKDDPDKTPLAVEELLRYFTI ADSVTARVATEDVQLGDTTINAGDGVVISGLAADRDPTVFAEPDRLDLERGARHHVAFGF GPHQCIGQTLARMELRIVFDTLFHRIPTLRLAAPLDDIPFKSDAFVYGIEELPVAW CYP105AC1 Saccharopolyspora erythraea NRRL23338 SACE_4243, se: 4724009,4725226 (-) STRAND, 52% to CYP105AA2, 52% to CYP105AA1 MQKHAPHNADDVLESLPRDRPSGCPFDPPEGLAEIRGQRPLTRLVYGDGHVGWLATGHAV VRAVLADRRFSSRYELMHFPVAMPGLPAQIPPAQVGDITGIDPPEHTRYRKLLTGKFTVR RMRALTERVEQITAERLDAMQRLGPPVDLVEAYAQPIPALMICELLGVPYDRLEEFLGLV AASGDRDLTPEEQFDAFAKIQEFVRELVPAKRAKPTDDLLSDLTTTELTDQELAGIGGLL LAAGLDTTANILALGTFALLRNPEQIAALREGDADRAVEELLRYLSIAHTGMRSALEDVE IDGTLIRAGETVTLSIQAANRDPRRFTDPDALDLRRHAAGHLSFGHGIHQCLGQQLARVE MRVAFPALFNRFPGLRLAVAPEEVPLRGDMNIYGVHGLPVTWDGA CYP105 fragment Streptoalloteichus hindustanus AF071147 66% to 105AA2, 63% to 105Y1 61% to 105C1 59% to AF040570 CDS 2652..3842 (CYP166A1) LLIAGHETTANMLALGAFALLEHPEQLAELRANPDLMPGAVEEL MRYLSIVHIGPVRTAVADVEIEGQLIRAGESVTVSVPAANWDPAKFPEPERLDLTRRT SGHLAFGHGVHQCLRQNL 106 Family CYP106A1 Bacillus megaterium GenEMBL X16610 Gene BM-1 CYP106A2 Bacillus megaterium GenEMBL Z21972 (4317bp) PIR S32216 (410 amino acids) PIR S39924 (410 amino acids) Swiss Q06069 (410 amino acids) Rauschenbach,R., Isernhagen,M., Noeske-Jungblut,C., Boidol,W. and Siewert,G. Cloning, sequencing and expression of the genes for cytochrome P450meg, the steroid-15beta-monooxygenase from Bacillus megaterium ATCC 13368. Molec. Gen. Genet. 241, 170-176 (1993) CYP106B1 Bacillus anthracis str. Ames Genpept AAP26480 47% to 106A2 47% to 109B1 1 MASPENVILV HEISKLKTKE ELWNPYEWYQ FMRDNHPVHY DDEQDVWNVF LYDDVNRVLS 61 DYSLFSSRRE RRQFAIPPLE TRININSTDP PEHRNVRSIV SKAFTPRSLE QWKPRIQSIA 121 NELVKDIENC SEVDIVEQFA APLPVTVISD LLGVPTTDRK KIKAWSDILF MPYSKEKFND 181 LDAEKGIALN EFKAYLLPIV QEKRYHLTDD IISDLIRAEY EGERLTDEEI VTFSLGLLAA 241 GNETTTNLII NSFYCFLVDS PATYKEVREK PKLISKAVEE VLRYRFPVTL ARRITEDTNI 301 FGPLMKKDQM VVAWVSAANL DEKKFSQASK FNIHRIGNEK HLTFGKGPHF CLGAPLARLE 361 AEIALTTFIN AFEKIALSPS FNIEQCILEN EQTLKFLPIR LKPQ CYP106B2P Bacillus cereus ATCC 14579 GenPept AAP09572 GenEMBL AE017006 83% to 106B1 54% to CYP109B1 YjiB Z99110 Bacillus subtilis I -helix 1 MTSVITDGEI VTFSLGLLAA GNETTTNLII NSFYCFLVDS PGIYEELRKE PNLILKAIEE 61 VLRYRFPVTL TRRITALSER ESPSPLGMG CYP106B3P Bacillus cereus ATCC 14579 GenPept AAP09575 GenEMBL AE017006 87% to 106B1 54% to 106A2 C-term fragment LKEDTNIFGPF 1 MKKNQMIVAW VSAANLDEKK FSQASQFNVH RTGNEKHLTF GKGPHFCLGA PLARLEAEIA 61 LTTFINAFEK IELFPSFCLE KCILENEQTL KYLPIRLKAT 107A Subfamily CYP107A1 Saccharopolyspora erythraea GenEMBL X60379 Swiss Q00441 (406 amino acids) Haydock S.F., Dowson J.A., Dhillon N., Roberts G.A., Cortes J., Leadlay P.F. Cloning and sequence analysis of genes involved in erythromycin biosynthesis in Saccharopolyspora erythraea: sequence similarities between eryG and a family of S-adenosylmethionine-dependent methyltransferases. Mol. Gen. Genet. 230, 120-128 (1991). Weber J.M., Leung J.O., Swanson S.J., Idler K.B., Mcalpine J.B. An erythromycin derivative produced by targeted gene disruption in Saccharopolyspora erythraea. Science 252, 114-117 (1991) CYP107A1 Saccharopolyspora erythraea NRRL23338 SACE_0730, 825267,826481 (-) strand EryF 6-deoxyerythronolide B hydroxylase (6-DEB hydroxylase) MTTVPDLESDSFHVDWYRTYAELRETAPVTPVRFLGQDAWLVTG YDEAKAALSDLRLSSDPKKKYPGVEVEFPAYLGFPEDVRNYFATNMGTSDPPTHTRLR KLVSQEFTVRRVEAMRPRVEQITAELLDEVGDSGVVDIVDRFAHPLPIKVICELLGVD EKYRGEFGRWSSEILVMDPERAEQRGQAAREVVNFILDLVERRRTEPGDDLLSALIRV QDDDDGRLSADELTSIALVLLLAGFEASVSLIGIGTYLLLTHPDQLALVRRDPSAL PNAVEEILRYIAPPETTTRFAAEEVEIRGVAIPQYSTVLVANGAANRDPKQFPDPHRF DVTRDTRGHLSFGQGIHFCMGRPLAKLEGEVALRALFGRFPALSLGIDADDVVWRRSL LLRGIDHLPVRLDG CYP107A2 Streptomyces rochei plasmid pSLA2-L NC_004808 complement(44847..46067) AB088224.1 complement(44847..46067) 64% to 107A1 note="ORF26 (406 aa), lankamycin biosynthesis protein similar to M54983-1 Saccharopolyspora erythraea 6-deoxyerythronolide B hydroxylase, EryF CYP107A1 MTTDAHTAVPSLDSDLFHIDQYEAYAALREREPVSKVSFIGREA FLITRHAEAKAALGDLRLSNDFKKQPPGVELPTYHGIPEDVRPYFANNMGSNDPPAHT RLRRLVSREFTARRVESMRTRVAQLAEHLLDGLAGERETDLVERFAYPLPITVISELL GVEERYQGDFGRWSNEFLVIDADRVEQREHAARALVGFILELVDRRRADPGSDLLSAL IHVHDEDEDRLSTDELASVVLILLIAGFETSVSLIAMATYLLLTHPGELAKVRADPSL VPNAVDEVLRFLGPAEITTRGTLEPVEIGGVHIPAHSTVLIAGAAANRDPRRFPDPER FDVTRDTGGHLSFGHGIHFCVGGPLARLEGEIALRALLNRFPGLDLAIPAEQVRWRRS FLRGIESLPVRLGR 107B Subfamily CYP107B1 Saccharopolyspora erythraea GenEMBL M83110 Swiss P33271 (405 amino acids) PIR B42606 (405 amino acids) Andersen J.F., Hutchinson C.R. Characterization of Saccharopolyspora erythraea cytochrome P-450 genes and enzymes, including 6-deoxyerythronolide B hydroxylase. J. Bacteriol. 174, 725-735 (1992) CYP107B1 Saccharopolyspora erythraea NRRL23338 SACE_5814, se: 6524921,6526138 (-) STRAND MTTGEVPDLLAFDDAFAQDRHNRYARMREEPVQRIRTVNGLDAWLITRYEDVKQALLDPR IAKDFGRTQQIIEKRLADAERRPGFSPDLGPHMLNTDPPDHTRLRKLVVKAFTARRVEGL RPRIEQITDDLLDRLAGRSEVDLIDEFAFPLPITVISELMGVEDSRRDDFRSWTNVLVDG SQPEAQAQASVAMVEYLTELIAKKRTEPGDDLLTALLEAVEDGDRLSEGELIAMVFLLLV AGHETTVNLIGNCVLSLLGNPDQLAALRNDPSLLPGAIEETLRYESPVANGTFRHTAEAV RFGDVVIPEGELVWVALGAANRDGERFEDPDRFDITRETTGHVAFGHGIHFCVGAALARL EAQIAVGRLLERFPDLRMAASPDDLRWRFSVLMRGLEKLPVRPGA CYP107B2 Streptomyces sp. GenEMBL BD133548 58% to 107B1 3 LIAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDSPVGIATFRFSTE 182 183 ALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFGFGMHHC 344 107C Subfamily CYP107C1 Streptomyces thermotolerans GenEMBL D30759 (3267bp complete sequence of CarA) Arisawa,A., Kawamura,N., Takeda,K., Tsunekawa,N., Okamura,K. and Okamoto,R. Cloning of a macrolide antibiotic biosynthesis gene acyA, which encodes 3-O-acyltransferase, from Streptomyces thermotolerans and its use for direct fermentative production of a hybrid macrolide. Appl. Environ. Microbiol. 60, 2657-2660 (1994) Arisawa,A., Tsunekawa,N., Okamura,K. and Okamoto,R. Nucleotide sequence analysis of carbomycin biosynthetic genes including macrolide antibiotics 3-O-acyltransferase gene from Streptomyces thermotolerans. unpublished (1994) CYP107C1 Streptomyces thermotolerans GenEMBL M80346 (2393bp C-terminal fragment of CarA) Schoner,B.E., Geistlich,M., Rosteck,P., Rao.R.N., Seno,E., Reynolds,P., Cox,K., Burgett,S. and Hershberger,C.L. Sequence similarity between macrolide resistance determinants and ATP binding transport proteins. Gene 115, 93-96 (1992) Note: P450 fragment called carX. is equivalent to C-terminal of CarA. 107D Subfamily CYP107D1 Streptomyces antibioticus GenEMBL L37200 (1400bp) Rodriguez,A.M., Olano,C., Mendez,C., Hutchinson,C.R. and Salas,J.A. A cytochrome P450-like gene possibly involved in oleandomycin biosynthesis by Streptomycese antibioticus. unpublished (1994) 107E Subfamily CYP107E1 Micromosospora griseorubida GenEMBL D16098 (2168bp) Inouye,M., Takada,Y., Muto,N., Horinouchi,S. and Beppu,T. Cloning and nucleotide sequences of a gene governing mycinamicinIV hydroxylation. unpublished (1993) CYP107E2 Saccharopolyspora erythraea NRRL23338 SACE_1426, se: 1577490,1578686 (+) STRAND 58% to 107E1, 56% to 107N1 MPEPRPYPFSAAERLNLDPFYARLRAQEPMSRVKLPYGEAAWLATRYEDAKVVLADPRFS RAAVLEKDEPRMRPGITGGGILSMDPPDHTRLRRLVAKAFTQRRVERLRPRTQEIADGLV DRMIEHGSPADLVEEFALPLPITVICELLGVPYEDRDDFREWSDAFLSTTKLTPEQVVDY MDRMFGYMAGLIAKRRVDPQDDLMSALIEARDEHDKLTEQEMVQLAAGILVAGHETTATQ IPNFVYVLLTHPDQLEGLLADLDGLPRAVEELTRYVPLGVAAVFARYAVEDVELGGVTVR AGEPVLVSASSANRDEAVFDDPDRLDLTRENNAHIGFGHGPHHCLGAQLARLELQVGLRT LLTRLPGLRFAGGEDDVVWKEGMLVRGPSKLEVAWQSE CYP107E3 Salinispora tropica (marine actinomycete) Strop_2770 complement(3139855..3141042) 59% TO CYP107E2 Saccharopolyspora erythraea 52% to CYP107E1 D16098 Micromosospora griseorubida MTIDQEIRKYPFCESPGIGIDPTYGLLRSTEPLARVQLPYGEVSWLATRYEDVKTVLTDP RFSRAAAQGKDQPRTREEMTYEGIIGLDPPDHTRLRKLAGKALTARRVNAIRADAQRIAN EYVDEMIAKGSPGDLVELFALPYPVTVICELLGVPFEDRAQFRIWTEGLTSTSEQLMVYA EQLFGYMGKLVAQRREEPTDDLLGALVKARDEGDRLTEQELLSIAGVGLLLTGVETVSTH IPNFVYALLTHPELMAQLRADRSLVPAAVEELLRMIPLNPAAMFPRYAVEDVTLSGITVR AGEPVLVSLPGANRDPEVFENPETFDFTREQNPHVAFGHGPHHCLGAQLARMELQVALHT VLDRFPDLSLADGDEGVSWKSGLLVRGPSRLLVAW 107F Subfamily CYP107F1 Streptomyces griseus GenEMBL D45916 (2787bp) AB018074 CDS 341-1561 Ueda,K. and Horinouchi,S. Cloning and Nucleotide Sequence of a Gene Involved in Redbrown Pigment Biosynthesis in S. griseus Unpublished (1995) CYP107F2 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV1171 55% to 107F1 this subfamily is on the outskirts of CYP107 107G Subfamily CYP107G1 Streptomyces hygroscopicus GenEMBL X86780 (107379bp) complement (91764-92978) rapN 107H Subfamily CYP107H1 Bacillus subtilis GenEMBL U51868 (10153bp) Z99119, AF008220 coding region 7164-8351 pimelic acid biosynthesis gene name bioI 107J Subfamily CYP107J1 Bacillus subtilis GenEMBL Y11043 U93876, Z99117 Belitsky, B. R., M. C. Gustafsson, A. L. Sonenshein, and C. Von Wachenfeldt. An lrp-like gene of Bacillus subtilis involved in branched-chain amino acid transport. J Bacteriol. 179, 5448-57 (1997). gene name cypA 42.6% identical to 107B1 also called yrdE CYP107J2 Bacillus anthracis str. Ames GenPept AAP26475 58% to 107J1 cypA of Bacillus subtilis 1 MAMKNKVGIR IEDGINLASA QFKEDAYEIY KESRKVQPVL FVNKTELGAE WLITRYEDAL 61 PLLKDNRLKK DPANVFSQDT LNVFLTVDNS DYLTTHMLNS DPPNHNRLRS LVQKVFTPKM 121 IAQLEGRIQD IADDLLNEVE RKGSLNLVDD YSFPLPIIVI SEMLGIPKED QAKFRIWSHA 181 VIAYPETPEE IKETEKQLSE FITYLQYLVD MKRKEPKEDL VSALILAESE GHKLSARELY 241 SMIMLLIVAG HETTVNLITN TVLALLENPN QLQLLKENPK LIDAAIEEGL RYYSPVEVTT 301 SRWADEPFQI HDQTIEKGDM VVIALAAANR DETVFENPEV FDITRENNRH IAFGHGSHFC 361 LGAPLARLEA KIAITTLFER MPELQIKGNR EDIKWQGNYL MRSLEELPLT F CYP107J3 Bacillus cereus ATCC 14579 GenPept AAP09568 59% to 107J1 cypA Y11043 Bacillus subtilis 1 MKNKVGLSIE DGINLASAQF KEDAYEIYKE SRKKQPILFV NQVEIGKEWL ITRYEDALPL 61 LKDNRLKKDW TNVFSQDIKN MYLSVDNSDH LTTHMLNSDP PNHSRLRSLV QKAFTPKMIA 121 QLDGRIQRIA DDLISDIERK GTLNLVDDYS FPLPIIVISE MLGIPKEDQA KFRIWSHAVI 181 ASPETPEEIK ETEKQLSEFI TYLQYLVDIK RKEPKEDLVS ALILAESEGH KLSARELYSM 241 IMLLIVAGHE TTVNLITNTV LALLENPNQL QLLKDNPKLI DSAIEEGLRY YSPVEVTTAR 301 WAAEPFQIHH QTIQKGDMVI IALASANRDE TVFENPEIFD ITRENNRHIA FGHGSHFCLG 361 APLARLEAKI AITTLFNRMP ELQIKGNREE IKWQGNYLMR SLEELPLTF CYP107J4P Bacillus cereus ATCC 14579 GenPept AAP09593 46% to CYP107J3 in same genomic region 47% to CYP107Y1 SAV2377 AP005030 Streptomyces avermitilis 50% to 107H1 1 MKEPQLQQHL EKFIQYIEAL VNEKRLNPDA DLISELVQTK EQEDKLSNNE LLSTIWLLII 61 AGHETTVNLI SNGLLALLQH PEQMNLIREN PSLIPSAVDE LLRHSGPVMF ISRLASEDMT 121 IHGKRIPKGD LVLLSLTAAN IDPQKFTYPE TLNISREENN HLAFGAGIHH CLGAPLARLE 181 GQIALGTLLQ RLPNLRLAIK PDQLNYNHSK IRSLVNLPVV F CYP107K1 Bacillus subtilis GenEMBL AL009126 Z99113 comp(76702-77832) polyketide hydroxylase pksS just over 41% identical to CYP107J1 CYP107L1 Streptomyces venezuelae GenEMBL AF087022 GenEMBL AF079139 CDS 122..1372 pikC gene function="catalyzes the hydroxylation of YC-17 into methymycin and neomethymycin and narbomycin into pikromycin" 51% to 107B1 47% to 107A1 44% to AF254925 42% to 107J1 41% to AL049754 new CYP107 subfamily CYP107L2 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV1987 60% to 107L1 from Streptomyces venezuelae CYP107L3 Streptomyces tubercidicus strain I-1529 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name CypLA 60% to CYP107L1 91% to 107L4 CYP107L4 Streptomyces tubercidicus strain R-922 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name CypLC 61% to CYP107L1 91% to 107L3 CYP107L5 Streptomyces sp. GenEMBL BD133547 68% to 107L2 3 LIAGHETTVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAE 182 183 PLEIGGTVIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGFGTHRC 344 CYP107L6 Streptomyces sp. GenEMBL BD133544 72% to 107L2 MGHEHVIDLGEYGPGFTENPHPVYAELRARGPVHRVRLPKHDAHHEAWLVVGYEEARAAL ADPRLSKDGSTIGVTFLDEELIGKYLLIADPPQHTRLRGLIAREFTGRRVERLRPRVQEI TDSLLDEMLPRGRADLVESFAYPLPLTVICELLGVPEIDRAAFRKLSTEAVAPTSGESEY AAFVQLAAYLEELVEEKRCAPPADDLLSALIRTTDEDGDRLSPAELRGMAFILLIAGHET TVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAEPLEIGGT VIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGHGIHFCLGAPLARLEARVA LRALLERCPGLTPDGAPGEWLPGMLIRGVRSLPVRW* CYP107L7P Streptomyces narbonensis GenEMBL AF521878 13901..14661 desosamine biosynthetic gene cluster 91% to 107L1 gene= nbmL note= frameshift and deleltion generates premature stop codon and truncated protein" MSRTHQGTTASRPVLDLAALGQDFAADPYPTYARLRAEGPAHRV RTPEGDEVWLVVGYDTARAVLADPRFSKDWRNSATPPTEAEAALSHNMLESDPRCGPT (deletion) ALRADLTLLDGAVEEMLRYGGPVESATYRFPVEPVDLDGTVLPAGETVLVVLAD AHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCTGAPLARMEARIAVRALLERCPDLALD VSPGELFWYPNPMIRGLESLPIRWRSGREAGRRVPVEPACRP* CYP107L8 Streptomyces sp. HK803 GenEMBL AY354515 complement(72672..73871) Gene = plmS2 56% to CYP107L6 MVTVDLSAYGPGFFTDPYPYYARLREAGPVHEIVLADGDRFWLI VGYDEARAALADPRLAKSLDPPSEDERHVLITDPPDHTRLRRLVSREFTARRVEAMRP RVQEITDGLLDEMVAGRRRADLVPSLGSPLPITVLCELLGVPLADREDFRGWTERVLV PAEPDTIAWWKSRGFAQAGMALTDYLKNMIEDKRRSTPTGDLISSLLRTTAEDNDRLS AAELHSMVFILIVAGHETTANLITNGVRALLAHPEQLAALRTDPEGLIDQAVEEMLRY DGPVETSTKRFTLEAVRYGATKIPPGETLLVSIAATGRDPAQFERPDTFDIHRGTTGT RSGHVAFGHGIHFCLGAGLARMESRVAILTLLRRCPDLALDIDPAGLDWLPGIRVRGV RSLPVRW CYP107L9 Streptomyces peucetis No accession number Niranjan Parajuli Submitted to nomenclature committee Nov. 2, 2003 62% to 107L6 before frameshift at C-term clone name SP0854 CYP107M1 Actinomadura hibisca GenEMBL D87924CDS complement(6299..7534) 45% to AF127374 CDS 3226..4458 44% to AF254925 45% to 107D1 44% to 107G1, 107E1 new subfamily in 107 CYP107N1 Streptomyces lavendulae GenEMBL AF127374 CDS 3226..4458 50% to 107D1 52% to AF254925 47% to 107E1 new subfamily in 107 CYP107P1 Streptomyces coelicolor cosmid H10 GenEMBL AL049754 CDS complement(10413..11648) 41% to AF087022 40% to 107B1 40% tp 107G1 40% to 107D1 new subfamily in 107 cloned and expressed by David Lamb and Steve Kelly CYP107P2 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV4539 86% to 107P1 from Streptomyces coelicolor CYP107P3 Streptomyces peucetis No accession number Niranjan Parajuli Submitted to nomenclature committee Nov. 2, 2003 78% to 107P2 missing 156 aa at N-term C-term may be frameshifted clone name SP0887 CYP107Q1 Amycolatopsis mediterranei GeEMBL AF040571 CDS complement(781..>2316) 66% to AF040570 comp(68704..69969) 43% to 107C1 41% to 107B1 40% to 107A1 new subfamily in 107 CYP107Q2 Amycolatopsis mediterranei GenEMBL AF040570 CDS comp (68704..69969) 66% to AF040571 complement(781..>2316) new subfamily in 107 CYP107R1 Streptomyces maritimus GenEMBL AF254925 CDS comp (18384..19589) gene="encR" 53% to AF127374 CDS 3226..4458 49% to 107E1 new subfamily in 107 MTTHTQQLRDFPFAPPAELHMEPAFAQLREEEPISRVRLPYGGE AWLVTRYQDIKTVLGDPRFSRAATQHAQAPRIQPDPAGEGVLMSLDPPDHTRLRKTVA GVFTKRRVEDLRPATQRIAEELLEAMEASGAPADLVASYALPLPVTVICDLLGVPGDD REQLRGWSDALLSTTACTPAESAAAAQAMADHFAALVSQRRRQPTDDLLGALVQTWDR EEGLLRDEELVLLTRDLLIAGHETTASQIANCTYLLLQRPHDMDRLRTDPSAMASAVE ELLRFIPLGSGSFRARVATEPVELCGVRIQPGDTVFAPTVAANWDPDVFAEPGRLDID RSPNPHVAFGHGVHHCLGAQLARLELQVALGVLLRRLPRLRLAVDEAEIVWKTGMQVR GPKTLPVKW CYP107S1 Pseudomonas aeruginosa NZ_AABQ07000001 NC_002516 3741011..3742267 locus_tag = PA3331 47% to 107B1 CYP107T1 Streptomyces coelicolor StH63 [Full Sequence] Sanger cosmid 51% to CYP107L1 CDS 16028-17233 cloned and expressed by David Lamb and Steve Kelly CYP107U1 Streptomyces coelicolor StE41 [Full Sequence] Sanger cosmid comp(7438-8739) 44% to CYP107B1 cloned and expressed by David Lamb and Steve Kelly CYP107U2 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV3536 85% to 107U1 from Streptomyces coelicolor CYP107U3 Streptomyces peucetis No accession number Niranjan Parajuli Submitted to nomenclature committee Nov. 2, 2003 84% to 107U1 missing 90 aa at N-term clone name SP0819 CYP107U4 Streptomyces scabies SCAB54411 David Lamb Submitted to Nomenclature committee Nov. 10, 2006 90% to 107U1 Streptomyces coelicolor CYP107V1 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV3519 low 40% range with some 107 subfamilies CYP107W1 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV2894_olmB low 40% to 107 subfamilies CYP107X1 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV6249 49% to 107L1 from Streptomyces venezuelae CYP107X2 Saccharopolyspora erythraea NRRL23338 SACE_1158, se:1279482,1280657 (-) STRAND 57% to 107X1 MRPVEIDDEFVTCPHAAYARLREQGPVHRAVAPDGSRVWLVTRYDDVRAALADSRLSLDK AHATDGYRGLSLPPALDANLLNMDAPEHTRLRRTVTRAFTAHRTELLRPRVQEIADELLA AVAGQERAELMSAFAGPLPITVICELLGVDARDRPDFRAWTDEMLAPSTPDRARDSLRSL YAFLVDLIARKRAEPGADMPSTLVGLRDEDGSLTEDELTSTAFLVLFAGYENTVNLIGNG LAALLARPAQLAAVRSDRGLLPSTVEELLRFDPPPQLSIRRFPKEDLEIGGVRIPAGDTV LLSLVSAHHDPARFTSPGELIPDRADNAHLAFGHGPHFCIGAPLARMEAEVAFSTVLTRF PALSLAVDPAELRWRPSFRNRGLRELPVRLS CYP107Y1 Streptomyces avermitilis No accession number Submitted by David Lamb and Haruo Ikeda 9/3/02 Clone name SAV2377 50% to 107L1 from Streptomyces venezuelae CYP107Z1 Streptomyces rimosus ssp. paromyceticus strain R-2374 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema11 96% to CYP107Z2v1 CYP107Z2v1 Streptomyces albofaciens strain C-0083 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema8 96% to 107Z2v2 and CYP107Z1 CYP107Z2v2 Streptomyces rimosus ssp. paromyceticus strain BOEH-4355 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema3 96% to CYP107Z2v1 95% to CYP107Z1 CYP107Z3 Streptomyces sp. strain IHS-0435 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema7 76% to 107Z12 CYP107Z4 Streptomyces lydicus strain NRAB-0114 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema16 82% to 107Z12 CYP107Z5V1 Streptomyces lydicus strain NRRL-2433 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema15 97% to 107Z5v3 CYP107Z5v2 Streptomyces chattanoogensis DSM-40241 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema6 1 aa diff to CYP107Z5v3 CYP107Z5v3 Streptomyces lydicus strain R-401 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema4 100% to S. kasugaensis strain A/96 CYP107Z5v3 Streptomyces kasugaensis strain A/96 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema10 100% to S. lydicus strain R-401 CYP107Z6 Streptomyces sp. strain I-1525 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema5 85% to CYP107Z8 CYP107Z7 Streptomyces tubercidicus strain DSM-40261 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema17 90% to CYP107Z8 CYP107Z8 Streptomyces platensis strain Tu-3077 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema13 89% to CYP107Z9 CYP107Z9 Streptomyces tubercidicus strain NRAA-7027 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema12 89% to CYP107Z8 CYP107Z10 Streptomyces tubercidicus strain I-1529 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema2 90% to CYP107Z11 CYP107Z10 Streptomyces platensis strain I-1548 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema14 100% to S. tubercidicus strain I-1529 CYP107Z11 Streptomyces platensis strain NRAA-7479 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema9 92% to 107Z12 CYP107Z12 Streptomyces tubercidicus strain R-922 No accession number Istvan Molnar, Syngenta Biotechnology, Inc. Submitted to nomenclature committee June 2, 2003 Clone name Ema1 92% to CYP107Z11 CYP107AA1 Mycobacterium smegmatis MSMEG3142 TIGR RubU (pksS) 45% to 107B1 43% to CYP107AB1P 41% to CYP105S2 Mycobacterium vanbaalenii MTPYSRRDRNH MLRLGNSFVQNPHEVYDRLRRSGPVQRVEMWGGVPVWLVTRYQEARNLLT DPRIGKDGAAASALFPPGTDGSIGTVLGDNMLFRDPPDHTRLRRFVTSAF TAHAVRRLRPTIAGFADALLDDIAASVPGQVDLLQAFAQPLPVQVIGELL GVPERDRELFAALVVPIFTSTDTTVLRRAQKELTQLLTDMLAEKRQSPAD DVLSSLVHRRDGTDQLSEAELLGTAFLLIVAGYETTVNLLANGILALLRN PEQLRAVRADRSLLPRAVEEALRFESPLNTATVRYTSAPVTVGDVEIPSG ELVVIGLLAANHDDEQFPDAHRFDVSRTHNRHLAFGYGVHHCVGAPLARM EAEIGFDRLLSRFEVMELVDSGPPRYRPSTLMRGVERLPVILGYPHDIAS TMREWSGSLPSSGEADSSFAH CYP107AB1P Mycobacterium smegmatis 45% to CYP107B1 43% to CYP107AA1 MILDEQFAQDPEGLYRMLRSEAPVCEVELIGGVRGWLVTRYADVMALLKD PRVSKDHTSALPRLAPDRVRPYISPQLHNHMLNLDPPEHTRLRRLVVQAF TPKALARMQPVIDAIADELLDDIDLRSGDEPIDLMADYAEPLPIQVIAEL LGVAVEYA*PFRAAVTPLLMSVTVEEKAESGRATIEILNAVIDEKIREPG EDLLSGMIGASVDGHGLTRDELMAMCFLLITAGYETTVNLIGNGTLALID NPSQLEKVRENPDLTAGAVEEILRFDGPVNIATWRYATADIDVDGVVIPA NEQIFLSLLSANRDTGRFENADRFDIERNTRGHIAFGHGIHYCLGAPLAR MEGVTAIGRIVQRYDSITLDPTAELRYHNGTLMHGLKSLPVRLTRVPQPRP CYP107AC1 Streptomyces atroolivaceus GenEMBL AF484556 60948..62147 leinamycin biosynthetic gene cluster 48% to 107N1 gene = LnmA MSATRRVHIYPFEGEVDGLEIHPKFAELRETDPLARVRLPYGGE GWMVTRYDDVRAANSDPRFSRAQIGEDTPRTTPLARRSDTILSLDPPEHTRLRRLLSK AFTARRMGAMQSWLEELFAGLLDGVERTGHPADIVRDLAQPFTIAVICRLLGVPYEDR GRFQHWSEVIMSTTAYSKEEAVSADASIRAYLADLVSARRAAPHDDLLGVLVSARDDD DRLTEDELITFGVTLLVAGHETSAHQLGNMVYALLTHEDQLSLLREQPELLPRAVEEL LRFVPLGNGVGNARIALEDVELSGGTVRAGEGVVAAAVNANRDPRAFDDPDRLDITRE KNPHLAFGHGAHYCLGAQLARMELRVAIGGLLERFPGLRLAVPADQVEWKTGGLFRGP QRLPIAW CYP107AD1 Streptomyces hygroscopicus GenEMBL AF521896 4248..5489 ansamycin biosynthesis gene cluster 43% to 107X1 gene = gdnH MSGRHFEQGERGTAMADTPEEELRILDPQSVAQELRKHGPPRQI TMHGTTAWLVSRYEEVRDCLGHPGMSPAAAYAASQGQTNPVSGLFEDTVAGTNPPQHT RLRRLLAKAFTVRRVESLRPRVQEITDTLLDRIAVDGRADLVSALAIPLPMQVICELL GVPIADRTEFHQWADLMLTPPLDPDTAARSQDASAKLWTYMEDLAEARRKAPEDDLIS DLMSAHEDDRLSHREVVATARMMLIAGYELTGSFISNAVFSLLSQPDQMELLRKDPEL AGRGLEELLRHAGPGILIVRFANEDVEIGSVSIRAGDQVLLDMDAAHSDPAHFTDGER LDLTRDSAVHLQFGHGIHYCIGAPLARVEGQIALESLVRRFPGLRLSVPAAEISHSKN PFIRSLTALPVEFEAQQPVAG CYP107AE1 Streptomyces sp. GenEMBL BD133545 50% to 107X1 VILLKSLAANGLTASSCFTVSPLPIRSASPSIAFLTSSSERDSGVRNDRPSDAQPAIARF RFPTPPHPRNPTQPHPTPPRPSPTDDPLQAPTFFADPYPTYARLRDTAPVLKVPTGSGGG GRHSYVVTGYAEAREAFTDPRLSKDTASFFAGRPSQRDLHPAVSRNMLATDPPQHARLRA LVTKAFTTGAVARLRPYISSLVDELLDTWPTHGTVDLIADLAVPLPVTVICELLGVPDSD RASVRTWSSDLFAAGDPQRIDAASHAVGDYMTALVAAKRTAPGDSLLDDLIAVRDGQDHL SEDELVSLAVLLLVAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDS PVGIATFRFSTEALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFG HGIHRCLGAPLARAEAELALHAVITRYPQAALATPPETLPWRHTRLTRGLASLPITLRDH PK* CYP107AF1 Streptomyces collinus DSM2012 GenEMBL AF293355 24259..25518 Gene = rubU rubrinomycin gene cluster 52% to 107B1 MARTDAPQAAPPADLFTPAFHQNPHEALAGLRRTAPAVPVMTPN GLRTWLVTGHEHARALLADPRLSKDMRVGRDLIPRNFVDPDKQREFLAESGERSQFPH VLSVHMLDSDPPDHTRLRRLVGRAFTARRVESLRPRITELTDELLDAMARHERLDLME ALAFPVPFTVICWLLGVPPDDRAAFRRWSNLLVSGAGTDEVREASASMITYLTELIEA KRNEPADDMLTDLVHARDAGDQLSSDELISMAFLLLVAGHETTVNLIGNGALALLTHP EVREQLAADESLWPGAVEEFLRYDGPVTNATWRFTTEPVEVGSVTIPEGEFVTISIGA AGRDPDRYPDPDRLDITRAHSGSVAFGHGIHHCLGAPLARLEGRIVLSRLFARLPGLR LAADPDELSWRSSLMMRGLEELPVFTA CYP107AG1 Streptomyces atroolivaceus GenEMBL AF484556 complement(120436..121638) Gene = LnmZ leinamycin biosynthetic gene cluster 49% to 107E1 MSTEVETEKPAPVAYPFTGSEGLELSQSYAKLFEDGDPIRVQLP FGEPAWLVTRYDDARFVLTDRRFSRHLATQRDEPRMTPRAVPESILTMDPPDHTRLRT LVSKAFTPRRIESKRAWIGELAAGLVADMKAGGAPAELVGSYALAIPVTVICELLGVP EDDRTRLRGWCDAALSTGELTDEECVQSFMDLQKYFEDLVKERRAEPRDDLTSALIEA RDAHDRLAEPELIGLCISILIGGFETTASEISSFVHVLQQRRELWTRLCADPEAIPAA VEELLRFVPFAANGISPRYALEDMTVGGVLVREGEPVIVDTSAVNRDGLVFDNADEVV IDRADNRHMVFGHGAHHCLGAHLARVELQEALKALVEGMPGLRLSGDVEWKADMIIRA PRVMHVEW CYP107AH1 Streptomyces peucetis No accession number Niranjan Parajuli Submitted to nomenclature committee Nov. 2, 2003 50% to 107L6 missing about 42 aa at N-term clone name SP0749 CYP107AJ1 Streptomyces peucetis No accession number Niranjan Parajuli Submitted to nomenclature committee Nov. 2, 2003 52% to 107B1 frameshifted C-term clone name SP0908 CYP107AK1 Streptomyces scabies SCAB79691 David Lamb Submitted to Nomenclature committee Nov. 10, 2006 48% to 107AF1, 47% to 107B1 CYP107AL1 Streptomyces scabies SCAB63301 David Lamb Submitted to Nomenclature committee Nov. 10, 2006 45% to 107AC1, 45% to 107N1, 45% to 107M1 CYP107AM1 Streptomyces scabies SCAB44031 David Lamb Submitted to Nomenclature committee Nov. 10, 2006 41% to 107AC1, 42% to 107E1 CYP107AN1 Bradyrhizobium japonicum USDA 110 GenPept BAC51802 NC_004463 complete genome complement(7193424..7194725) 41% to 133B1v1 45% to 107L1 formerly CYP107AA1, but this name was also given to an M. smegmatis sequence so one had to be changed. 1 MVTPGSGAAI GVFVSCGNRF EVTMNEQAQP AGGDPLFNPL SPDFIRNPYP HYDRLRAIDP 61 IHVTPFGQFV ASRHADVSLV MRDKRFGKDF VERSKRRYSE KIMDEPVFRS MSHWMLQADP 121 PDHTRLRGLV VKAFTARRVE DMRPRIQEIV DEAIDAVIDR GHMDLIEDFA FRLPVTIICD 181 MLGIPEDHRE VFYKSSRDGG RLLDPVPLTP EEIAKGNAGN MMAQMYFQQL FELRRRNPAD 241 DLTTQLVQAE EDGNKLTNEE LTANIILLFG AGHETTVNLI GNGLLALHRN PDQLALLKAR 301 PELMVNAIEE FLRYDSSVQM TGRVTLEDID DLGGRKIPKG ETVLCLLGSA NRDPAVYPDR 361 PDRLDVTRPN VKPLSFGGGI HFCLGAQLAR IEAEIAIATL LRRLPDLRID DVENPEWRPT 421 FVLRGLKSLP ASW CYP107AP1 Streptomyces rochei plasmid pSLA2-L NC_004808 Links 87725..88939 49% to 107A1 note="ORF37 lankamycin biosynthesis protein formerly CYP107AB1 but that name was already assigned to an M. smegmatis seq. (my error) MNQPQLPEIPALNSELFHTDQYATYREILEQRPVTRVRFYDGSL VWLVNRHEDVRAALTDPRLSNDPMKQSDIDLSAATGIPADLIEYFQRNMFRSDEPDHG RLRKLVTREFTVRRINALRPRIRQIADDLLEKFAATGGGDLVEALARPLPLTVMCELL GVPEEDRADFQTWSQHIVESSPEFAERNAVSYRSLFECVRSLIRRRRDEPGDDLLSAL VDLRDVADRLSENELISTVFLLLVAGIETTVNVLGTGTFLLLTHPGELARLRADGALL GPAVEEMLRYMAPIEITSRHTLEPVEIGGVSIDAQSTVLINLAAANRDPARFEDPQSF RVDRNDGGHLTFGHGIHYCLGAALARAEAEVTFEALLERFPDLRLAASASDLTWRHAF MRGPVELPVSWG CYP107AQ1 Saccharopolyspora erythraea NRRL23338 SACE_0125, se:152305,153462 (-) STRAND 50% to CYP107P1 MFDTADPAFVADPYPCFAELRRRGEVHRHPGLGMAVAVSHAAASEVLRHRGLGRIWVDAQ PAADFPAFNLLHRTSLLETEGAEHTRLRRSISAAFARGHVERVRPWVAGLADALVGGLVE RGGGDVVEEVAAPLPVQVIAELLGVPESDRNLLRPWSNAIVKMYEPGLPERRRAAAESAA AEFAEYMRALADRRRSAPADDMVSDLVAAEELSADEVVGTAVLLLMAGHEASVNLVANGV LALLRHPGQWRRLVDDPGLVPTAVEELIRYDSPLQLFERTAVEDVVVAGHRVAAGSKIAA LLGAAARDPEVFESPDVLDVGRQPNPHLGFGAGIHYCLGAPLARVEAAAALSALVRLAPR LEQAGEPVRRPEFVIRGLRELPVSV CYP107AR1 Saccharopolyspora erythraea NRRL23338 SACE_0651, se:714930,716198 (-) STRAND 44% to CYP107Y1 VQPDQSPTPRPESRHSPAAACPHAAVHREERPGGLVTWQISAFSEARAALGDSRFSKDPR RLGEALRAGGRSMFAEYGDNLLDNLLNSDPPDHTRLRRLVGKAFSPATIERLGPTTQRLA DELVASMLPAGRADLLAQFAYPFAFGVIARVLGLPPDSYRIFQRWTESMTAPREQGTDRM VAARHLCEHVTELVRRSREWLASAPAETLLDELVSARDDGDRLSENELVATVLLLIIAGH ETTVNLIGNGVHALLQHPGQLALLRDHPDLIDGAVEELLRFQPPISKTTLRVTTTDVEVA GTEIPAGSIVNVLVPAANRDQRQFPDADRLDITRPPSAHMSFGHGIHYCIGAPLARMEGR IAIGALLRGLPGLRLAEPAAEIPWRASNILRGLQRLPVRFDSAGADDEVRDRRHAGAARV HA CYP107AS1 Saccharopolyspora erythraea NRRL23338 SACE_2922, se: 3203418,3204656 (+) STRAND, 48% to CYP107D1 MTESIQRDADSQEAACPHARAYPFGDPGALDLDPDYARVRDEEPLTRIRMPYGEQGWLVT RYDDVRTVLADPRFSRSEVLKRDVPRPTPQQVERPGTLVTTDPPEHGRLRRLVAGAFTHR RAESMRPRIRGVVDELVDEMLAGDKPADLVAAVSMPLPVNVICELLGVPREDRHVFHSGA VLSDYTVPADEREATFKSLADYLAVLIAERRARPEGPGDDVLGALISARDTDGDRLSEDE LIELWVDILVAGYASIMSVTPDMVVTLLTERDRWDGLVADPGGVPDAVEEMLRVMPTIIE SGHSRVATEDVEIAGGTVRAGEAVLPCLPAANRDPAVFDAAEEMRLDRDAGKHIAFGFGP HYCLGASLARVQLQSVLTALVRKVPTLDLVEAVREDSTRVAAAVQGQLLVTW CYP107AT1 Saccharopolyspora erythraea NRRL23338 SACE_4142, se: 4611465,4612715 (-) STRAND 56% to CYP107AT2, 47% to CYP107AU1 MSVGDIDTPSGEFDFAANLLPFDPLDPAFQADPYPFFRLVRETAPALCTQPGMWVVTGFR ECSAVLRNPKFGHGDGRLVASQITHDAEGNVVRPFVFMDPPDHTRIRSLVTKAFSARMVE RLRPTAERLVGELLAAAMSGPADEPVDLMAELAFALPSNLISELLGMPPQDKPLFEQWSS ALGRGLDPDFMLSPEEMQRRDQARTEFDGYFAELARRRRAEPADDLVSALVAVEEDGRNL SMSELVSTCRLLLSAGYLSTAHLIGNGVNALLRHPEQFEWFRAHPDQVAGVVEELLRYDS PVQTAGMRTALQDTEIGDQPVSAGEGAMLLVGAANRDPAAFPDPDRLDVSRKPERNLGFG IGAHFCVGAPLARLTTQVALTALAGLRVELATDDAPRINNLVLRGFAELPVFLRAA CYP107AT2 Saccharopolyspora erythraea NRRL23338 SACE_4144, se: 4614537,4615769 (-) STRAND 56% to CYP107AT1, 47% to CYP107AU1 MSTTEGAPPVDSNVVRQLLLFDPFDPEFRADPHRVYREIRESGPVTATPGGLWLVSGHRQ VSAVLRDQAFGWGEAELAAGHFTTDDEGNTVRPLTFADPPEHTRIRSLVTSAFSARIVER LRPRAQELARESLAAALAGGGSADVIQQVAYPLTGRLLCELLGVDPEYQERFRAWAEAMG RGLDPDFMQSPDQLARREEARAHFHEYFAELAARRRAEPGDDLVSALVAVEQEGDRLTAT ELVVTCTLLLSAGYATTVHLIGNGMLALLENPDQLAWLRANPGRVGDAVEEVLRFDGPIQ LVSRVALRDTEVDGHAVAAGSPVLLLLAAANRDPAVFDDPDRLDVSRKPGRNLGFGVGIH FCLGAPLARLTAQAALSLLVEHELVLDGPRPAPTGSLVLRGLAELPLRSA CYP107AU1 Saccharopolyspora erythraea NRRL23338 SACE_5309, se: 5939205,5940431 (+) STRAND 47% to CYP107AT1, 47% to CYP107AT2 MTAGTNNAGRLGALAEAVLGYNPVDPEYHANAHEHHRRMAERGPIFRTPGGMWTAVSHAA CSAVLRDDRFGHDPGSAAQNLFDSTQRPSVAQRSFEFMDGPDHSRLRRLVNRAFTARRVE RLRPAVRTLADQLLTDVSGRIDVLADFILPLAMTTIVDMLGAPTEDNHLFRAWAEPIVRG LDPDFLLSSSELAAREQANAEFAEYFDRLVALRRAEPKDDLISALIAVEDDGVVLSGNEL ISMCLLLLAAGHESIMHLVGNGTVALLRDEDQLEHFRGHPGEVTNAVNELLRYDPPVVLL VRTALADAEVLGNRVRRGEIVWLQIGAANRDPAVFPDPDRLDLTRDTGGSLAFGLGIHFC IGASLARLEAAAALSALLHRDVALASEQLVHQKNVVIRGYEEVPVVLR CYP107AV1 Saccharopolyspora erythraea NRRL23338 SACE_5939, se: 6665300,6666601 (+) STRAND 44% to 107L1, 43% to 107AT2, 41% to 107AN1 MTTAEPAETGSLAELNLGMRLVLHGAVTWSIARLGDPVARLLHSPWRRDPYPIYRQLRAR GPLVRSRLGVWAASTYEVCDAVLRDRRFGVRTSDGSYGDPTAAAVGLQLSLLELDPPDHT RLRKLAAPAFRPRKLENYRQRIEDTAHELLDRALAKGEFDLIRDFATPLPIRVICELLGL PELGAERLAVHGAALSGALDGIRSIRHLRRMRASTLELNELFGDLIEQRRRQPGEDIVSD LVTALDQDRLDSTELVQMCDLLLVAGFETTVNLIGNGVLALLERPDQWRLLCDDPDQAVG VVEETLRWDPPVQTTMRVAHEPVEVAGRLLPRNSAVLPMLGAAGRDPAVHFAPDRFDITR GTRGDHLAFSSGIHYCLGAPLARLESEIAFRCLATRVPELRRSGALVRRPTSVIHGLSAL PVAASKATAGGRR CYP107AW1 Salinispora tropica (marine actinomycete) Strop_2290 complement(2583447..2584649) 49% to CYP107B1 M83110 Saccharopolyspora erythraea MESVTSTSAPPPVPYIADPYPALARIRANGPVSILHSDEGIPMWVIARYRNVRAALADPR FGQDARRAQTLADNRVAGVTLGGDVIHMLNSDPPDHTRLRHLVQGAFTARRVAAMRPLVE RITTSLLDGVGGRQTVDLVQDFAFPLPMLVICELLGFPAEERDAYRSWSTAILTHNDDPA AFATALRDMTDYIEVQLRHRRARPGEDLLTELLAARDAGQLTDDEIVGMVFLLLIGGHET TVNLLGTATLALVRNPDQHRWLLANPHALSEAIDEFLRYESPVAMATLRFTTAPVTVDDV VIPAGELVLVSLGGANRDPDRFPDADRLILDRRDTGHLAFGHGLHRCLGAFLGKLEGEVA LGALLGRYPGLTLAAEVRQLRWRDTIMLRGLESLPVSLHG CYP107AX1 Salinispora tropica (marine actinomycete) Strop_2367 2663162..2664352 47% to CYP107X1 SAV6249 AP005046 Streptomyces avermitilis MTSRPTAVFDQCLLRDPHSRYNALRDQAPVHHVLTPDGAPAWLVTRYNDVRAAFTDPRLS VDKRFSGTDGEHGSSLPPELDAHLLNRDPPDHTRLRRLAAAACTPRRVADLHPAVERIVS TLLDGLAGHDRAELIGSLASPLPLQVMHELLGLPTQANIDFRTWTNTLLSADANQPAQSR AAMANMRRFLIEQLAHKRAQPGDDLLTGLLAAREDDDRLTDDELVAMVFLLMFAGYDNTA ALIGTVTHALLTNAELHEAVRGGSLALDELIDEVLRWNPAFPLAVRRFAREPITIAGQTI PAGDRIWLCLASANRDPAQFTQPDELGIIGLRRSHLSFGHGIHYCLGAPLARLQTTIAVT SLLNRFPEMRLAVPAHDIRWRESFRLRGLIALPVYL CYP107AY1 Salinispora tropica (marine actinomycete) Strop_4427 complement(5021489..5022637) 49% to CYP107X1 SAV6249 AP005046 Streptomyces avermitilis MSQDPTMRAELAPIPRSGARLGQEYDQLRNAGDVHQVLLPDSSLAWLVTNPKLVSRALTD PRLALNRRHSRGSWSGFALPPALDANLLNLDAPDHTRLRRLVGPAFSPQRVSALRPRIRR TAEHLLDTLVATSGPVDLVTGYCTPLSVQVIADLMGVPEAGRADLRTWTDTMLTSYPPDR DAIRQAVVELHGYVVDLIDTKRQQPGDDLLSALVTIEQDGDRLTRDELTSLAFLILFAGY ENTANLIASTVLRLLDHGGLRGVQLPEAIEETLRLEPPAPAAVRRFPTEEMTIGGATIPA GDIVLLSIAAATRGTAGNAARLAFGNGPHFCLGAALARVEAEEALTVLARRLPDLALALP VAQVRWRPTFRTHGPAELLVTW CYP107AZ1 Roseiflexus sp. RS-1, complete genome. GenEMBL CP000686 REGION: 945215..946423 49% to CYP107AN1 MHHPTEPPPELWSAAAISDPYPIYDRLRAEQPIRWTGGDWQIFR YADAQALLRDPRLGADRLQVDPQWLIASGLEPLFKTRDSMMLFADPPDHTRLRTLVHR AFTPRVVESYRPLVQRIVDQLLDAAAARGAIELIGEFAYPLPVTVIAHMLGVPVNMHD QFRRWSDSLAAFIGGTTRPEADVLPAALKAVLEMTDFFLALVAERRRAPRDDLLSALA QAEDGGDRLSEQELVANSILLLLAGHETTTNLIGNGMLALMRHPDQFALLRDHPELTP SAIEELLRYDSPVQVTSRRALTDIEFQGHRIEEGQAVTVFIGAANRDPAQYQDPARLD VTRGDVRHLSFGHGPHYCLGAPLARLEGQVAISALVRRFPHMRTLDEQVVWRDNFALR GLQSLHIELE CYP107BA1 Roseiflexus sp. RS-1, complete genome. GenEMBL CP000686 REGION: 4471138-4472327 (-) strand Frameshift at 4471920 46% to 107B1, 46% to CYP107AZ1 4472327 MTPTIVARLASPEFLADPYPVYRQLIEQTPVFWLPHANAPGGMWCIARYDDIAFVLREAPIFKDT 4472133 4472132 SRIAPPDTLTPLDRAMLQRDPPDHTRLRRLASHAFTPRRVHDLMPRIEQI 4471983 4471982 SLDLIERIGARGEADFIADYA 4471920 4471920 PLPIIVIAELLGVPFEDHEQFSTWSDQIMAGSDSVLGGEEAARQSHQAMASLVDYFTTLI 4471741 4471740 RQRRHSPRDDLISALIAAHDAGDSLSEDELLGMCVLLLIAGHETTVNLIGNGLLTLLRH 4471564 4471563 PDQLNLLRRQSEYLTSAIEEMLRYESPVQRSTPRFAAEPFVIGGEQIEAGQQISLMFGAA 4471384 4471383 NRDPAHFSDPDRFDITRQPNPHLGFGMGIHYCLGAPLARIEARVAFTHILERLPAIRLAT 4471204 4471203 DTPAWKPVTWLRGLKSLPVLV* 4471138 CYP107BB1 Streptomyces sp. Tu6071 ABB69746 PlaO2 1 MDRVLDFLSS ADSAGELQPR LVELSRQETL PRVLLADGQE AWLVTRNEDV RTVLSDRSFT 61 RDVMGERARQ AGETPDGARS VNMDGRPHNE LRALVSKAFT VRRIEAMRPR IQAWTDELID 121 AMEETGPPAD LVAHLAVPLP ALAICELLGF PVEDRQVLSG WCERITRLGE GGPDQRAWQE 181 LSAYIARRVP VERAAARGGL APETSILTRL VHAHDSEDAL SMEELLSLTV VVLAGGLETT 241 QTAIGAGMVR LFRNPAQLDK VRADPDLVVP AVEEILRYQP VIDVNRVQVA TRTVRLGGQE 301 IRAGDLVQVS VNAANRDETV FPDSERCDVT RGPNPHLAFG YGAHHCLGAA LARLELKTAF 361 STLLRRLPDL RPAVPLESLG WRGGHVTLGL EELPVAW CYP107BC1 Streptomyces sp. Tu6071 ABB69764 PlaO5 1 MTESLETTSP DPTRSGNSDT GATPGYTVPK QVNDMWREKP VRRFSMRDGR EAWLVTGRAE 61 VRTVLADPRF SRVEARRLDA VMSPAVIFTR PGILDMDPPE HTRLRRLVAG EFSARRMRAL 121 RPRIQQIADE LIGTMKAAGP PADLAEGLSY PLPIAVICEI LGVPYADRER FRAWADRVSA 181 PGTQPQEAMA ALRSLFDYMG GLVDDKHAHP DGSLLHGLVT ARDEQGRLDN EELVTLGCGL 241 LLAGYETTAT MLGKGLLALL DNPDQLAVVR SDPRAVPAAV SEVLRHVTPG VDPHTGLIRA 301 TTADVELGGT VIPAHSVVVA CNTAANFDPA TFRDPDRFDV TRENAAAHLT FGHGMHRCVG 361 AQLAQIELEA AFAALFPAIP GLRLAVPADE ITYTQSTLIR GLRSLPVLW similar to CYP107 family Crocosphaera watsonii WH 8501 ZP_00517718 MTKTKKNKTQNKLVFNPFYRAFHNNPYPIYERLRNEDPIHWSFLKAWIITRYQDVDTILKDNLFQVDDLP LRLEEKSAYLKQGNFLPLAKTIDKWLFFQQPPNHTRLRSLVNKSFSPASVGNMKEEIEAKVNHLLDKVIP TGKMDLIDDLASPLPAMTVTNILGLPPEDYYKLIHWSYELFFVFDQPMSLEGYEKQNKMAMEAREYLLRF IANIDENSQGLIADLVKAKDEENKLDEDEILGFCIMLLIVGQETTKSFISNSILALLQHPEKLQELKDNP EIIKEASEELLRYDTPVQVIARLAREDVEIGGKTILKGDKVILCLGGANRDENKFPNPEKIEFQRSNRNL PFGGGIHFCLGAFLARLQGQISINRIVQRLPNLQLVNQTPDWRESITLRGLKSLPLTFDKNDIKTD similar to CYP107 family Gloeobacter violaceus PCC 7421 NP_924888 MDSVANLNQDAFGNTLPQTEAPFKFNVFDPAFHEDPYPFYDRLRRESPIYRNFMGAWVFTRYSDIKSILR DRRFRVLDKPGWIKNKNRYLTPDQGNFDALVRSSSKFFFFLEPPDHGRLRGLITKAFSASFVDRLRPHVE ATLADLLGKVREQGAMDIMADLACPLPAIVIARLIGVPAADYARLGHLSDELARIFDPVISLEGYLHLNA VVEEFGSYFLDLVAEHKRQPGTDLIDSLIAAQEEGNRLSEEEVVAVCMQLFAGGEETTVNLIGNGMLALL THPEQLELLRSKPEIIAGAVEELLRYDSSIQLVARAAIEDIEIEGCTIGAGEHVHLYLGAANRDPAQFFD PHSLDLTRVDNRHLAFGDGIHHCFGGPLARVEGQVVFQTLVQQFPKLRLAESRRPERREGTLLRGLKTLP VTF CYP107 fragment Streptomyces noursei GenEMBL AF071516 CDS complement(85..>519) putative P450 hydroxylase gene, partial cds. function="may hydroxylate a macrolide antibiotic polyketide moiety C-term only 63% to 107A2, 55% to 107A1 WTTPTRWSCSAPSLICLPRHGRNTAHRRTSRSHHPGRARRHPDR DTLIPARSTVFIAGAAANRDPQKFPNPDTFDITRNTQGHLAFGYGVHHCIGRPLAQME GEVAITALLRRFPHLHLTTPSQNLTWRRSFLRGLTALPVTLN 108 Family CYP108A1 Pseudomonas spp. Swiss P33006 (428 amino acids) PIR S27653 A42971 (428 amino acids) Also found a PIR cross-reference to EMBL S39894 but could not retrieve it Peterson J.A., Lu J.-Y., Geisselsoder J., Graham-Lorence S., Carmona C., Witney F., Lorence M.C. Cytochrome P-450 terp: Isolation and purification of the protein and sequencing of its operon. J. Biol. Chem. 267, 14193-14203 (1992) CYP108A1 Pseudomonas spp. GenEMBL M91440 (6620bp) Hasemann,C.A., Ravichandran,K.G., Peterson,J.A. and Deisenhofer,J. Crystal structure and refinement of cytochrome P450terp at 2.3A resolution. J. Molec. Biol. 236, 1169-1185 (1994) CYP108B1 Mycobacterium smegmatis MSMEG1429 TIGR, 71% to CYP108B2 68% to CYP108B3 84% to CYP108B9 Mycobacterium vanbaalenii 72% to CYP108B8 Mycobacterium vanbaalenii MFLTPVKFSPSEVPD MSTPVINDAARVLAEPRAYADEPRLHAALAELRSQTPVAYVDVPGYYPFW AITKHADVMAIERDNELFINAPRPMLITKEKDDLAKANLAAGGGIRTLIH MDDPLHRDIRKIGADWFRPKAMRALKERVDELAKIYVDKLVEKGPECDFV QEVAVNYPLYVILSLLGLPESDFDRMLKLTQELFGNDDDEMGRGSSAEEL NAVILDFFNYFTELTADRRANPTEDLASAIANAKLNGEYLNDVDCLSYYV IVASAGHDTTSAAISGGLLALTENQDQLARLKADMSLMPLATEEIIRWSA PVKEFMRTATRDTEVRGVPIKEGESVLLSYVSANRDEEIFENADKFDVGR DPNKHLSFGYGVHFCLGAALARMEINSFFTELIPRLESIELAGDPEFMAT TFVGGLKHLPIRYSVR CYP108B2 Mycobacterium smegmatis MSMEG1428 TIGR 71% to CYP108B1 68% to CYP108B3 87% to CYP108B8 Mycobacterium vanbaalenii 73% to CYP108B9 Mycobacterium vanbaalenii MSTPTMDDAAKALADPTAYADDARLHEALARLRAENPVAWVDQAPYRPFW AITKHADIMAIERANDLWLSAPRPLLATAEADDLGRSQQEMGIGLRTLIH MDDPHHRKVRAIGADWFRPKAMRELKVRVDELARIYVDKMREIGPECDFV TDIAVNFPLYVILSLLGLPEEDFGRMHMLTQEMFGGDDDEYKRGTTVEEQ MAVLTDFFNYFSALTNSRRENPTDDLASAIANGRVDGELMSDMDTLSYYV IVASAGHDTTKDAISGGLHALIENPGELARLKADPGLMGTAVEEMIRWST PVKEFMRTAAEDTEVRGVPIAKGESVYLAYVSGNRDEEVFTDPFRFDVGR DPNKHLAFGYGVHFCLGAALARMEMNSLFSELLPRLDSIELAGEPELSAT TFVGGLKHLPIRYSIR CYP108B3 Mycobacterium smegmatis MSMEG2261 TIGR 68% to CYP108B1 68% to CYP108B2 73% to CYP108B7 Mycobacterium vanbaalenii 69% to CYP108B8 Mycobacterium vanbaalenii 69% to CYP108B9 Mycobacterium vanbaalenii MTARTIDDAAKVFAMPSAYTDEAKFHEALTHLRVNAPVSWVDVPPYRPFW AITRYADIMAIERANDLFTNSPRPVLMTAEEDEQQAAVGISTLIHMDDPQ HRVIRAIGADWFRPKAMRALKIRVDELAKIHVDKMVAAGGECDFVQEITV NYPLYVIMSLLGIPEADFPLMLKLTQELFGNKDDEYQRSADEGDSMAALL EMFQYFTELTASRRANPTDDLASAIANATVNGEPLNDIETVSYYAIVAAA GHDTTSATISGGMLALLEHPDQLERLRNDPSLMGTATEEMIRWVTPVKAF MRTAATDTVVRDVPIAAGESLLLAYPSGNRDEEVFTDPFRFDVGRDPNKH VAFGYGVHFCLGAALARMEINSFFAELIPRLESIELTGSPRHTATTFVGG LKHLPVRYALR CYP108B4 Mycobacterium marinum No accession number Tim Stinear MM3999 72% to 108B3 name changed to CYP108B4 CYP108B4 Mycobacterium ulcerans No accession number Tim Stinear 98% to 108B4 M. marinum = ortholog name changed to CYP108B4 CYP108B5 Mycobacterium avium subsp. paratuberculosis K-10. NP_961305, AAS04688.1, 79% to 108B7, 75% to 108B3 MSTTTMDEAA KLLADPMAYT DEQRLHAALT HLRANAPVSW VEVPNYKPFW AITKHADVMD IERENMLFTN WPRPVLTTAE GDEMQAAAGV RTLIHMDDPQ HRVVRAIGSD WFRPKAMRAL KVRVDELAKI YVDKMLAAGP ECDFVQEVAV NYPLYVIMSL LGLPEADFPR MLKLTQELFG SDDSEFKRGS SNEDQLPALL DMFGYFNGVT AARREHPTED LASAIANARV DGEPLSDIDT VSYYLIVATA GHDTTSATIS GGLQALIENP DQLQRLRDNL DLMPLATEEM IRWVTPVKEF MRTAAKDTVV RGVPIAAGES VLLSYVSANR DEDVFDEPFR FDVGRDPNKH LAFGYGVHFC MGAALARMEV NSFFTELLPR LKSIELTGDP ELVATTFVGG LKHLPVRYSL A CYP108B6 Mycobacterium flavescens PYR-GCK ZP_01192125.1, EAS11507.1 92% to 108B7, 74% to 108B3 MSVRVADEAG KVFADPTAYA DEQRLHAAMT HLRANAPVSW VDVEGYNPFW AITKHADIMA IERDNTVFTN SPRPVLTTAE GDAQHASMGV STLIHMDDPQ HRKVRAIGAD WFRPKAMRAL KVRVDELAKT FVDQMYDRGG ECDFVQEVAV NFPLYVIMSL LGIPESDFGR MLTYTQELFG SDDAELQRGT TMEERGLALF DMFTYFNELT ASRRAQPTED LASAIANARI NGEPLSDIDT VSYYLIVATA GHDTTSATIS GGLQALIENP DQLARLQQAP ELLPLAVEEM IRWVTPVKEF MRTAQQDTEV RGVPIAAGES VLLSYPSGNR DEDVFTDPFR FDIGRDPNKH VAFGYGVHFC LGAALARMEI NSFFSELLPR LTSVELAGRP EHIATIFVGG LKHLPIRYSL TR CYP108B7 Mycobacterium vanbaalenii PYR-1 ZP_01203876.1 EAS24868.1 73% to CYP108B3, 73% to CYP108B9, 70% to CYP108B8 MSVRIADEAARVFADPSAYADEARLHAAMTHLRANAPVSWVEVPGYNPFWAITKHADIMAVERDNLVFTN SPRPVLTTAEGDAQHEAMGISTLIHLDDPQHRKVRAIGADWFRPKAMRALKVRVDELAKTFVDQMYERGG ECDFVQEVAVNFPLYVIMSLLGIPESDFQRMLTYTQELFGNDDAELQRGESMEERGLALFDMFTYFNEIT AARRARPTEDLASAIANARIDGAPLSDIDTVSYYLIVATAGHDTTSATISGGLQALIENPDQLQRLQQNP GLMPLAVEEMIRWVTPVKEFMRTAQQDAEVRGVKIAAGESVLLSYPSGNRDEDVFTDPFRFDVGRDPNKH VAFGYGVHFCLGAALARMEINSFFTELLPRLKSVELAGRPEHIATIFVGGLKHLPIRYSLTR CYP108B8 Mycobacterium vanbaalenii PYR-1 ZP_01205329.1, EAS26321.1 87% to 108B2 MSTPTMNQESQEAAKVLADPTAYADDQRLHKALAHLRANDPVAWVDHPPYRPFWAITKHADIMAIERAND LFLSEPRPVLVTAEADDMARAQLEAGFGLRTLIHMDDPHHRKVRAIGADWFRPKAMRDLKIRVDELAKRY VDKMRDIGPECDFVTEIAVNFPLYVILSLLGLPEEDFGRMHMLTQEMFGGDDDEYKRGATVEEQMAVLTD FFNYFGALTASRRANPTDDLASAIANGLVDGELMSDVDTLSYYVIVASAGHDTTKDAISGGLHALVENPG ELARLQGDLDLMPTAVEEMIRWSTPVKEFMRTAAEDTTVRGVPIAKGESVYLAYVSANRDEDIFDDPFRF DVGRDPNKHLSFGYGVHFCLGAALARMEINSLFSELLPRLDSIELAGRPELSATTFVGGLKHLPVRYSLR CYP108B9 Mycobacterium vanbaalenii PYR-1 ZP_01205327.1, EAS26319.1 84% to 108B1 MSTPVIDEAASDAARVLADPKAYTDEARLHAALAHLRAHAPVSYVDVPDYRPFWAVTKHSDIMAIERDNE LWINEPRPLLTTAATDDLSQANLAAGGGIRTLIHMDDPLHRDIRKIGADWFRPKAMRDLKTRVDELAKIY VDKMVEKGPECDFVQEVAVNFPLYVILSLLGLPESDFGRMLKLTQEMFGGDDDELTRGKSPEELHEVITD FFRYFTALTAERRANPTEDLASAIANAKLDGEYLNDIDCLSYYVIVASAGHDTTSAAISGGMLALIENQD QLARLKAQPELMGTAVEEIIRWTTPVKEFMRTATADTEVRGVPIREGESVLLSYVSANRDEDIFDEPAKF DVGRDPNKHLSFGYGVHFCLGAALARMEINSFFTELIPRLESIELAGDPEYIATIFVGGLKHLPIRYSVR CYP108C1 Saccharopolyspora spinosa strain NRRL 18395 No accession number Istvan Molnar Syngenta Biotechnology, Inc. 47% to CYP108B1 43% to CYP108A1 CYP108D1 Novosphingobium aromaticivorans GenEMBL NZ_AAAV01000137 16805..18166 gene = Saro1710 47% to 108B1 39% to 108C1 MTNTSRLTKRRRPRRSDGKREGFMDSIPMVPAEVGRAVIDPKSY GTWEPLLDRFDALRAEAPVAKVVAPDDEHEPFWLVSSFDGVMKASKDNATFLNNPKST VFTLRVGEMMAKAITGGSPHLVESLVQMDAPKHPKLRRLTQDWFMPKNLARLDGEIRK IANEAIDRMLGAGEEGDFMALVAAPYPLHVVMQILGVPPEDEPKMLFLTQQMFGGQDE DMNKSGLKDLPPEQISQIVAGAVAEFERYFAGLAAERRRNPTDDVATVIANAVVDGEP MSDRDTAGYYIITASAGHDTTSASSAGAALALARDPDLFARVKADRNLLPGIVEEAIR WTTPVQHFMRTAATDTELCGQKIAAGDWLMLNYVAANHDPAQFPEPRKFDPTRPANRH LAFGAGSHQCLGLHLARLEMRVLLDVLLDRVDSLELAGEPKRVNSTFVGGFKSLPMRW KAA CYP108E1 Ralstonia metallidurans GenEMBL NZ_AAAI01000348 46192..47481 gene = Reut4024 41% to 108B1 39% to 108A1 48% to 108C1 MTIASDFDTELASHEIYSDPERMHEMFETLRREDPVHWTTAPGH PPFWAVTKQADVIEVGKHPDVFIASPKSFLMNDVEQRVRIEETAATGGKLVRTMIHMD DPDHKKYRGLTQSYFMPANIKRLESVIQERARALVGRLIEKGTSEFCSEIAVWYPLQI VMTLLDVPESEHPYLLKLTQQFLAPKDPTLRRDGPDERGKGAVAKEYFAYFGKMLAER RAAPLKEDLGSLIAHATVDGEPLPLMEAVSYYVILATAGHDTTSSSMCSGLYYLLTQP GELDRLRARPELMPSAIEEMFRHGSPVKHFVRTATRDFELRGKKIQAGDEVALMYHSA SFDEEVFDEPRSFRIDRGPNKHVAFGFGIHACLGQNLARASMRTFFTELLARTESIEV VGKAEFIASNQVGGMKTLNIRVTPSKQSTTDRIEVAA CYP108F1X Mycobacterium marinum No accession number Tim Stinear MM3999 46% to 108B1 name changed to CYP108B4 CYP108F1X Mycobacterium ulcerans No accession number Tim Stinear 98% to 108B4 M. marinum = ortholog name changed to CYP108B4 CYP108G1 Caulobacter crescentus CB15 GenEMBL AE005918 GenPept AAK24465 NC_002696 complete genome 2703947..2705221 Complete genome sequence of Caulobacter crescentus Proc. Natl. Acad. Sci. U.S.A. 98 (7), 4136-4141 (2001) 47% to CYP108A1 formerly 108B1 but this was already assigned to an M. smegmatis seq. (my error) 1 MTISTDIANT IIDPKAYADG DRIDQAFAHL RREAPLAVAQ PDGFDPFWVV TRHADILEVE 61 RQNELFHNGD RATVVTTIEP DKKVREMMGG SPHLVRSLVQ MDNPDHFAYR KITQGALLPQ 121 NLRALEARIR EIARGFVDRM AEHGDRCDFA RDVAFLYPLH VIMEVLGVPE SDEPRMLKLT 181 QELFGNADPD LNRTGKSVTD VGEGVDSIQS VVMDFMMYFN AITEDRRANP RDDLATLIAN 241 GKINGEPMGH LEAMSYYIIA ATAGHDTTSS TTAGALWALA ENPDQFAKVK ADPSLIPGLI 301 EESIRWVTPV KHFMRTATAD AELGGQKIAK GDWIMLSYPS GNRDEAVFED PFTFRVDRTP 361 NKHVAFGYGA HICLGQHLAR MEMRVLWEEL FARLDHVELD GAPTRMVANF VCGPKSVPIR 421 FKMH CYP108G2 Ectocarpus bacterium Genoscope Ectocarpus siliculosus brown algae project A bacterial genome was found with the Ectocarpus DNA 67% to CYP108G1 Caulobacter crescentus AE005918 Ectocarpus sctg_1 159023-157728 CYP108G3 Parvibaculum lavamentivorans DS-1 CP000774 (genome) ABS63163.1 (protein) CDS complement(1684831..1686087) locus_tag="Plav_1544" 56% to CYP108G1 Caulobacter crescentus MTDKTIDNAIVNPKTYAHVDEFHRLFTQLRKEEPVRWTEPDGFRPFWTVSKHADIMEVERQNDKFLNDPR LTLQTIEVEEEVKKFTGGNSKLIRSLVDMDNPDHRNYRGLTQAWFMPPNLKAISARVEALAEKYIDRLEA KGGECDFVSDVAVWYPLRVIMTVLGVPAEDEPIMLKLTQELFGSTDPDMKRPDATETVNTVTEFFNYFTA MTEDRRKNPKDDVASVIANATIDGEPIGHLEAISYYIIVATAGHDTTSSTAAGGLLALMQNPEEFAKLKA NPEGLLGGAIDEMIRWTTPVKHFFRTAAVDYELRGQKIKAGDNLLMCYWSANRDEEAFDDPFSFKIERSP NKHLAFGYGAHLCLGQHLAKMEIRALYKELLARLDHIELAGDPAWVEASFVSGLKRLPIRYSMKRKAA CYP108H1 Ectocarpus bacterium Genoscope Ectocarpus siliculosus brown algae project A bacterial genome was found with the Ectocarpus DNA 49% to CYP108G1 Caulobacter crescentus CB15 AE005918 Ectocarpus sctg_1 1535540-1536841 CYP108H2 Marine metagenome 1093018949056 AACY022454370 57% to CYP108H1 APPRRGPQQSAEHTETWNRVYKEFEEYYEPVIKDRQTCPREDLASLISNGKIDGCPMEHR AQISYFIIASTAGHDTTSATLATAISVLAERPEVLEQLKANLELIPAFIEETIRWASPVK HFLRHATQDYELRGQQIKKGDLMYLSYISGNRDEDLIEDPFEFRIDRKPNRHVAFAFGNH ICLGQHLARLELKIMLEELLPRLESLQLTGKPKLAISDLVCGPKSVPIQYDFKRTA* CYP108-un1 Mycobacterium smegmatis MSMEG4159 TIGR (pseudogene) 47% to 108D1 158 residues. 39% to Msmeg_CYPXXII LTDHYAMAHPKALNDTPDTAQPGQPQTRNPTTPGDLPPLQFFARTVHAET SLGGVQFSENQRLVMNLAAANRDPRQFDDPESFDADRPRNPHVAFGGGLH SCQGQHIARAEMRAVLRVLLTRLPDVHLTGEVGEAGVLAGLMAVISLPVA FTPERSQT 109 Family CYP109A1 Bacillus subtilis GenEMBL M24523 (3187bp) Lewis,P.J. and Wake,R.G. DNA and protein sequence conservation at the replication terminus in Bacillus subtilis 168 and W23 J. Bacteriol. 171, 1402-1408 (1989) Ahn,K. and Wake,R.G. A unique open reading frame adjacent to the replication terminus of the Bacillus subtilis W23 chromosome compared with Bacillus subtilis 168 unpublished (1990) Ahn,K.S. and Wake,R.G. Variations and coding features of the sequence spanning the replication terminus of Bacillus subtilis 168 and W23 chromosomes Gene 98, 107-112 (1991) CYP109B1 Bacillus subtilis GenEMBL AF015825 Z99110 YjiB also similar to CYP106A, both 106 and 109 are close together on a tree CYP109C1 Sorangium cellulosum So Ce56 (myxobacterium) no accession number Rolf Muller Submitted to nomenclature committee 8/5/05 Clone name sce_040811_111 49% to 109B1 CYP109C2 Sorangium cellulosum So Ce56 (myxobacterium) no accession number Rolf Muller Submitted to nomenclature committee 8/5/05 Clone name sce_040811_8140 69% to 109C1, 43% to 109B1 CYP109D1 Sorangium cellulosum So Ce56 (myxobacterium) no accession number Rolf Muller Submitted to nomenclature committee 8/5/05 Clone name sce_040811_4257 43% to 109C1 39% to 109A1 110 Family CYP110A1 Anabaena sp. (a cyanobacterium) Swiss P29980 (354 amino acids) GenEMBL M38044 (5933bp) GenEMBL U38537, M13161 Lammers,P.J., McLaughlin,S., Papin,S., Trujillo-Provencio,C. and Ryncarz,A.J.II. Developmental rearrangement of cyanobacterial nif genes: Nucleotide sequence, open reading frames, and cytochrome p-450 homology of the Anabaena sp. strain PCC 7120 nifD element J. Bacteriol. 172, 6981-6990 (1990) This sequence was later revised to give a complete P450 sequence of 448 amino acids. CYP110A1 Nostoc sp. PCC 7120 same as Anabaena sp. PCC 7120 GenPept BAB73407, C37842 (this entry missing N-term) NC_003272 complete genome 1708114..1709493 1 aa diff to M38044 1 MLTQLPNPIS VPSWWQLINW IADPIGFQKK YSKKYGNIFS MQLAGIGSFV ILGEPQALQE 61 IFTQDSRFDV GRGNTLAEPL IGRTSLMLMD GDRHRRERKL LMPPFHGERL QAYAQQICLI 121 TNQIASEWQI GQPFVARSAM QKLSLEVIIQ IVFGLADGER YQQIKPLFTD WLNMTDSPLR 181 SSMLFLKSLQ KDWGTWTPWG QMKHKQRSIY DLLQAEIEEK RTKENEQRGD VLSLMMAARD 241 ENGQAMTDEE LKDELLTILF AGHETTATTI AWAFYQILKN VNVQEKLQQE LDRLGANPNP 301 MEIAQLPYLT AVSQETLRMY PVLPTLFPRI TKSSINIAGY QLEPDTTLMA SIYLIHYRED 361 LYPNPQQFRP ERFIERQYSP SEYIPFGGGS RRCLGYALAL LEIKLVIATV LSNYQLALAE 421 DKPVNVQRRG FTLAPDGGVR VIMTGKKSLK FEQSSKIFN CYP110A2 Anabaena variabilis (a cyanobacterium) GenEMBL U38478 (1743bp) Lammers, P.J. and Duran, S. possible alkane/fatty acid hydroxylase CYP110B1 Nostoc sp. PCC 7120 Same as Anabaena GenPept BAB75445, AC2274 NC_003272 complete genome complement(4523158..4524546) 45% to CYP110A2 53% to 110E1 49% to 110D1 47% to 110C1 1 MHLPKGPQTP VFVQVLRWVF SPMSFLEDCA KRYGDIFSVK LAKDVPAIVF LSNPKDIQQI 61 LTNDNNQLDS PGDWNDLFEP LLGKRSVITL SGAEHQRQRQ LLMPPFHGER MRGYSQVITD 121 VTEKVISQHQ IGQPFQVRSV TQAITLRVIM QAVFGLYEGS RAEKLQHLLS DLLEKSSSPF 181 SVALLYFPSL RRDFGPIKFW GEQVQIQQQA DELIYQEIQE RRENPDPSRT DILSLLMDAR 241 DADGQPMTDV ELRDELMTLL VAGHETTATA LAWAMYWIHK LPPVKARLLE ELDSLGDNPD 301 STTIFKLPYL NAVYSETLRI YPVAMLTFAR RVIETMALGG YELPPGTPVL GSIYLTHHRE 361 DLYPEPKKFK PERFLERQFS PYEYLPFGGG TRRCLGLAFA QWEMKLALAK ILTSYELELV 421 NNSVEVRPKR RGLVTGPHRP IEMVIKSQRQ ITSRILETTT VS CYP110B2 Nostoc punctiforme NZ_AAAY02000005 GenPept ZP_00111619.1 complement(58895..60277) gene = Npun6097 75% TO 110B1 MKLPKGPQSPAVLQMLRWITSPMSFMETCAKRYGDMFTIRLDSK SPPLIFVSKPEVLEQILTNDIKGLEAPGDTNLVFESLLGKHSVITISGAEHQRQRQLL LPPFHGERMRSYSQIISDITEKVISQYQIGQPFNIRSVTQAITLRVIMQAVFGLDEGP RAEKLQHCLAEMLEKGSSVLSAALLYFPALQRDFGPINFWGKQMRRQQAADKLIYEEI RERQEQPDPSRTDILSLLMAARDEAGQPMTDEKLRDELMTLLVAGHETTATALAWAFY WIQKIPTVRQKLLKELDSLGDNPDPSTIFKLPYLNAVCSETLRIYPVAMLTFARVVRT PLSLGGYELEPGIGVIGSIYLTHHREDLYPEPKQFKPERFLERQFSPYEYLPFGGGAR RCIGLAFAQLEMKLALAKILSTRELELVDNSEVRPKRRGLVTGQDRPIQMVVTSQRQV KFPILQTATV CYP110C1 Nostoc sp. PCC 7120 Same as Anabaena GenPept BAB76385, AF2391 NC_003272 complete genome 5587079..5588485 48% to CYP110A2 49% to 110E1 47% to 110B1 1 MKYQIQRPNP LKTHPFLQKL QWIADPVEYM KKASLQHPDM FTAEVIGFGD TVVFVSHPQG 61 IQTLFANDRK KLVAVGEANR ILYPLVGNNS MFLLEGVKHK QRRQLLMPSF HGERMREYGH 121 LIRNITENLF SQLQQDVTFS ALTAMREISM QVILQAVFGF YEGERCQQFK HLLPIFLSEL 181 FQSPLASSIL FFPSLQKDLG NLTPWGRFVR QREKIDKLLY AEIAERRQEI NSDRIDILSL 241 LISARDETGD SMSDKELRDE LITLMISGHE TTGTAMAWSL YWILQTPEVF QRLIQELDSL 301 GDSPDPMSIF RLPYLTAVCN ETLRINPVAM LTLPRVVKEP IELLGNRLET STTVVGCIYL 361 THHREDLYPE SKLFKPERFL KREFSQYEFM PFGGGVRGCI GQALAMFEMK IVLATVLSRY 421 QLALADRKPE RPQRQGFTLT PTNGVKMLIT GQHKRQNYSM AASTTFNA CYP110C2 Nostoc punctiforme GenPept ZP_00108280.1 GenEMBL NZ_AAAY02000070 complement(34550..35941) gene = Npun2703 60% to 110C1 1 MQLPNILKSP SLLQKLHWVS DPIGYMENAA QEYPDIFTGK IVGFGDTVVF VNHPQAIQEI 61 LTNDRKKFTA VGELNGILKP LLGDNSVLML ESDRHKRQRQ LVTPSFHGER MQAYGQLICN 121 VSKKIFNQLP LNKPFVARNL TKEISLQVIL QSIFGFYEGE KIQKLRQLLP LLLELFESPL 181 SSSLFLFSFL QQDLGAWSPW GNFLRVREKI DQFLYTEIAE CQQQADPERI DILSLLISCR 241 DEAGQPMTDQ ELRDQLITLI LAGYDTTATA MAWGLYWIHK QPLVCEKLLQ ELDTLGDSPD 301 PMSISRLPYL TAVCNETLRI HPVTMFSFPR VVQEPLELLG HSLEPGTILL PSIYLTHHRE 361 NLYPQSKQFK PERFIERQFS PYEFLPFGGG VRRCMGEALA LFEIKLALAT IVSHYHLALV 421 DQRPEQPQRR GFNLAPGSGV KMVMTDQRAR KESLINMTTT PLS CYP110C3 Anabaena variabilis ATCC 29413 YP_322498 95% to 110C1 MKYQIKRPNPLKTHPFLQKLQWIADPVEYMEKASLQHRDMFTAEVIGFGDTVVFVSHPQGIQTIFANDRK KLVAVGEANRILYPLVGNNSMFLLEGVKHKQRRQLLMPSFHGERMREYGHLIRNITETLFSQLQQNVTFS ALTAMREISMQVILQAVFGFYEGERCQQFKHLLPVFLSELFQSPLASSILFFPFLQKDLGNLTPWGRFVR QREKIDKLLYEEIAERRQEINSDRIDILSLLISSRDETGNSMSDQELRDELITLMISGHETTGTAMAWSL YWILQTPEVFQRLIQELDSLGDSPDPMSIFRLPYLTAVCNETLRINPVAMLTLPRVVKEPVELLGNRLES GTTVVGCIYLTHHREDLYPESKLFQPERFLKREFSQYEFMPFGGGVRGCIGQAIAMFEMKIVLATVLSRY QFALADGKPERPQRQGFTLTPANGVKMLITGKHQRQNYSTAASTTFTT CYP110C4 Nodularia spumigena CCY9414 ZP_01629302 MSTPNRLKTPAFFQQLQWVADPVGYMEKAAQQYPDIFTAQVVGFGNNLVFVNHPQAMQEILTNDRKKLFA GGKENKILQPLLGDYSMIMLDGDRHRKRRQLVMPSFHGDRMRSYGEIISNITEEVWSNLPTDKSFLARNV TQDITLQVMIQAVFGVYQGERSQQLKKQLELMANIFRSPLSSSMLFFSSLQQDLGAWSPWGKFVRDRQEL DNLIYTEIAERRQQNLENRIDILSLLMSAEDESGNPMTVQELRDELMTLLFAGYETTATALAWGLYFIQK HPEVQEKLLQELDTLGDSPDPMSIFRLPYLTAVCNETLRIHPVAMLTFPRTVKEPVEISGYALDPGTILV GSMYLTHQREDLYPEPKQFKPERFLERQFSPYEFIPFGGGVRRCVGEALAVFELKLVLATILSRYELALT DDQPEVPRRRGVTLAPGRGVNMMITGQRLA CYP110C5 Crocosphaera watsonii WH 8501 ZP_00518945 MKTIPTPKTPTLVQQLQWVLNPTGYLQTNHHRYPDLFKAKIIGLGNDIILISNPEIMQYILTHDRQEFTA PSSLNTLLKPLLGDYSVVMLDGDGHRQRRQLVMPSFHGERLKVYGDLTCRITREAMEKLPENQPFLAREV MQDISLKVIMEAVFGVTEGERYEELQYRLKELLDLFDSPITSGFLFFPSLQKDLGNWSPWGYFLRQRQAL DKLIYAEISDRRANPDPERTDILSLLMFAKDEQGESMKDQELRDELITLLMAGHETTASAMAWALYWLHH IPEIKDKLIEELNTLSPDAEGMDIFRLPYLTAVCNETLRLSPSAMLTFTRLAQQTVEVGGYTFKPGDIVA GCLYLTHLREDIYANPKQFNPQRFLDHKYSAYEFIPFGGGSRRCMGEALAKFEMKLVIAIIISEYCLKLA DTQPEKQQRRGLTLSPKRGVKMILEGKRQPQKARELELSTR CYP110C6 Cyanothece sp. CCY0110 ZP_01731952 MKTIPGSKTPKLIQQLQWIFNPTKYLKTNHRRYPDIFKAKIIGFGDKMILTSRPEIMQYILTHDRKQFTS PSGLNAILRPLLGDSSVLMLDGDRHRQRRQLVMPSFHGERLKVYGDLTCRITEEVMAKVPQNQPFLAREI MQDISLKVIMEAVFGVTEGKRYEQLQDRLKKMLDLFNSPLTSAFLFFPFLQKDLGSWSPWGHFLRQRQAI DELIYAEISDRKAHPDSDRTDILSLLMSAKDEQGQGMKDQELRDELMTLLTAGHETTASAMAWALYWIHH TPEVKDKLIEELNTLSPDAEGMDIFRLPYLTAVCNETLRLSPSAMLTFTRVAQEKVEVAGYTFEPGDMIM GCMYLTHLREDLYTNPEQFNPQRFVDRQYTPYEFIPFGGGSRRCVGEALAQFEIKLVIATIMSQYCLKLA DTQPEKQQRRGVTLSPARGVKMILEGKRQPQPVRELELSRQ CYP110C7 Lyngbya sp. PCC 8106 ZP_01620519 MKTLNSPKTSPLIQRLQWVFNPLEYMETNVKINRDIFNTQVTGGVGLIFVNSPEGMQELLTRDTKEFYAP GSINEILKPLLGEQSVMLLDGDRHKRQRKLLMPPFHGERMRTYGELILNITQQATAKLKPGQPFIARNAM QEITLAVILQAVFGIYEGSRYDKLKQLITSLLAVTDSPVSSSLLFFTSLQKDWGAWSPWGRFLRMRQKVD QLLFAEIEERRQNWDENRTDILNLMMAARDEDGQPMADEELRDELLTLLVAGHETTATAMAWALYWIHRQ PEVYQKLIQELESLPENADPMTIFRLPYLTAVCNEALRIYPVAMLTFPRVTKEPTQLLGYELEANIGLAG CIYLLHHREDLYPEPKQFKPERFLERKFSPYEFLPFGSGARQCIGMALAQFEMKLALAQILLDYDLTLLE KRPVKAARRGVTLSPVGGIKMMMNGKRTPSKSVAIPATV CYP110D1 Nostoc sp. PCC 7120 Same as Anabaena GenPept BAB76465, AF2401 NC_003272 complete genome 5678382..5679743 48% to CYP110A1 53% to 110E1, 49% to 110B1 1 MTVTQNLPNG PRIPRLLRLF KFITQPIQYV EDFAKVYGDN FTIWGSGESY FVYFSHPQAL 61 EQIFTNVSCF ESSGGGSPLL ELLLGKNSLI LLEGDRHQRQ RQLLTPPFHG ERMRAYGQTI 121 REITQQVTQA WQMGKPFNIR ASMQEITMRV ILRVVFGVDE GELFQELRQL LTTLLDFMGS 181 PLMSSTFFFS FTQKDYGAWS PWGRMVRLIK KIDQLIYALI AQRRAEFGEN RQDILSLLIS 241 ARYDDGQPMS DVELRDELMT MLVAGHETTA SALTWAFYWI DSVPEVREKL FQELDTLNDD 301 SEPSIIAKLP YLTAVCQETL RFYPIVLNAF FRRTKNPMEI MGYKLPKATL VVPSIYLAHH 361 REEVYPQSKQ FRPERFLEKQ FSPYEYLPFG GGNRRCIGLA FAQYEMKIVL ATILSQFQVS 421 RLSKRPVQPV RRGLTLAAPG GMKMVANKRM RNS CYP110D2 Nostoc punctiforme NZ_AAAY02000028 GenPept ZP_00109203.1 52704..54170 gene = Npun3650 68% to 110D1 MNIPLSVTLSNMKSRNNKIQKPSNLQTPMTATYNLPDGPQMPRW LRTIKFISQPVKYVDDFAKTYGDTFTIRSSRSDNHIVYFSQPQALEEIFTADSRHFEV GRGNTGLRFLLGDRSFMLVDGDRHQRQRQLLAPPFHGERMRAYGEDIRKITQQVSHEW KIGKPFNIRESMQEITLRVILRVVFGLNEGELFEELRRSLSDLLDFISSPIMSSAFFF RFIQKDFGAWSPWGRILLQRQKVDLLIYTLLRERRAQTDQNRQDILSLMMAARYDDGQ GMSDEELHDELMTLLVAGHETTASALTWAFYWIDHLPEVREKLLQELNTIGVNPDLSS VAKLPYLTAVCQETLRIYPIAMTAFVRIVKTPITIMGYELREGTAIVPSIYLAHHREE VYPQSKQFKPERFLERQYSPYEYLPFGGGNRRCIGMAFAQYEMKIVLATVLSEFQVSL VNKRPVHPVRRGLTVATPAGMRMVATPQVKRANTPALV CYP110D3 Trichodesmium erythraeum GenPept ZP_00074554.1 GenEMBL NZ_AABK02000068 complement(10019..11407) gene = Tery3870 54% to 110D1 MTLPDGPSLSPLQRRLRTWKFIFSPLSAIEERYSEYGDIFRTNT NSLYPFIYFCNPKAIQQIFTADPDTFTSGSINGILKYFVGLNSLLLQDGDRHKRQRKL LMPPFHGDRMRKYGDLIYNITSNVISQWKIEQPFPIRKSTQEISLKVILAAVFGLDQE GKSYEKLRVLMSDLLDSMSSPLSSTFLFFNFLRKDWGPWSPWGRFLRKKQELHELIIA EIQTAKKEGNHRDDILSLLLEARDEAGNAMSDEEIKDELLTMLFAGHETTASALAWAL YWIDMIPSVGEKLMAELATIPSNSDQVAITKLPYLSAICQETLRIYPIAMNAFPRVVQ KPIEIMGYQLEPGMVAIVPIYLTHHREDIYPEPKKFKPERFLERQFSPYEYLPFGGGS RRCIGSAFALFEMKLVLATILSQWELKLLPNQRISPVRRGLTMAPPANMRMVVKPKKS WQKVSQPILTSG CYP110D4 Lyngbya sp. PCC 8106 ZP_01620515 MTLPNGPQTPRVLRMMKFVARPLDYLEDYYRRYGDFIRIGKSATPLVYVNHPAAIEKIFTAGSEQFRTGN AGGVLLFLLGDNSVLMVDGERHERQRKLLMPPFHGERLKTYNQLICEITKEVMSQVKIGQPFRVRTLMQD ITLRVILKAVFGLTEGERYEQLRHLLSAMMESIGSPLAASLMFFPSLRQDWGEWSPWGRFLRYKQQADEM IYAEIRERKQQRDFDGDDILTLLMSARDETGKPMNETELRDELVTLLIAGHETTASSLTWALYWTHYLPE VKDKLCFELANLGENPHLSEIARLPYLTAVCNETLRIYPVTLTSGVRVLKKPLELGGYSFEPGTVLFPCT YLVHQREDIYPEPKKFKPERFLQRQFSPYEFFPFGGGHRRCIGSAMATLEMKIALATILSDWQLKLPHHK AYKPVRRGLTLSPPAQLSLVAVNRLN CYP110D5 Crocosphaera watsonii WH 8501 ZP_00513562 MNLPPTLSQPRLLRLFKLIFYPLDYLEDNYQRYGDIFVAGKSETPFVYISNPQGIQTILTRDKTDFKTGG GSGFLSTLLGDNSLLFLQGERHRRERKLLMPPFHGERLKSYANLIYSISDKVTDKLQINRSFNVRDIMQE ITLKVILKAVFGITEGERYQRLQELLKSWLSFFDSPANAILIFFPWLRKNWGNWTPWGRFLQIKAEIQEL IYTEIRERREQKKYEGTDILTLLMLAKDEEGKPLSDQELHDELITLLIAGHETTASALTWALYWIHFCPD VEDKLRFHFSNLNNNTDLLDIVKLPYLDAVCKETLRIYPVLLTTFIRVLQTPLELMGYQFKPGTVFAPAI YLVHHREDIYPNSQQFRPERFLERNFSPYEYFPFGGGSRRCIGMELAKMEMKIVLYTILSKHKLKLPSSR PLKAVRRGLTVAPPSNFKMILSN CYP110D6 Cyanothece sp. CCY0110 ZP_01726400 MVLPPSISTPRLLRLFKLIFYPLDSLENYYERYGDIFIVGQSETPFVYISNPQGIQEILTKDKTHFRTGG GSGFLTTFLGNNSLLSLKGEKHQRERKLLTPAFHGERLQSYATLIYSISDEVSEKLEINQSFNVREIMQE ITLQVILKAVFGIAEGKRYQKLKNLLTSWLSFFDSPINATIIFFPFLQKDWGNWTTWGRFLRIKAQIDDL IYTEINERRQQKNYQGKDILTLLILARDEDGNPMSDQELHDELITLLIAGHETTASSLTWALYWIHYCPE VEEKLRSHFSILDKNIDLLNIIKLPYLDAVCSETLRIYPVVVNAFIRVLETPLELMGYQFKPGTVFAPAI YLVHHREDIYPNSKQFRPERFLERQFSPYEYLPFGGGSRRCIGMELAKMEMKIVLFTLLSKYKFKLSSSH PLKPVRRGLTIAPPNSFKMIITQKLAYT CYP110E1 Nostoc sp. PCC 7120 Same as Anabaena GenPept BAB76532, AI2409 NC_003272 complete genome 5753083..5754450 50% to CYP110A2 53% to CYP110B1 53% to 110D1 1 MKLPDSPKIP KFMQLVQWIY QPLQLMEASA KAHGDSFTLW LTNKRPIVFL SNPQAIQELF 61 TTPLEQLDAR GTAQVLQPLL GENSLLLLSG ETHQRQRKLL TPPFHGDRMR AYGDIITNIT 121 KEVISNWQLG KPFSVRDSMQ EITLRVILQA VFGLREGERY TQLQKRLCDI LDLSGSALRS 181 TLSFLPALQI DLGRWSPWGH FLRQREAIDQ LLYAEIQDRR DHPDPSRTDI LSLMMAARDE 241 NGEAMTDVEL RDELMTLLVA GHETTASALT WALYWIHKLP QVREKLLAEL DNFGDNGDVN 301 EITRLPYLTA VCQETLRIYP IAMVTIPRIT KTNLEIGGHQ FAPGTMLVGC IYLMHRRPDL 361 YPQPQEFKPE RFLEKQYSLY EYLPFGGSNR RCVGMAFALY EMKLILATVL ANVDLALVDN 421 YPVKPTRRGV TLAPSGGKWL IATAQHQKIK NPVEV CYP110E2 Nostoc punctiforme NZ_AAAY02000088 GenPept ZP_00107327.1 complement(18173..19567) gene = Npun1723 58% TO 110E1 55% TO 110B1 MSLLKLPNGPQTHPWIQMYQWLTNPLEYMEACTKRYGDIFTLKL GQNFAHQVFISNPQAIQQIFTTDPKQLDSGESAGIKAPLLGQQSLLALDGKPHQRQRK LLTPPFHGERMLAYGELIREITEQVSSQWQVGETFAVLPSMQAISFQVILKAVFGLED GPRYKKLNELLIKILNPKIPLLRTVLLIFPSMRQDLGAWSPWGKYLRLRQQIDQLIYA QIQERKAQPNLSGTDILSLMMAARDEAGEPMTDLELRDELMTLLVAGHETTATSLSWA LYWIHHRPQVREKLLQELDNLGEKPDPNAIFRLPYLNAVCSETLRLYPVAMSALNRLV KSPLQIGEYNFEPGTILIPSIYLTHHREDLYPESKQFKPERFLERQFSPYEYLPFGGG NRRCIGMAFALFEMKLVLATVLSRWQMELADSKPVRPVRKGLLFSPAGGVQMVVKGKR LQNQPILQTSSSSV CYP110E3 Trichodesmium erythraeum GenPept ZP_00072591.1 GenEMBL NZ_AABK02000017 complement(<3..1016) 53% to 110E1 missing C-terminal 121 aa (runs off end of clone) 1 MIKLPGPKSP ALTQILQWTA KPIKFMEKCA REYGDTFEVK LNYPIVFISH PKAIEEIFKA 61 NPKKFDCGSS NKLAQPLLGD YSLLLLDDIP HQRQRKLLMP PFHGKRMQAY GELICNVAQE 121 VASKWEIGQV FSMREFTAEI SLKVILQAVF GLYEGERYSK LEKLLGSLLE SLSSPLKTSM 181 LFFQFLQIDL GPWSPWGNFI KNREEIYELL CAEISERRQK LDPERSDILT MLLLARDEEG 241 EGMSDIELRD ELMTLLIAGH ETTATSLSWA FYWIHHQPEI YQKLSRELET FGDDLNPMTV 301 INLPYMNAVC SETLRIYPVV IIVSPRKTKL PITIMGQT CYP110E4 Gloeobacter violaceus PCC 7421 GenEMBL AP006578 complement(257348..258724) gene = gll3063 NC_005125 complete genome complement(3256348..3257724) locus_tag = gll3063 71% to 110E5 55% to 110E1 MSLPPGPSSPSPFQLMQWIGCPTDYLHTTAARYGDPFTMRVGVF PPLVMFSDPRAIQQLFTAEAGTFDAGASNVALRPTLGANSLLLLDGERHQQQRRLLTP PFHGERMRAYGELIRQVTEEVIVRWQPGKPFLVRNAMQRISLAVILQAVFGLHDGTRL VRLRQALGSMLDAMSSPLSMAMLLMLPEDFGPWSPRARLQAHLGAIDELLYAEIRERR EHFDAGAGDILGLLLAARDEAGAAMGDAELRDELMTLLVAGHETTATAMAWALYWIHY LPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVALIASPRVARHTVRI LERDYEAGTRLAAGIYLAHHRPETYPEPERFRPERFLERTFSPYEFVPFGGGSRRCIG MAFALYEMKLVIATVLLERDLRLVQPRLLRPVRRGVTLAPPEGLYLVPTGERSASRLL SRTSTAGQ CYP110E5 Gloeobacter violaceus PCC 7421 GenEMBL AP006578 complement(258800..260176) gene = gll3064 NC_005125 complete genome complement(3257800..3259176) locus_tag = gll3064 71% to 110E4 55% to 110E2 MSLPAGPASPPPLQLLQWIGRPTDYLERTARRYGDPFTMRLGLH SPVTGVFFSSPEAFQQLFNTEPGLFDSGGANASSTFNLLFGTNSLILLDGERHQQQRR LLTPPFHGERMRSYGELIRTLAEQVTARWNLGTPFQARRSMQRISLGVILKAVFGLHD GTRYLRVCRLLGNLIDASASPLLFGLRLIFPQDAGPMSPMGQLKAQIDAIDELLYAEI RERRERPDPRADDILSLLMAARDEAGQGMGDVELRDELMTLLVAGHETTATAMAWALY WIHRLPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVAMVAFARVPRR PVRILDREYPAGTFLIPNIYLAHRRPEAYPDPERFRPERFLERTFSPYEFVPFGGGSR RCIGVAFALYEMKLVLATVLSRVELRLADPRPRLPVRRGLTLAPPEDLHLIPTALRSG HRDLLPAC CYP110E6 Anabaena variabilis ATCC 29413 YP_322620 MKLPDSPKIPRFMQLVQWIYQPLQLMEASAKAHGDCFTLWLTNKRPIVFLSNPQAIQELFTTPLEQLDAR GTAQVLQPLLGENSLLLLSGETHQRQRKLLTPPFHGDRMRAYGDIITNITQEVISKWQLGEPFSVRDSMQ EITLRVILQAVFGLREGERYTQLQKRLCDILDLSGSALRSTLSFLPALQIDLGSWSPWGHFLRQRAAIDQ LLYAEIQDRRDHPDPSRTDILSLMMAARDENGEAMTDIELRDELMTLLVAGHETTASALTWALYWIHKLP QVREKLLAELDNFGDNGDVNEITRLPYLTAVCQETLRIYPIAMVTIPRIVKTTLEIGGHQFAPGTMLVGC IYLMHRRPDLYPQPQEFKPERFLEKQYSLYEYLPFGGSNRRCVGMAFALYEMKLVLATVLANMDLALVDN YPVKPTRRGVTLAPSGGKWLIATGQHQKVKSPVEV CYP110E7 Nodularia spumigena CCY9414 ZP_01631632 MPALQLPDGPKNHPWLQTYRWLTSPLEYMEDCAKNYGDIFTIRVGPLSTPQVFVSNPQAIQQIFSTDPKY LDSGAAAGFKSPLLGNQSLLSLDGKPHQRQRKLLTPPFHGERMLAYGELIRDISQQVTNKWQVGETVSVL SSMQAISFQVILKAVFGLAEGPRYEKIKEALIAILNPKKPLLRSMLLMFPSLRRDLGAWSPWGEFLRLRQ QIDELVYAEIQERKAQLDSSRTDILSLMMATRDEAGEPMTDLELRDELMTLLVAGHETTATALSWALYWI HHQPQVREKLLQELDTLGEKPDPNAIFRLPYLNAVCSETLRLYPVAMLLLSRLVKSPLQIGEYQFEPGTL LIPCVYLTHHREDLYPDSQTFKPERFLERQFSNSEFIPFGGGNRRCIGMAFALFEMKLVLATVLSNWQME LANTQPVLPVRKGLLFGPKGGVQMVVKGRRELS CYP110F1 Nostoc punctiforme NZ_AAAY02000005 GenPept ZP_00111618.1 complement(57031..58407) gene = Npun6096 48% TO 110E1 48% T