P450s that have appeared since the 1993 P450 nomenclature update.
      This is part E of the bibiographic P450 files.  
      This section contains bacterial sequences CYP101 to CYP174.  
      This includes references that were incomplete and duplications
      of sequences that were already in the update.  If a sequence 
      is assigned an accession number that was not in the old update
      it is included in this list.  48 new P450s were added July 27, 2000
      Four new sequences were added Jan. 9, 2001 CYP102C1, CYP172-174.
      Added CYP175A1 9/17/2001
      Compiled by David R. Nelson
      Last modified June 2, 2003 added 25 new sequences. 
      Last modified Nov. 5, 2003 There are now 501 bacterial P450s
      Last modified Dec. 2, 2003 added 4 seqs in CYP153 family
      Last modified Feb. 23, 2005 added 26 seqs from Rhodococcus sp. RHA1 
      Last modified Feb. 23, 2005 added 155C1 and 155C2
      Last modified Sept. 20, 2005 added 18 Sorangium cellulosum seqs.
      Last modified Nov. 17, 2006 
      Last modified Dec. 31, 2007 

51 Family

101 Family

102 Family

103 Family

104 Family

105A Subfamily

105B Subfamily

105C Subfamily

105D Subfamily

105E Subfamily

106 Family

107A Subfamily

107B Subfamily

107C Subfamily

107D Subfamily

107E Subfamily

107F Subfamily

107G Subfamily

107H Subfamily

107J Subfamily

108 Family

109 Family

110 Family

111 Family

112 Family

113A Subfamily

113B Subfamily

114 Family

115 Family

116 Family

117 Family

118 Family

119 Family

120 Family

121 Family

122 Family

123 Family

124 Family

125 Family

126 Family

127 Family

128 Family

129 Family

130 Family

131 Family

132 Family

133 Family


51 Family

A note on nomenclature.  CYP51s were originally all called CYP51, because only one 
gene was found per species and they all seemed to be in this one conserved family.
However, rice had many CYP51s in at least two sequence groups, so subfamilies
have been designated for CYP51s.  These are not the typical subfamilies, but only 
one subfamily is created for each major taxonomic group.  CYP51A for animals,
CYP51B for bacteria. CYP51C for Chromista, CYP51D for Dictyostelium, CYP51E
for Euglenozoa, CYP51F for fungi.  Those groups with only one CYP51 per species 
are all called by one name: CYP51A1 is for all animal CYP51s since they are 
orthologous.  The same is true for CYP51B, C, D, E and F.  CYP51G (green plants) 
and CYP51Hs (monocots only so far) have individual sequence numbers.

CYP51B1    Mycobacterium tuberculosis (Actinobacteria)
           GenEMBL Z80226 (34809bp) gi 1550642 Rv0764c
           complement (6140-7495)
           33.7% identical to CYP51 over 439AA overlap
           this is a bacterial CYP51

CYP51B1    Mycobacterium marinum
           No accession number
           Tim Stinear
           MM4932
           82% to 51B1 M. tuberculosis

CYP51B1    Mycobacterium ulcerans
           No accession number
           Tim Stinear
           99% to CYP51B1 Mycobacterium marinum = ortholog

CYP51B1    Mycobacterium bovis subsp. bovis AF2122/97 (Actinobacteria)
           NC_002945 complete genome complement(858662..858868)
           CYP51 100% match
           locus_tag = Mb0786c

CYP51B1    Mycobacterium avium (Actinobacteria)
           TIGR contig:3273:m_avium Length = 5,475,738
           79% to CYP51 M. tuberculosis
3021360 TSTVVPRVSGGEEEHGHLEEFRTDPIGLMQRVRDECGDVGWFQLVDKHVILLSGAQANEF 3021539
3021540 FFRSADEDLDQAEAYPFMTPIFGKGVVFDASPERRKEMLHNSALRGEQMKGHASTIEGEV 3021719
3021720 KKMIADWGDEGEIELLDFFAELTIYTSTACLIGLKFREQLDHRFAEYYHDLERGTDPLCY 3021899
3021900 VDPYLPIESFKRRDEARVKLVALVQEIMDQRLANPPKDKADRDMLDVLVSIKDEDGKPRF 3022079
3022080 SADEITGMFISLMFAGHHTSSGTSAWTLIELIRHPDVYAEVLAELEELYADGQEVSFHAL 3022259
3022260 RSIPKLDNVVKETLRLHPPLIILMRVAKGEFEVEGFPIHEGDYVAASPAISNRIPEDFPD 3022439
3022440 PDAFKPDRYNKPEQADIVNRWTWIPFGAGRHRCVGAAFAQMQIKAIFSVLLREYDFEMAQ 3022619
3022620 PADSYRNDHSKMVVQLARPAKVRYRKR 3022700

CYP51B1    Mycobacterium smegmatis (Actinobacteria)
           TIGR contig:3439:m_smegmatis Length = 6,989,783
           80% to CYP51 M. tuberculosis
4858809 VPRVSGGEEEHGHLEEFRTDPIGLMKRVRSECGDVGWFQLADKQVVLLSGAEANEFFFRS 4858988
4858989 SDSELNQAEAYPFMTPIFGEGVVFDADPERRAEMLHNTALRGEQMKGHAATIENEVRRMV 4859168
4859169 ESWGDEGEIDLLEFFAELTIYTSTACLIGVKFRNQLDKRFADYYHLLERGTDPLCYVDPY 4859348
4859349 LPIESFRIRDEARANLVELVQEVMNGRIANPPKDKSDRDLLDVLVSIKDEDGTPRFSANE 4859528
4859529 VTGMFISLMFAGHHTSSGTASWTLIELLRHPEFYAKVQAELDDLYADGQEISFHALRQIP 4859708
4859709 NLDNALKETLRLHPPLIILMRVAQDEFEVAGRPIHKGQMVAASPAISNRIPEDFPDPDTF 4859888
4859889 DPDRYDKPRQEDLINRWTWIPFGAGKHRCVGAAFAQMQIKAIFSVLLRDFEFEMAQPSES 4860068
4860069 YRNDHSKMVVQLARPAKVRYRRR 4860137

CYP51B1    Methylococcus capsulatus (Proteobacteria)
           TIGR contig:221:m_capsulatus
           49% to CYP51 M. tuberculosis
           NOTE FUSION PROTEIN EXTENDS C-TERMINAL. 
           SEE J. Biol. Chem., Vol. 277, Issue 49, 46959-46965, December 6, 2002
           A Novel Sterol 14-Demethylase/Ferredoxin Fusion Protein (MCCYP51FX) 
           From Methylococcus capsulatus Represents a New Class of the Cytochrome 
           P450 Superfamily 
           Colin J. Jackson, David C. Lamb, Timothy H. Marczylo, Andrew G. S.
           Warrilow, Nigel J. Manning, David J. Lowe, Diane E. Kelly, and Steven 
           L. Kelly
908332 MSHPPSNTP
908305 PVKPGGLPLLGHILEFGKNPHAFLMALRHEFGDVAEFRMFHQRMVLLTGSQASEAFYRAP 908126
908125 DEVLDQGPAYRIMTPIFGRGVVFDARIERKNQQLQMLMPALRDKPMRTYSEIIVAEVEAM 907946
907945 LRDWKDAGTIDLLELTKELTIYTSSHCLLGAEFRHELNTEFAGIYRDLEMGIQPIAYVFP 907766
907765 NLPLPVFKRRDQARVRLQELVTQIMERRARSQERSTNVFQMLIDASYDDGSKLTPH 907598
907597 EITGMLIATIFAGHHTSSGTTAWVLIELLRRPEYLRRVRAEIDALFETHGRVTFESLRQM 907418
907417 PQLENVIKEVLRLHPPLILLMRKVMKDFEVQGMRIEAGKFVCAAPSVTHRIPELFPNPEL 907238
907237 FDPDRYTPERAEDKDLYGWQAFGGGRHKCSGNAFAMFQIKAIVCVLLRNYEFELAAAPE 907061
907060 SYRDDYRKMVVEPASPCLIRYRRRDAP 906980

CYP51B1     Rhodococcus sp. RHA1 (Actinobacteria)
            No accession number
            Marianna A. Patrauchan
            Rha05830
            Submitted to nomenclature committee 12/13/04
            77% to CYP51B1 M. avium or M. tuberculosis

CYP51B1     Nocardia farcinica IFM 10152 (Actinobacteria)
            GenEMBL AP006618.1
            CDS complement(2757924..2759282)
MTLVKPRRVSGGEHEHGHLEEFRTDPIALMRRVRQECGDVGAFE
LAGKQVILLSGAEANEFFFRSGDEDLDQGAAYPFMKPIFGEGVVFDASPERRKEMLHN
SALRAEQMRGHATTIAAEVDRMIAGWDDEGEIDLLDFFAELTIYTSSACLIGVKFRNE
LDDRFARLYHELERGTDALAYVDPYAPIESFRRRDEARAALVALVQAIMDERAANPPA
DKSDRDLLDVLVSVPNEDGGPRFSASEITGIFISMMFAGHHTTSGTAAWTVIELLRHP
ELRDRVVAELDELFADGKDVSFHALRQIPLLEATLKETLRMHPPLIILMRVAQGDFEV
CGHHIAAGDHVAATPAISNRLPEDFPDPDTFDPGRYIDPNQEDLVNRWTWIPFGAGRH
RCVGAAFALMQLKAIFSILLRDWEFEMAQPSESYRNDHSKMVVQLQQPCRVRYRRRVR
TS

CYP51B1   Mycobacterium vanbaalenii 
          ZP_01207535.1, AAT40578.1, EAS23094.1 
          Q5IZM4CP51_MYCVN Cytochrome P450 51 (CYPLI) (P450-LIA1) (Sterol 14-alpha 
          80% TO CYP51 Mycobacterium tuberculosis
MTAVKEVPRVSGGEEEHGHLEEFRTDPIGLMKRVREECGDVGWFQLADKQVILLSGAEANEFFFRSSDSE LNQAEAYPFMTPIFGEGVVFDADPERRAEMLHNTALRGEHMKGHATTIEAEVRKMIEGWGESGEIDLLEF FAELTIYTSTACLIGLKFRNQLDSRFANYYHLLERGTDPLCYVDPYLPIESFRIRDEARAGLVELVQDVM HGRIANPPKDKSDRDMLDVLVSIKDEDGNPRFTANEITGMFISLMFAGHHTSSGTSSWTLIELLRHPEFY AKVQQELDDLYADGQEVSFHALRQIPSLDNALKETLRLHPPLIILMRVAQDEFEVAGYPIHKGQMVAASP AISNRIPEDFPNPDDFDPDRYEKPRQEDLINRWTWIPFGAGKHRCVGAAFAQMQIKAIFSVLLREYEFEM AQPPESYQNDHSKMVVQLARPAKVRYRRRVRD

101 Family


CYP101A1    Pseudomonas putida
            GenEMBL D00528 (1950bp)
            Koga,H., Yamaguchi,E., Matsunaga,K., Aramaki,H. and Horiuchi,T.
            Cloning and nucleotide sequences of NADH-putidaredoxin reductase
            gene(camA) and putidaredoxin gene(camB) involved in cytochrome
            P-450cam hydroxylase of Pseudomonas putida
            J. Biochem. 106, 831-836 (1989)
            Note: only the last 93 nucleotides of the cam gene was cloned along 
            with two downstream genes.

CYP101A1    Pseudomonas putida
            PIR C60886 (last 8 amino acids)
            Romeo, C., Moriwaki, N., Yasunobu, K.T., Gunsalus, I.C.,
            Koga, H.
            Identification of the coding region for the putidaredoxin
            reductase gene from the plasmid of Pseudomonas putida.
            J. Protein Chem. 6, 253-261 (1987)

CYP101B1    Novosphingobium aromaticivorans
            NZ_AAAV01000165.1 
            complement(29626..30870) gene = Saro2804
            43% to CYP101
MLPHDRGQNSTRRITAMEAPAHVPADRVVDIDIYMPPGLAEHGF
HKAWSDLSAGNPAVVWTPRNEGHWIALGGEALQEVQSDPERFSSRIIVLPKSVGEMHG
LIPTTIDPPEHRPYRQLLNAHLNPGAIRGLSESIRQTAVDLIEGFAAQGHCNFTAQYA
EQFPIRVFMALVGIEASEAPRIRHWAECMTRPGMDMTFDEAKAVFFDYVGPLVDARRE
TPGEDMISAMINADLGDGRRLTRDEALSVVTQVLIAGLDTVVNVLGFIMRELAGNPAL
RADLRQRGADILPVVHELFRRFGLVSIAREVRRDIEFHGVHLKAGDMIAIPTQVHGLD
PRVNPDPLAIDPSRKRARHSTFGSGPHMCPGQELARKEVAITLEEWLRRIPDFALGPN
SDLSPVPGIVGALRRVELVWNT

CYP101C1   Novosphingobium aromaticivorans
           NZ_AAAV01000133.1 
           complement(4199..5389) gene = Saro1574
           44% to CYP101A1
MIPAHVPADRVVDFDIFNPPGVEQDYFAAWKTLLDGPGLVWSTA
NGGHWIAARGDVVRELWGDAERLSSQCLAVTPGLGKVMQFIPLQQDGAEHKAFRTPVM
KGLASRFVVALEPKVQAVARKLMESLRPRGSCDFVSDFAEILPLNIFLTLIDVPLEDR
PRLRQLGVQLTRPDGSMTVEQLKQAADDYLWPFIEKRMAQPGDDLFSRILSEPVGGRP
WTVDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQRLLRERPDLIPAAADELMR
RYPTVAVSRNAVADVDADGVTIRKGDLVYLPSVLHNLDPASFEAPEEVRFDRGLAPIR
HTTMGVGAHRCVGAGLARMEVIVFLREWLGGMPEFALAPDKAVTMKGGNVGACTALPL
VWRA

CYP101D1   Novosphingobium aromaticivorans
           NZ_AAAV01000085.1 
           complement(6803..8068) gene = Saro0669
           44% to CYP101
MNAQTSTATQKHRVAPPPHVPGHLIREIDAYDLDGLEQGFHEAW
KRVQQPDTPPLVWTPFTGGHWIATRGTLIDEIYRSPERFSSRVIWVPREAGEAYDMVP
TKLDPPEHTPYRKAIDKGLNLAEIRKLEDQIRTIAVEIIEGFADRGHCEFGSEFSTVF
PVRVFLALAGLPVEDATKLGLLANEMTRPSGNTPEEQGRSLEAANKGFFEYVAPIIAA
RRGGSGTDLITRILNVEIDGKPMPDDRALGLVSLLLLGGLDTVVNFLGFMMIYLSRHP
ETVAEMRREPLKLQRGVEELFRRFAVVSDARYVVSDMEFHGTMLKEGDLILLPTALHG
LDDRHHDDPMTVDLSRRDVTHSTFAQGPHRCAGMHLARLEVTVMLQEWLARIPEFRLK
DRAVPIYHSGIVAAVENIPLEWEPQRVSA

CYP101D2   Novosphingobium aromaticivorans
           NZ_AAAV01000042
           complement(5601..6899) gene = Saro0208
           63% to 101D1
MGTTRMDTFNPQESRLATNFDEAVRAKVERPANVPEDRVYEIDM
YALNGIEDGYHEAWKKVQHPGIPDLIWTPFTGGHWIATNGDTVKEVYSDPTRFSSEVI
FLPKEAGEKYQMVPTKMDPPEHTPYRKALDKGLNLAKIRKVEDKVREVASSLIDSFAA
RGECDFAAEYAELFPVHVFMALADLPLEDIPVLSEYARQMTRPEGNTPEEMATDLEAG
NNGFYAYVDPIIRARVGGDGDDLITLMVNSEINGERIAHDKAQGLISLLLLGGLDTVV
NFLSFFMIHLARHPELVAELRSDPLKLMRGAEEMFRRFPVVSEARMVAKDQEYKGVFL
KRGDMILLPTALHGLDDAANPEPWKLDFSRRSISHSTFGGGPHRCAGMHLARMEVIVT
LEEWLKRIPEFSFKEGETPIYHSGIVAAVENVPLVWPIAR

CYP101D3   Sphingomonas sp. SKA58
           ZP_01304513
           78% to CYP101D2
  1 MPEHVPETMS RARAPRPEHI PEQYVHEIDM YALEGIEQGY HEAWKNIPKP DMPDLIWTPF
 61 TGGHWIATNG DTVREVYSDP TRFSSEVIFL PKEAGEKYEM VPTRMDPPEH TPYRKALDKG
121 LSLAQIRKVE SKVRKVAVDL IDSFVSRGEC DFSAEYANVF PVRVFMALAD LPESDVPTLS
181 RFAKMMTRPE GNTPEEMAKH LEEGNKGFFA YVEPIIQARR GKEGEDLITV MVNAEINGER
241 ITHDKALGLI SLLLLGGLDT VVNFLSFMMI HLAKNPQVVE ELRADPLKLM RSAEEMFRRF
301 PVVSEARMVA KDQDFRGIEL KRGDMILLPT ALHGLDDQLN DDPWRINLER RGISHSTFGG
361 GPHRCAGLHL ARMEVIVTIE EWLKRIPTFA MKPGAQPIYH SGIVAAVDNV PLIWSER

102 Family

CYP102A1   Bacillus megaterium
           Ruettinger,R.T.,Wen, L.-P. and Fulco, A.J. 
           Coding Nucleotide, 5'-Regulatory, and Deduced Amino Acid Sequences of 
           P450BM-3, a Single Peptide Cytochrome P450:NADPH-P450 Reductase from 
           Bacillus megaterium. 
           J. Biol. Chem. 264, 10987-10995 (1989)

CYP102A1    Bacillus megaterium
            GenEMBL J04832 (4957bp)
            Ravichandran,K.G., Boddupalli, S.S., Hasemann,C.A.,
            Peterson,J.A. and Deisenhofer,J. 
            Crystal structure of hemoprotein domain of P450BM-3, a prototype
            for microsomal P450s.
            Science 261, 731-736 (1993)
            P450 is N-terminal

CYP102A2    Bacillus subtilis
            GenEMBL D87979
            Yamamoto, H., S. Uchiyama, F. A. Nugroho, and J. Sekiguchi.
            A 23.4 kb segment at the 69 degrees-70 degrees region of the 
            Bacillus subtilis genome. 
            Microbiology. 143, 1317-20 (1997)
            Gene name yfnJ    66.4% identical to CYP102A1 P450 part only
            also called YetO (fusion of P450 and reductase like CYP102A1, P450 part is 
            N-terminal)

CYP102A3    Bacillus subtilis
            GenEMBL U93874, Z99117
            Sorokin, A., A. Bolotin, B. Purnelle, H. Hilbert, J. Lauber, A. 
            Dusterhoft, and S. D. Ehrlich.  
            Sequence of the Bacillus subtilis genome region in the vicinity of 
            the lev operon reveals two new extracytoplasmic function RNA 
            polymerase sigma factors SigV and SigZ. 
            Microbiology. 143, 2939-43 (1997)
            Gene name yrhJ  most similar to CYP102A2
            (fusion of P450 and reductase like CYP102A1 P450 part is N-
            terminal)

CYP102A4    Bacillus anthracis str. Ames
            GenPept AAP27014                
            bifunctional P-450:NADPH-P450 reductase 1 
            79% to 102A2
   1 MDKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKLAEEYG PIFRMQTLSD TIIVVSGHEL
  61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETQEPNWQ KAHNILMPTF SQRAMKDYHA
 121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
 181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSE NQEENDLLSR
 241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
 301 VLTDSTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
 361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
 421 MLLQHFEFID YEEYQLDVKQ TLTLKPGDFK IRIVPRNQTI SHTTVLAPTE EKLKNHEIKQ
 481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVAAL NDRIGSLPKE
 541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKG DELKGVQYAV FGCGDHNWAS TYQRIPRYID
 601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQRMWSDAMK VFGLELNKNM EKERSTLSLQ
 661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYK EGDHLGVLPI
 721 NSEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVR LYDLLSYSVE VQEAATRAQI
 781 REMVTFTACP PHKKELESLL EDGVYQEQIL KKRISMLDLL EKYEACEIRF EPFLELLPAL
 841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
 901 QSNFQLPENP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNVGEAHLYF GCRHPEKDYL
 961 YRTELENDER DGLISLHTAF SRLEGQAKTY VQHVIKEDRI HLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR LQEEGRYGKD VWAGI

CYP102A5    Bacillus cereus ATCC 14579
            GenPept AAP10153 
            NADPH-cytochrome P450 reductase/P450 fusion
            79% to 102A2 Bacillus subtilis
  1 MEKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKIAEEYG PIFQIQTLSD TIIVVSGHEL
  61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETHEPNWK KAHNILMPTF SQRAMKDYHA
 121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
 181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSG DQEENDLLSR
 241 MLNVPDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
 301 VLTDPTPTYQ QVMKLKYMRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
 361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
 421 MLLQHFELID YQNYQLDVKQ TLTLKPGDFK IRILPRKQTI SHPTVLAPTE DKLKNDEIKQ
 481 HVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE
 541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKP DELKGVQYAV FGCGDHNWAS TYQRIPRYID
 601 EQMAQKGATR FSKRGEADAS GDFEEQLEQW KQNMWSDAMK AFGLELNKNM EKERSTLSLQ
 661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSDRSTRHIE VSLPEGATYK EGDHLGVLPV
 721 NSEKNINRIL KRFGLNGKDQ VILSASGRSI NHIPLDSPVS LLALLSYSVE VQEAATRAQI
 781 REMVTFTACP PHKKELEALL EEGVYHEQIL KKRISMLDLL EKYEACEIRF ERFLELLPAL
 841 KPRYYSISSS PLVAHNRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
 901 QSNFELPKDP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MNLGQAHLYF GCRHPEKDYL
 961 YRTELENDER DGLISLHTAF SRLEGHPKTY VQHLIKQDRI NLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR VQDEGRYGKD VWAGI

CYP102A6    Bradyrhizobium japonicum USDA 110
            GenPept BAC48147                
            NC_004463 complete genome 3173438..3176674
            NADPH-cytochrome P450 reductase/P450 fusion
            54% to 102A2
   1 MSSKNRLDPI PQPPTKPVVG NMLSLDSAAP VQHLTRLAKE LGPIFWLDMM GSPIVVVSGH
  61 DLVDELSDEK RFDKTVRGAL RRVRAVGGDG LFTADTREPN WSKAHNILLQ PFGNRAMQSY
 121 HPSMVDIAEQ LVQKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
 181 SLVRSLETIM MTRGLPFEQI WMQKRRKTLA EDVAFMNKMV DEIIAERRKS AEGIDDKKDM
 241 LAAMMTGVDR STGEQLDDVN IRYQINTFLI AGHETTSGLL SYTLYALLKH PDILKKAYDE
 301 VDRVFGPDVN AKPTYQQVTQ LTYITQILKE ALRLWPPAPA YGISPLADET IGGGKYKLRK
 361 GTFITILVTA LHRDPSVWGP NPDAFDPENF SREAEAKRPI NAWKPFGNGQ RACIGRGFAM
 421 HEAALALGMI LQRFKLIDHQ RYQMHLKETL TMKPEGFKIK VRPRADRERG AYGGPVAAVS
 481 SAPRAPRQPT ARPGHNTPML VLYGSNLGTA EELATRMADL AEINGFAVHL GALDEYVGKL
 541 PQEGGVLIIC ASYNGAPPDN ATQFVKWLGS DLPKDAFANV RYAVFGCGNS DWAATYQSVP
 601 RFIDEQLSGH GARAVYPRGE GDARSDLDGQ FQKWFPAAAQ VATKEFGIDW NFTRTAEDDP
 661 LYAIEPVAVT AVNTIVAQGG AVAMKVLVND ELQNKSGSNP SERSTRHIEV QLPSNITYRV
 721 GDHLSVVPRN DPTLVDSVAR RFGFLPADQI RLQVAEGRRA QLPVGEAVSV GRLLSEFVEL
 781 QQVATRKQIQ IMAEHTRCPV TKPKLLAFVG EEAEPAERYR TEILAMRKSV YDLLLEYPAC
 841 ELPFHVYLEM LSLLAPRYYS ISSSPSVDPA RCSITVGVVE GPAASGRGVY KGICSNYLAN
 901 RRASDAIYAT VRETKAGFRL PDDSSVPIIM IGPGTGLAPF RGFLQERAAR KAKGASLGPA
 961 MLFFGCRHPD QDFLYADELK ALAASGVTEL FTAFSRADGP KTYVQHVLAA QKDKVWPLIE
1021 QGAIIYVCGD GGQMEPDVKA ALVAIRHEKS GSDTATAARW IEEMGATNRY VLDVWAGG

CYP102A7  Bacillus licheniformis ATCC 14580
          GenEMBL AAU24352
          Rey,M.W., Ramaiya,P., Nelson,B.A., Brody-Karpin,S.D.,
          Zaretsky,E.J., Tang,M., de Leon,A.L., Xiang,H., Gusti,V.,
          Clausen,I.G., Olsen,P.B., Rasmussen,M.D., Andersen,J.T.,
          Jorgensen,P.L., Larsen,T.S., Sorokin,A., Bolotin,A., Lapidus,A.,
          Galleron,N., Ehrlich,S.D. and Berka,R.M.
          Complete genome sequence of the industrial bacterium Bacillus
          licheniformis and comparisons with closely related Bacillus species
          Genome Biol. 5 (10), R77 (2004)
          74% to CYP102A3 for the P450 domain
   1 MNKLDGIPIP KTYGPLGNLP LLDKNRVSQS LWKIADEMGP IFQFKFADAI GVFVSSHELV
  61 KEVSEESRFD KNMGKGLLKV REFSGDGLFT SWTEEPNWRK AHNILLPSFS QKAMKGYHPM
 121 MQDIAVQLIQ KWSRLNQDES IDVPDDMTRL TLDTIGLCGF NYRFNSFYRE GQHPFIESMV
 181 RGLSEAMRQT KRFPLQDKLM IQTKRRFNSD VESMFSLVDR IIADRKQAES ESGNDLLSLM
 241 LHAKDPETGE KLDDENIRYQ IITFLIAGHE TTSGLLSFAI YLLLKHPDKL KKAYEEADRV
 301 LTDPVPSYKQ VQQLKYIRMI LNESIRLWPT APAFSLYAKE ETVIGGKYLI PKGQSVTVLI
 361 PKLHRDQSVW GEDAEAFRPE RFEQMDSIPA HAYKPFGNGQ RACIGMQFAL HEATLVLGMI
 421 LQYFDLEDHA NYQLKIKESL TLKPDGFTIR VRPRKKEAMT AMPGAQPEEN GRQEERPSAP
 481 AAENTHGTPL LVLYGSNLGT AEEIAKELAE EAREQGFHSR TAELDQYAGA IPAEGAVIIV
 541 TASYNGNPPD CAKEFVNWLE HDQTDDLRGV KYAVFGCGNR SWASTYQRIP RLIDSVLEKK
 601 GAQRLHKLGE GDAGDDFEGQ FESWKYDLWP LLRTEFSLAE PEPNQTETDR QALSVEFVNA
 661 PAASPLAKAY QVFTAKISAN RELQCEKSGR STRHIEISLP EGAAYQEGDH LGVLPQNSEV
 721 LIGRVFQRFG LNGNEQILIS GRNQASHLPL ERPVHVKDLF QHCVELQEPA TRAQIRELAA
 781 HTVCPPHQRE LEDLLKDDVY KDQVLNKRLT MLDLLEQYPA CELPFARFLA LLPPLKPRYY
 841 SISSSPQLNP RQTSITVSVV SGPALSGRGH YKGVASNYLA GLEPGDAISC FIREPQSGFR
 901 LPEDPETPVI MVGPGTGIAP YRGFLQARRI QRDAGVKLGE AHLYFGCRRP NEDFLYRDEL
 961 EQAEKDGIVH LHTAFSRLEG RPKTYVQDLL REDAALLIHL LNEGGRLYVC GDGSRMAPAV
1021 EQALCEAYRI VQGASREESQ SWLSALLEEG RYAKDVWDGG VSQHNVKADC IART

CYP102A8   Bacillus thuringiensis serovar konkukian str. 97-27
           AAT62301
           98% to CYP102A4, 96% to CYP102A5
  1 MDKKVSAIPQ PKTYGPLGNL PLIDKDKPTL SFIKLAEEYG PIFQIQTLSD TIIVVSGHEL
 61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETDEPNWK KAHNILMPTF SQRAMKDYHA
121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSFYR ETPHPFITSM
181 TRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSE NQEENDLLSR
241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
301 VLTDSTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
361 LIPQLHRDKD AWGDDVEEFQ PERFEELDKV PHHAYKPFGN GQRACIGMQF ALHEATLVMG
421 MLLQHFEFID YEDYQLDVKQ TLTLKPGDFK IRIVPRNQTI SHTTVLAPTE EKLKKHEIKK
481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE
541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKP DELKGVQYAV FGCGDHNWAS TYQRIPRYID
601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQSMWSDAMK AFGLELNKNM EKERSTLSLQ
661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYK EGDHLGVLPI
721 NNEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVR LYDLLSYSVE VQEAATRAQI
781 REMVTFTACP PHKKELESLL EDGVYQEQIL KKRISMLDLL EKYEACEIRF ERFLELLPAL
841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
901 QSNFQLPENP ETPIIMVGPG TGIAPFRGFL QARRVQKQKG MKVGEAHLYF GCRHPEKDYL
961 YRTELENDER DGLISLHTAF SRLEGHPKTY VQHVIKEDRI HLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR LQEEGRYGKD VWAGI

CYP102A9   Bacillus weihenstephanensis KBAB4
           ZP_01184381
           96% to CYP102A5
  1 MDKKVSAIPQ PKTYGLLGNL PLIDKDKPTL SFIKIAEEYG PIFRIQTLSD TIIVVSGHEL
 61 VAEVCDETRF DKSIEGALAK VRAFAGDGLF TSETHEPNWK KAHNILMPTF SQRAMKDYHA
121 MMVDIAVQLV QKWARLNPNE NVDVPEDMTR LTLDTIGLCG FNYRFNSYYR ETPHPFITSM
181 SRALDEAMHQ LQRLDIEDKL MWRTKRQFQH DIQSMFSLVD NIIAERKSSG NQEENDLLSR
241 MLNVQDPETG EKLDDENIRF QIITFLIAGH ETTSGLLSFA IYFLLKNPDK LKKAYEEVDR
301 VLTDPTPTYQ QVMKLKYIRM ILNESLRLWP TAPAFSLYAK EDTVIGGKYP IKKGEDRISV
361 LIPQLHRDKD AWGDNVEEFQ PERFEELDKI PHHAYKPFGN GQRACIGMQF ALHEATLVMG
421 MLLQHFEFID YQDYQLDVKQ TLTLKPGDFK IRILPRNQTI SHTTVLAPIE EKLKNDEIEQ
481 QVQKTPSIIG ADNLSLLVLY GSDTGVAEGI ARELADTASL EGVQTEVVAL NDRIGSLPKE
541 GAVLIVTSSY NGKPPSNAGQ FVQWLEELKS DELKGVQYAV FGCGDHNWAS TYQRIPRYID
601 EQMAQKGATR FSTRGEADAS GDFEEQLEQW KQSMWSDAMK AFGLELNKNI EKERSTLSLQ
661 FVSRLGGSPL ARTYEAVYAS ILENRELQSS SSERSTRHIE ISLPEGATYQ EGDHLGVLPI
721 NSEKNVNRIL KRFGLNGKDQ VILSASGRSV NHIPLDSPVS LFDLLSYSVE VQEAATRAQI
781 REMVTFTACP PHKKELELLL EEGVYHEQIL KKRMSMLDLL EKYEACEIRF ERFLELLPAL
841 KPRYYSISSS PLVAQDRLSI TVGVVNAPAW SGEGTYEGVA SNYLAQRHNK DEIICFIRTP
901 QSNFQLPENP ETPIIMVGPG TGVAPFRGFL QARRVQKQKG INLGQAHLYF GCRHPEKDYL
961 YRTELENDER DGLISLHIAF SRLEGYPKTY VQHLIKQDRI NLISLLDNGA HLYICGDGSK
1021 MAPDVEDTLC QAYQEIHEVS EQEARNWLDR VQEEGRYGKD VWAGI

CYP102A10   Erythrobacter litoralis HTCC2594
            YP_456909
            60% to 102A6
  1 MDAPTALAPI PQPPGKPIVG NAFTVDSSRL IQSLMELAEE YGPIFQLEVM GTPLVFVSGA
 61 DMVAEICDES RFDKTVRGPL KRLRLIAGDG LFTGDTDDPN WAKAHHILLP SFSQKAMGSY
121 LPMMTDIASQ LMLKWERLNS DDVIDVPMDM VRLTLDTIGV CGFGYRFNSF YREDFHPFIE
181 ALNRTLDTTQ KMRGLPGEKL LKRQQIEQLN EDAAYMNNLV DEIIRERRQT GESGQGDLLD
241 FMLSGRDPVT GERLSDENIR YQINTFLIAG HETTSGLLSF TLYYLLKNRD VLQRAYAEVD
301 EVLGRNIDQT PTLSQIGRLP YIRAILSEAL RLWPTAPAMG LAPFEDEVLG GKYAIAKGTF
361 TTVLIPSLHR DKLVWGENPE AFNPDNFSPK AEAARPPHAY KPFGNGQRAC IGRQFAIQES
421 ILVLGMLLQR FELFDHADYQ LRIKETLSIK PDGFTIKARL RHDVERGGVA TVEPESKTPD
481 QAAAVPSHGT PLLVLYGSNL GSSEGFAREL AQRGEFSGFD VTMAPLDAHV AKLPTDGAVA
541 IACASYNGMP PDNAAKFVDW LEQADAADAP LSNVSYLVLG CGNSDWAATF QVVPRKIDAL
601 MEQHGAERLV PAEELDARGD LDTQFHDWLD GLIPQLGDAF DIDLESGFDA VFEPLYTVEI
661 TDSITGNTVA DRVGAREVEV VANRELKDTS KDEGRSTRHL EVRLPEGMEY EPGDHLCVVP
721 VNDPAVVDRL LKRFGLDRDT FVRIESRSDM RGPFPSGSTF SVLNLAETAG ELQAVATRKD
781 IATLARYSEC PNSRAALEAL AAPPSADGTD RYTSEVLEKR RSVLDMLEEF PACDVPLAVF
841 LELIPFLSPR YYSISSAPEA NQGLCSITVG VVKGPALAGT GEFKGTCSAY LADLPPGDRF
901 RAVVRKPTAQ FRLPDNPETP VIMIGPGTGV APFRAFLQRR DHLQEDGAVL GEAMLFFGCR
961 HPDIDYLYRE ELDDYDQRGV ATVHAAFSRH DGSRTYVQDL IAREADRVWE LIEQDARIYV
1021 CGDGARMEPD VRKALMAIYA EKKSSDEASA KAWIDDLVAQ DRYLLDVWVG

CYP102A11   Erythrobacter sp. NAP1
            ZP_01041731
            75% to CYP102A10
  1 MATNATLTPI PQPPGKPLIG NALTVDASQQ IQSLMELAEE YGPIFQLDMM GTPIVIISGA
 61 DLVAEVCDEK RFDKSVRGPL KRLRLIGGDG LFTGDTDAPN WSKAHNILLP SFSQKAMGSY
121 LPMMTDIATQ LVMKWERMNS DDVIDVPKDM IRLTLDTIGV CGFGYRFNSF YREDFHPFIR
181 ALTRTLETTQ KIRGIPGEKL LKGDAVKQLH RDAKYMNNLV DEIIRERQRS GGDGPEDLLD
241 FMLSGRDPLT GERLSDENIR YQINTFLIAG HETTSGLLSF TLYYLLKNRD VLTRAYAEVD
301 TVLGRNIDQP PSLKQIGQLP YIRAILFEAL RLWPTAPAFG LAPFEDEVLG GKYLIPKGTF
361 TTVLIPSLHR DKSVWGENPE VFDPENFTAE AEAARPPHAY KPFGNGQRAC IGRQFAIQES
421 ILVLAMILQR FELFDHSDYK LDIKETLSIK PDEFTIKARM RKDVERGGKA TEDAEEASTE
481 PAEPKVPKHD TPLLVLYGSN LGSSESFARE VAQKGEFSGF EVEMAPLDDY VGKLPEDGAV
541 AIACASYNGM PPDNAAKFID WIEGKDGAAP DLSGVSYMML GCGNSDWAAT FQAVPRLIDA
601 RMEELGAKRI VPTTELDARS DVDTQFHTWL DALMPQLGEH FDLDLSGEAA GIAEPLYKVE
661 VTQSVTANTV ASRVGAHAVK MVANRELKNT DIGEGRSTRH IEVELAQGET YQPGDHLCVV
721 PENDDAVVER LLRRFNLDAD TYVRIESRSE MRGPFPSGST FSVYNLAKTA GELQAVATRK
781 DIATLALYTE CPNSKPALEK LAQPPQEDGT DQYATDVLAK RKSVLDLLED YPACDLPLAV
841 FLEMIPFLSP RYYSISSAPG DTPQTCSITV GVVKGPALSG KGTFKGTCSN YLAELEPGAS
901 FNAVVREPTA NFRLPDDPKV PLIMVGPGTG LAPFRGFLQE RDALASSGEE LGPARLYFGC
961 RTPDEDFLYR DELEDYDKRG IVTLRTAFSR VDEGKCYVQD HIADDADAIW EMLEAGGRIY
1021 VCGDGARMEP DVRAALAKIH SDKTGSTPAE AQSWVGDLIT NERYSLDVWV G

CYP102A12   Rhodopseudomonas palustris HaA2
            YP_487251
            84% to CYP102A6
  1 MSSSNKLAPI PHPPKQPVVG NMLSIDTKAP VQHLVRLAEE LGPIFWLDMM GAPIVIVSGY
 61 DLVDEISDEK RFDKAVRGAL RRVRTVGGDG LFTADTSEPN WSKAHNILLT PFGGRAMQSY
121 HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
181 SLVRSLETIM MTRGLPLENL WMKKRRDTLA EDVAFMNAMV DEIIAERRKA AAVADKMDML
241 GAMMTGVDKV TGEPLDDVNI RYQINTFLIA GHETTSGLLS CAIYALLKHP EVLQKAYDEV
301 DRVLGADTSV EPSYQQVNQL GYITQILKET LRLWPPAPAY GVAPIQDETI GGQYHLKRGT
361 FTTVLVLALH RDPSIWGPNP DAFDPENFSR EAESKRPANA WKPFGNGQRA CIGRGFAMHE
421 AALALGMILQ RFKLIDHTRY RMVLKETLTI KPEGFKIKVR PRSDKDRATR IASGVSHSVA
481 PAPAAPRARP GHNTPLLVLY GSNLGTAEEL AHRVADLADL NGFATRLGAL DQYVGQLPEE
541 GGVLIFAASY NGAPPDNATQ FVRWLSGDLP PDAFAKLRYA VFGCGNRDWT ATYQAIPRLI
601 DERLAAHGGR NIFVRGEGDA RDDLEGQFEA WFATLGPLAV KEFGIDAAFD RGADDTPLYG
661 IEPLAPAASQ PLAATGVAVA MRVLENRELQ DRAASGRSTR HIEIALPQGM SYRVGDHLSV
721 IPRNDPALVA AVAQRFGFAP DDQIRLSAAP GRRAQLPVGE AVSIGGLLGD HVELQQVATR
781 KQIVALAAHT RCPQTRPKLQ ALAGGDGAAD DAYRAEVLGK RRSVFDLLQE HPACELPFAA
841 YLEMLTPLQP RYYSISSSPA RDPARASVTV AVVEGPALSG RGIYRGACSS WLAGRGSGDT
901 VQATVRATKA CFRLPDDDRV PLIMIGPGTG VAPFRGFLQE RSARKVGGAT LGPALLFFGC
961 RHPAQDYLYA DELQGFAADG IVELHAAFSR GDGPKTYVQH LIAAQKDRVF ALIEQGAIVY
1021 VCGDGGRMEP DVKAALCAIH RERSGADATA AAAWIADLGA RDRYVLDVWA SV

CYP102A13   Rhodopseudomonas palustris HaA2
            YP_568957
            87% to CYP102A12
  1 MPSTNKLDPI PHPPKKPVVG NMLSLDTTAP VQHLVRLAKE LGPIFWLDMM GAPLVIVSGY
 61 DLVDEISDEK RFDKAVRGAL RRARAVGGDG LFTADTKEPN WSKAHNILLT PFGGRAMQSY
121 HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
181 SLVRSLETIM MTRGLPLENL WMKKRRETLA DDVVFMNAMV DEIIAERRKA SESAADKKDM
241 LGAMLAGVDR ATGEPLDDVN IRYQINTFLI AGHETTSGLL SCAIYALLKH PDVLQKAYDE
301 VDRVLGSDTA VRPSYQQVNQ LSYITQILKE TLRMWPPAPA YGVAPIKDEV IGGKYHLKRG
361 TFVTVLVLAL HRDPAIWGPN PDAFDPENFS REAESKRPAN AWKPFGNGQR ACIGRGFAMH
421 EAALALGMIL QRFQLIDHQR YRMVLKETLT IKPEGFKIKV RPRSDKDRGD FVAAGASQVS
481 TPALAQAAPR ARPDHNTPLL VLYGSNLGTA EELATRVADL AELNGFSTRL GALDQYVGHL
541 PEEGGVLIFT ASYNGAPPDN ATQFVQWLSG DLPKDAFAKL RYAVFGCGNR DWTATYQAIP
601 RLVDERLAAH GGRNIFLRGE GDARDDLEGQ FESWFAKLGP LAVKEFGIDA KFARAVDDAP
661 LYRIEPVAPA AGNAVAAAGG AVPMKVLANR ELQDCAASGR STRHIEIALP EGISYRVGDH
721 LSVMPRNDPA LVAAVAQRLG FAPDDQIKLQ VAPGRRAQLP IGEAISVGRL LGDFVELQQV
781 ATRKQIAVMA EHTRCPQTRP KLQALAGGDG AADEAYRAGV LAKRKSVYDL MQEHPACELP
841 LHAYLEMLSP LAPRYYSISS SPLRDPSRAA ITVAVVDGPA LSGRGHYRGV CSTWLAGRSV
901 GDTIHATVRA TKAGFRLPDD DRVPLIMIGP GTGLAPFRGF LQERAARQQN GATLGPALLF
961 FGCRHPAQDY LYADELQGFA AEGVVELHTA FSRGEGPKTY VQHLIAAQKD RVFTLIEQGA
1021 IIYVCGDGGK MEPDVRAALM AIHRERSGAD AAAASTWIDD LGACNRYVLD VWASA

CYP102A14   uncultured soil bacterium
            ABD83817
            74% to 102A12
  1 MASNNKMSPI PQPPTRPVVG NMLSLDSAAP VQDLTRLAKE LGPIFWLDMM GAPIVIVSGY
 61 TLVDELSVET RLDKVVRGAL RRVRAIGGDG LFTADTAEPN WSKARNILLQ PFGNRAMQSY
121 HPSMVDIAEQ LVKKWERLNA DDEIDVVHDM TALTLDTIGL CGFDYRFNSF YRRDYHPFVE
181 SLVRSLETIM MIRGLPLENF WMRRRRSDLA TDVAFMNKMV DEIVAERRKS AEASDGKKDM
241 LNAMMSGVDR STGEQLDDVN IRYQINTFLI AGHETTSGLL SYAIYALLKH PDVLKKAYAE
301 VDRVLGADIE ARPSYQQVTQ LTYITQILKE ALRLWPPAPA YGIAPLKDET IAGGKYSLKK
361 NTFISILVTA LHHDPAVWGP NPDLFDPENF SPEAEAKRPV NAWRPFGNGQ RACIGRGFAM
421 HEAALALGMI LQRFKLIDHQ RYQIRLKETL TIKPDGFKIK VRPRSGHDRT VHAEAATAAV
481 ATGAALPRAR PRPGHNTPLL VLYGSNLGTA EDLATRVADL AEVNGFATRL APLDDCAGQL
541 PDSGGVLIFC ASYNGAPPDN ATKFVGWLRG ELPNDAFAKL RYAVFGCGNR DWAATYQSVP
601 RLIDETLSAH GGKRVFPRGE GDARSDLDGQ FESWFAALGA AAVKEFGLES RFSRSADDAP
661 LYSVEPVAPS AVNAVAALGG TVPMTILVSR ELQNKSGPDA SERSTRHIEV QLPGGMTYRV
721 GDHLSVVPCN APALVDRVAR RFGFLPADQI RLAVAEGRRA QLPVGEAVSI GQLLTDFVEL
781 QQVATRKQIQ IMSEHTRCPV TKPKLVAYVG DDADSSERYR ADILSRRKSV YDLLEEFPAI
841 ELPFPAYLEM LSLLAPRYYS ISSSPTGDAS RCSITVGVVS CPASSTGRGL YRGVCPYYLA
901 SRREGESVFA TVRETKAGFR LPDDPSVPII MIGPGTGLAP FRGFLQERAA RKAGGATLGP
961 AMLFFGCRHP EQDYLYADEL KAFADEGITE LFVAFSRSEG PKTYVQHLLA TQKARVWDLI
1021 EQGAVIFVCG DGSKMEPDVK ATLVQIYRDC TGADANGGAK WIADLGAQNR YVLDVWAGG

CYP102B1    Streptomyces coelicolor cosmid F43.
            GenEMBL AL136502 CDS 10570..12153 gene="SCF43.12"
            Highly similar to the N-terminal P450 domain of Bacillus
            megaterium 41.9% identity in 497 aa overlap. 
            45% to 102A1 over 433 amino acids
            cloned and expressed by David Lamb and Steve Kelly

CYP102B2   Streptomyces avermitilis
           GenEMBL AP005050
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV7426 
           78% to 102B1 from Streptomyces coelicolor

CYP102B3    Rhodococcus sp. RHA1 
            No accession number
            Marianna A. Patrauchan
            Rha05872
            Submitted to nomenclature committee 12/13/04
            62% to CYP102B2 

CYP102B4    Streptomyces scabies
            SCAB9321  
            David Lamb
            Submitted to Nomenclature committee Nov. 10, 2006
            80% to CYP102B2 Streptomyces avermitilis

CYP102C1    Rhodococcus sp. X309 
            GenEMBL AF059700.1 complement(3619-4584) runs off end of sequence
            partial gene 48% to 102B1 

CYP102C2   Rhodococcus erythropolis PR4
           YP_345116
           88% to CYP102C1 
  1 MNGPRGSRTR PRRASLVNLA VLGVVLVHTV LMSADKCPYP KSATRAGEIT AVQPQVFESI
 61 PSPAWRLPLL GDLLTVDSEK PIQKEMALAS KLGPIFEWKI VNNRVTVVSG VDLVAEVNNE
121 ALWAKSVGLP ILKLRKVAED GLFTAFNSEP NWRKAHNILS EGFSRSALRN YHPSMLRALG
181 GLTDSWDRVA DAGETIDASS DANKLALDVI GLAGFGYDFA SFIGEEHPFV GAMSRVLAHV
241 NSTSNDIPFL RKLRGNGADL QNEKDIALLR TVVDNVIAER QSKPGEHQDD LLDLMLHSAD
301 AETGEKLDPV NIRNQVFTFL VAGNETTAGT LAFALYFLSR HPDVADTARA EVADVTAGET
361 PAFEDVARMR YLRRVVDETL RLWPSAPGYF RKVRTDTTLG GRYDMPKGSW VFVLLPQLHR
421 DPVWGEDPES FDPDRFKPEN VKKRPAHAYR PFGTGPRACI GRQFALHEAV LALAIILQRY
481 NFQSDPEYKL DIRETLSLKP VGFELSLQRR

CYP102D1   Streptomyces avermitilis
           GenEMBL AP005023
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV575 47% to 102A3 
           40% to 102B1, 44% to 102C1 partial seq 

CYP102E1    Ralstonia metallidurans
            GenEMBL NZ_AAAI01000371 
            104500-107000 region
            51% to 102D1
MSTATPAAALEPIPRDPGWPIFGNLFQITPGEVGQHLLARSRHHDGIFELDFAGKRVPFVS
SVALASELCDATRFRKIIGPPLSYLRDMAGDGLFTAHSDEPNWGCAHRILMPAFSQRAM
KAYFDVMLRVANRLVDKWDRQGPDADIAVADDMTRLTLDTIALAGFGYDFASFASDELDP 
FVMAMVGALGEAMQKLTRLPIQDRFMGRAHRQAAEDIAYMRNLVDDVIRQRRVSPTSGMD 
LLNLMLEARDPETDRRLDDANIRNQVITFLIAGHETTSGLLTFALYELLRNPGVLAQAY 
AEVDTVLPGDALPVYADLARMPVLDRVLKETLRLWPTAPAFAVAPFDDVVLGGRYRLRKD 
RRISVVLTALHRDPKVWANPERFDIDRFLPENEAKLPAHAYMPFGQGERACIGRQFALTE 
AKLALALMLRNFAFQDPHDYQFRLKETLTIKPDQFVLRVRRRRPHERFV
TRQASQAVADAAQTDVRGHGQAMTVLCASSLGTARELAEQIHAGAIAAGFDAKLADLDDA
VGVLPTSGLVVVVAATYNGRAPDSARKFEAMLDADDASGYRANGMRLALLGCGNSQWATY
QAFPRRVFDFFITAGAVPLLPRGEADGNGDFDQAAERWLAQLWQALQADGAGTGGLGVDV
QVRSMAAIRAETLPAGTQAFTVLSNDELVGDPSGLWDFSIEAPRTSTRDIRLQLPPGITY
RTGDHIAVWPQNDAQLVSELCERLDLDPDAQATISAPHGMGRGLPIDQALPVRQLLTHFI
ELQDVVSRQTLRALAQATRCPFTKQSIEQLASDDAEHGYA

CYP102F1   Actinosynnema pretiosum subsp. auranticum
           AF453501 complement(6501..9518)
           maytansinoid antitumor agent ansamitocin biosynthetic gene cluster I
           49% to 102A3
           gene = asm30
MVATGTRIPGPKPLPLVGNLLDVLTSDLDTDVDFLDRCHREHGG
IVALTFAGQRQVFASSHELVARMCSDPSWGKAVHPALEQVRDFAGDGLFTARGDEPNW
GKAHRLLMPAFGPTAMRDHFPAMLDIAEQMLVRWRRFGPDHRIDVADDMTRLTLDTIA
LCAFGARFNSFYRDRAHPFVDAMVRSLVEAGERAERLPGVQPFLVGRNQRYRDDIATM
NRIADGIVAARAALPAGERPDDLLERMLTCADPVTGERLSARNVRYQLATFLIAGHET
TSGLLSFAVHRLLAHPEVLRKAKDAVDGVLGDRVPAFEDLARLDYLGQVLRETLRLHP
TAPAFALAPDEPAELGGHAIGAGEPVLVMLPTLHRDPAVWRDPDVFDPERFAPERMDE
IPACAWMPFGHGARACIGRPFALQEATLVLALVLQRFDLALADPDHRLTIKQTLTLKP
DSLVVRARPRADRPGATATVETVVPHQVPATHRHGTPLHVFYGSNGGSGEGLARTIAG
DGAARGWATSVAPLDDAVRALPASGPVVIVSSSYNGAPPDNAAHFVRWLTQDGPDLSG
VDYLVLGCGNLDWSATYQRVPTLIDEAMAAAGARRLRERGATDARADFFGDWERWYEP
LWPLLSAECGVEVGEIGPRFRVVESDAADGLGDLASAVVLENRELVRGPDAGSKRHLE
LRLPDGTSYRTGDYLSVLPQNHPDLVRRAVARLGTRAERVVTVESSAPTGLVPVGRAL
RVDELLTRCVDLSAPAGAGVVARLAERCPCPPERAELAATTGATLLELLERFPSCAVD
LALALELLPAPRTRLYSISSAAEEQRAEVALTVSVTGVTSGYLSRVRPGDRVAVGIAS
PPESFRPPADNTVPVVLIAAGTGIAPFRGFLRARAALGGEPGPALLLFGCRGPELDDL
YAEEFAALGDWLEVDRAYSRHPDGEVRHVQHRLWQRRDRVRELVDAGARVYLCGDATR
VGPAVEEVLGRIGPGAGWLDALRAGGRYATDVF

CYP102G1    Streptomyces scabies
            SCAB5931  
            David Lamb
            Submitted to Nomenclature committee Nov. 10, 2006
            52% to 102A5 Bacillus cereus P450 part
            52% to 102A3 Bacillus subtilis, 45% to 102D1

CYP102G2    Saccharopolyspora erythraea NRRL23338
            SACE_4205, 4676800,4679985 (-) STRAND
            56% to CYP102G1 Streptomyces scabies, 51% to CYP102B2
MTQTPLHHDDVPVADVSGTGLTATPTQQAMELARRHGPVFRRRTREFQSLLVSDVDLVAE
LSDEQRFAKAVGPALENVREFAADGLFTAYNDEFNWAKAHDILMPAFALGSMRTYHPVML
RVARRLLDSWDRAAAASAPVDVPDDMTRMTLDTIGLAGFGFDFGSFGRAEPHPFVGAMVR
CLDWSMTRLSRVPGTDHSERDEAFRADARYLASVVDEVINTRAAEGDTSGEDLLGLMLGA
RHPADGTTLDAANIRNQIITFLIAGHETTSGTLSFALHYLAKNPTVLRLVQREADELWGD
SPDPEPSFEDIGRLTYTRQVLNETLRLWPSAPAFGRQARHDTVLGGRIPMRAGEAAAVLI
PMLHRSPVRGDNPELFDPARFAPEAEAARGPHAFKPFGTGERACIGRQFALDEATMVLAM
LAHRYRLVDHAGYRLKVKETLTLKPEGLTLAVRARTAADRVTNRLALPVGLPSAAPGEPA
DAARRPGRVLPGTGLLLLHGSNYGTCRDFAAQLALAAGELGCDTAVAPLDEYAGNLPSDR
PVIVVAASYNGRPTDDAVSFSRWLDEAEPGAADGVDFAVLGVGDRNWAATYQHVPTRIDA
RLAELGGTRILERGEADASGDLAGAVRRFSAALETALLERSGDPDAVAAAPEGDGPAYTV
SEVTGGALDSLAARHGMVEMTVTEVADLTAPDYPRTKRFVRLALPEGTAYRTADHLAVLP
VHDAALVERAAGVLGVDLDTVLDIRAKRPGRLTFDRSLTVRELLSHHVELQDPPTPDGLD
ALAALNPCPPERAALRGLAEEARSGTADHRTLWDLIEDHPALRDALSWSALLELLPATRP
RQYSVSSSPAVDPRHVDLMVSVLRAPARSGRGEFRGAGSRHLSEVRPGDTVLARVQPCRE
DFRVAPDEPLIMVAAGTGLAPFRGVIADRRERVANGARQAPALCYFGCDAPDADYLHSAE
LRAAESAGAVAMRPAFNEAPVGGQCFVQHRIAAEAGEVWALLESGARVLVCGDGRHMAPG
VREAFRGIYRERTPGADDASAHEWLQAMIAGGCYVEDVYAG 

CYP102H1    Nocardia farcinica IFM 10152 plasmid pNF1 
            GenEMBL AP006619 177679..179100
            51% TO 102C1
MAVTTSTTSGGHSNPPLPHPKWRLPIINDLLTINPIKPTLTSLR
DAEQLGGIFERRLVDWPMIVVSDSELITEICDERNWAKHLGVPLRKMRHIARDGLFTA
RNDEPNWAKAHAVLAPAFTKEAMRSYHQTMLTTIGELLDYWAKRDGQWVDVGEDMNKL
TLEIIARTGFDYTFDSFTRSEVHPFVAAMLRGLTYISRNSNMPPFLQKTIGARAAARH
SRDITYVRTVVDDVIKARQASGTVGDHHDLLDRMLTVPDPASDELLDTTNVRSQILTM
LVAGHETSAGVLAFALYELSRRPELVAAARAEIETRFADGDLSTIAYDDVAKLRTLRR
IVDETLRLYPVAPGFFREARHETTIGGGRYRFGPGDWVLVLTLHAHRDPATWGPDAGE
FQPDRWLPERMRSLDGRQVFKPFGTGLRACIGRQFAYHEIVLALAHILHTFEFTPDPG
YELDIAEQITLKPHRFRLRLNHR

CYP102J1   Burkholderia sp. 383
           ABB05850
           50% to CYP102A4 
  1 MKSSSLVPQP PLKPVIGHLM EVLGPSPLAK MMDLARTYGP VYWFEVFGQG YYVVSGQTLV
 61 DEVCDETRFQ KCVHQSLLEL RPAIGDGLFT AFGDEPNWAK AHRVLMQAFG PLSIWSMFDK
121 MVDIADQMFL HWERFGPETP VDVSDHMTRL TLDTIALCAF DCRFNSFYRE DQHPFVDAMV
181 NTLSEAGKRE LRPKLVSKLM VKRSRQFDAD IEVMRSLATK MIEDRRKNPH VNEESMDLLD
241 RMLNGIDPVT GEKLDDENIV FQMITFLIAG HETTSGLLSF ATYFLLKNPD ILQKARDMVD
301 EVVGSETPRI EHLARLRYVE QILMETLRIW PTAPGFAVKP LADTTFGGKY AVSPDDIIMI
361 LTPMLHRDVS VWGEDVEAFR PERFAPENAE QLPPNSWKPF GNGARACIGR PFAMQEAHLV
421 LIMLLQRFDF SFADPDYELD VAETLTLKPA GFRVNVKPRA RGALKVPDTA LHARSNAQSP
481 SVAPVQSIAP GEDLSNLLVL FGGNTGSAES FARRIAGDAS RHGFHATCAP LDDFAGKLGG
541 YPAIVIVTAS YEGQPPDNAK SFVPYVEALD EGALDGVHFS VLGCGNKQWA RTYQAIPKRV
601 DEALEKAGAT RVHFRGELDS GGDFFGEFDR WYTEMWNRFA ISAGKEIPVI QHDDVALKVS
661 FAGSSREKVL NLGDMAHASI VDNRELVDIS VAGSRSKKHI ELKLPEAMTY RSGDYLAVLP
721 RNAKNNVDRV LRRFRVSWDT QVVIEGTSSN PRLPLGQAIG CGELFSSYVE LAVPATRSQV
781 SSLAAATRCP PEKVELERLS ADGFECEILG KRTTVMDLLE RFGSVDLSLE KFLDMLPALK
841 ARQYSISSSP LWKADHVTLT VAVVDAPALS GNGRHEGVAS SYLARLNTGD SLSVAVRPSN
901 ARFRPPAEPD LPMILICAGS GIAPFRGFLQ ERALQKQRGE NVGTSLLFFG IDDPDVDFLY
961 RDELDEWARC GVVEVMPAYS NRPEEGARFV QDKVWLERER ISALFSQGAT VFVCGDGKNM
1021 APAVRATLGR IYQETTGEND ESASAWIDTM EREHGRYVAD VFA

103 Family

CYP103A1     Agrobacterium tumefaciens
             GenEMBL M19352, AF242881 CDS 141158.142426 
             gene="virH1"

CYP103A2     Agrobacterium tumefaciens
             GenEMBL AF034769
             GenEMBL AB016260 CDS 124584..125759

CYP103A3   Agrobacterium tumefaciens plasmid pTiAB2/73 vir region
           GenEMBL AF329849 892..2148
           gene = virH
           61% TO 103A1
MNARGPEKVSQTSGPIISASLDPDNVSVSDLDRSGHAIFAEWRP
KRPFLRRQDGVYVLLRADDVLGLSSDPRTRQIETELMLNRGINEGAVFDFVRYSMLFS
NNEVHSRRRSPFTRTFAFRMIENLRPQVSQLTETLFQDLKELDSFNFVEEFASKLPAV
AIAGLLGLPPSDIPYFTQLVYRVARCLSPSWRDADLPDIEASAAEFKNYVQAVIDDRR
SNPRDDFLSSFIRATREAEDLSPDEGLAQLMLIVLAGTDTTKTGLTALTGQLLRHRHV
WEALLKDESLVPAAVEEGLRFEPPVGSYPRLALADIDLEGFILPKGSLLALCTMSALR
DEKHFAHPELFDIHRKQMHWHMVFGAGAHRCLGEALARLELQEGLATVLRYAPTLSIE
GEWPTVQGHGGVRRIAEMRVGFRRQI

104 Family

CYP104A1     Agrobacterium tumefaciens
             GenEMBL M19352, AF242881 CDS 142447..143670 
             gene="virH2"

CYP104A2    Agrobacterium tumefaciens
            GenEMBL AB016260 
            103A2 CDS 124584..125759 and 
            104A2 CDS 125919..127094 83% to 104A1

105A Subfamily

CYP105A1    Streptomyces griseolus
            GenEMBL M36480 (1629bp) Y18556 CDS 2447..3703
            Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M.,
            Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
            Genes for two herbicide-inducible cytochromes P-450 from
            Streptomyces griseolus
            J. Bacteriol. 172, 3335-3345 (1990)
            Gene suaC

CYP105A2    Amycolata autotrophica
            GenEMBL D26543 (1197bp)
            Kawauchi,H., Sasaki,J., Adachi,T., Hanada,K., Beppu,T. and 
            Horinouchi,S. 
            Cloning and nucleotide sequence of a bacterial cytochrome P-450 
            VD25
            gene encoding vitamin D-3 25-hydroxylase
            Biochim. Biophys. Acta 1219, 179-183 (1994)

CYP105A3    Streptomyces carbophilus
            GenEMBL D30815 PIR JC4287
            Watanabe,I., Nara,F. and Serizawa,N.
            Cloning, characterization and expression of the gene encoding
            cytochrome P-450sca-2 from Streptomyces carbophilus involved in
            production of pravastatin, a specific HMG-CoA reductase inhibitor
            Gene 163 (1), 81-85 (1995)

105B Subfamily

CYP105B1    Streptomyces griseolus
            GenEMBL M36481 (1688bp) M32239
            Omer,C.A., Lenstra,R., Litle,P.J., Dean,C.R., Tepperman,J.M.,
            Leto,K.J., Romesser,J.A. and O'Keefe,D.P.
            Genes for two herbicide-inducible cytochromes P-450 from
            Streptomyces griseolus
            J. Bacteriol. 172, 3335-3345 (1990)
            Gene subC, SU-2

CYP105B2    Streptomyces tubercidicus strain R-922 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Cyp229
            78% to 105B1

CYP105B3    Saccharopolyspora erythraea NRRL23338
            SACE_2842, se: 3099326,3100528 (+) STRAND
            55% to CYP105B2, 54% to 105Q1, 53% to 105B1 
MTSTLDESTPDYPMPRARGCPFDPPPALRDLQRETPMARVRLWDGSTPWLVTRYADQRAV
LRDSHVSADMNHPTYPRQAPGGGTLSFIGMDDPEHARLRRMVGSAFAVKNVERMRPWVQR
IVDEAVDELLAGPRPADLVEEFALPVPSLVICGLLDVPYADHAFFQSNSKTMINRDSTPE
QRSQASGRLAEYLSDLLSSKMDTRGDDLRSRLCGRIEAGDLTLRQATEMAVLLLIAGHET
TANMIALSTLLLLRHPDQLALLRESDDPDVARRAVEEMLRYLNITHGGRRRVALEDVEVA
GQRVRAGEGLVLPNEIANRDPDAFPDPDRLDITREARHHVAFGFGVHQCLGQPLARLELE
IVYRTLYRRAPRLALAAGIEDIPFKHDGFVYGVYELPVTW 

105C Subfamily

CYP105C1    Streptomyces sp.
            GenEMBL M31939 PIR S19629 (381 amino acids)
            Horii, M., Ishizaki, T., Paik, S.Y., Manome, T. and Murooka, Y.
            An operon containing the genes for cholesterol oxidase and a
            cytochrome P-450-like protein from a Streptomyces sp.
            J. Bacteriol. 172, 3644-3653 (1990)
            Gene choP

105D Subfamily

CYP105D1    Streptomyces griseus
            GenEMBL S45823 X63601 (1700bp) PIR S24750 (412 amino acids)
            Trower,M.K., Lenstra,R., Omer.C., Buchholz,S.E., and 
            Sariaslani,F.S.
            Cloning, nucleotide sequence determination and expression
            of the genes encoding cytochrome P-450soy (soyC) and 
            ferredoxinsoy (soyB) from streptomyces griseus.
            Mol. Microbiol. 6, 2125-2134 (1992)
            PIR S35901 (412 amino acids)
            Erratum. Cloning, nucleotide sequence determination and
            expression of the genes encoding cytochrome P-450(soy)
            (soyC) and ferredoxin(soy) (soyB) from Streptomyces griseus.
            Mol. Microbiol. 7, 1024-1025 (1993)

CYP105D2    Streptomyces griseus
            GenEMBL AF071145
            84% identical to 105D1

CYP105D3    Streptomyces sclerotialus
            GenEMBL AF071149
            68% identical to 105D1

CYP105D4    Streptomyces lividans 
            GenEMBL AF072709 CDS complement(1593..2813)
            69% to 105D1 67% to 105D2 82% to 105D3 57% to 105A1 

CYP105D5    Streptomyces coelicolor 
            3StF60 [Full Sequence] Sanger cosmid 
            CDS comp(2106-3344) 98% identical to CYP105D4
            cloned and expressed by David Lamb and Steve Kelly

CYP105D6   Streptomyces avermitilis
           GenEMBL AB070949.1 69121-70371
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV412_pteD 55% to 105D1 from Streptomyces griseus,
           53% to 105D4, 54% to 105D5 (if first 17aa left off 105D5)
           Gene = pteD

CYP105D7   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV7469 73% to 105D4 from Streptomyces lividans

CYP105D8    Streptomyces tubercidicus strain I-1529 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Cyp233
            68% to 105D7

CYP105D9   Streptomyces sp. JP95
           GenEMBL AF509565 11774..13024
           griseorhodin biosynthesis gene cluster
           55% to 105D6
           gene = grhO3
MTDTLDEPQTLADGAEDAPAYPVKRTCPYRMPPGYEELREKGPI
SRVTLWNGRTAWLVTGNDLGRRLFPDARLSSDVLDPRFPLLAPRIEAQRQQAAAPPLV
GVDDPVHARQRRMVLPSFGIRQINALRPEIQKYADDLLDTMLAKGPGVTVDLLTEYAL
PMPSAVICMLLGVPYEDHHYFDERSRHVLSSSGEEQAAQAQQAFTEILAYLDDLIVRK
QAEPGDTLLDELIARQLEEGKVDRQELAMIATVLLVSGHETTSNMIALSTMALLADPD
QLAALRADESLMPRAVDELMRFSSIGDMLMRVAKEDIEIEGHLIRAGDGVILSTMLMN
RDPGAFERPDELDIRRPAGRHVAFGYGIHQCIGQNLARAEMEIALATLFRRVPTLKLA
VPAEQVPVNAPFVLQGVSELPVTW

105E Subfamily

CYP105E1    Rhodococcus fascians
            GenEMBL Z29635 (7139bp) PIR S42052 (399 amino acids)
            Crespi,M., Vereecke,D.M., Temmerman,W.G., Van Montagu,M.
            and Desomer,J.
            The fas operon of Rhodococcus fascians encodes new genes required 
            for efficient fasciation of host plants.
            J. Bact. 176, 2492-2501 (1994)
MAGTADLPLEMRRNGLNPTEELAQVRDRDGVIPVGELYGAPAFL
VCRYEDVRRIFADSNRFSNAHTPMFAIPSGGDVIEDELAAMRAGNLIGLDPPDHTRLR
HILAAEFSVHRLSRLQPRIAEIVDSALDGLEQAGQPADLMDRYALPVSLLVLCELLGV
PYADRDELRDRTARLLDLSASAEQRAVAQREDRRYMATLVTRAQEQPGDDLLGILARK
IGDNLSTDELISIISLIMLGGHETTASMIGLSVLALLHHPEQAAMMIEDPNCVNSGIE
ELLRWLSVAHSQPPRMAVTEVQIAGVTIPAGSFVIPSLLAANRDSNLTDRPDDLDITR
GVAGHLAFGHGVHFCLGHSLARMTLRTAVPAVLRRFPDLALSPSHDVRLRSASIVLGL
EELQLTW

CYP105F1    Streptomyces lavendulae 
            GenEMBL AF127374 CDS 2006..3229
            48% to 105C1 42% to 105B1 40% to 105D1 new subfamily in 105

CYP105F2   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           85% to 105F1
           clone name SP8812

CYP105G1    Amycolatopsis mediterranei 
            GenEMBL AF040571 CDS complement(5011..6066)
            49% to 105C1, 105B1 new subfamily in 105 
            looks like an insertion in the seq from 80-120

CYP105H1    Streptomyces noursei ATCC 11455 nyst 
            GenEMBL AF263912 CDS comp (58637..59833)
            gene="nysN" 47% to 105B1 46% to 105A1 46% to 105D1 
            function="presumably involved in modification of the
            nystatin macrolactone ring"

CYP105H2    Streptomyces albus
            GenEMBL AF071143 
            77% to 105H1
LLIAGHETTANNIGLGVVTLLSHPQWAGDERAVEELLRLHSVAD
MVALRVAVDDVEIAGQVIRKGEGIVPLLAAANHDTEVFGCPHAFDPERSERRHVAFGY
GVHQCLGQNL

CYP105H3   Streptomyces natalensis 
           GenEMBL AJ278573 52789..53985
           pimaricin biosynthetic gene cluster.
           68% to 105H1
           gene = pimG
MTYTDPAAPETDPPAVDFPQRKPGVPFPPPDYADYRDRKGLVLS
QLSDGKRVWLVTRHEDVRAVLTSPSISSNPEHKGFPNVGNLGVPKQDQIPGWFVGMDS
PEHDRFRKALIPEFTVRRVRAMKPAIERTVDAQLDAMLAAGNTADLVADFALPIPSLV
ISALLGVPPADREFFESRTRVLVSLRSSTDDDRMAAAKDLLRYINRLVEIKQKWGGDD
LITRLLATGAIAPHEMSGVLMLLLIAGHETTANNIALGVVTLLANPQWIGDDRAVEET
LRFHSVADLVSLRVAVQDVEIAGQLIKAGEGIVPLVAAANHDENAFECPHAFDPSRSA
RHHVAFGYGVHQCLGQNLVRIEMEVAYRKLFERIPNLELAVPTDGLDIKYDGVLYGLN
ELPVRW

CYP105H4   Streptomyces nodosus 
           GenEMBL AF357202 complement(62051..63250)
           amphotericin biosynthetic gene cluster
           84% to 105H1
MTAETEMTTFAPGCPVAFPLRRPGRPFPPPEYADYRAGEGLVRS
ELPASGPVWLVTRHEDVRTVLTDPRISADPSRPGFPRARRTGGAPSQSEIPGWFVALD
PPEHDRFRKTLIPEFTVRKVRELRPAIQQIVDERIDALLAAGNSADLIADFALSVPSL
VISDLLGVPKADRDFFEAKTKVLVTLSSTDEQRDEASKALLRYLNRLIQIKGRRPGED
LISRLLQAGTMNRQELSGVSMLLLIAGHETTANNIGLGVVQLLTNPQWIGDDRIVEEM
LRYYSVADLVSFRVAVEDVEIGGQLIKAGEGIVPLIAAANHDGSVFDKPEEFNPERSA
RSHVAFGYGVHQCLGQNLVRVEMEIAYRTLFERIPTLELAVPVEELPLKYDGVLFGLH
ELPVTWS

CYP105H5   Streptomyces griseus 
           GenEMBL AJ300302 10678..11859
           Gene = canC
           72% to 105H3
MTTSPGPTVVDFPRRTPREPLPLSQYAEHRKQNGLVQTHLPNGR
PIWLVTRHEDVRAVLTHPRISANPDNEGFPNVGETMGVPKQEQIPGWFVGLDSPEHDR
FRKVLIPEFTVRRVRELRPAIERTVDERIDAMLAGGNTADLVNDFALPVPSLVISALL
GVPSADRDFFESRTRTLVAIRTSTDEERAEATRQLLRYINRLIVIKKKWRGEDLISRL
LSTGKLSDEELSGVLLLLLIAGHETTANNIGLGVVTLLSHREWIGDDRLVEELLRLHS
VADMVALRVAVDDVEIAGQTIRKGEGIVPLLASANHDTEAFGCPHAFNPERTERRHVA
FGYGVHQCLGQNLVRVEMEIAYRKLFERIPELRLAVPEDQLAYKYDGILFGLHELPVR
W

CYP105J1   Amycolatopsis mediterranei rifamycin 
           GenEMBL AF040570 CDS comp (67462..68673)
           52% to AF072709 105D4 50% to 105D1 new subfamily in 105

CYP105K1   Streptomyces tendae strain Tue901 
           GenEMBL Y18574 CDS 6325..7557
           45% to 105A3 46% to 105D1 43% to 105B1 new subfamily in 105
           gene="nikF"

CYP105K2   Streptomyces ansochromogenes
           GenEMBL AF469953  14..1246
           95% to 105K1
           note="involved in nikkomycin biosynthesis
MTEAFDHDIPSFPMARECPMHPPAEYRELRGQEPVSRVRMPDGQ
VAWLVLKHALARKLLADPRVSADRLHPAFPGRLTAEQRAATERVRRLTTRRSMIHLDG
DEHGAHRRILTGEFSLRRIAAQRPRVQEIVDRSIDEMLAAPQPADLVEHVSQAVPSLV
ICELLGVPHEQRRDFHEWAGMLVSRSVSIQERAAASDALNDFLEALVTEKERGEPADD
LIGRLIARNRQTPVMTHDEIVGTAVMLLVAGHQTTANMISLGVVALLENPEHKARIAA
DSSLLPPAIEEMLRYFSVVENAPARVATEDIAIGGVTIRKNEGIVVSGLAADWDDEVF
GHPDRLDFERGARHHVAFGYGVHQCLGQNLARVELEIVFETLLRRVPGLSLAVPAEEL
PYKDDAGIYGIYRVPVNC

CYP105L1    Streptomyces fradiae 
            GenEMBL AF055922 CDS comp (6507..7769)
              GenEMBL AF147703 complement(2565..3875)
            Fouces,R., Mellado,E., Diez,B. and Barredo,J.L.
            The left edge of the tylosin gene cluster from Streptomyces 
            fradiae
            Microbiology (1999) In press
            tylH1
            46% to 105A1 42% to 105D1 43% to 105B1 new subfamily in 105
MSSSGDARPSQKGILLPAARANDTDEAAGRRSIAWPVARTCPFS
PPEQYAALRAEEPIARAELWDGAPVWLISRQDHVRALLADPRVSIHPAKLPRLSPSDG
EAEASRSLLTLDPPDHGALRGHFIPEFGLRRVRDVRPSVEQIVTGLLDDLTARGDEAD
LLADFALPMATQVICRLLDIPYEDRDYFQERTEQATRPAAGEEALEALLELRDYLDRL
ISGKTGRESGDGMLGSMVAQARGGGLSHADVLDNAVLLLAAGHETTASMVTMSVLVLL
QHPTAWRELTVNPGLLPGAVDELLRYLSIADGLRRSATADIEIDGHTIRAGDGLVFLL
AAANRDEAVFSEPEAFDIHRSARRHVAFGYGPHQCLGQNLARMELEVALGAVLERLPA
LRPTTDVAGLRLKSDSAVFGVYELPVAW

CYP105L2   Micromonospora griseorubida
           GenEMBL AB089954 1490..2641
           gene cluster for the polyketide macrolide mycinamicin
           54% to 105L1
           gene = mycCI
MDRTCAWALPEQYAEFRQRATGWPAKVWDGSPTWLVSRYEHVRA
LLVDPRVTVDPTRQPRLSEADGDGDGFRSMLMLDPPEHTRLRRMFISAFSVRQVETMR
PEIEKIVDGILDRLLALEPPVDILTHLALPMSTQVICHLLGVPYEDREFFQERSELAS
RPNDDRSMPALIELVEYLDGLVRTKTAHPDTGLLGTAVTERLLKGEITHQELVNNAVL
LLAAGHETSANQVTLSVLTLLRHPETAAELREQPELMPNAVDELLRYHSIADGLRRAA
TADIVLGDHTIRAGDGLIILLSSANHDGNTFGAEATFDIHRPARHHVAFGYGPHQCLG
QNLARLEMEVTLGKLFRRVPALRLAQEPDALRVRQGSPIFGIDELLVEW

CYP105M1    Streptomyces clavuligerus clavulanic 
            GenEMBL AF200819 CDS 136..1359
            GenEMBL AY034175 CDS 200..1423
            GenEMBL U87786 CDS 13810..15036 
            function="involved in clavulanic acid biosynthesis"
            48% to 105B1 42% to 105A1 41% to 105D1 new subfamily in 105
MNEAAPQSDQVAPAYPMHRVCPVDPPPQLAGLRSQKAASRVTLW
DGSQVWLVTSHAGARAVLGDRRFTAVTSAPGFPMLTRTSQLVRANPESASFIRMDDPQ
HSRLRSMLTRDFLARRAEALRPAVRELLDEILGGLVKGERPVDLVAGLTIPVPSRVIT
LLFGAGDDRREFIEDRSAVLIDRGYTPEQVAKARDELDGYLRELVEERIENPGTDLIS
RLVIDQVRPGHLRVEEMVPMCRLLLVAGHGTTTSQASLSLLSLLTDPELAGRLTEDPA
LLPKAVEELLRFHSIVQNGLARAAVEDVQLDDVLIRAGEGVVLSLSAGNRDETVFPDP
DRVDVDRDARRHLAFGHGMHQCLGQWLARVELEEILAAVLRWMPGARLAVPFEELDFR
HEVSSYGLGALPVTW

CYP105N1    Streptomyces coelicolor 
            St4C2 [Full Sequence] Sanger cosmid 
            CDS 29986-31221 45% to 105A1 new subfamily in 105
            cloned and expressed by David Lamb and Steve Kelly

CYP105N2    Streptomyces glaucescens cytochrome P450
            GenEMBL AF071144
            95% to 105N1 only 5 aa diffs
            57% to AF071148 56% to AF071146 59% to 105D3 54% to 105A3 
LLIAGHETTTSMIALSTLLLLDRPELPAELRNDPDLMPAAVDEL
LRVLSVADSIPLRVAAEDIELSGRTVPADDGVIALLAGANHDPEQFDDPERVDFHRTD
NHHVAFGYGMHQCLGQNL

CYP105N3   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           91% to 107N1
           clone name SP0881

CYP105P1   Streptomyces avermitilis
           GenEMBL AB070949.1 67376-68575
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV413_pteC low 40% range to 105 subfamilies 
           Gene = pteC

CYP105P2   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           92% to 105P1
           clone name SP7863

CYP105Q1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV1611 49% to 105B1 from Streptomyces griseolus 
           46% to 105D4 and D5

CYP105Q2   Streptomyces sp. 
           GenEMBL BD133549
           78% to CYP105Q1 
  3 LIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHSGLRRVA 182
183 KGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGFGTHQC 350

CYP105Q3   Streptomyces sp.
           GenEMBL BD133546 
           77% to 105Q1
 139 MADTLTDAAPDTDGRVPEYPMPRATGCPLAPSPAAAELRGDRPITRVRIWNGSTPWLITR 318
 319 HADQRTLLTDPRVSNDDHEPDFPHVNAHRAAIAPHTPKLITNTDAPEHTRLRRSVNAPFL 498
 499 VKRIEAMRPAVQKIVDDLIDDMLAGPSPADLLTALALPVPSLVIAELLGVPYEDHHFFQE 678
 679 NSNRVLDNSLTAEEAQESSRALGGYLDTLFRTKLEQPGEDVLSEMGSKVKAGEMTHQEAV 858
 859 SMGVAMLIAGHETTATMISLGTLALFEHPDQLAVLRDTEDPKVVAGAVDELLRYLSIVHS 1038
1039 GLRRVAKGDIEIDGRLIRKGDGLLFDLQTANWDPNAFPGAERLDLARPARQHNAFGYGPH 1218
1219 QCLGQNLARLELQVVYGTLYRRVLTLRPAVPVDQLAFNHTGTTYGVKCLPVTW 1377

CYP105Q4   Mycobacterium marinum
           No accession number
           Tim Stinear
           MM4762
           52% to 105Q1

CYP105Q4   Mycobacterium ulcerans
           No accession number
           Tim Stinear
           98% to 105Q4 M. marinum = ortholog

CYP105Q5   Streptomyces scabies
           SCAB11341
           David Lamb
           Submitted to Nomenclature committee Nov. 10, 2006
           92% to 105Q3 Streptomyces sp.

CYP105Q6   Mycobacterium vanbaalenii PYR-1 
           ZP_01205830.1, EAS26822.1 
           50% to 105Q3, 86% to CYP105Q7
MTETLAQEAVSVPEYPMERTAGCPFAPPQQMLEMNQVKPLSRVRIWNGTTPWLVTGHEVARTLFADSRVS VDDRREGFPHWNEHMLSTVDKRPRSVFTSDAEEHTRFRRMLSKPFTFRRVEALRPVIQQVTDECIDEILA GPQPADMVAKLALPVPTRVISDMLGVPYEDHEFFQEHANAGLARYAAADAMQKGAMSLHQYLINLVEEKQ AHPAEDAVSDLAERVTAGEISVKEAAQLGTGLLIAGHETTANMIGIGICALLENPEQAALLRDSDDPKFI ANAVEELMRYLSIIQNGQRRVATEDIEIGGETIRAGEGIILDLAPANWDARAFPEPDKLDLTRDATQQLG FGYGRHQCVGQQLARAELQIVFHTLLRRIPTMKPAIPLEEVPFKHDRLAYGVYELPVTW

CYP105Q7   Mycobacterium smegmatis
           MSMEG4843 TIGR 
           53% to CYP105Q1, 51% to 105Q3
           86% to 105Q6 M. vanbaalenii, 76% to 105Q4 M. marinum
           Formerly CYP105T1 but more similar to CYP105Q sequences
MSETLTQPSATDIPGYPMERAAACPFAPPPQMLDMNKAKGLSRVRIWDGS 
TPWLITGHEEARALFADSRVSVDDRRPGFPHWNEHMLATVHKRPRSVFTS 
DAEEHTRFRRMLSKPFTFRRVEGLRPAIQKITDECIDAILAGPQPADIVD 
KLALPVPTVVISEMLGVPYEDHEFFQEHANAGLARYAAADAMQKGAMSLH 
QYLIDLIEKKQAEPAEDAVSDLAERVTAGELSVKEAAQLGTGLLIAGHET 
TANMIGIGILALLENPEQADFLRNAEDPKVIANAVEELMRYLSIIQTGQR 
RVAVEDIEIGGETIKAGEGIIIDLVPANWDAKAFPEPDKLDLTRDAGQQL 
GFGYGRHQCVGQQLARAELQIVFHTLLRRIPTLRLAIPLEEVPFKHDRLA
YGVYELPVAW

CYP105R1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV7186

CYP105S1   Mycobacterium smegmatis
           MSMEG0758 TIGR 
           fas1
           52% to 105E1
           76% to 105S2 Mycobacterium vanbaalenii
MTQAQALPPLHIRRDAFDPTPELGEIRAGEGVHVTVNPFGMQVYLVTRHE
DVKTVLSDHERFSNSRPPGFVLPGAPQISAEEQASNRAGNLLGLDPPEHQ
RLRRMLTPEFTIRRIKRLEPRIVEIVDAHLDAMESAGPPADIVADFALPI
PSLVICELLGVPYEDRTDFQQRSARQLDLSAPMPERLELQRQGRAYMRGL
VERSRTRPGDDILGMLVREHGTELTDDELIGIAGLLLLAGHETTSNMLGL
GVLALLRHPDQLACVRDDPDAVGPAIEELLRWLSIVSTALPRITTTDVEL
AGVTIPAGHLVFASLPAGNRDPEFIDDPDTLDIRRGAPGHLAFGHGVHHC
LGAPLARMEMRIALPALLRRFPTLALAEPFEDVRWRPFHFIYGLQSLAVAW

CYP105S2   Mycobacterium vanbaalenii PYR-1 
           ZP_01208508.1, EAS22074.1 
           52% to 105E1, 76% to CYP105S1
MSQAVRPELPPVHMRRDGFDPTPQLREIRETEGVRVITSAFGMSAYLVTRHEDVKTVLSDHTRFSNTRPP GFVVPGAPPIDEDEQARSRAGNLLGLDPPEHQRLRRMLTPEFTLRRMRRLQPRIAEIVDAQLDALAAARD GEASADLVQHFALPIPSLVICELLGVPYADRDDFQRRSARQLDLSIPIPERIELAREGRAYMGSLVAGAR TNPGDDILGMLVREHGAELTDDELVGIAGLLLLAGHETTSNMLALGTLALLRHPEQLAAVREDPDAVAPA VEELLRWLSIVHTAIPRITTTDVEIAGVSIPAGQLVFASLPSGNRDDEFIERPEVFDITRGAMGHLAFGH GVHHCLGAPLARMEMQIAFPALLRRFPTLAPAGEFDDVPFRSFHFIYGLKSLEVTW

CYP105T1    Burkholderia fungorum
            GenEMBL NZ_AAAJ02000095 
            8366..9610 gene = Bcep2217
            44% to 105H1
MRKTMTSAINDVRPQTTSTFPFARTGSPLHPPAEYARYRDGQPV
TRVQMWDGRYAWIFTRMEDVKAVLSSPHFSVVPSKPGYPFLTPARAATVKSYQTFITM
DPPDHTRFRRMLTRDFTQKRMEELRPQIAAYVNRLIDEMLARGSPGDLVSALALKLPV
TVVSMLVGVPYEDHEDLVKWSGQRLDLEQNPTVSESAADNMLAYFDGLLQRKERDPGD
GADMLSRLVIEQIKPGHLSRLEAIHMVNLLYFAGHETTANQIALGTLSFLLDPRQRAL
LENNPGLLKNAIEEMLRFHTISHYNSCRVATADVEVGGTLIREGEGAYALIMAANRDP
AAFPAPDRFDIERPNSQEHVAFSYGLHMCLGQPLARLELQVCFEALFRRLPRLRLAVP
LEELPFKREMYVYGLHALPVTW

CYP105U1   Streptomyces hygroscopicus strain NRRL 3602
           AY179507 complement(63940..65133)
           Geldanamycin biosynthesis gene cluster
           50% to 105B1 52% to 105B2 not 105S
           gene = gdmP
MDEIRDYPESRAAACPFSPPLGYEELRERSAVTRVRMWDGSTPF
LVTGYHEARAALGDSRFSADGTHKAMPRFVKFEVPAEVFNLGRMDDPEHARIRRMLTA
NFTIRRTEAMRPMIQGIVDGLLDRLIAQGPPADLVADFAFPLPSQVIGVMLGVSDADF
AEFQQASQGVMDFTASAEEMGAALGVMVDYVARMCAAKRADPGDDLLSRLIVDQELTG
GLTQQQVVATALVLLLAGHETTANMIALSTVLLLSHPEQLARLRADAGLMGNAVDELL
RYITIVQEGTGRVATEDVEVGGVLIPGGEGVIINLPSANRDPHFADAHELDLSRPNAR
EHVAFGFGVHQCLGQTLARVELQIALETLLRRLPTLRLEVPFDDLAFLYESMNFGVAR
VPVAW

CYP105V1   Streptomyces sp. HK803
           GenEMBL AY354515 36297..37508
           Gene = plmT4 
           43% to CYP105Q1
MSQLSSELPAFPMSKAKGCPLDPPPEYAQLRSDRPVAKARLWDG
KEVWLITGYDEIRSIFTDPRISVDNTQPGYPWLSEQARTVVLTGGVKPVGRMDPPEHT
AMRRMLGQGFLVKKIQNMRGDVEALVNELIDDILAGPRPTDLVPSLAMPVPSTALGWV
LGVPPADKRLISLVPRLFDEDSGLEGAMEARAELFAYIDELITHRENQPGDDIISHLV
GYYQKGELSRVSVLTQSVTLIAAALDTTRSMITNGILALLQHPEQAAALIEDPDLVPA
AVEELLRYTVVTEFSSKRVAAADIEIAGETIKAGDGIICLISAGNRDEKVFTDPDTLD
VRRDAKQHLGFGAGIHTCIGKQLARMELEVVYGTLFRRIPELRLAVPFDQLVFRNTFD
VQGVRALPVTW

CYP105W1   Micromonospora echinospora
           GenEMBL AF497482 84045..85229
           Gene = calE10
           calicheamicin biosynthetic locus
           45% to CYP105K1 47% to 105D4
MPRRCPFGPPAEYARLRTERPVARLPMLGGNTAWVVSRYADVKR
VLSDPRMSADRRRAGFPRFAPTTESQRQASFANFRPPLNWMDPPEHTAARRQIVDEFA
ARRVRQLRPLVERVVDEHLDAMTAGRSSADLVPSFSYPVPSRVICEMLGVPYGEHAFF
ERRSTRMLSRGVPADERARCAREIREFLDGVVTDKERHPGDDVLSRLLAAQRAAGEPD
HEAVVSMAFVLLVAGHVTTSNMISLSVLALLTHPERLARLRAEPDRFPAAVEELLRYF
TIVEAATARTATADVTVGGVTIRAGEGVVALGQAANRDPAAFDRPDEFDPDRDARHHL
AFGYGRHICPGQHLARLELDVALSRLVRRLPGLRLTVDVDDLPLKEDGNIFGLHALPVAW

CYP105X1   Pseudonocardia autotrophica same as Amycolata autotrophica
           GenEMBL AF525299 2766..3974
           Gene = pauC
           P-450 gene cluster
           49% to 105A3
MAEDTLGQDFPMQRQCPFEPPKEYERLRAEQPISRVRMPDGTPA
WLVTLHEDVRTVLASPAFSSDLAHPGMPAVNPEIRTIARQQRPPFSRMDPPEHSFFRR
MLIPEFTVKRTKTLRAGIQSVVDGLIDDLLRKSPPVDLVDEFALPVPSLVICQLLGVP
YSRHEFFQQQARVILSRQSTREQVGAAFTALRAYLDTLVEEKLHTPGDDLTSRLATEH
LEPTGDVRRQDLVASCMLLLTAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEA
VEELVRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDI
HRGNRRHACFGYGVHQCIGQHLARTELEVAFSTLFTRIPTLQIAAPSDELDYDHDGML
FGLHELPVTW

CYP105X2   Amycolata autotrophica same as Pseudonocardia autotrophica
           GenEMBL AF071148
           99% to 105X3 94% to 105X1 61% to 165B2
LLIAGHETTSHMISLGVTALLERPDQLAALQNDLTLLPEAVEEL
LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN
RRHVAFGYGVHQCLGQNL

CYP105X3   Micromonospora inyoensis 
           GenEMBL AF071146
           99% to 105X2 61% to 165B2 60% to 105A3
LLIAGHETTSHMISLGVTALLEHPDQLAALQNDLTLLPEAVEEL
LRYLSIADYVPSRVALEDVVIGGTVIRAGEGVVPLLAAADWDPKVFDNPGTLDIHRGN
RRHVAFGYGVHQCLGQNL

CYP105Y1    Rhodococcus sp. RHA1 
            No accession number
            Marianna A. Patrauchan
            Rha04313
            Submitted to nomenclature committee 12/13/04
            48% to CYP105X1, 46% to 105D7 

CYP105Z1    Streptomyces scabies
            SCAB17851
            David Lamb
            Submitted to Nomenclature committee Nov. 10, 2006
            49% to 105B1 Streptomyces griseolus
            51% to 105B2 Streptomyces tubercidicus possible ortholog

CYP105AA1   Streptomyces tubercidicus strain R-922
            GenEMBL AY549204
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Jungmann,V., Molnar,I., Hammer,P.E., Hill,D.S., Zirkle,R.,
            Buckel,T.G., Buckel,D., Ligon,J.M. and Pachlatko,J.P.
            Biocatalytic conversion of avermectin to 4'-oxo-avermectin:
            characterization of biocatalytically active bacterial strains and
            of cytochrome p450 monooxygenase enzymes and their genes
            Appl. Environ. Microbiol. 71 (11), 6968-6976 (2005)
            Submitted to nomenclature committee June 2, 2003
            Clone name Cyp230
            56% to CYP105AA2
            formerly 105S1, but that name was already assigned to 
            a Mycobacterium smegmatis sequence (my error).

CYP105AA2   Streptomyces tubercidicus strain I-1529
            GenEMBL AY549201
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Jungmann,V., Molnar,I., Hammer,P.E., Hill,D.S., Zirkle,R.,
            Buckel,T.G., Buckel,D., Ligon,J.M. and Pachlatko,J.P.
            Biocatalytic conversion of avermectin to 4'-oxo-avermectin:
            characterization of biocatalytically active bacterial strains and
            of cytochrome p450 monooxygenase enzymes and their genes
            Appl. Environ. Microbiol. 71 (11), 6968-6976 (2005)
            Submitted to nomenclature committee June 2, 2003
            Clone name Cyp234
            56% to CYP105AA1
            formerly 105S2, but that subfamily was already assigned to 
            a Mycobacterium smegmatis sequence (my error).

CYP105AB1   Saccharopolyspora erythraea NRRL23338
            SACE_3429, se: 3784062,3785189 (+) STRAND, 
            52% to CYP105W1, 51% to 105K1
LRSQEPVKRVRTIGGGTAWLVTRHEDVRRVLSDPRMSSDRTMPGFPSLVPGRRAIVAENK
QAMIGMDGQEHAEARRAVIGEFTVRRINRMRPRIQEIVDECVDRMLAAGGPVDLVRELSL
PVPSLVICELLGVPYSDHDFFQSRSALMISRSTPPERRRDVVLELRRYLDELVAEKVREP
ADDLLGRQVAQQSEKGEVDREGLVSLAFLLLIAGHETTANMISLGTLALLDNPDQLARIT
EDPARTPAAVEELLRYFSIVDGATSRTALADIEIGGVLIREGEGVVAVGLSANRDPEAFD
SPDELDLDRQARNHVAFGFGAHQCLGQNLARVELQIVFDTLFRRIPGLRLADGLDGIRFK
DDALVYGAHEMSVTW

CYP105AB2    Salinispora tropica (marine actinomycete)
             Strop_1339 complement(1505346..1506596)
             57% TO CYP105AB1 Saccharopolyspora erythraea
             48% to CYP105K1 Y18574.1 Streptomyces tendae
MTETASIATTRTASGQLTDAEFPVQRGCPFTTPTEYEQIREESSIAKVRLKNGGEAWWIA
GHELGRSVLADRRFSSDRRRDNFPFVSTDPETRAQLQSQPTSMLGMDGAEHAQTRRALMG
EFTVRRMAGLRPRIQQIVDQHIDEMLATPQRSVDLVEALSLPVPSLVICELLGVPYADHD
FFQGLTGPLLRHTTPPEVRLRIQEELNTYLGTLIDRKLTDPTDDLLSRQIAKHRDNGTFD
RASMVSLAFLLLVAGHETTANMISLGVVGLLQHPDQLVIIKDDPDKTPLAVEELLRYFTI
ADSVTARVATEDVQLGDTTINAGDGVVISGLAADRDPTVFAEPDRLDLERGARHHVAFGF
GPHQCIGQTLARMELRIVFDTLFHRIPTLRLAAPLDDIPFKSDAFVYGIEELPVAW

CYP105AC1   Saccharopolyspora erythraea NRRL23338
            SACE_4243, se: 4724009,4725226 (-) STRAND, 
            52% to CYP105AA2, 52% to CYP105AA1
MQKHAPHNADDVLESLPRDRPSGCPFDPPEGLAEIRGQRPLTRLVYGDGHVGWLATGHAV
VRAVLADRRFSSRYELMHFPVAMPGLPAQIPPAQVGDITGIDPPEHTRYRKLLTGKFTVR
RMRALTERVEQITAERLDAMQRLGPPVDLVEAYAQPIPALMICELLGVPYDRLEEFLGLV
AASGDRDLTPEEQFDAFAKIQEFVRELVPAKRAKPTDDLLSDLTTTELTDQELAGIGGLL
LAAGLDTTANILALGTFALLRNPEQIAALREGDADRAVEELLRYLSIAHTGMRSALEDVE
IDGTLIRAGETVTLSIQAANRDPRRFTDPDALDLRRHAAGHLSFGHGIHQCLGQQLARVE
MRVAFPALFNRFPGLRLAVAPEEVPLRGDMNIYGVHGLPVTWDGA

CYP105 fragment  Streptoalloteichus hindustanus
            AF071147 
            66% to 105AA2, 63% to 105Y1
            61% to 105C1 59% to AF040570 CDS 2652..3842 (CYP166A1)
LLIAGHETTANMLALGAFALLEHPEQLAELRANPDLMPGAVEEL
MRYLSIVHIGPVRTAVADVEIEGQLIRAGESVTVSVPAANWDPAKFPEPERLDLTRRT
SGHLAFGHGVHQCLRQNL

106 Family

CYP106A1    Bacillus megaterium 
            GenEMBL X16610
            Gene BM-1

CYP106A2    Bacillus megaterium
            GenEMBL Z21972 (4317bp) PIR S32216 (410 amino acids)
            PIR S39924 (410 amino acids) Swiss Q06069 (410 amino acids)
            Rauschenbach,R., Isernhagen,M., Noeske-Jungblut,C., Boidol,W.
            and Siewert,G.
            Cloning, sequencing and expression of the genes for cytochrome
            P450meg, the steroid-15beta-monooxygenase from Bacillus
            megaterium ATCC 13368.
            Molec. Gen. Genet. 241, 170-176 (1993)

CYP106B1    Bacillus anthracis str. Ames
            Genpept AAP26480                 
            47% to 106A2 47% to 109B1
  1 MASPENVILV HEISKLKTKE ELWNPYEWYQ FMRDNHPVHY DDEQDVWNVF LYDDVNRVLS
 61 DYSLFSSRRE RRQFAIPPLE TRININSTDP PEHRNVRSIV SKAFTPRSLE QWKPRIQSIA
121 NELVKDIENC SEVDIVEQFA APLPVTVISD LLGVPTTDRK KIKAWSDILF MPYSKEKFND
181 LDAEKGIALN EFKAYLLPIV QEKRYHLTDD IISDLIRAEY EGERLTDEEI VTFSLGLLAA
241 GNETTTNLII NSFYCFLVDS PATYKEVREK PKLISKAVEE VLRYRFPVTL ARRITEDTNI
301 FGPLMKKDQM VVAWVSAANL DEKKFSQASK FNIHRIGNEK HLTFGKGPHF CLGAPLARLE
361 AEIALTTFIN AFEKIALSPS FNIEQCILEN EQTLKFLPIR LKPQ

CYP106B2P  Bacillus cereus ATCC 14579
           GenPept AAP09572  GenEMBL AE017006
83% to 106B1 54% to CYP109B1 YjiB Z99110 Bacillus subtilis I -helix
1 MTSVITDGEI VTFSLGLLAA GNETTTNLII NSFYCFLVDS PGIYEELRKE PNLILKAIEE
61 VLRYRFPVTL TRRITALSER ESPSPLGMG

CYP106B3P  Bacillus cereus ATCC 14579
           GenPept AAP09575 GenEMBL AE017006
87% to 106B1 54% to 106A2 C-term fragment 
   LKEDTNIFGPF
 1 MKKNQMIVAW VSAANLDEKK FSQASQFNVH RTGNEKHLTF GKGPHFCLGA PLARLEAEIA
61 LTTFINAFEK IELFPSFCLE KCILENEQTL KYLPIRLKAT

107A Subfamily

CYP107A1    Saccharopolyspora erythraea
            GenEMBL X60379 Swiss Q00441 (406 amino acids)
            Haydock S.F., Dowson J.A., Dhillon N., Roberts G.A., Cortes J.,
            Leadlay P.F.
            Cloning and sequence analysis of genes involved in erythromycin 
            biosynthesis in Saccharopolyspora erythraea: sequence similarities 
            between eryG and a family of S-adenosylmethionine-dependent 
            methyltransferases.
            Mol. Gen. Genet. 230, 120-128 (1991).

            Weber J.M., Leung J.O., Swanson S.J., Idler K.B., Mcalpine J.B.
            An erythromycin derivative produced by targeted gene disruption in
            Saccharopolyspora erythraea.
            Science 252, 114-117 (1991)

CYP107A1    Saccharopolyspora erythraea NRRL23338
            SACE_0730, 825267,826481 (-) strand
            EryF 6-deoxyerythronolide B hydroxylase (6-DEB hydroxylase)
MTTVPDLESDSFHVDWYRTYAELRETAPVTPVRFLGQDAWLVTG
YDEAKAALSDLRLSSDPKKKYPGVEVEFPAYLGFPEDVRNYFATNMGTSDPPTHTRLR
KLVSQEFTVRRVEAMRPRVEQITAELLDEVGDSGVVDIVDRFAHPLPIKVICELLGVD
EKYRGEFGRWSSEILVMDPERAEQRGQAAREVVNFILDLVERRRTEPGDDLLSALIRV
QDDDDGRLSADELTSIALVLLLAGFEASVSLIGIGTYLLLTHPDQLALVRRDPSAL
PNAVEEILRYIAPPETTTRFAAEEVEIRGVAIPQYSTVLVANGAANRDPKQFPDPHRF
DVTRDTRGHLSFGQGIHFCMGRPLAKLEGEVALRALFGRFPALSLGIDADDVVWRRSL
LLRGIDHLPVRLDG

CYP107A2   Streptomyces rochei plasmid pSLA2-L
           NC_004808 complement(44847..46067)
           AB088224.1 complement(44847..46067)
           64% to 107A1
           note="ORF26 (406 aa), lankamycin biosynthesis protein
           similar to M54983-1 Saccharopolyspora erythraea
           6-deoxyerythronolide B hydroxylase, EryF CYP107A1
MTTDAHTAVPSLDSDLFHIDQYEAYAALREREPVSKVSFIGREA
FLITRHAEAKAALGDLRLSNDFKKQPPGVELPTYHGIPEDVRPYFANNMGSNDPPAHT
RLRRLVSREFTARRVESMRTRVAQLAEHLLDGLAGERETDLVERFAYPLPITVISELL
GVEERYQGDFGRWSNEFLVIDADRVEQREHAARALVGFILELVDRRRADPGSDLLSAL
IHVHDEDEDRLSTDELASVVLILLIAGFETSVSLIAMATYLLLTHPGELAKVRADPSL
VPNAVDEVLRFLGPAEITTRGTLEPVEIGGVHIPAHSTVLIAGAAANRDPRRFPDPER
FDVTRDTGGHLSFGHGIHFCVGGPLARLEGEIALRALLNRFPGLDLAIPAEQVRWRRS
FLRGIESLPVRLGR

107B Subfamily

CYP107B1    Saccharopolyspora erythraea
            GenEMBL M83110 Swiss P33271 (405 amino acids) PIR B42606 (405 
            amino acids)
            Andersen J.F., Hutchinson C.R.
            Characterization of Saccharopolyspora erythraea cytochrome P-450 
            genes
            and enzymes, including 6-deoxyerythronolide B hydroxylase.
            J. Bacteriol. 174, 725-735 (1992)

CYP107B1    Saccharopolyspora erythraea NRRL23338
            SACE_5814, se: 6524921,6526138 (-) STRAND
MTTGEVPDLLAFDDAFAQDRHNRYARMREEPVQRIRTVNGLDAWLITRYEDVKQALLDPR
IAKDFGRTQQIIEKRLADAERRPGFSPDLGPHMLNTDPPDHTRLRKLVVKAFTARRVEGL
RPRIEQITDDLLDRLAGRSEVDLIDEFAFPLPITVISELMGVEDSRRDDFRSWTNVLVDG
SQPEAQAQASVAMVEYLTELIAKKRTEPGDDLLTALLEAVEDGDRLSEGELIAMVFLLLV
AGHETTVNLIGNCVLSLLGNPDQLAALRNDPSLLPGAIEETLRYESPVANGTFRHTAEAV
RFGDVVIPEGELVWVALGAANRDGERFEDPDRFDITRETTGHVAFGHGIHFCVGAALARL
EAQIAVGRLLERFPDLRMAASPDDLRWRFSVLMRGLEKLPVRPGA 

CYP107B2   Streptomyces sp.
           GenEMBL BD133548 
           58% to 107B1
3   LIAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDSPVGIATFRFSTE 182
183 ALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFGFGMHHC 344

107C Subfamily

CYP107C1    Streptomyces thermotolerans
            GenEMBL D30759 (3267bp complete sequence of CarA) 
            Arisawa,A., Kawamura,N., Takeda,K., Tsunekawa,N.,
            Okamura,K. and Okamoto,R.
            Cloning of a macrolide antibiotic biosynthesis gene acyA, which
            encodes 3-O-acyltransferase, from Streptomyces thermotolerans and 
            its use for direct fermentative production of a hybrid macrolide.
            Appl. Environ. Microbiol. 60, 2657-2660 (1994)

            Arisawa,A., Tsunekawa,N., Okamura,K. and Okamoto,R.
            Nucleotide sequence analysis of carbomycin biosynthetic genes
            including macrolide antibiotics 3-O-acyltransferase gene from
            Streptomyces thermotolerans.
            unpublished (1994)

CYP107C1    Streptomyces thermotolerans
            GenEMBL M80346 (2393bp C-terminal fragment of CarA)
            Schoner,B.E., Geistlich,M., Rosteck,P., Rao.R.N., Seno,E.,
            Reynolds,P., Cox,K., Burgett,S. and Hershberger,C.L. 
            Sequence similarity between macrolide resistance determinants and
            ATP binding transport proteins.
            Gene 115, 93-96 (1992)
            Note: P450 fragment called carX. is equivalent to C-terminal of CarA.

107D Subfamily

CYP107D1    Streptomyces antibioticus
            GenEMBL L37200 (1400bp)
            Rodriguez,A.M., Olano,C., Mendez,C., Hutchinson,C.R. and 
            Salas,J.A.
            A cytochrome P450-like gene possibly involved in oleandomycin 
            biosynthesis by Streptomycese antibioticus.
            unpublished (1994)

107E Subfamily

CYP107E1    Micromosospora griseorubida
            GenEMBL D16098 (2168bp)
            Inouye,M., Takada,Y., Muto,N., Horinouchi,S. and Beppu,T.
            Cloning and nucleotide sequences of a gene governing mycinamicinIV
            hydroxylation.
            unpublished (1993)

CYP107E2    Saccharopolyspora erythraea NRRL23338
            SACE_1426, se: 1577490,1578686 (+) STRAND
            58% to 107E1, 56% to 107N1
MPEPRPYPFSAAERLNLDPFYARLRAQEPMSRVKLPYGEAAWLATRYEDAKVVLADPRFS
RAAVLEKDEPRMRPGITGGGILSMDPPDHTRLRRLVAKAFTQRRVERLRPRTQEIADGLV
DRMIEHGSPADLVEEFALPLPITVICELLGVPYEDRDDFREWSDAFLSTTKLTPEQVVDY
MDRMFGYMAGLIAKRRVDPQDDLMSALIEARDEHDKLTEQEMVQLAAGILVAGHETTATQ
IPNFVYVLLTHPDQLEGLLADLDGLPRAVEELTRYVPLGVAAVFARYAVEDVELGGVTVR
AGEPVLVSASSANRDEAVFDDPDRLDLTRENNAHIGFGHGPHHCLGAQLARLELQVGLRT
LLTRLPGLRFAGGEDDVVWKEGMLVRGPSKLEVAWQSE

CYP107E3    Salinispora tropica (marine actinomycete)
            Strop_2770 	complement(3139855..3141042)
            59% TO CYP107E2      Saccharopolyspora erythraea
            52% to CYP107E1 D16098 Micromosospora griseorubida
MTIDQEIRKYPFCESPGIGIDPTYGLLRSTEPLARVQLPYGEVSWLATRYEDVKTVLTDP
RFSRAAAQGKDQPRTREEMTYEGIIGLDPPDHTRLRKLAGKALTARRVNAIRADAQRIAN
EYVDEMIAKGSPGDLVELFALPYPVTVICELLGVPFEDRAQFRIWTEGLTSTSEQLMVYA
EQLFGYMGKLVAQRREEPTDDLLGALVKARDEGDRLTEQELLSIAGVGLLLTGVETVSTH
IPNFVYALLTHPELMAQLRADRSLVPAAVEELLRMIPLNPAAMFPRYAVEDVTLSGITVR
AGEPVLVSLPGANRDPEVFENPETFDFTREQNPHVAFGHGPHHCLGAQLARMELQVALHT
VLDRFPDLSLADGDEGVSWKSGLLVRGPSRLLVAW

107F Subfamily

CYP107F1    Streptomyces griseus
            GenEMBL D45916 (2787bp) AB018074 CDS 341-1561
            Ueda,K. and Horinouchi,S.
            Cloning and Nucleotide Sequence of a Gene Involved in Redbrown
            Pigment Biosynthesis in S. griseus
            Unpublished (1995)

CYP107F2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV1171 55% to 107F1 
           this subfamily is on the outskirts of CYP107

107G Subfamily

CYP107G1   Streptomyces hygroscopicus
           GenEMBL X86780 (107379bp)
           complement (91764-92978)
           rapN

107H Subfamily

CYP107H1  Bacillus subtilis
          GenEMBL U51868 (10153bp) Z99119, AF008220
          coding region 7164-8351
          pimelic acid biosynthesis
          gene name bioI

107J Subfamily

CYP107J1  Bacillus subtilis
          GenEMBL Y11043 U93876, Z99117
          Belitsky, B. R., M. C. Gustafsson, A. L. Sonenshein, and C. Von
          Wachenfeldt. 
          An lrp-like gene of Bacillus subtilis involved in
          branched-chain amino acid transport. J Bacteriol. 179, 5448-57 
          (1997).
          gene name cypA 42.6% identical to 107B1
          also called yrdE

CYP107J2  Bacillus anthracis str. Ames 
           GenPept AAP26475
           58% to 107J1 cypA of Bacillus subtilis
  1 MAMKNKVGIR IEDGINLASA QFKEDAYEIY KESRKVQPVL FVNKTELGAE WLITRYEDAL
 61 PLLKDNRLKK DPANVFSQDT LNVFLTVDNS DYLTTHMLNS DPPNHNRLRS LVQKVFTPKM
121 IAQLEGRIQD IADDLLNEVE RKGSLNLVDD YSFPLPIIVI SEMLGIPKED QAKFRIWSHA
181 VIAYPETPEE IKETEKQLSE FITYLQYLVD MKRKEPKEDL VSALILAESE GHKLSARELY
241 SMIMLLIVAG HETTVNLITN TVLALLENPN QLQLLKENPK LIDAAIEEGL RYYSPVEVTT
301 SRWADEPFQI HDQTIEKGDM VVIALAAANR DETVFENPEV FDITRENNRH IAFGHGSHFC
361 LGAPLARLEA KIAITTLFER MPELQIKGNR EDIKWQGNYL MRSLEELPLT F

CYP107J3   Bacillus cereus ATCC 14579
           GenPept AAP09568            
           59% to 107J1 cypA Y11043 Bacillus subtilis
  1 MKNKVGLSIE DGINLASAQF KEDAYEIYKE SRKKQPILFV NQVEIGKEWL ITRYEDALPL
 61 LKDNRLKKDW TNVFSQDIKN MYLSVDNSDH LTTHMLNSDP PNHSRLRSLV QKAFTPKMIA
121 QLDGRIQRIA DDLISDIERK GTLNLVDDYS FPLPIIVISE MLGIPKEDQA KFRIWSHAVI
181 ASPETPEEIK ETEKQLSEFI TYLQYLVDIK RKEPKEDLVS ALILAESEGH KLSARELYSM
241 IMLLIVAGHE TTVNLITNTV LALLENPNQL QLLKDNPKLI DSAIEEGLRY YSPVEVTTAR
301 WAAEPFQIHH QTIQKGDMVI IALASANRDE TVFENPEIFD ITRENNRHIA FGHGSHFCLG
361 APLARLEAKI AITTLFNRMP ELQIKGNREE IKWQGNYLMR SLEELPLTF

CYP107J4P  Bacillus cereus ATCC 14579
           GenPept AAP09593                 
           46% to CYP107J3 in same genomic region
           47% to CYP107Y1 SAV2377 AP005030 Streptomyces avermitilis
           50% to 107H1
  1 MKEPQLQQHL EKFIQYIEAL VNEKRLNPDA DLISELVQTK EQEDKLSNNE LLSTIWLLII
 61 AGHETTVNLI SNGLLALLQH PEQMNLIREN PSLIPSAVDE LLRHSGPVMF ISRLASEDMT
121 IHGKRIPKGD LVLLSLTAAN IDPQKFTYPE TLNISREENN HLAFGAGIHH CLGAPLARLE
181 GQIALGTLLQ RLPNLRLAIK PDQLNYNHSK IRSLVNLPVV F

CYP107K1   Bacillus subtilis
           GenEMBL AL009126 Z99113 comp(76702-77832)
           polyketide hydroxylase pksS
           just over 41% identical to CYP107J1

CYP107L1   Streptomyces venezuelae 
           GenEMBL AF087022
           GenEMBL AF079139 CDS 122..1372
           pikC gene
           function="catalyzes the hydroxylation of YC-17 into
           methymycin and neomethymycin and narbomycin into
           pikromycin"
           51% to 107B1 47% to 107A1 44% to AF254925 42% to 107J1 
           41% to AL049754 new CYP107 subfamily

CYP107L2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV1987 60% to 107L1 from Streptomyces venezuelae

CYP107L3    Streptomyces tubercidicus strain I-1529
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name CypLA
            60% to CYP107L1 91% to 107L4

CYP107L4    Streptomyces tubercidicus strain R-922
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name CypLC
            61% to CYP107L1 91% to 107L3

CYP107L5   Streptomyces sp.
           GenEMBL BD133547 
           68% to 107L2 
3 LIAGHETTVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAE 182
183 PLEIGGTVIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGFGTHRC 344

CYP107L6   Streptomyces sp.
           GenEMBL BD133544 
           72% to 107L2
MGHEHVIDLGEYGPGFTENPHPVYAELRARGPVHRVRLPKHDAHHEAWLVVGYEEARAAL
ADPRLSKDGSTIGVTFLDEELIGKYLLIADPPQHTRLRGLIAREFTGRRVERLRPRVQEI
TDSLLDEMLPRGRADLVESFAYPLPLTVICELLGVPEIDRAAFRKLSTEAVAPTSGESEY
AAFVQLAAYLEELVEEKRCAPPADDLLSALIRTTDEDGDRLSPAELRGMAFILLIAGHET
TVNLITGAVHALLTHPGQLAQVRGDMSLVDAVVEETLRHEGPVENATFRFAAEPLEIGGT
VIPAGDPVLIGLAAADRDGARYPGPDRFDIHRDTRGHLAFGHGIHFCLGAPLARLEARVA
LRALLERCPGLTPDGAPGEWLPGMLIRGVRSLPVRW*

CYP107L7P   Streptomyces narbonensis
            GenEMBL AF521878  13901..14661
            desosamine biosynthetic gene cluster
            91% to 107L1
            gene= nbmL
            note= frameshift and deleltion generates premature 
            stop codon and truncated protein"
MSRTHQGTTASRPVLDLAALGQDFAADPYPTYARLRAEGPAHRV
RTPEGDEVWLVVGYDTARAVLADPRFSKDWRNSATPPTEAEAALSHNMLESDPRCGPT
(deletion)
ALRADLTLLDGAVEEMLRYGGPVESATYRFPVEPVDLDGTVLPAGETVLVVLAD
AHRTPERFPDPHRFDIRRDTAGHLAFGHGIHFCTGAPLARMEARIAVRALLERCPDLALD
VSPGELFWYPNPMIRGLESLPIRWRSGREAGRRVPVEPACRP*

CYP107L8   Streptomyces sp. HK803
           GenEMBL AY354515 complement(72672..73871)
           Gene = plmS2
           56% to CYP107L6
MVTVDLSAYGPGFFTDPYPYYARLREAGPVHEIVLADGDRFWLI
VGYDEARAALADPRLAKSLDPPSEDERHVLITDPPDHTRLRRLVSREFTARRVEAMRP
RVQEITDGLLDEMVAGRRRADLVPSLGSPLPITVLCELLGVPLADREDFRGWTERVLV
PAEPDTIAWWKSRGFAQAGMALTDYLKNMIEDKRRSTPTGDLISSLLRTTAEDNDRLS
AAELHSMVFILIVAGHETTANLITNGVRALLAHPEQLAALRTDPEGLIDQAVEEMLRY
DGPVETSTKRFTLEAVRYGATKIPPGETLLVSIAATGRDPAQFERPDTFDIHRGTTGT
RSGHVAFGHGIHFCLGAGLARMESRVAILTLLRRCPDLALDIDPAGLDWLPGIRVRGV
RSLPVRW

CYP107L9   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           62% to 107L6 before frameshift at C-term
           clone name SP0854

CYP107M1   Actinomadura hibisca 
           GenEMBL D87924CDS complement(6299..7534)
           45% to AF127374 CDS 3226..4458 44% to AF254925 
           45% to 107D1 44% to 107G1, 107E1 new subfamily in 107

CYP107N1   Streptomyces lavendulae 
           GenEMBL AF127374 CDS 3226..4458
           50% to 107D1 52% to AF254925 47% to 107E1 new subfamily in 107

CYP107P1   Streptomyces coelicolor cosmid H10 
           GenEMBL AL049754 CDS complement(10413..11648)
           41% to AF087022 40% to 107B1 40% tp 107G1 
           40% to 107D1 new subfamily in 107
           cloned and expressed by David Lamb and Steve Kelly

CYP107P2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV4539 86% to 107P1 from Streptomyces coelicolor

CYP107P3   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           78% to 107P2 missing 156 aa at N-term
           C-term may be frameshifted
           clone name SP0887

CYP107Q1   Amycolatopsis mediterranei 
           GeEMBL AF040571 CDS complement(781..>2316)
           66% to AF040570 comp(68704..69969) 43% to 107C1 
           41% to 107B1 40% to 107A1 new subfamily in 107

CYP107Q2   Amycolatopsis mediterranei 
           GenEMBL AF040570 CDS comp (68704..69969)
           66% to AF040571 complement(781..>2316) new subfamily in 107

CYP107R1   Streptomyces maritimus 
           GenEMBL AF254925 CDS comp (18384..19589) 
           gene="encR" 
           53% to AF127374 CDS 3226..4458 49% to 107E1 new subfamily in 107
MTTHTQQLRDFPFAPPAELHMEPAFAQLREEEPISRVRLPYGGE
AWLVTRYQDIKTVLGDPRFSRAATQHAQAPRIQPDPAGEGVLMSLDPPDHTRLRKTVA
GVFTKRRVEDLRPATQRIAEELLEAMEASGAPADLVASYALPLPVTVICDLLGVPGDD
REQLRGWSDALLSTTACTPAESAAAAQAMADHFAALVSQRRRQPTDDLLGALVQTWDR
EEGLLRDEELVLLTRDLLIAGHETTASQIANCTYLLLQRPHDMDRLRTDPSAMASAVE
ELLRFIPLGSGSFRARVATEPVELCGVRIQPGDTVFAPTVAANWDPDVFAEPGRLDID
RSPNPHVAFGHGVHHCLGAQLARLELQVALGVLLRRLPRLRLAVDEAEIVWKTGMQVR
GPKTLPVKW

CYP107S1   Pseudomonas aeruginosa
           NZ_AABQ07000001
           NC_002516 3741011..3742267
           locus_tag = PA3331
           47% to 107B1

CYP107T1   Streptomyces coelicolor  
           StH63 [Full Sequence] Sanger cosmid 
           51% to CYP107L1 CDS 16028-17233
           cloned and expressed by David Lamb and Steve Kelly

CYP107U1   Streptomyces coelicolor 
           StE41 [Full Sequence] Sanger cosmid 
           comp(7438-8739) 44% to CYP107B1 
           cloned and expressed by David Lamb and Steve Kelly

CYP107U2   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV3536 85% to 107U1 from Streptomyces coelicolor

CYP107U3   Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           84% to 107U1 missing 90 aa at N-term
           clone name SP0819

CYP107U4   Streptomyces scabies
           SCAB54411
           David Lamb
           Submitted to Nomenclature committee Nov. 10, 2006
           90% to 107U1 Streptomyces coelicolor

CYP107V1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV3519 low 40% range with some 107 subfamilies

CYP107W1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV2894_olmB low 40% to 107 subfamilies

CYP107X1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV6249 49% to 107L1 from Streptomyces venezuelae

CYP107X2    Saccharopolyspora erythraea NRRL23338
            SACE_1158, se:1279482,1280657 (-) STRAND
            57% to 107X1
MRPVEIDDEFVTCPHAAYARLREQGPVHRAVAPDGSRVWLVTRYDDVRAALADSRLSLDK
AHATDGYRGLSLPPALDANLLNMDAPEHTRLRRTVTRAFTAHRTELLRPRVQEIADELLA
AVAGQERAELMSAFAGPLPITVICELLGVDARDRPDFRAWTDEMLAPSTPDRARDSLRSL
YAFLVDLIARKRAEPGADMPSTLVGLRDEDGSLTEDELTSTAFLVLFAGYENTVNLIGNG
LAALLARPAQLAAVRSDRGLLPSTVEELLRFDPPPQLSIRRFPKEDLEIGGVRIPAGDTV
LLSLVSAHHDPARFTSPGELIPDRADNAHLAFGHGPHFCIGAPLARMEAEVAFSTVLTRF
PALSLAVDPAELRWRPSFRNRGLRELPVRLS

CYP107Y1   Streptomyces avermitilis
           No accession number
           Submitted by David Lamb and Haruo Ikeda 9/3/02
           Clone name SAV2377 50% to 107L1 from Streptomyces venezuelae

CYP107Z1    Streptomyces rimosus ssp. paromyceticus strain R-2374 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema11
            96% to CYP107Z2v1

CYP107Z2v1  Streptomyces albofaciens strain C-0083
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema8
            96% to 107Z2v2 and CYP107Z1

CYP107Z2v2  Streptomyces rimosus ssp. paromyceticus strain BOEH-4355
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema3
            96% to CYP107Z2v1 95% to CYP107Z1

CYP107Z3    Streptomyces sp. strain IHS-0435
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema7
            76% to 107Z12

CYP107Z4    Streptomyces lydicus strain NRAB-0114 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema16
            82% to 107Z12

CYP107Z5V1  Streptomyces lydicus strain NRRL-2433 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema15
            97% to 107Z5v3

CYP107Z5v2  Streptomyces chattanoogensis DSM-40241 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema6
            1 aa diff to CYP107Z5v3

CYP107Z5v3  Streptomyces lydicus strain R-401
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema4
            100% to S. kasugaensis strain A/96

CYP107Z5v3  Streptomyces kasugaensis strain A/96
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema10
            100% to S. lydicus strain R-401

CYP107Z6    Streptomyces sp. strain I-1525 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema5
            85% to CYP107Z8

CYP107Z7    Streptomyces tubercidicus strain DSM-40261 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema17
            90% to CYP107Z8

CYP107Z8    Streptomyces platensis strain Tu-3077 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema13
            89% to CYP107Z9

CYP107Z9    Streptomyces tubercidicus strain NRAA-7027 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema12
            89% to CYP107Z8

CYP107Z10   Streptomyces tubercidicus strain I-1529 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema2
            90% to CYP107Z11

CYP107Z10   Streptomyces platensis strain I-1548
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema14
            100% to S. tubercidicus strain I-1529

CYP107Z11   Streptomyces platensis strain NRAA-7479 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema9
            92% to 107Z12

CYP107Z12   Streptomyces tubercidicus strain R-922 
            No accession number
            Istvan Molnar, Syngenta Biotechnology, Inc.
            Submitted to nomenclature committee June 2, 2003
            Clone name Ema1
            92% to CYP107Z11

CYP107AA1   Mycobacterium smegmatis
            MSMEG3142 TIGR 
            RubU (pksS)
            45% to 107B1 43% to CYP107AB1P
            41% to CYP105S2 Mycobacterium vanbaalenii
MTPYSRRDRNH
MLRLGNSFVQNPHEVYDRLRRSGPVQRVEMWGGVPVWLVTRYQEARNLLT 
DPRIGKDGAAASALFPPGTDGSIGTVLGDNMLFRDPPDHTRLRRFVTSAF 
TAHAVRRLRPTIAGFADALLDDIAASVPGQVDLLQAFAQPLPVQVIGELL 
GVPERDRELFAALVVPIFTSTDTTVLRRAQKELTQLLTDMLAEKRQSPAD 
DVLSSLVHRRDGTDQLSEAELLGTAFLLIVAGYETTVNLLANGILALLRN 
PEQLRAVRADRSLLPRAVEEALRFESPLNTATVRYTSAPVTVGDVEIPSG 
ELVVIGLLAANHDDEQFPDAHRFDVSRTHNRHLAFGYGVHHCVGAPLARM 
EAEIGFDRLLSRFEVMELVDSGPPRYRPSTLMRGVERLPVILGYPHDIAS 
TMREWSGSLPSSGEADSSFAH

CYP107AB1P    Mycobacterium smegmatis
              45% to CYP107B1
              43% to CYP107AA1
MILDEQFAQDPEGLYRMLRSEAPVCEVELIGGVRGWLVTRYADVMALLKD 
PRVSKDHTSALPRLAPDRVRPYISPQLHNHMLNLDPPEHTRLRRLVVQAF 
TPKALARMQPVIDAIADELLDDIDLRSGDEPIDLMADYAEPLPIQVIAEL 
LGVAVEYA*PFRAAVTPLLMSVTVEEKAESGRATIEILNAVIDEKIREPG 
EDLLSGMIGASVDGHGLTRDELMAMCFLLITAGYETTVNLIGNGTLALID 
NPSQLEKVRENPDLTAGAVEEILRFDGPVNIATWRYATADIDVDGVVIPA 
NEQIFLSLLSANRDTGRFENADRFDIERNTRGHIAFGHGIHYCLGAPLAR 
MEGVTAIGRIVQRYDSITLDPTAELRYHNGTLMHGLKSLPVRLTRVPQPRP

CYP107AC1   Streptomyces atroolivaceus
            GenEMBL AF484556 60948..62147
            leinamycin biosynthetic gene cluster
            48% to 107N1
            gene = LnmA
MSATRRVHIYPFEGEVDGLEIHPKFAELRETDPLARVRLPYGGE
GWMVTRYDDVRAANSDPRFSRAQIGEDTPRTTPLARRSDTILSLDPPEHTRLRRLLSK
AFTARRMGAMQSWLEELFAGLLDGVERTGHPADIVRDLAQPFTIAVICRLLGVPYEDR
GRFQHWSEVIMSTTAYSKEEAVSADASIRAYLADLVSARRAAPHDDLLGVLVSARDDD
DRLTEDELITFGVTLLVAGHETSAHQLGNMVYALLTHEDQLSLLREQPELLPRAVEEL
LRFVPLGNGVGNARIALEDVELSGGTVRAGEGVVAAAVNANRDPRAFDDPDRLDITRE
KNPHLAFGHGAHYCLGAQLARMELRVAIGGLLERFPGLRLAVPADQVEWKTGGLFRGP
QRLPIAW

CYP107AD1   Streptomyces hygroscopicus
            GenEMBL AF521896 4248..5489
            ansamycin biosynthesis gene cluster
            43% to 107X1
            gene = gdnH
MSGRHFEQGERGTAMADTPEEELRILDPQSVAQELRKHGPPRQI
TMHGTTAWLVSRYEEVRDCLGHPGMSPAAAYAASQGQTNPVSGLFEDTVAGTNPPQHT
RLRRLLAKAFTVRRVESLRPRVQEITDTLLDRIAVDGRADLVSALAIPLPMQVICELL
GVPIADRTEFHQWADLMLTPPLDPDTAARSQDASAKLWTYMEDLAEARRKAPEDDLIS
DLMSAHEDDRLSHREVVATARMMLIAGYELTGSFISNAVFSLLSQPDQMELLRKDPEL
AGRGLEELLRHAGPGILIVRFANEDVEIGSVSIRAGDQVLLDMDAAHSDPAHFTDGER
LDLTRDSAVHLQFGHGIHYCIGAPLARVEGQIALESLVRRFPGLRLSVPAAEISHSKN
PFIRSLTALPVEFEAQQPVAG

CYP107AE1   Streptomyces sp.
            GenEMBL BD133545 
            50% to 107X1
VILLKSLAANGLTASSCFTVSPLPIRSASPSIAFLTSSSERDSGVRNDRPSDAQPAIARF
RFPTPPHPRNPTQPHPTPPRPSPTDDPLQAPTFFADPYPTYARLRDTAPVLKVPTGSGGG
GRHSYVVTGYAEAREAFTDPRLSKDTASFFAGRPSQRDLHPAVSRNMLATDPPQHARLRA
LVTKAFTTGAVARLRPYISSLVDELLDTWPTHGTVDLIADLAVPLPVTVICELLGVPDSD
RASVRTWSSDLFAAGDPQRIDAASHAVGDYMTALVAAKRTAPGDSLLDDLIAVRDGQDHL
SEDELVSLAVLLLVAGHETTTNFIGNAALALLRHPESLAHLRAEPQLLGGALDELLRYDS
PVGIATFRFSTEALTLGGTEIPEGVPVLIAPGAANRDPDRFPDPDRLDLTRGATGHLAFG
HGIHRCLGAPLARAEAELALHAVITRYPQAALATPPETLPWRHTRLTRGLASLPITLRDH
PK*

CYP107AF1   Streptomyces collinus DSM2012
            GenEMBL AF293355 24259..25518
            Gene = rubU
            rubrinomycin gene cluster
            52% to 107B1
MARTDAPQAAPPADLFTPAFHQNPHEALAGLRRTAPAVPVMTPN
GLRTWLVTGHEHARALLADPRLSKDMRVGRDLIPRNFVDPDKQREFLAESGERSQFPH
VLSVHMLDSDPPDHTRLRRLVGRAFTARRVESLRPRITELTDELLDAMARHERLDLME
ALAFPVPFTVICWLLGVPPDDRAAFRRWSNLLVSGAGTDEVREASASMITYLTELIEA
KRNEPADDMLTDLVHARDAGDQLSSDELISMAFLLLVAGHETTVNLIGNGALALLTHP
EVREQLAADESLWPGAVEEFLRYDGPVTNATWRFTTEPVEVGSVTIPEGEFVTISIGA
AGRDPDRYPDPDRLDITRAHSGSVAFGHGIHHCLGAPLARLEGRIVLSRLFARLPGLR
LAADPDELSWRSSLMMRGLEELPVFTA

CYP107AG1   Streptomyces atroolivaceus
            GenEMBL AF484556 complement(120436..121638)
            Gene = LnmZ
            leinamycin biosynthetic gene cluster
            49% to 107E1
MSTEVETEKPAPVAYPFTGSEGLELSQSYAKLFEDGDPIRVQLP
FGEPAWLVTRYDDARFVLTDRRFSRHLATQRDEPRMTPRAVPESILTMDPPDHTRLRT
LVSKAFTPRRIESKRAWIGELAAGLVADMKAGGAPAELVGSYALAIPVTVICELLGVP
EDDRTRLRGWCDAALSTGELTDEECVQSFMDLQKYFEDLVKERRAEPRDDLTSALIEA
RDAHDRLAEPELIGLCISILIGGFETTASEISSFVHVLQQRRELWTRLCADPEAIPAA
VEELLRFVPFAANGISPRYALEDMTVGGVLVREGEPVIVDTSAVNRDGLVFDNADEVV
IDRADNRHMVFGHGAHHCLGAHLARVELQEALKALVEGMPGLRLSGDVEWKADMIIRA
PRVMHVEW

CYP107AH1  Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           50% to 107L6 missing about 42 aa at N-term
           clone name SP0749

CYP107AJ1  Streptomyces peucetis
           No accession number
           Niranjan Parajuli
           Submitted to nomenclature committee Nov. 2, 2003
           52% to 107B1 frameshifted C-term
           clone name SP0908

CYP107AK1  Streptomyces scabies
           SCAB79691
           David Lamb
           Submitted to Nomenclature committee Nov. 10, 2006
           48% to 107AF1, 47% to 107B1

CYP107AL1  Streptomyces scabies
           SCAB63301
           David Lamb
           Submitted to Nomenclature committee Nov. 10, 2006
           45% to 107AC1, 45% to 107N1, 45% to 107M1

CYP107AM1  Streptomyces scabies
           SCAB44031
           David Lamb
           Submitted to Nomenclature committee Nov. 10, 2006
           41% to 107AC1, 42% to 107E1

CYP107AN1   Bradyrhizobium japonicum USDA 110
            GenPept BAC51802                 
            NC_004463 complete genome complement(7193424..7194725)
            41% to 133B1v1 45% to 107L1
            formerly CYP107AA1, but this name was also given to 
            an M. smegmatis sequence so one had to be changed.  
  1 MVTPGSGAAI GVFVSCGNRF EVTMNEQAQP AGGDPLFNPL SPDFIRNPYP HYDRLRAIDP
 61 IHVTPFGQFV ASRHADVSLV MRDKRFGKDF VERSKRRYSE KIMDEPVFRS MSHWMLQADP
121 PDHTRLRGLV VKAFTARRVE DMRPRIQEIV DEAIDAVIDR GHMDLIEDFA FRLPVTIICD
181 MLGIPEDHRE VFYKSSRDGG RLLDPVPLTP EEIAKGNAGN MMAQMYFQQL FELRRRNPAD
241 DLTTQLVQAE EDGNKLTNEE LTANIILLFG AGHETTVNLI GNGLLALHRN PDQLALLKAR
301 PELMVNAIEE FLRYDSSVQM TGRVTLEDID DLGGRKIPKG ETVLCLLGSA NRDPAVYPDR
361 PDRLDVTRPN VKPLSFGGGI HFCLGAQLAR IEAEIAIATL LRRLPDLRID DVENPEWRPT
421 FVLRGLKSLP ASW

CYP107AP1   Streptomyces rochei plasmid pSLA2-L
            NC_004808 Links 87725..88939
            49% to 107A1 
            note="ORF37 lankamycin biosynthesis protein
            formerly CYP107AB1 but that name was already assigned
            to an M. smegmatis seq. (my error)
MNQPQLPEIPALNSELFHTDQYATYREILEQRPVTRVRFYDGSL
VWLVNRHEDVRAALTDPRLSNDPMKQSDIDLSAATGIPADLIEYFQRNMFRSDEPDHG
RLRKLVTREFTVRRINALRPRIRQIADDLLEKFAATGGGDLVEALARPLPLTVMCELL
GVPEEDRADFQTWSQHIVESSPEFAERNAVSYRSLFECVRSLIRRRRDEPGDDLLSAL
VDLRDVADRLSENELISTVFLLLVAGIETTVNVLGTGTFLLLTHPGELARLRADGALL
GPAVEEMLRYMAPIEITSRHTLEPVEIGGVSIDAQSTVLINLAAANRDPARFEDPQSF
RVDRNDGGHLTFGHGIHYCLGAALARAEAEVTFEALLERFPDLRLAASASDLTWRHAF
MRGPVELPVSWG

CYP107AQ1   Saccharopolyspora erythraea NRRL23338
            SACE_0125, se:152305,153462 (-) STRAND
            50% to CYP107P1
MFDTADPAFVADPYPCFAELRRRGEVHRHPGLGMAVAVSHAAASEVLRHRGLGRIWVDAQ
PAADFPAFNLLHRTSLLETEGAEHTRLRRSISAAFARGHVERVRPWVAGLADALVGGLVE
RGGGDVVEEVAAPLPVQVIAELLGVPESDRNLLRPWSNAIVKMYEPGLPERRRAAAESAA
AEFAEYMRALADRRRSAPADDMVSDLVAAEELSADEVVGTAVLLLMAGHEASVNLVANGV
LALLRHPGQWRRLVDDPGLVPTAVEELIRYDSPLQLFERTAVEDVVVAGHRVAAGSKIAA
LLGAAARDPEVFESPDVLDVGRQPNPHLGFGAGIHYCLGAPLARVEAAAALSALVRLAPR
LEQAGEPVRRPEFVIRGLRELPVSV

CYP107AR1   Saccharopolyspora erythraea NRRL23338
            SACE_0651, se:714930,716198 (-) STRAND
            44% to CYP107Y1
VQPDQSPTPRPESRHSPAAACPHAAVHREERPGGLVTWQISAFSEARAALGDSRFSKDPR
RLGEALRAGGRSMFAEYGDNLLDNLLNSDPPDHTRLRRLVGKAFSPATIERLGPTTQRLA
DELVASMLPAGRADLLAQFAYPFAFGVIARVLGLPPDSYRIFQRWTESMTAPREQGTDRM
VAARHLCEHVTELVRRSREWLASAPAETLLDELVSARDDGDRLSENELVATVLLLIIAGH
ETTVNLIGNGVHALLQHPGQLALLRDHPDLIDGAVEELLRFQPPISKTTLRVTTTDVEVA
GTEIPAGSIVNVLVPAANRDQRQFPDADRLDITRPPSAHMSFGHGIHYCIGAPLARMEGR
IAIGALLRGLPGLRLAEPAAEIPWRASNILRGLQRLPVRFDSAGADDEVRDRRHAGAARV
HA

CYP107AS1   Saccharopolyspora erythraea NRRL23338
            SACE_2922, se: 3203418,3204656 (+) STRAND, 
            48% to CYP107D1
MTESIQRDADSQEAACPHARAYPFGDPGALDLDPDYARVRDEEPLTRIRMPYGEQGWLVT
RYDDVRTVLADPRFSRSEVLKRDVPRPTPQQVERPGTLVTTDPPEHGRLRRLVAGAFTHR
RAESMRPRIRGVVDELVDEMLAGDKPADLVAAVSMPLPVNVICELLGVPREDRHVFHSGA
VLSDYTVPADEREATFKSLADYLAVLIAERRARPEGPGDDVLGALISARDTDGDRLSEDE
LIELWVDILVAGYASIMSVTPDMVVTLLTERDRWDGLVADPGGVPDAVEEMLRVMPTIIE
SGHSRVATEDVEIAGGTVRAGEAVLPCLPAANRDPAVFDAAEEMRLDRDAGKHIAFGFGP
HYCLGASLARVQLQSVLTALVRKVPTLDLVEAVREDSTRVAAAVQGQLLVTW 

CYP107AT1   Saccharopolyspora erythraea NRRL23338
            SACE_4142, se: 4611465,4612715 (-) STRAND
            56% to CYP107AT2, 47% to CYP107AU1
MSVGDIDTPSGEFDFAANLLPFDPLDPAFQADPYPFFRLVRETAPALCTQPGMWVVTGFR
ECSAVLRNPKFGHGDGRLVASQITHDAEGNVVRPFVFMDPPDHTRIRSLVTKAFSARMVE
RLRPTAERLVGELLAAAMSGPADEPVDLMAELAFALPSNLISELLGMPPQDKPLFEQWSS
ALGRGLDPDFMLSPEEMQRRDQARTEFDGYFAELARRRRAEPADDLVSALVAVEEDGRNL
SMSELVSTCRLLLSAGYLSTAHLIGNGVNALLRHPEQFEWFRAHPDQVAGVVEELLRYDS
PVQTAGMRTALQDTEIGDQPVSAGEGAMLLVGAANRDPAAFPDPDRLDVSRKPERNLGFG
IGAHFCVGAPLARLTTQVALTALAGLRVELATDDAPRINNLVLRGFAELPVFLRAA 

CYP107AT2   Saccharopolyspora erythraea NRRL23338
            SACE_4144, se: 4614537,4615769 (-) STRAND
            56% to CYP107AT1, 47% to CYP107AU1
MSTTEGAPPVDSNVVRQLLLFDPFDPEFRADPHRVYREIRESGPVTATPGGLWLVSGHRQ
VSAVLRDQAFGWGEAELAAGHFTTDDEGNTVRPLTFADPPEHTRIRSLVTSAFSARIVER
LRPRAQELARESLAAALAGGGSADVIQQVAYPLTGRLLCELLGVDPEYQERFRAWAEAMG
RGLDPDFMQSPDQLARREEARAHFHEYFAELAARRRAEPGDDLVSALVAVEQEGDRLTAT
ELVVTCTLLLSAGYATTVHLIGNGMLALLENPDQLAWLRANPGRVGDAVEEVLRFDGPIQ
LVSRVALRDTEVDGHAVAAGSPVLLLLAAANRDPAVFDDPDRLDVSRKPGRNLGFGVGIH
FCLGAPLARLTAQAALSLLVEHELVLDGPRPAPTGSLVLRGLAELPLRSA

CYP107AU1   Saccharopolyspora erythraea NRRL23338
            SACE_5309, se: 5939205,5940431 (+) STRAND
            47% to CYP107AT1, 47% to CYP107AT2
MTAGTNNAGRLGALAEAVLGYNPVDPEYHANAHEHHRRMAERGPIFRTPGGMWTAVSHAA
CSAVLRDDRFGHDPGSAAQNLFDSTQRPSVAQRSFEFMDGPDHSRLRRLVNRAFTARRVE
RLRPAVRTLADQLLTDVSGRIDVLADFILPLAMTTIVDMLGAPTEDNHLFRAWAEPIVRG
LDPDFLLSSSELAAREQANAEFAEYFDRLVALRRAEPKDDLISALIAVEDDGVVLSGNEL
ISMCLLLLAAGHESIMHLVGNGTVALLRDEDQLEHFRGHPGEVTNAVNELLRYDPPVVLL
VRTALADAEVLGNRVRRGEIVWLQIGAANRDPAVFPDPDRLDLTRDTGGSLAFGLGIHFC
IGASLARLEAAAALSALLHRDVALASEQLVHQKNVVIRGYEEVPVVLR

CYP107AV1   Saccharopolyspora erythraea NRRL23338
            SACE_5939, se: 6665300,6666601 (+) STRAND
            44% to 107L1, 43% to 107AT2, 41% to 107AN1
MTTAEPAETGSLAELNLGMRLVLHGAVTWSIARLGDPVARLLHSPWRRDPYPIYRQLRAR
GPLVRSRLGVWAASTYEVCDAVLRDRRFGVRTSDGSYGDPTAAAVGLQLSLLELDPPDHT
RLRKLAAPAFRPRKLENYRQRIEDTAHELLDRALAKGEFDLIRDFATPLPIRVICELLGL
PELGAERLAVHGAALSGALDGIRSIRHLRRMRASTLELNELFGDLIEQRRRQPGEDIVSD
LVTALDQDRLDSTELVQMCDLLLVAGFETTVNLIGNGVLALLERPDQWRLLCDDPDQAVG
VVEETLRWDPPVQTTMRVAHEPVEVAGRLLPRNSAVLPMLGAAGRDPAVHFAPDRFDITR
GTRGDHLAFSSGIHYCLGAPLARLESEIAFRCLATRVPELRRSGALVRRPTSVIHGLSAL
PVAASKATAGGRR

CYP107AW1    Salinispora tropica (marine actinomycete)
             Strop_2290 complement(2583447..2584649)
             49% to CYP107B1 M83110 Saccharopolyspora erythraea
MESVTSTSAPPPVPYIADPYPALARIRANGPVSILHSDEGIPMWVIARYRNVRAALADPR
FGQDARRAQTLADNRVAGVTLGGDVIHMLNSDPPDHTRLRHLVQGAFTARRVAAMRPLVE
RITTSLLDGVGGRQTVDLVQDFAFPLPMLVICELLGFPAEERDAYRSWSTAILTHNDDPA
AFATALRDMTDYIEVQLRHRRARPGEDLLTELLAARDAGQLTDDEIVGMVFLLLIGGHET
TVNLLGTATLALVRNPDQHRWLLANPHALSEAIDEFLRYESPVAMATLRFTTAPVTVDDV
VIPAGELVLVSLGGANRDPDRFPDADRLILDRRDTGHLAFGHGLHRCLGAFLGKLEGEVA
LGALLGRYPGLTLAAEVRQLRWRDTIMLRGLESLPVSLHG

CYP107AX1    Salinispora tropica (marine actinomycete)
             Strop_2367 	2663162..2664352
             47% to CYP107X1 SAV6249 AP005046 Streptomyces avermitilis
MTSRPTAVFDQCLLRDPHSRYNALRDQAPVHHVLTPDGAPAWLVTRYNDVRAAFTDPRLS
VDKRFSGTDGEHGSSLPPELDAHLLNRDPPDHTRLRRLAAAACTPRRVADLHPAVERIVS
TLLDGLAGHDRAELIGSLASPLPLQVMHELLGLPTQANIDFRTWTNTLLSADANQPAQSR
AAMANMRRFLIEQLAHKRAQPGDDLLTGLLAAREDDDRLTDDELVAMVFLLMFAGYDNTA
ALIGTVTHALLTNAELHEAVRGGSLALDELIDEVLRWNPAFPLAVRRFAREPITIAGQTI
PAGDRIWLCLASANRDPAQFTQPDELGIIGLRRSHLSFGHGIHYCLGAPLARLQTTIAVT
SLLNRFPEMRLAVPAHDIRWRESFRLRGLIALPVYL

CYP107AY1    Salinispora tropica (marine actinomycete)
             Strop_4427 	complement(5021489..5022637)
             49% to CYP107X1 SAV6249 AP005046 Streptomyces avermitilis
MSQDPTMRAELAPIPRSGARLGQEYDQLRNAGDVHQVLLPDSSLAWLVTNPKLVSRALTD
PRLALNRRHSRGSWSGFALPPALDANLLNLDAPDHTRLRRLVGPAFSPQRVSALRPRIRR
TAEHLLDTLVATSGPVDLVTGYCTPLSVQVIADLMGVPEAGRADLRTWTDTMLTSYPPDR
DAIRQAVVELHGYVVDLIDTKRQQPGDDLLSALVTIEQDGDRLTRDELTSLAFLILFAGY
ENTANLIASTVLRLLDHGGLRGVQLPEAIEETLRLEPPAPAAVRRFPTEEMTIGGATIPA
GDIVLLSIAAATRGTAGNAARLAFGNGPHFCLGAALARVEAEEALTVLARRLPDLALALP
VAQVRWRPTFRTHGPAELLVTW

CYP107AZ1    Roseiflexus sp. RS-1, complete genome.
             GenEMBL CP000686 REGION: 945215..946423
             49% to CYP107AN1 
MHHPTEPPPELWSAAAISDPYPIYDRLRAEQPIRWTGGDWQIFR
YADAQALLRDPRLGADRLQVDPQWLIASGLEPLFKTRDSMMLFADPPDHTRLRTLVHR
AFTPRVVESYRPLVQRIVDQLLDAAAARGAIELIGEFAYPLPVTVIAHMLGVPVNMHD
QFRRWSDSLAAFIGGTTRPEADVLPAALKAVLEMTDFFLALVAERRRAPRDDLLSALA
QAEDGGDRLSEQELVANSILLLLAGHETTTNLIGNGMLALMRHPDQFALLRDHPELTP
SAIEELLRYDSPVQVTSRRALTDIEFQGHRIEEGQAVTVFIGAANRDPAQYQDPARLD
VTRGDVRHLSFGHGPHYCLGAPLARLEGQVAISALVRRFPHMRTLDEQVVWRDNFALR
GLQSLHIELE

CYP107BA1    Roseiflexus sp. RS-1, complete genome.
             GenEMBL CP000686 REGION: 4471138-4472327 (-) strand
             Frameshift at 4471920
             46% to 107B1, 46% to CYP107AZ1 
4472327  MTPTIVARLASPEFLADPYPVYRQLIEQTPVFWLPHANAPGGMWCIARYDDIAFVLREAPIFKDT  4472133
4472132  SRIAPPDTLTPLDRAMLQRDPPDHTRLRRLASHAFTPRRVHDLMPRIEQI  4471983
4471982  SLDLIERIGARGEADFIADYA  4471920
4471920  PLPIIVIAELLGVPFEDHEQFSTWSDQIMAGSDSVLGGEEAARQSHQAMASLVDYFTTLI  4471741
4471740  RQRRHSPRDDLISALIAAHDAGDSLSEDELLGMCVLLLIAGHETTVNLIGNGLLTLLRH  4471564
4471563  PDQLNLLRRQSEYLTSAIEEMLRYESPVQRSTPRFAAEPFVIGGEQIEAGQQISLMFGAA  4471384
4471383  NRDPAHFSDPDRFDITRQPNPHLGFGMGIHYCLGAPLARIEARVAFTHILERLPAIRLAT  4471204
4471203  DTPAWKPVTWLRGLKSLPVLV*  4471138

CYP107BB1   Streptomyces sp. Tu6071
            ABB69746
            PlaO2
  1 MDRVLDFLSS ADSAGELQPR LVELSRQETL PRVLLADGQE AWLVTRNEDV RTVLSDRSFT
 61 RDVMGERARQ AGETPDGARS VNMDGRPHNE LRALVSKAFT VRRIEAMRPR IQAWTDELID
121 AMEETGPPAD LVAHLAVPLP ALAICELLGF PVEDRQVLSG WCERITRLGE GGPDQRAWQE
181 LSAYIARRVP VERAAARGGL APETSILTRL VHAHDSEDAL SMEELLSLTV VVLAGGLETT
241 QTAIGAGMVR LFRNPAQLDK VRADPDLVVP AVEEILRYQP VIDVNRVQVA TRTVRLGGQE
301 IRAGDLVQVS VNAANRDETV FPDSERCDVT RGPNPHLAFG YGAHHCLGAA LARLELKTAF
361 STLLRRLPDL RPAVPLESLG WRGGHVTLGL EELPVAW

CYP107BC1   Streptomyces sp. Tu6071
            ABB69764
            PlaO5
  1 MTESLETTSP DPTRSGNSDT GATPGYTVPK QVNDMWREKP VRRFSMRDGR EAWLVTGRAE
 61 VRTVLADPRF SRVEARRLDA VMSPAVIFTR PGILDMDPPE HTRLRRLVAG EFSARRMRAL
121 RPRIQQIADE LIGTMKAAGP PADLAEGLSY PLPIAVICEI LGVPYADRER FRAWADRVSA
181 PGTQPQEAMA ALRSLFDYMG GLVDDKHAHP DGSLLHGLVT ARDEQGRLDN EELVTLGCGL
241 LLAGYETTAT MLGKGLLALL DNPDQLAVVR SDPRAVPAAV SEVLRHVTPG VDPHTGLIRA
301 TTADVELGGT VIPAHSVVVA CNTAANFDPA TFRDPDRFDV TRENAAAHLT FGHGMHRCVG
361 AQLAQIELEA AFAALFPAIP GLRLAVPADE ITYTQSTLIR GLRSLPVLW

similar to CYP107 family   Crocosphaera watsonii WH 8501
ZP_00517718 
MTKTKKNKTQNKLVFNPFYRAFHNNPYPIYERLRNEDPIHWSFLKAWIITRYQDVDTILKDNLFQVDDLP
LRLEEKSAYLKQGNFLPLAKTIDKWLFFQQPPNHTRLRSLVNKSFSPASVGNMKEEIEAKVNHLLDKVIP
TGKMDLIDDLASPLPAMTVTNILGLPPEDYYKLIHWSYELFFVFDQPMSLEGYEKQNKMAMEAREYLLRF
IANIDENSQGLIADLVKAKDEENKLDEDEILGFCIMLLIVGQETTKSFISNSILALLQHPEKLQELKDNP
EIIKEASEELLRYDTPVQVIARLAREDVEIGGKTILKGDKVILCLGGANRDENKFPNPEKIEFQRSNRNL
PFGGGIHFCLGAFLARLQGQISINRIVQRLPNLQLVNQTPDWRESITLRGLKSLPLTFDKNDIKTD

similar to CYP107 family   Gloeobacter violaceus PCC 7421
NP_924888 
MDSVANLNQDAFGNTLPQTEAPFKFNVFDPAFHEDPYPFYDRLRRESPIYRNFMGAWVFTRYSDIKSILR
DRRFRVLDKPGWIKNKNRYLTPDQGNFDALVRSSSKFFFFLEPPDHGRLRGLITKAFSASFVDRLRPHVE
ATLADLLGKVREQGAMDIMADLACPLPAIVIARLIGVPAADYARLGHLSDELARIFDPVISLEGYLHLNA
VVEEFGSYFLDLVAEHKRQPGTDLIDSLIAAQEEGNRLSEEEVVAVCMQLFAGGEETTVNLIGNGMLALL
THPEQLELLRSKPEIIAGAVEELLRYDSSIQLVARAAIEDIEIEGCTIGAGEHVHLYLGAANRDPAQFFD
PHSLDLTRVDNRHLAFGDGIHHCFGGPLARVEGQVVFQTLVQQFPKLRLAESRRPERREGTLLRGLKTLP
VTF

CYP107 fragment  Streptomyces noursei 
          GenEMBL AF071516 CDS complement(85..>519)
          putative P450 hydroxylase gene, partial cds.
          function="may hydroxylate a macrolide antibiotic polyketide moiety
          C-term only 63% to 107A2, 55% to 107A1
WTTPTRWSCSAPSLICLPRHGRNTAHRRTSRSHHPGRARRHPDR
DTLIPARSTVFIAGAAANRDPQKFPNPDTFDITRNTQGHLAFGYGVHHCIGRPLAQME
GEVAITALLRRFPHLHLTTPSQNLTWRRSFLRGLTALPVTLN

108 Family

CYP108A1    Pseudomonas spp.
            Swiss P33006 (428 amino acids) PIR S27653 A42971 (428 amino acids)
            Also found a PIR cross-reference to EMBL S39894 but could not 
            retrieve it
            Peterson J.A., Lu J.-Y., Geisselsoder J., Graham-Lorence S.,
            Carmona C., Witney F., Lorence M.C.
            Cytochrome P-450 terp: Isolation and purification of the protein 
            and sequencing of its operon.
            J. Biol. Chem. 267, 14193-14203 (1992)

CYP108A1    Pseudomonas spp.
            GenEMBL M91440 (6620bp)
            Hasemann,C.A., Ravichandran,K.G., Peterson,J.A. and
            Deisenhofer,J.
            Crystal structure and refinement of cytochrome P450terp at 2.3A
            resolution.
            J. Molec. Biol. 236, 1169-1185 (1994)

CYP108B1    Mycobacterium smegmatis
            MSMEG1429 TIGR, 
            71% to CYP108B2 68% to CYP108B3
            84% to CYP108B9 Mycobacterium vanbaalenii
            72% to CYP108B8 Mycobacterium vanbaalenii
MFLTPVKFSPSEVPD
MSTPVINDAARVLAEPRAYADEPRLHAALAELRSQTPVAYVDVPGYYPFW
AITKHADVMAIERDNELFINAPRPMLITKEKDDLAKANLAAGGGIRTLIH
MDDPLHRDIRKIGADWFRPKAMRALKERVDELAKIYVDKLVEKGPECDFV
QEVAVNYPLYVILSLLGLPESDFDRMLKLTQELFGNDDDEMGRGSSAEEL
NAVILDFFNYFTELTADRRANPTEDLASAIANAKLNGEYLNDVDCLSYYV
IVASAGHDTTSAAISGGLLALTENQDQLARLKADMSLMPLATEEIIRWSA
PVKEFMRTATRDTEVRGVPIKEGESVLLSYVSANRDEEIFENADKFDVGR
DPNKHLSFGYGVHFCLGAALARMEINSFFTELIPRLESIELAGDPEFMAT
TFVGGLKHLPIRYSVR

CYP108B2    Mycobacterium smegmatis
            MSMEG1428 TIGR 
            71% to CYP108B1
            68% to CYP108B3
            87% to CYP108B8 Mycobacterium vanbaalenii
            73% to CYP108B9 Mycobacterium vanbaalenii
MSTPTMDDAAKALADPTAYADDARLHEALARLRAENPVAWVDQAPYRPFW
AITKHADIMAIERANDLWLSAPRPLLATAEADDLGRSQQEMGIGLRTLIH
MDDPHHRKVRAIGADWFRPKAMRELKVRVDELARIYVDKMREIGPECDFV
TDIAVNFPLYVILSLLGLPEEDFGRMHMLTQEMFGGDDDEYKRGTTVEEQ
MAVLTDFFNYFSALTNSRRENPTDDLASAIANGRVDGELMSDMDTLSYYV
IVASAGHDTTKDAISGGLHALIENPGELARLKADPGLMGTAVEEMIRWST
PVKEFMRTAAEDTEVRGVPIAKGESVYLAYVSGNRDEEVFTDPFRFDVGR
DPNKHLAFGYGVHFCLGAALARMEMNSLFSELLPRLDSIELAGEPELSAT
TFVGGLKHLPIRYSIR

CYP108B3    Mycobacterium smegmatis
            MSMEG2261 TIGR
            68% to CYP108B1 68% to CYP108B2
            73% to CYP108B7 Mycobacterium vanbaalenii
            69% to CYP108B8 Mycobacterium vanbaalenii
            69% to CYP108B9 Mycobacterium vanbaalenii
MTARTIDDAAKVFAMPSAYTDEAKFHEALTHLRVNAPVSWVDVPPYRPFW 
AITRYADIMAIERANDLFTNSPRPVLMTAEEDEQQAAVGISTLIHMDDPQ 
HRVIRAIGADWFRPKAMRALKIRVDELAKIHVDKMVAAGGECDFVQEITV 
NYPLYVIMSLLGIPEADFPLMLKLTQELFGNKDDEYQRSADEGDSMAALL 
EMFQYFTELTASRRANPTDDLASAIANATVNGEPLNDIETVSYYAIVAAA 
GHDTTSATISGGMLALLEHPDQLERLRNDPSLMGTATEEMIRWVTPVKAF 
MRTAATDTVVRDVPIAAGESLLLAYPSGNRDEEVFTDPFRFDVGRDPNKH 
VAFGYGVHFCLGAALARMEINSFFAELIPRLESIELTGSPRHTATTFVGG
LKHLPVRYALR

CYP108B4   Mycobacterium marinum
           No accession number
           Tim Stinear
           MM3999
           72% to 108B3
           name changed to CYP108B4

CYP108B4   Mycobacterium ulcerans
           No accession number
           Tim Stinear
           98% to 108B4 M. marinum = ortholog
           name changed to CYP108B4

CYP108B5   Mycobacterium avium subsp. paratuberculosis K-10.
           NP_961305, AAS04688.1, 
           79% to 108B7, 75% to 108B3
MSTTTMDEAA KLLADPMAYT DEQRLHAALT HLRANAPVSW VEVPNYKPFW AITKHADVMD
IERENMLFTN WPRPVLTTAE GDEMQAAAGV RTLIHMDDPQ HRVVRAIGSD WFRPKAMRAL
KVRVDELAKI YVDKMLAAGP ECDFVQEVAV NYPLYVIMSL LGLPEADFPR MLKLTQELFG
SDDSEFKRGS SNEDQLPALL DMFGYFNGVT AARREHPTED LASAIANARV DGEPLSDIDT
VSYYLIVATA GHDTTSATIS GGLQALIENP DQLQRLRDNL DLMPLATEEM IRWVTPVKEF
MRTAAKDTVV RGVPIAAGES VLLSYVSANR DEDVFDEPFR FDVGRDPNKH LAFGYGVHFC
MGAALARMEV NSFFTELLPR LKSIELTGDP ELVATTFVGG LKHLPVRYSL A

CYP108B6   Mycobacterium flavescens PYR-GCK
           ZP_01192125.1, EAS11507.1 
           92% to 108B7, 74% to 108B3
MSVRVADEAG KVFADPTAYA DEQRLHAAMT HLRANAPVSW VDVEGYNPFW AITKHADIMA
IERDNTVFTN SPRPVLTTAE GDAQHASMGV STLIHMDDPQ HRKVRAIGAD WFRPKAMRAL
KVRVDELAKT FVDQMYDRGG ECDFVQEVAV NFPLYVIMSL LGIPESDFGR MLTYTQELFG
SDDAELQRGT TMEERGLALF DMFTYFNELT ASRRAQPTED LASAIANARI NGEPLSDIDT
VSYYLIVATA GHDTTSATIS GGLQALIENP DQLARLQQAP ELLPLAVEEM IRWVTPVKEF
MRTAQQDTEV RGVPIAAGES VLLSYPSGNR DEDVFTDPFR FDIGRDPNKH VAFGYGVHFC
LGAALARMEI NSFFSELLPR LTSVELAGRP EHIATIFVGG LKHLPIRYSL TR

CYP108B7   Mycobacterium vanbaalenii PYR-1 
           ZP_01203876.1 EAS24868.1 
           73% to CYP108B3, 73% to CYP108B9, 70% to CYP108B8
MSVRIADEAARVFADPSAYADEARLHAAMTHLRANAPVSWVEVPGYNPFWAITKHADIMAVERDNLVFTN SPRPVLTTAEGDAQHEAMGISTLIHLDDPQHRKVRAIGADWFRPKAMRALKVRVDELAKTFVDQMYERGG ECDFVQEVAVNFPLYVIMSLLGIPESDFQRMLTYTQELFGNDDAELQRGESMEERGLALFDMFTYFNEIT AARRARPTEDLASAIANARIDGAPLSDIDTVSYYLIVATAGHDTTSATISGGLQALIENPDQLQRLQQNP GLMPLAVEEMIRWVTPVKEFMRTAQQDAEVRGVKIAAGESVLLSYPSGNRDEDVFTDPFRFDVGRDPNKH VAFGYGVHFCLGAALARMEINSFFTELLPRLKSVELAGRPEHIATIFVGGLKHLPIRYSLTR

CYP108B8   Mycobacterium vanbaalenii PYR-1 
           ZP_01205329.1, EAS26321.1 
           87% to 108B2
MSTPTMNQESQEAAKVLADPTAYADDQRLHKALAHLRANDPVAWVDHPPYRPFWAITKHADIMAIERAND LFLSEPRPVLVTAEADDMARAQLEAGFGLRTLIHMDDPHHRKVRAIGADWFRPKAMRDLKIRVDELAKRY VDKMRDIGPECDFVTEIAVNFPLYVILSLLGLPEEDFGRMHMLTQEMFGGDDDEYKRGATVEEQMAVLTD FFNYFGALTASRRANPTDDLASAIANGLVDGELMSDVDTLSYYVIVASAGHDTTKDAISGGLHALVENPG ELARLQGDLDLMPTAVEEMIRWSTPVKEFMRTAAEDTTVRGVPIAKGESVYLAYVSANRDEDIFDDPFRF DVGRDPNKHLSFGYGVHFCLGAALARMEINSLFSELLPRLDSIELAGRPELSATTFVGGLKHLPVRYSLR

CYP108B9   Mycobacterium vanbaalenii PYR-1 
           ZP_01205327.1, EAS26319.1 
           84% to 108B1
MSTPVIDEAASDAARVLADPKAYTDEARLHAALAHLRAHAPVSYVDVPDYRPFWAVTKHSDIMAIERDNE LWINEPRPLLTTAATDDLSQANLAAGGGIRTLIHMDDPLHRDIRKIGADWFRPKAMRDLKTRVDELAKIY VDKMVEKGPECDFVQEVAVNFPLYVILSLLGLPESDFGRMLKLTQEMFGGDDDELTRGKSPEELHEVITD FFRYFTALTAERRANPTEDLASAIANAKLDGEYLNDIDCLSYYVIVASAGHDTTSAAISGGMLALIENQD QLARLKAQPELMGTAVEEIIRWTTPVKEFMRTATADTEVRGVPIREGESVLLSYVSANRDEDIFDEPAKF DVGRDPNKHLSFGYGVHFCLGAALARMEINSFFTELIPRLESIELAGDPEYIATIFVGGLKHLPIRYSVR

CYP108C1    Saccharopolyspora spinosa strain NRRL 18395
            No accession number
            Istvan Molnar
            Syngenta Biotechnology, Inc.
            47% to CYP108B1 43% to CYP108A1

CYP108D1    Novosphingobium aromaticivorans
            GenEMBL NZ_AAAV01000137
            16805..18166 gene = Saro1710
            47% to 108B1 39% to 108C1 
MTNTSRLTKRRRPRRSDGKREGFMDSIPMVPAEVGRAVIDPKSY
GTWEPLLDRFDALRAEAPVAKVVAPDDEHEPFWLVSSFDGVMKASKDNATFLNNPKST
VFTLRVGEMMAKAITGGSPHLVESLVQMDAPKHPKLRRLTQDWFMPKNLARLDGEIRK
IANEAIDRMLGAGEEGDFMALVAAPYPLHVVMQILGVPPEDEPKMLFLTQQMFGGQDE
DMNKSGLKDLPPEQISQIVAGAVAEFERYFAGLAAERRRNPTDDVATVIANAVVDGEP
MSDRDTAGYYIITASAGHDTTSASSAGAALALARDPDLFARVKADRNLLPGIVEEAIR
WTTPVQHFMRTAATDTELCGQKIAAGDWLMLNYVAANHDPAQFPEPRKFDPTRPANRH
LAFGAGSHQCLGLHLARLEMRVLLDVLLDRVDSLELAGEPKRVNSTFVGGFKSLPMRW
KAA

CYP108E1    Ralstonia metallidurans
            GenEMBL NZ_AAAI01000348
            46192..47481 gene = Reut4024
            41% to 108B1 39% to 108A1 48% to 108C1 
MTIASDFDTELASHEIYSDPERMHEMFETLRREDPVHWTTAPGH
PPFWAVTKQADVIEVGKHPDVFIASPKSFLMNDVEQRVRIEETAATGGKLVRTMIHMD
DPDHKKYRGLTQSYFMPANIKRLESVIQERARALVGRLIEKGTSEFCSEIAVWYPLQI
VMTLLDVPESEHPYLLKLTQQFLAPKDPTLRRDGPDERGKGAVAKEYFAYFGKMLAER
RAAPLKEDLGSLIAHATVDGEPLPLMEAVSYYVILATAGHDTTSSSMCSGLYYLLTQP
GELDRLRARPELMPSAIEEMFRHGSPVKHFVRTATRDFELRGKKIQAGDEVALMYHSA
SFDEEVFDEPRSFRIDRGPNKHVAFGFGIHACLGQNLARASMRTFFTELLARTESIEV
VGKAEFIASNQVGGMKTLNIRVTPSKQSTTDRIEVAA

CYP108F1X  Mycobacterium marinum
           No accession number
           Tim Stinear
           MM3999
           46% to 108B1
           name changed to CYP108B4

CYP108F1X  Mycobacterium ulcerans
           No accession number
           Tim Stinear
           98% to 108B4 M. marinum = ortholog
           name changed to CYP108B4

CYP108G1    Caulobacter crescentus CB15
            GenEMBL AE005918 GenPept AAK24465
            NC_002696 complete genome 2703947..2705221
            Complete genome sequence of Caulobacter crescentus
            Proc. Natl. Acad. Sci. U.S.A. 98 (7), 4136-4141 (2001)
            47% to CYP108A1
            formerly 108B1 but this was already assigned to 
            an M. smegmatis seq. (my error)
  1 MTISTDIANT IIDPKAYADG DRIDQAFAHL RREAPLAVAQ PDGFDPFWVV TRHADILEVE
 61 RQNELFHNGD RATVVTTIEP DKKVREMMGG SPHLVRSLVQ MDNPDHFAYR KITQGALLPQ
121 NLRALEARIR EIARGFVDRM AEHGDRCDFA RDVAFLYPLH VIMEVLGVPE SDEPRMLKLT
181 QELFGNADPD LNRTGKSVTD VGEGVDSIQS VVMDFMMYFN AITEDRRANP RDDLATLIAN
241 GKINGEPMGH LEAMSYYIIA ATAGHDTTSS TTAGALWALA ENPDQFAKVK ADPSLIPGLI
301 EESIRWVTPV KHFMRTATAD AELGGQKIAK GDWIMLSYPS GNRDEAVFED PFTFRVDRTP
361 NKHVAFGYGA HICLGQHLAR MEMRVLWEEL FARLDHVELD GAPTRMVANF VCGPKSVPIR
421 FKMH

CYP108G2    Ectocarpus bacterium
            Genoscope Ectocarpus siliculosus brown algae project
            A bacterial genome was found with the Ectocarpus DNA
            67% to CYP108G1 Caulobacter crescentus AE005918
            Ectocarpus sctg_1 159023-157728

CYP108G3    Parvibaculum lavamentivorans DS-1 
            CP000774 (genome) ABS63163.1 (protein)
            CDS complement(1684831..1686087) locus_tag="Plav_1544"
            56% to CYP108G1 Caulobacter crescentus
MTDKTIDNAIVNPKTYAHVDEFHRLFTQLRKEEPVRWTEPDGFRPFWTVSKHADIMEVERQNDKFLNDPR
LTLQTIEVEEEVKKFTGGNSKLIRSLVDMDNPDHRNYRGLTQAWFMPPNLKAISARVEALAEKYIDRLEA
KGGECDFVSDVAVWYPLRVIMTVLGVPAEDEPIMLKLTQELFGSTDPDMKRPDATETVNTVTEFFNYFTA
MTEDRRKNPKDDVASVIANATIDGEPIGHLEAISYYIIVATAGHDTTSSTAAGGLLALMQNPEEFAKLKA
NPEGLLGGAIDEMIRWTTPVKHFFRTAAVDYELRGQKIKAGDNLLMCYWSANRDEEAFDDPFSFKIERSP
NKHLAFGYGAHLCLGQHLAKMEIRALYKELLARLDHIELAGDPAWVEASFVSGLKRLPIRYSMKRKAA

CYP108H1    Ectocarpus bacterium
            Genoscope Ectocarpus siliculosus brown algae project
            A bacterial genome was found with the Ectocarpus DNA
            49% to CYP108G1  Caulobacter crescentus CB15 AE005918
            Ectocarpus sctg_1 1535540-1536841

CYP108H2    Marine metagenome 1093018949056
            AACY022454370
            57% to CYP108H1
APPRRGPQQSAEHTETWNRVYKEFEEYYEPVIKDRQTCPREDLASLISNGKIDGCPMEHR
AQISYFIIASTAGHDTTSATLATAISVLAERPEVLEQLKANLELIPAFIEETIRWASPVK
HFLRHATQDYELRGQQIKKGDLMYLSYISGNRDEDLIEDPFEFRIDRKPNRHVAFAFGNH
ICLGQHLARLELKIMLEELLPRLESLQLTGKPKLAISDLVCGPKSVPIQYDFKRTA*

CYP108-un1   Mycobacterium smegmatis
             MSMEG4159 TIGR (pseudogene)
             47% to 108D1
             158 residues. 39% to Msmeg_CYPXXII
LTDHYAMAHPKALNDTPDTAQPGQPQTRNPTTPGDLPPLQFFARTVHAET
SLGGVQFSENQRLVMNLAAANRDPRQFDDPESFDADRPRNPHVAFGGGLH
SCQGQHIARAEMRAVLRVLLTRLPDVHLTGEVGEAGVLAGLMAVISLPVA
FTPERSQT 

109 Family

CYP109A1    Bacillus subtilis
            GenEMBL M24523 (3187bp)
            Lewis,P.J. and Wake,R.G.
            DNA and protein sequence conservation at the replication terminus
            in Bacillus subtilis 168 and W23
            J. Bacteriol. 171, 1402-1408 (1989)

            Ahn,K. and Wake,R.G.
            A unique open reading frame adjacent to the replication terminus 
            of the Bacillus subtilis W23 chromosome compared with Bacillus
            subtilis 168
            unpublished (1990)

            Ahn,K.S. and Wake,R.G.
            Variations and coding features of the sequence spanning the
            replication terminus of Bacillus subtilis 168 and W23 chromosomes
            Gene 98, 107-112 (1991)

CYP109B1    Bacillus subtilis
            GenEMBL AF015825 Z99110  
            YjiB
            also similar to CYP106A, both 106 and 109 are close 
            together on a tree

CYP109C1    Sorangium cellulosum So Ce56 (myxobacterium)
            no accession number
            Rolf Muller
            Submitted to nomenclature committee 8/5/05
            Clone name sce_040811_111
            49% to 109B1

CYP109C2    Sorangium cellulosum So Ce56 (myxobacterium)
            no accession number
            Rolf Muller
            Submitted to nomenclature committee 8/5/05
            Clone name sce_040811_8140
            69% to 109C1, 43% to 109B1

CYP109D1    Sorangium cellulosum So Ce56 (myxobacterium)
            no accession number
            Rolf Muller
            Submitted to nomenclature committee 8/5/05
            Clone name sce_040811_4257
            43% to 109C1 39% to 109A1

110 Family

CYP110A1    Anabaena sp. (a cyanobacterium)
            Swiss P29980 (354 amino acids) GenEMBL M38044 (5933bp)
            GenEMBL U38537, M13161
            Lammers,P.J., McLaughlin,S., Papin,S., Trujillo-Provencio,C. and
            Ryncarz,A.J.II.
            Developmental rearrangement of cyanobacterial nif genes: 
            Nucleotide sequence, open reading frames, and cytochrome p-450 
            homology of the Anabaena sp. strain PCC 7120 nifD element
            J. Bacteriol. 172, 6981-6990 (1990)
            This sequence was later revised to give a complete P450 sequence 
            of 448 amino acids.

CYP110A1    Nostoc sp. PCC 7120 same as Anabaena sp. PCC 7120
            GenPept BAB73407, C37842 (this entry missing N-term)
            NC_003272 complete genome 1708114..1709493
            1 aa diff to M38044
  1 MLTQLPNPIS VPSWWQLINW IADPIGFQKK YSKKYGNIFS MQLAGIGSFV ILGEPQALQE
 61 IFTQDSRFDV GRGNTLAEPL IGRTSLMLMD GDRHRRERKL LMPPFHGERL QAYAQQICLI
121 TNQIASEWQI GQPFVARSAM QKLSLEVIIQ IVFGLADGER YQQIKPLFTD WLNMTDSPLR
181 SSMLFLKSLQ KDWGTWTPWG QMKHKQRSIY DLLQAEIEEK RTKENEQRGD VLSLMMAARD
241 ENGQAMTDEE LKDELLTILF AGHETTATTI AWAFYQILKN VNVQEKLQQE LDRLGANPNP
301 MEIAQLPYLT AVSQETLRMY PVLPTLFPRI TKSSINIAGY QLEPDTTLMA SIYLIHYRED
361 LYPNPQQFRP ERFIERQYSP SEYIPFGGGS RRCLGYALAL LEIKLVIATV LSNYQLALAE
421 DKPVNVQRRG FTLAPDGGVR VIMTGKKSLK FEQSSKIFN

CYP110A2    Anabaena variabilis (a cyanobacterium)
            GenEMBL U38478 (1743bp)
            Lammers, P.J. and Duran, S.
            possible alkane/fatty acid hydroxylase

CYP110B1   Nostoc sp. PCC 7120 Same as Anabaena
           GenPept BAB75445, AC2274 
           NC_003272 complete genome complement(4523158..4524546)
           45% to CYP110A2 53% to 110E1 49% to 110D1 47% to 110C1
  1 MHLPKGPQTP VFVQVLRWVF SPMSFLEDCA KRYGDIFSVK LAKDVPAIVF LSNPKDIQQI
 61 LTNDNNQLDS PGDWNDLFEP LLGKRSVITL SGAEHQRQRQ LLMPPFHGER MRGYSQVITD
121 VTEKVISQHQ IGQPFQVRSV TQAITLRVIM QAVFGLYEGS RAEKLQHLLS DLLEKSSSPF
181 SVALLYFPSL RRDFGPIKFW GEQVQIQQQA DELIYQEIQE RRENPDPSRT DILSLLMDAR
241 DADGQPMTDV ELRDELMTLL VAGHETTATA LAWAMYWIHK LPPVKARLLE ELDSLGDNPD
301 STTIFKLPYL NAVYSETLRI YPVAMLTFAR RVIETMALGG YELPPGTPVL GSIYLTHHRE
361 DLYPEPKKFK PERFLERQFS PYEYLPFGGG TRRCLGLAFA QWEMKLALAK ILTSYELELV
421 NNSVEVRPKR RGLVTGPHRP IEMVIKSQRQ ITSRILETTT VS

CYP110B2   Nostoc punctiforme
           NZ_AAAY02000005 GenPept ZP_00111619.1
           complement(58895..60277) gene = Npun6097
           75% TO 110B1
MKLPKGPQSPAVLQMLRWITSPMSFMETCAKRYGDMFTIRLDSK
SPPLIFVSKPEVLEQILTNDIKGLEAPGDTNLVFESLLGKHSVITISGAEHQRQRQLL
LPPFHGERMRSYSQIISDITEKVISQYQIGQPFNIRSVTQAITLRVIMQAVFGLDEGP
RAEKLQHCLAEMLEKGSSVLSAALLYFPALQRDFGPINFWGKQMRRQQAADKLIYEEI
RERQEQPDPSRTDILSLLMAARDEAGQPMTDEKLRDELMTLLVAGHETTATALAWAFY
WIQKIPTVRQKLLKELDSLGDNPDPSTIFKLPYLNAVCSETLRIYPVAMLTFARVVRT
PLSLGGYELEPGIGVIGSIYLTHHREDLYPEPKQFKPERFLERQFSPYEYLPFGGGAR
RCIGLAFAQLEMKLALAKILSTRELELVDNSEVRPKRRGLVTGQDRPIQMVVTSQRQV
KFPILQTATV

CYP110C1   Nostoc sp. PCC 7120 Same as Anabaena
           GenPept BAB76385, AF2391         
           NC_003272 complete genome 5587079..5588485
           48% to CYP110A2 49% to 110E1 47% to 110B1
  1 MKYQIQRPNP LKTHPFLQKL QWIADPVEYM KKASLQHPDM FTAEVIGFGD TVVFVSHPQG
 61 IQTLFANDRK KLVAVGEANR ILYPLVGNNS MFLLEGVKHK QRRQLLMPSF HGERMREYGH
121 LIRNITENLF SQLQQDVTFS ALTAMREISM QVILQAVFGF YEGERCQQFK HLLPIFLSEL
181 FQSPLASSIL FFPSLQKDLG NLTPWGRFVR QREKIDKLLY AEIAERRQEI NSDRIDILSL
241 LISARDETGD SMSDKELRDE LITLMISGHE TTGTAMAWSL YWILQTPEVF QRLIQELDSL
301 GDSPDPMSIF RLPYLTAVCN ETLRINPVAM LTLPRVVKEP IELLGNRLET STTVVGCIYL
361 THHREDLYPE SKLFKPERFL KREFSQYEFM PFGGGVRGCI GQALAMFEMK IVLATVLSRY
421 QLALADRKPE RPQRQGFTLT PTNGVKMLIT GQHKRQNYSM AASTTFNA


CYP110C2   Nostoc punctiforme
           GenPept ZP_00108280.1
           GenEMBL NZ_AAAY02000070 complement(34550..35941)
           gene = Npun2703
           60% to 110C1
  1 MQLPNILKSP SLLQKLHWVS DPIGYMENAA QEYPDIFTGK IVGFGDTVVF VNHPQAIQEI
 61 LTNDRKKFTA VGELNGILKP LLGDNSVLML ESDRHKRQRQ LVTPSFHGER MQAYGQLICN
121 VSKKIFNQLP LNKPFVARNL TKEISLQVIL QSIFGFYEGE KIQKLRQLLP LLLELFESPL
181 SSSLFLFSFL QQDLGAWSPW GNFLRVREKI DQFLYTEIAE CQQQADPERI DILSLLISCR
241 DEAGQPMTDQ ELRDQLITLI LAGYDTTATA MAWGLYWIHK QPLVCEKLLQ ELDTLGDSPD
301 PMSISRLPYL TAVCNETLRI HPVTMFSFPR VVQEPLELLG HSLEPGTILL PSIYLTHHRE
361 NLYPQSKQFK PERFIERQFS PYEFLPFGGG VRRCMGEALA LFEIKLALAT IVSHYHLALV
421 DQRPEQPQRR GFNLAPGSGV KMVMTDQRAR KESLINMTTT PLS

CYP110C3   Anabaena variabilis ATCC 29413
           YP_322498 
           95% to 110C1
MKYQIKRPNPLKTHPFLQKLQWIADPVEYMEKASLQHRDMFTAEVIGFGDTVVFVSHPQGIQTIFANDRK
KLVAVGEANRILYPLVGNNSMFLLEGVKHKQRRQLLMPSFHGERMREYGHLIRNITETLFSQLQQNVTFS
ALTAMREISMQVILQAVFGFYEGERCQQFKHLLPVFLSELFQSPLASSILFFPFLQKDLGNLTPWGRFVR
QREKIDKLLYEEIAERRQEINSDRIDILSLLISSRDETGNSMSDQELRDELITLMISGHETTGTAMAWSL
YWILQTPEVFQRLIQELDSLGDSPDPMSIFRLPYLTAVCNETLRINPVAMLTLPRVVKEPVELLGNRLES
GTTVVGCIYLTHHREDLYPESKLFQPERFLKREFSQYEFMPFGGGVRGCIGQAIAMFEMKIVLATVLSRY
QFALADGKPERPQRQGFTLTPANGVKMLITGKHQRQNYSTAASTTFTT

CYP110C4   Nodularia spumigena CCY9414
           ZP_01629302 
MSTPNRLKTPAFFQQLQWVADPVGYMEKAAQQYPDIFTAQVVGFGNNLVFVNHPQAMQEILTNDRKKLFA
GGKENKILQPLLGDYSMIMLDGDRHRKRRQLVMPSFHGDRMRSYGEIISNITEEVWSNLPTDKSFLARNV
TQDITLQVMIQAVFGVYQGERSQQLKKQLELMANIFRSPLSSSMLFFSSLQQDLGAWSPWGKFVRDRQEL
DNLIYTEIAERRQQNLENRIDILSLLMSAEDESGNPMTVQELRDELMTLLFAGYETTATALAWGLYFIQK
HPEVQEKLLQELDTLGDSPDPMSIFRLPYLTAVCNETLRIHPVAMLTFPRTVKEPVEISGYALDPGTILV
GSMYLTHQREDLYPEPKQFKPERFLERQFSPYEFIPFGGGVRRCVGEALAVFELKLVLATILSRYELALT
DDQPEVPRRRGVTLAPGRGVNMMITGQRLA

CYP110C5    Crocosphaera watsonii WH 8501
            ZP_00518945 
MKTIPTPKTPTLVQQLQWVLNPTGYLQTNHHRYPDLFKAKIIGLGNDIILISNPEIMQYILTHDRQEFTA
PSSLNTLLKPLLGDYSVVMLDGDGHRQRRQLVMPSFHGERLKVYGDLTCRITREAMEKLPENQPFLAREV
MQDISLKVIMEAVFGVTEGERYEELQYRLKELLDLFDSPITSGFLFFPSLQKDLGNWSPWGYFLRQRQAL
DKLIYAEISDRRANPDPERTDILSLLMFAKDEQGESMKDQELRDELITLLMAGHETTASAMAWALYWLHH
IPEIKDKLIEELNTLSPDAEGMDIFRLPYLTAVCNETLRLSPSAMLTFTRLAQQTVEVGGYTFKPGDIVA
GCLYLTHLREDIYANPKQFNPQRFLDHKYSAYEFIPFGGGSRRCMGEALAKFEMKLVIAIIISEYCLKLA
DTQPEKQQRRGLTLSPKRGVKMILEGKRQPQKARELELSTR

CYP110C6    Cyanothece sp. CCY0110
            ZP_01731952 
MKTIPGSKTPKLIQQLQWIFNPTKYLKTNHRRYPDIFKAKIIGFGDKMILTSRPEIMQYILTHDRKQFTS
PSGLNAILRPLLGDSSVLMLDGDRHRQRRQLVMPSFHGERLKVYGDLTCRITEEVMAKVPQNQPFLAREI
MQDISLKVIMEAVFGVTEGKRYEQLQDRLKKMLDLFNSPLTSAFLFFPFLQKDLGSWSPWGHFLRQRQAI
DELIYAEISDRKAHPDSDRTDILSLLMSAKDEQGQGMKDQELRDELMTLLTAGHETTASAMAWALYWIHH
TPEVKDKLIEELNTLSPDAEGMDIFRLPYLTAVCNETLRLSPSAMLTFTRVAQEKVEVAGYTFEPGDMIM
GCMYLTHLREDLYTNPEQFNPQRFVDRQYTPYEFIPFGGGSRRCVGEALAQFEIKLVIATIMSQYCLKLA
DTQPEKQQRRGVTLSPARGVKMILEGKRQPQPVRELELSRQ

CYP110C7    Lyngbya sp. PCC 8106
            ZP_01620519 
MKTLNSPKTSPLIQRLQWVFNPLEYMETNVKINRDIFNTQVTGGVGLIFVNSPEGMQELLTRDTKEFYAP
GSINEILKPLLGEQSVMLLDGDRHKRQRKLLMPPFHGERMRTYGELILNITQQATAKLKPGQPFIARNAM
QEITLAVILQAVFGIYEGSRYDKLKQLITSLLAVTDSPVSSSLLFFTSLQKDWGAWSPWGRFLRMRQKVD
QLLFAEIEERRQNWDENRTDILNLMMAARDEDGQPMADEELRDELLTLLVAGHETTATAMAWALYWIHRQ
PEVYQKLIQELESLPENADPMTIFRLPYLTAVCNEALRIYPVAMLTFPRVTKEPTQLLGYELEANIGLAG
CIYLLHHREDLYPEPKQFKPERFLERKFSPYEFLPFGSGARQCIGMALAQFEMKLALAQILLDYDLTLLE
KRPVKAARRGVTLSPVGGIKMMMNGKRTPSKSVAIPATV

CYP110D1   Nostoc sp. PCC 7120 Same as Anabaena
           GenPept BAB76465, AF2401
           NC_003272 complete genome 5678382..5679743
           48% to CYP110A1 53% to 110E1, 49% to 110B1
  1 MTVTQNLPNG PRIPRLLRLF KFITQPIQYV EDFAKVYGDN FTIWGSGESY FVYFSHPQAL
 61 EQIFTNVSCF ESSGGGSPLL ELLLGKNSLI LLEGDRHQRQ RQLLTPPFHG ERMRAYGQTI
121 REITQQVTQA WQMGKPFNIR ASMQEITMRV ILRVVFGVDE GELFQELRQL LTTLLDFMGS
181 PLMSSTFFFS FTQKDYGAWS PWGRMVRLIK KIDQLIYALI AQRRAEFGEN RQDILSLLIS
241 ARYDDGQPMS DVELRDELMT MLVAGHETTA SALTWAFYWI DSVPEVREKL FQELDTLNDD
301 SEPSIIAKLP YLTAVCQETL RFYPIVLNAF FRRTKNPMEI MGYKLPKATL VVPSIYLAHH
361 REEVYPQSKQ FRPERFLEKQ FSPYEYLPFG GGNRRCIGLA FAQYEMKIVL ATILSQFQVS
421 RLSKRPVQPV RRGLTLAAPG GMKMVANKRM RNS

CYP110D2   Nostoc punctiforme
           NZ_AAAY02000028 GenPept ZP_00109203.1
           52704..54170 gene = Npun3650
           68% to 110D1
MNIPLSVTLSNMKSRNNKIQKPSNLQTPMTATYNLPDGPQMPRW
LRTIKFISQPVKYVDDFAKTYGDTFTIRSSRSDNHIVYFSQPQALEEIFTADSRHFEV
GRGNTGLRFLLGDRSFMLVDGDRHQRQRQLLAPPFHGERMRAYGEDIRKITQQVSHEW
KIGKPFNIRESMQEITLRVILRVVFGLNEGELFEELRRSLSDLLDFISSPIMSSAFFF
RFIQKDFGAWSPWGRILLQRQKVDLLIYTLLRERRAQTDQNRQDILSLMMAARYDDGQ
GMSDEELHDELMTLLVAGHETTASALTWAFYWIDHLPEVREKLLQELNTIGVNPDLSS
VAKLPYLTAVCQETLRIYPIAMTAFVRIVKTPITIMGYELREGTAIVPSIYLAHHREE
VYPQSKQFKPERFLERQYSPYEYLPFGGGNRRCIGMAFAQYEMKIVLATVLSEFQVSL
VNKRPVHPVRRGLTVATPAGMRMVATPQVKRANTPALV

CYP110D3   Trichodesmium erythraeum
           GenPept ZP_00074554.1 GenEMBL NZ_AABK02000068
           complement(10019..11407) gene = Tery3870
           54% to 110D1
MTLPDGPSLSPLQRRLRTWKFIFSPLSAIEERYSEYGDIFRTNT
NSLYPFIYFCNPKAIQQIFTADPDTFTSGSINGILKYFVGLNSLLLQDGDRHKRQRKL
LMPPFHGDRMRKYGDLIYNITSNVISQWKIEQPFPIRKSTQEISLKVILAAVFGLDQE
GKSYEKLRVLMSDLLDSMSSPLSSTFLFFNFLRKDWGPWSPWGRFLRKKQELHELIIA
EIQTAKKEGNHRDDILSLLLEARDEAGNAMSDEEIKDELLTMLFAGHETTASALAWAL
YWIDMIPSVGEKLMAELATIPSNSDQVAITKLPYLSAICQETLRIYPIAMNAFPRVVQ
KPIEIMGYQLEPGMVAIVPIYLTHHREDIYPEPKKFKPERFLERQFSPYEYLPFGGGS
RRCIGSAFALFEMKLVLATILSQWELKLLPNQRISPVRRGLTMAPPANMRMVVKPKKS
WQKVSQPILTSG

CYP110D4    Lyngbya sp. PCC 8106
            ZP_01620515 
MTLPNGPQTPRVLRMMKFVARPLDYLEDYYRRYGDFIRIGKSATPLVYVNHPAAIEKIFTAGSEQFRTGN
AGGVLLFLLGDNSVLMVDGERHERQRKLLMPPFHGERLKTYNQLICEITKEVMSQVKIGQPFRVRTLMQD
ITLRVILKAVFGLTEGERYEQLRHLLSAMMESIGSPLAASLMFFPSLRQDWGEWSPWGRFLRYKQQADEM
IYAEIRERKQQRDFDGDDILTLLMSARDETGKPMNETELRDELVTLLIAGHETTASSLTWALYWTHYLPE
VKDKLCFELANLGENPHLSEIARLPYLTAVCNETLRIYPVTLTSGVRVLKKPLELGGYSFEPGTVLFPCT
YLVHQREDIYPEPKKFKPERFLQRQFSPYEFFPFGGGHRRCIGSAMATLEMKIALATILSDWQLKLPHHK
AYKPVRRGLTLSPPAQLSLVAVNRLN

CYP110D5    Crocosphaera watsonii WH 8501
            ZP_00513562 
MNLPPTLSQPRLLRLFKLIFYPLDYLEDNYQRYGDIFVAGKSETPFVYISNPQGIQTILTRDKTDFKTGG
GSGFLSTLLGDNSLLFLQGERHRRERKLLMPPFHGERLKSYANLIYSISDKVTDKLQINRSFNVRDIMQE
ITLKVILKAVFGITEGERYQRLQELLKSWLSFFDSPANAILIFFPWLRKNWGNWTPWGRFLQIKAEIQEL
IYTEIRERREQKKYEGTDILTLLMLAKDEEGKPLSDQELHDELITLLIAGHETTASALTWALYWIHFCPD
VEDKLRFHFSNLNNNTDLLDIVKLPYLDAVCKETLRIYPVLLTTFIRVLQTPLELMGYQFKPGTVFAPAI
YLVHHREDIYPNSQQFRPERFLERNFSPYEYFPFGGGSRRCIGMELAKMEMKIVLYTILSKHKLKLPSSR
PLKAVRRGLTVAPPSNFKMILSN

CYP110D6    Cyanothece sp. CCY0110
            ZP_01726400 
MVLPPSISTPRLLRLFKLIFYPLDSLENYYERYGDIFIVGQSETPFVYISNPQGIQEILTKDKTHFRTGG
GSGFLTTFLGNNSLLSLKGEKHQRERKLLTPAFHGERLQSYATLIYSISDEVSEKLEINQSFNVREIMQE
ITLQVILKAVFGIAEGKRYQKLKNLLTSWLSFFDSPINATIIFFPFLQKDWGNWTTWGRFLRIKAQIDDL
IYTEINERRQQKNYQGKDILTLLILARDEDGNPMSDQELHDELITLLIAGHETTASSLTWALYWIHYCPE
VEEKLRSHFSILDKNIDLLNIIKLPYLDAVCSETLRIYPVVVNAFIRVLETPLELMGYQFKPGTVFAPAI
YLVHHREDIYPNSKQFRPERFLERQFSPYEYLPFGGGSRRCIGMELAKMEMKIVLFTLLSKYKFKLSSSH
PLKPVRRGLTIAPPNSFKMIITQKLAYT

CYP110E1    Nostoc sp. PCC 7120 Same as Anabaena
            GenPept BAB76532, AI2409
            NC_003272 complete genome 5753083..5754450
            50% to CYP110A2 53% to CYP110B1 53% to 110D1
  1 MKLPDSPKIP KFMQLVQWIY QPLQLMEASA KAHGDSFTLW LTNKRPIVFL SNPQAIQELF
 61 TTPLEQLDAR GTAQVLQPLL GENSLLLLSG ETHQRQRKLL TPPFHGDRMR AYGDIITNIT
121 KEVISNWQLG KPFSVRDSMQ EITLRVILQA VFGLREGERY TQLQKRLCDI LDLSGSALRS
181 TLSFLPALQI DLGRWSPWGH FLRQREAIDQ LLYAEIQDRR DHPDPSRTDI LSLMMAARDE
241 NGEAMTDVEL RDELMTLLVA GHETTASALT WALYWIHKLP QVREKLLAEL DNFGDNGDVN
301 EITRLPYLTA VCQETLRIYP IAMVTIPRIT KTNLEIGGHQ FAPGTMLVGC IYLMHRRPDL
361 YPQPQEFKPE RFLEKQYSLY EYLPFGGSNR RCVGMAFALY EMKLILATVL ANVDLALVDN
421 YPVKPTRRGV TLAPSGGKWL IATAQHQKIK NPVEV

CYP110E2   Nostoc punctiforme
           NZ_AAAY02000088 GenPept ZP_00107327.1
           complement(18173..19567) gene = Npun1723
           58% TO 110E1 55% TO 110B1
MSLLKLPNGPQTHPWIQMYQWLTNPLEYMEACTKRYGDIFTLKL
GQNFAHQVFISNPQAIQQIFTTDPKQLDSGESAGIKAPLLGQQSLLALDGKPHQRQRK
LLTPPFHGERMLAYGELIREITEQVSSQWQVGETFAVLPSMQAISFQVILKAVFGLED
GPRYKKLNELLIKILNPKIPLLRTVLLIFPSMRQDLGAWSPWGKYLRLRQQIDQLIYA
QIQERKAQPNLSGTDILSLMMAARDEAGEPMTDLELRDELMTLLVAGHETTATSLSWA
LYWIHHRPQVREKLLQELDNLGEKPDPNAIFRLPYLNAVCSETLRLYPVAMSALNRLV
KSPLQIGEYNFEPGTILIPSIYLTHHREDLYPESKQFKPERFLERQFSPYEYLPFGGG
NRRCIGMAFALFEMKLVLATVLSRWQMELADSKPVRPVRKGLLFSPAGGVQMVVKGKR
LQNQPILQTSSSSV

CYP110E3   Trichodesmium erythraeum
           GenPept ZP_00072591.1
           GenEMBL NZ_AABK02000017 complement(<3..1016)
           53% to 110E1 missing C-terminal 121 aa (runs off end of clone)
  1 MIKLPGPKSP ALTQILQWTA KPIKFMEKCA REYGDTFEVK LNYPIVFISH PKAIEEIFKA
 61 NPKKFDCGSS NKLAQPLLGD YSLLLLDDIP HQRQRKLLMP PFHGKRMQAY GELICNVAQE
121 VASKWEIGQV FSMREFTAEI SLKVILQAVF GLYEGERYSK LEKLLGSLLE SLSSPLKTSM
181 LFFQFLQIDL GPWSPWGNFI KNREEIYELL CAEISERRQK LDPERSDILT MLLLARDEEG
241 EGMSDIELRD ELMTLLIAGH ETTATSLSWA FYWIHHQPEI YQKLSRELET FGDDLNPMTV
301 INLPYMNAVC SETLRIYPVV IIVSPRKTKL PITIMGQT

CYP110E4   Gloeobacter violaceus PCC 7421
           GenEMBL AP006578 complement(257348..258724) 
           gene = gll3063
           NC_005125 complete genome complement(3256348..3257724)
           locus_tag = gll3063
           71% to 110E5 55% to 110E1
MSLPPGPSSPSPFQLMQWIGCPTDYLHTTAARYGDPFTMRVGVF
PPLVMFSDPRAIQQLFTAEAGTFDAGASNVALRPTLGANSLLLLDGERHQQQRRLLTP
PFHGERMRAYGELIRQVTEEVIVRWQPGKPFLVRNAMQRISLAVILQAVFGLHDGTRL
VRLRQALGSMLDAMSSPLSMAMLLMLPEDFGPWSPRARLQAHLGAIDELLYAEIRERR
EHFDAGAGDILGLLLAARDEAGAAMGDAELRDELMTLLVAGHETTATAMAWALYWIHY
LPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVALIASPRVARHTVRI
LERDYEAGTRLAAGIYLAHHRPETYPEPERFRPERFLERTFSPYEFVPFGGGSRRCIG
MAFALYEMKLVIATVLLERDLRLVQPRLLRPVRRGVTLAPPEGLYLVPTGERSASRLL
SRTSTAGQ

CYP110E5   Gloeobacter violaceus PCC 7421
           GenEMBL AP006578 complement(258800..260176) gene = gll3064
           NC_005125 complete genome complement(3257800..3259176)
           locus_tag = gll3064
           71% to 110E4 55% to 110E2
MSLPAGPASPPPLQLLQWIGRPTDYLERTARRYGDPFTMRLGLH
SPVTGVFFSSPEAFQQLFNTEPGLFDSGGANASSTFNLLFGTNSLILLDGERHQQQRR
LLTPPFHGERMRSYGELIRTLAEQVTARWNLGTPFQARRSMQRISLGVILKAVFGLHD
GTRYLRVCRLLGNLIDASASPLLFGLRLIFPQDAGPMSPMGQLKAQIDAIDELLYAEI
RERRERPDPRADDILSLLMAARDEAGQGMGDVELRDELMTLLVAGHETTATAMAWALY
WIHRLPQVRERLLAELDSLGSDPDPEAIARLPYLGAVCSETLRIYPVAMVAFARVPRR
PVRILDREYPAGTFLIPNIYLAHRRPEAYPDPERFRPERFLERTFSPYEFVPFGGGSR
RCIGVAFALYEMKLVLATVLSRVELRLADPRPRLPVRRGLTLAPPEDLHLIPTALRSG
HRDLLPAC

CYP110E6   Anabaena variabilis ATCC 29413
           YP_322620 
MKLPDSPKIPRFMQLVQWIYQPLQLMEASAKAHGDCFTLWLTNKRPIVFLSNPQAIQELFTTPLEQLDAR
GTAQVLQPLLGENSLLLLSGETHQRQRKLLTPPFHGDRMRAYGDIITNITQEVISKWQLGEPFSVRDSMQ
EITLRVILQAVFGLREGERYTQLQKRLCDILDLSGSALRSTLSFLPALQIDLGSWSPWGHFLRQRAAIDQ
LLYAEIQDRRDHPDPSRTDILSLMMAARDENGEAMTDIELRDELMTLLVAGHETTASALTWALYWIHKLP
QVREKLLAELDNFGDNGDVNEITRLPYLTAVCQETLRIYPIAMVTIPRIVKTTLEIGGHQFAPGTMLVGC
IYLMHRRPDLYPQPQEFKPERFLEKQYSLYEYLPFGGSNRRCVGMAFALYEMKLVLATVLANMDLALVDN
YPVKPTRRGVTLAPSGGKWLIATGQHQKVKSPVEV

CYP110E7   Nodularia spumigena CCY9414
           ZP_01631632 
MPALQLPDGPKNHPWLQTYRWLTSPLEYMEDCAKNYGDIFTIRVGPLSTPQVFVSNPQAIQQIFSTDPKY
LDSGAAAGFKSPLLGNQSLLSLDGKPHQRQRKLLTPPFHGERMLAYGELIRDISQQVTNKWQVGETVSVL
SSMQAISFQVILKAVFGLAEGPRYEKIKEALIAILNPKKPLLRSMLLMFPSLRRDLGAWSPWGEFLRLRQ
QIDELVYAEIQERKAQLDSSRTDILSLMMATRDEAGEPMTDLELRDELMTLLVAGHETTATALSWALYWI
HHQPQVREKLLQELDTLGEKPDPNAIFRLPYLNAVCSETLRLYPVAMLLLSRLVKSPLQIGEYQFEPGTL
LIPCVYLTHHREDLYPDSQTFKPERFLERQFSNSEFIPFGGGNRRCIGMAFALFEMKLVLATVLSNWQME
LANTQPVLPVRKGLLFGPKGGVQMVVKGRRELS

CYP110F1   Nostoc punctiforme
           NZ_AAAY02000005 GenPept ZP_00111618.1
           complement(57031..58407) gene = Npun6096
           48% TO 110E1 48% T