Micromonas P450 sequences (Photosynthetic marine
picoeukaryotes)
Micromonas sp. RCC299 (v3.0 at JGI)
Micromonas pusilla CCMP1545 (v2.0at JGI)
Closely related to but different from Micromonas sp. CCMP490 (some ESTs available)
D. Nelson,† April 13, 2009
>CYP51G1 Micromonas sp. RCC299
EuGene.1200010398|MicpuN3
81% to CYP51G1 Ostreococcus lucimarinus, ortholog
81% to CYP51G1 Ostreococcus Ostreococcus tauri, ortholog
MSVELIADAIGVPVWALVPVALAAAAVLVFILDGLSHSVLKPGKSPPVIGTTPVFGGMLEFLKGPIGLMARAYPKYGEAF
TVPVFHKRITFLIGPKVSEHFFKARDQEMSQKEVYEFNVPTFGKGVVFDVDHITRAEQFRFFANSLKSDRLRQYVGMMVK
EAEDYFAKWGDEGEVDLLDALSELIVLTASRCLLGREIRETLFSEVTTLVHDLDKGMVPLSVFFPYAPIEAHRKRDKARK
ELAAIFDKVIQGRRESGAVEPDVLQTFIDARYKDGSRLSNDQVLGMLIAVLFAGQHTSSITSTWTGLLSIANKERIFPNL
EKEQKDVMAKHGDKIDFDILAEMDELHFCIKEALRMHPPLIMLLRQCHVPFEVETTKGKKFVVPKGHIVATSPAFAHRMD
EVYSEPNVYKPERFKGESPEDKRAYASFIGFGSGRHGCMGETFAYMQIKTIWSHLLRNFDFELVGKLPEPDYEGMVVGPK
HPCTVRYKRRKL*
>CYP51G1 Micromonas pusilla CCMP1545
estExt_fgenesh1_pg.C_100222|MicpuC2
83% to CYP51G1 EuGene.1200010398|MicpuN3
MDVLAQVTEFVGVPAWGLLPVALVAAVVLFVVFDALSHVVLKPFKSPPVIGTWPVIGGMIQFLKGPIGLMNNAFPKYGEV
FTVPVFHKRITFLIGPHVSEHFFKARDADMSQKEVYEFNVPTFGKGVVFDVDHLTRAEQFRFFADSLKSDRLRSYVGMMV
KEAEDYFGKWGESGEVDLLDALSELIVLTASRCLLGREIRETLFSEVTTLVHDLDKGMVPLSVFFPYAPIEAHRKRDAAR
RDLAAIFDRVIQARREANAHEPDVLQTFIDARYRDGSRLTNDQVLGMLIAVLFAGQHTSSITSTWTGLLTIANKERVMPT
LEGEQKKVMEKHGGKIDFDVLAEMDELHYAIKEALRMHPPLIMLLRQCHKPFKVTTSKGKEFVVPKGHIVATSPAFSHRL
NNVFSDADTYKPSRFRQPSPEDKEKFASFIGFGGGRHGCMGETFAFMQIKTIWSILLRNFEFELVGKLPEPDYEGMVVGP
KHPCTIRYKRRKL*
>CYP97A17 Micromonas pusilla CCMP1545
estExt_Genewise1.C_50257|MicpuC2
83% to CYP97A18 EuGene.0600010504|MicpuN3 probable ortholog
MPVAAGDIREIAGQPVFVPLYKLFLAYGEMFVLAIGPKKFVVVSDNDVAREMLKDQATSFSKGLLSEILEFVMGTGLIPA
DGETWKVRRRTVVPSLHKKYVASMVDMFGDCGLNGSAQLARSEMNGDTVEMENFYSRLALDIIGKAVFNYDFNSLKMDDP
VIKAVYTVLREAEYRSVTFIPYWKVPPLRWLVPRQKACQEALVVVNDTLNMLIARTKKLVEEEDEEFVEEYLNKADPSIL
HFLIASGDDVTSKQLRDDLMTLLIAGHETTAAVLTWTTYLLATHPEIKARVQAEVDEVCGDRNPTIADMMDLKFTTRVIN
ESMRLYPQPPVLIRRALEPVTLDGYKIDAGTDFFISVWNLHRNPRLWENPDKFDPDRFPIDQKMPNEITENFAYLPFGGG
QRKCVGDQFALFESIITLAMVCRRFDFELDAKFHPDGECGMTTGATIHTTGGLHVKLKRRDGAGGNEMLGSIDCGDGVKC
ASLGELSDVNTGDGTPESGAASFDEAERAQARDLKEAAVVLGAKTAGASELKSGSGGAKGEKPSIDGAALEEAVKEAEAL
YEAEAEREAAMKKELEQSL*
>CYP97A17 Micromonas pusilla CCMP1545
EuGene.0000050221|MicpuC2
77% to CYP97A18, probable ortholog
same sequence as CYP97A17
estExt_Genewise1.C_50257|MicpuC2 with N-term extension
MLSLAASATTLVGANAGASAAAASTRRGGASLSARDATATTKPLRFVARGASSDRGRALTTRVNAKSLEERIASGEFTQK
RSTPAESVLNGIRDVIKDVPQSRGLSYQLAKLSRKWRKESMSK
MPVAAGDIREIAGQPVFVPLYKLFLAYGEMFVLAIGP
KKFVVVSDNDVAREMLKDQATSFSKGLLSEILEFVMGTGLIPADGETWKVRRRTVVPSLHKKYVASMVDMFGDCGLNGSA
QLARSEMNGDTVEMENFYSRLALDIIGKAVFNYDFNSLKMDDPVIKAVYTVLREAEYRSVTFIPYWKVPPLRWLVPRQKA
CQEALVVVNDTLNMLIARTKKLVEEEDEEFVEEYLNKADPSILHFLIASGDDVTSKQLRDDLMTLLIAGHETTAAVLTWT
TYLLATHPEIKARVQAEVDEVCGDRNPTIADMMDLKFTTRVINESMRLYPQPPVLIRRALEPVTLDGYKIDAGTDFFISV
WNLHRNPRLWENPDKFDPDRFPIDQKMPNEITENFAYLPFGGGQRKCVGDQFALFESIITLAMVCRRFDFELDAKFHPDG
ECGMTTGATIHTTGGLHVKLKRRDGAGGNEMLGSIDCGDGVKCASLGELSDVNTGDGTPESGAASFDEAERAQARDLKEA
AVVLGAKTAGASELKSGSGGAKGEKPSIDGAALEEAVKEAEALYEAEAEREAAMKKELEQSL*
>CYP97A18 Micromonas sp. RCC299
EuGene.0600010504|MicpuN3
83% to CYP97A15 Ostreococcus
tauri estExt_fgenesh1_pm.C_Chr_13.00010043
80% to CYP97A15 Ostreococcus
lucimarinus estExt_GenewiseEukaryote.C_Chr_130084
probable ortholog
MQSSLATGRAPTLGRARVAPSASSRSNLGAGRAMCGFGSESLRGSGRAAGVTLPRRRAATPCRAKTLEERIASGEFTKPR
SSPAEDLLNGFRGLIRNVDNPQARGLSVSLAKLSRKWRNESMSRMPVAAGDIREIAGQPVFVPLYKLFLAYGEMFILAIG
PKKFVVVSDNEVAKEMLLTQANSFSKGLLSEILDFVMGTGLIPANGETWKIRRRTVVPSLHKKYVASMVDMFGDCGVHGS
AQLAKSEREGKTVEMENFYSRLALDIIGKAVFNYDFDSLKKDDPVIKAVYTVLREAEYRSVTFIPYWKVPPLRWLVPRQK
ACQEALVVVNDTLNMLIERTKKIVEDSDEEFVEEYLSGDDPSILNFLIASGDDVTSKQLRDDLMTLLIAGHETTAAVLTW
TTYLLATHPEQMRKVQEEVDRVVGDRRPTIQDMMELKYTTRVINESMRLYPQPPVLIRRALEPVTLDGYKIETGTDFFIS
VWNLHRNPRLWPEPDKFIPERFPLDQKMPNEVTENFAYLPFGGGQRKCVGDQFALFESIITLAMVCRRFDIDLDPAFHPD
GECGMTTGATIHTTGGLHVKLTRRAGMGGDEMGYDETTRSLDELEDVNTRGGTPEAAVGSFDEREIEEARDVKEAAVVVG
AKTAGATEVRSGSGGAKGETPAIDGERFDQAVAEAEDILAKEEAARVEAQKAR*
>CYP97C18 Micromonas sp. RCC299
estExt_Genewise2Plus.C_Chr_140254|MicpuN3
84% to CYP97C18 Ostreococcus tauri
82% to CYP97C18 eugene.0900010237†† Ostreococcus lucimarinus
probable ortholog
MAAPDWLTQLNRLWGGNSEIPVADAKLEDITGLLGGGLFQPLFKWMKEAGPVYLLPTGPVTSYVVVSDPDCIKQILFNYG
SKYIKGTIAEAGEFLFGLGVALQENEAWKIRRKAVAPSLHRRYVEAMVDRCFGPCAERMVELVESAIDSEGGEKKRLNME
SKFSQAALDIIGISVFNYDFKALTSAAPVIQATYTALKEVETRSMDLLPTWRLPEPFLRVVSPRQKAAQDAVKIIQEVTT
KLVDDCKRMVEEEEKVGGAEEWARDYLNDSNPSVLRYLIAAREEVSSTQLRDDLLSLLVAGHETTASVLTWGTYELLKPE
NAEQLRLLRAELDEVLGDKPFPTYEDMTKMPYLERCFHESMRLYPQPPVYTRRAVVEDVLPKGLGVVPAGQDLLVSIYNL
HRSPENWGPNSQVFEPMRFGPLALGQPNELNTGYRYTPFSAGPRRCPGDKFAVLEGMAIWAVMFRRLDLTLVAGHDVVMT
SGATIHTRDGMLVNARRRRGEKGQKVDWAQLKPEKDIGAGWFEKGIEAKGSSGGGKCPVAH*
>CYP97C18 Micromonas pusilla CCMP1545
estExt_Genewise1Plus.C_30232|MicpuC2
88% to CYP97C18 Micromonas sp. RCC299, probable ortholog
MAAPDWMTQLNRLWGGKSEIPVADAKLDDITGLLGGGLFQPLFKWMKEAGPVYLLPTGPITSYVVVSDPDCIKQVLFNYG
SKYIKGTIAEAGAFLFGLGVALQENEAWKIRRKAVAPSLHRRYVEAMVDRCFGPCADRMVSLVEDQINADGGRRERVNME
SKFSQAALDIIGISVFNYDFKALTSAAPVIQATYTALKEVETRSMDLLPTWRLPEQFLRIVSPRQKAAQDAVTVIQEVTT
KLVDDCKKMVEEEEAVGGAEAWARDYLNDANPSVLRYLIAAREEVSSTQLRDDLLSLLVAGHETTASVLTWGTFELLKPE
NAEQLRLLRAELDEVLGDKPFPDYADMLKLPYLERCFHESMRLYPQPPVYTRRAVVEDVLPHGLGTIPAGQDLLVSIYNL
HRSPANWGPASQAFEPMRFGPLSAGQPNELNTGYRYTPFSAGPRRCPGDKFAVLEGMAIWAVLFRRLDMELVAGHDVVMT
SGATIHTRDGMLVNATRRETRRKGGGDAVDWANLRPAKDIGEGWWERGIETKGGSGADAKSKCPMPFVK*
>CYP97E5 Micromonas sp. RCC299
estExt_Genewise2Plus.C_Chr_160324|MicpuN3
80% to CYP97E5 Ostreococcus tauri estExt_gwp_GeneWisePlus.C_Chr_01.00010469
82% to CYP97E5 Ostreococcus lucimarinus eugene.0100010571
probable ortholog
MPQPPSTEEDIDDDFVPSQLGIKDIISLWITQILQTYGDEESKDGAPVCEGSVDDLVGGPIFLALYPYFLRYGGVFKLAF
GPKVFMVLSDPVVVREVLKEKPFAFSKGVLAEILEPIMGQGLIPAPYAVWKNRRRQLVPGFHKAWLDHMVGLFGDCSTQL
VKNLDAEIAKGNGSAIVDMEERFCSVSLDIIGLAVFNYDFGSTTRESPIIKAVYTCLQEAAHRSTFYFPYWNLPLADVLV
PRQREFKNNMNLINETLNGLIKKAQAFEGTEDLEELQNRDYSKVKDPSLLRFLVDIRGADVTDSQLRDDLMTMLIAGHET
TAAVLTWCLYCLAQDRELMARVVAEIDDVMGPADGETPTAPNYEQIQKMELVRLCLAEALRLYPEPPILIRRCLEDVPLP
KGAGDANVTLIKGMDVFISVWNLHRHPDCWEEPLKFDPTRFKRPFQNPGVKDWAGYNPDLISGLYPNEVTSDFAFIPFGA
GARKCIGDQFAMLEATSCLAMTLRRYDFEMTKDASEVGMEMGATIHTAGGLPMKVTRR*
>CYP97E5 Micromonas pusilla CCMP1545
e_gw1.13.59.1|MicpuC2
86% to CYP97E5 ortholog
cyan probable intron seq
MERPSQTYGDEESKDGAPVCEGSVDDLVGGPIFLALYPYFLKYGGVFKLAFGPKVFMVLSDPVIVRRVLKEKPFAFSKGV
LAEILEPIMGQGLIPAPYAVWKNRRRQLVPGFHKAWLDHMVGLFGDCSAQLVKNLGASHLTLTDASIAAGNGVARIDMEE
RFCSVSLDIIGLAVFNYDFGSTTRESPIIKAVYTCLQEAAHRSTFYFPYWNIPFMCDIVPRQREFKANMKLINDTLNGLI
TQAQQFEGTEDLEELQNRDYSKVKDPSLLRFLVDIRGADVTDLQLRDDLMTMLIAGHETTAAVLTWCLFCLVRDKPLMKK
VVEEIDSVMGPVAEEARAPNYEEIQKLELVRLCLAEALRLYPEPPILIRRCLEDVPLPKGAGDADVTLIKGMDVFISVWN
LHRHPDCWEEPLKFDPFRFKKPYSNPGVKDWAGYNPDLISGMYPNEVTSDFAFVPFGAGARKCIGDQARSCLHWSPYDRF
AMLEATSCLAMTLQRYDFELDKDAAEVGMEMGATIHTAGGLPMRVTRRK*
>CYP97E6 Micromonas sp. RCC299
e_gw2.16.55.1|MicpuN3
77% to CYP97E6 Ostreococcus
tauri estExt_fgenesh1_pm.C_Chr_15.00010014
73% to CYP97E6 fgenesh1_pg.C_Chr_14000053†† Ostreococcus lucimarinus
probable ortholog
MPAPTRFTRAPRRAHPGSRPAERGARVARRAVEEPKTRGGDSAPGRIYRGALFPGVEVPEAGADVWRGISRVFPWGNGAP
VTEGVLGDLLKEEVRAAPLFVPLYDYYRQYGGVYNLGAGPKWFVVVSDPVVVRHMFKDNADAFSKGILTDIMEPIMGDGL
IPAPKEIWAKRRPTVGAGFHGAWLKHMTNLFGASATNLADKLEREWCDKDVAVNLEDELYAMALDVIGKAVFNYEFGALR
EETPLIKAVYRVLRESEHRSTFPLQYWNIPGAMDVVPRQKQFKEDIAAINAELSKLIADALADRNETDLAEMESRDYANV
EDASLLRFLVDVRGETVSSTQLRDDLMTMLIAGHETTAAVLTWTMYLLATHPEEAELARAEVDAIVADPSGVPTVEEIRK
LERTRLCLAEGMRMYPAPPILIRRALEDVTLPAGGMGREITLKKGTDCFVAVWNLHRSPDLWDRPDVFDPARFKREFKNP
KIEGWNGLSPELVTGLYPNEQSTDFAYVPFGGGQRRCAGDMFAMMEATVALSVLLKRFEFELGCDESEVEMITGATIHTK
AGMPVKLRSRSAK*
>CYP97E6 Micromonas pusilla CCMP1545
EuGene.0000090537|MicpuC2
79% to CYP97E6, ortholog
MSGRRRRRRRSASTTTTTRAAIEEPKPKALVPPYRGALFPGVEIPEAGQDAFAAISKALFPWGNGAPITEGVLGDLLKEE
VRAAPLFVPLYDYYRKYGGVYNLGAGPKWFVVVSDPVAVRTMFKDDADSFSKGILTDIMEPIMGDGLIPAPKEVWAKRRP
VVGAGFHGAWLKHMVSLFGDSANNLAAKLAPDAAGGKTVEIESKLYAMALDVIGKAVFNYEFNSLAEETPLIKAVYRVLR
ESEHRSTFPLQYWNIPGAMELVPRQKRFKEDIEMINDELSVLIAAALKSRNETDLAEMEARDYANVDDASLLRFLVDVRG
EEATGTQLRDDLMTMLIAGHETTAAVLTWTTYLLATHPEEARKIQEEIDAVVSDPGGAPTVEEIRAMEKTRLALAESMRL
YPAPPILIRRALRDVTLPRGGMGKAITLKKGTDCFVAVWNLHRSPDLWENPEKFDPSRFKRPFQNPAVEGWRGLQPELAT
GLYPNETSTDFAYVPFGGGQRRCAGDMFAMMEATVALSVLMKRFDVELACEKEDVEMITGATIHTKAGMPVKLTPRR*
>CYP745B1 Micromonas sp. RCC299
e_gw2.06.182.1|MicpuN3
65% to CYP745B1 Ostreococcus
tauri. 62% to CYP745B1 Ostreococcus lucimarinus
probable ortholog
MIALYLILKVVVGAVASVLGFVRKELVLARAPMAPGFVPFIGHTITLFKAVGSYPCTWDLFAQWSTATAPKPVRVQIFTQ
HCVVIADPSTMKRVMSSNLKNYSKDLEFSYAPFLEILGTGLVTSGGDTWRKMRGHISKALRVEILDEIIAIATRAVDRLC
VKLDAIKGTGESIDMENEFRLLTLQVIGEAILSLSPEESDELFPSLYLPIMDECNARSLSPWRTYLPTPEWFAHRKRVRQ
LDAAIIKIVRDRWDKKRSGQNVPDDILERVLDQELNEKRSEVERQLCYEIKTFLLAGHETSAAMLTWTMHELSNNGEATE
KIRKESDKVFGRLKKDALPTRESLDTLDYSLAALKETLRLYSVVPVVNRVAEEDDDLGGVRIPKGTTVIMSLQGVHHRED
LWPDPLVYKPERFTDPGFNENEGFKFLPFIQGPRNCLGQYLALLEARVVLCTLIRKYKWTKASAENGKKHTKAIPIAPAN
GMHFTVA*
>CYP745B1 Micromonas pusilla CCMP1545
EuGene.0000050037|MicpuC2
73% to CYP745B1†† e_gw2.06.182.1|MicpuN3, ortholog
MAVAPPSEPVLIATSVALAVLTVAFYVSRAVAGFLYGVARWHRQAFVLRHTPTAPGYVPLIGHTIALFRAVGNYPCTWDL
FAMWATATAPKPARVQIFDRHCVVIADPSTMKRVMATNLKNYQKDLEFSYAPFLEILGTGLVTSGGETWRKMRGHISKAL
RVEILDDIIAIATRAVERLCVKLDAAKASAAAVDMEQEFRLLTLQVIGEAILSLSPEDSDELFPSLYLPIMDECNARSLS
PWRAWIPTREWFAHKARSIHWFPYDRVGVVNASRVRELDDAIISIVRARWRKKQAGEDVPDDVLERVLEQVREDEYGADV
ETQLCFEIKTFLLAGHETSAAMLTWTLHELSKAPDMMREVKRESDRVFGRTRKGSLPTRDQLASMEYTLAAFKETLRLYS
VVPVVTRVAVEDDELGGTRVPAGTTVIMSLQGVHHRADLWPEPLKYDPARFVKADENDEMRDFKFLPFIQGPRNCLGQYL
ALLEARVVLGTLVRRYAFAPSKAQGKKHTKAIPIAPANGMHFTVS*
>CYP746A1 Micromonas sp. RCC299
estExt_Genewise2Plus.C_Chr_040155|MicpuN3
56% to CYP746A1 Ostreococcus
tauri, probable ortholog
56% to CYP746A1
eugene.0300010366 Ostreococcus lucimarinus
52% to CYP746A1 Volvox
51% to CYP746A1 Chlamydomonas
48% to CYP746B1 moss
MQTAALRGVADVPSQLAKQLQRAAAPPPAGFPPGPPDDVAVALGADPLAFIVDTQRRHGDVVGLSLAGERVVLVSDPKVA
ADVMIDRAGLFVKEGTAFFPGSSLAGEGLLVSDGESWARQRRLSNPAFRAAAVDAYATAMARAGAELLRNEWGARSVRDC
YEDFNDLTLRIVAEALFGADVRGKRAREINGAIKEAFEFFGRRSATGMIVPEWVPIPDNFAYNAAVTRLDKAVYSLIAER
RQKRVNGLESVENPDLLDRLLDARDDGEGGDGGGMDDVSLRDELMTLMVAGQETSAILLSWCCVNVASNPRCAAKLSEEA
AAVIGPDPLNDLPNASHYSRLKYAEAAVLETMRMQPPAYMVGRCCAEDVFIAGGKYALPKGTTVLIAPYLLHNDARYWDA
PGEFRPERWLEPGGGMGPGGVYVPFGAGPRVCIGTGFAMMESALLLAMVARAVDVRLRPGADPPRPRALITLRPENVRLD
VVPKRRWW*
>CYP746A1 Micromonas pusilla CCMP1545
EuGene.0000120135|MicpuC2
69% to CYP746A1
estExt_Genewise2Plus.C_Chr_040155|MicpuN3, ortholog
yellow regions are questionable
MSAATAPASRGFAPSRALRCRGRGEETAAVARSRRVAARDLVASSRRFERPRATASDDASTSSSSSSSSSSSSTSTAPPA
PKPTPEDIASVAAFFLETTAAATASVPLVGDVARAFARAFEPTPADFPPGPPGDAAVELAADPIAFIVKTQREHGDVVGL
RLAGEHVVLVSDPDVVRDVAIDRADLFVKEGTAFFPGSSLAGEGLLVSDGEKWARQRRLSNPAFRAAAVDAYATAMTRAG
ERLLRGPWSARTIRDAYADFNDLTLEIVADALFGADVRGARAKEINAAIAEAFEFFGKRSSSGFIVPEWAPTPENAGYAA
AVARLDDAVYALIAERR
AKREKPRYERLELLDEDREDEFVEEEEEEEEEEVPGGSSSTTPSRGP
DLLDRLLDARDDGDGG
DGGGMTDASLRDELMTLMVAGQETSAILLSWCCALLAQHPEVAERCAAEARAVLGDGSGGGDGDDEIEIASPTAENVGEM
KYVEAVVLETMRLYPPAYMVGRCAARDVVVGGHSLRKGTTILIAPYLLHRDARRWDDPDAFDPERWL
RPGGAPKKSESESESESESATDDRPMTDAPP
MAKGALRGTGPGGVYLPFGAGPRVCIGTGFAMMEATILVAMIASTVDLKTLPGRAPPRPRAL
ITLRPDNVELDVIPKRRGARARTGVGGVARSE*
>CYP746A1 gw1.12.369.1|MicpuC2
66% to CYP746A1
estExt_Genewise2Plus.C_Chr_040155|MicpuN3
same seq as CYP746A1 EuGene.0000120135|MicpuC2 with some deleted
SPTAENVGEMKYVEAVVLETMRLYPPAYMVGRCAARDVVVGGHSLRKGTTILIAPYLLHRDARRWDDPDAFDPER
GTGPGGVYLPFGAGPRVCIGTGFAMME
>CYP747A1 Chlorella sp. NC64A EST
59% to CYP747A1 Chlamydomonas, probable ortholog, N-term
GAAQQQEEGDGAMQTPGPSPFSXQSLMDVSLIFTEGLHEAMLRFSARYGPVSRFANPSAL
NGAAGWVFLNSPGDIAHVCAANPKNYAERFLPDIYKFVTHEKGLLGSGGDYNRRHRRLCG
PPFRSGELLRKFGEVVVDRVSRVADTFVSQPGPFATNVAVQTQRLTLDVVGLTAFSHDFG
ECQAIARDVGGRAQPSESDQDRLLWAVNAFGQVLGEVFITPMPLLRLMDAVGLRQLRTLR
EAVGVMRSSMLGVIAERRQHL
>CYP800A2 Micromonas sp. RCC299
EuGene.0400010034|MicpuN3
59% to EST from Micromonas sp CCMP490 EC847127.1
61% to gwEuk.7.488.1
Ostreococcus lucimarinus CP000587.1 60-62kb range
61% to CYP800A1
fgenesh1_pg.C_Chr_07.0001000025 Ostreococcus tauri
40% to CYP747A1 in three pieces
47% to EuGene.0300010745|MicpuN3
cyan possible intron
MMLATPRLSAATSAGPSSRRVSGSNRLVRRFTTRRAAVVRRVAADRENESSDAVTSRDLVDLSCGAPPCDDVVHEVPADV
PTPVEAGKSGDCPYTAAKDAVSSLLPGQRMRNKMPEGGPVMLPSNAFFKSLLKAGSSPVGMPEAMLEWVNMTGHETVGIK
NAVGPMCVSTVDPDIVEYVCHTNAKNYRLRMLPDAFRYVIKNKGITGSDGAYNREHRLMC QKPFMNSFSLDQFSRTVEER
VGLLCDAWAKAAARTDEGNLAIDIDDHSQRLTLDIVTSLAFNMDFKQVEGIDAQLNGGEEEWRKRSSGGDLDYDIIDKVL
HAYNNTSEIMGELFITPVPILKLQNLLGIGRVRELREGYAVLED
VGINHIIGRRRKELEENKQRGDNE
DYCLLDVLLKAT
DADGNPLPKEDVWGDVNDIMAAGHRTTASNLTVNLHHLARLPEVQRKVEQEVAALGGRAPTFRDVQEGRLQYTQRVVKES
LRKYAPINLFPRVVEGDDTLPTGHEVKQGDFILLSSWAMGRNPRVWDKPDEFNPDRFTEESLRQNAERLAKESCGPDATE
EELRLQMERMSRRILAGRDFTYTPFGSGPRSCIGGTFALLATTVALATVVQRFKWSTVATKDPGFEIPFLYDTTITFPKG
VHVTATPRAVPLGAERGSESQAGVGEESMAPAR*
>CYP800A2 Micromonas pusilla CCMP1545
est_orfs.12_7471_4272609:1|MicpuC2
75% to EuGene.0400010034|MicpuN3, probable ortholog
MERPVEIDVDDHSQRLTLDIVTSLAFNKDFKQVEGIDAQLNGGAGPDDSEISAMLDAYNNTSEIMGQLFITPVPLLKLQN
VLGIGRVRELKEGYAVLEKGIDNIINERRTQLEINSRVGDDEDYCLLDVLLRAKDSEGNPLPQEDIWGDVNDIMAAGHRT
TASNLTVNLHHVARMPEVQAAIEREVATLRGRAPTFKDVQEGRLQYTQRVVKESLRKYAPINLFPRIVESDDVLPSGHAV
EAGDFILLSSWAMGRNPRVWARPNAFDPERFTEENLRVNAERLARESAGPDADEETLRIQMERMSRRIAGGRDFTYTPFG
SGPRSCIGGTFALLATTVSLASVVQRFSFEKSSDPRKDHGFEIPFVYDTTITFPKGVHVCAVPRKVRLGADAEGGVGVGV
EEGSASEAAARDFARQAPAPAR*
>CYP800A2 EST from Micromonas sp CCMP490 EC847127.1
N-term seq.
59% to EuGene.0400010034|MicpuN3
DTVGVSARGASSRPRVNGVSMPPRSRHTRVRRLAIASAPSDAGDGVGAYAVADTSEQRAR
GAAVCDEEFDTTTTTAVNLETSKESTDCPYTSLNASLGGILPTKSKVPIGGPTMLDNSVF
FKSLLKAGSSPVGMPEAMLEWCDLTGQETVGIKNLVGPLCVSTIDPDIVEYVCHTNAKNY
KLRMLPDAFRYVIKNKGITGSDGRYNREHRRMC
>CYP800B1 Micromonas sp. RCC299
EuGene.0300010745|MicpuN3
45% to CYP800A1 fgenesh1_pg.C_Chr_07.0001000025
Ostreococcus tauri
46% to CYP800A1 gwEuk.7.488.1
Ostreococcus lucimarinus CP000587.1 60-62kb range
45% to CYP747A1 Chamydomonas
from I-helix to PERF
42% to CYP747A1 from heme
signature to end
47% to Micromonas CP001325.1,
EuGene.0400010034|MicpuN3
63% to Ostreococcus lucimarinus CCE9901 XM_001418784.1 I-helix to PERF
MSAIHSAVTGVVAGAVPAANARASARRAPCPRALAKSSSSSPLIIDGVSLRAGRSRARLARGARTRALSPGKEGDKTDAG
SGDSQAASGGECPVGFGKNGRFSAVGEAINGVKNAAYLSFMSSSTPLTKEEAAASAAAANPDDVVILENKIFLNALRQAI
ADPVGMPAAMLEWHRLVGHDTVGIENPIGPGCVSTIDPRTVEYICKTNAANYRDRLLPDIFRLVLKDLGVTGSQGEYNRQ
HRKVCQKPFINNNFLKTFSTVVEKGVGHLIESFDMASAKAGPGVGIVEDVDLHSQHLLLDIISPISFDYNFNLLDKSRTV
ITGEGKFEANPMLEAYHRSAEIMGQVFITPLPLLKLGAKFGVGRLQELIDAYDSLERMGDDIVKQRREMHKARRERGEEI
EQVCLLDTMLYLEDDQGNLAYTDEELWGDMNDIMAAGHQTQAATMTMSLLYVSRNPDVKAKIEEELASLGGRAPTFEDVW
DGRLTYTQSVVKETLRLHPPIHMFPRIATDKDVMPSGHKVEPGDLILLSTWAMGRNPKVWEEPTKFDPERFTDERLERLA
REQNPGADADEIERAVTMLKSGRDFIYTPFGAGPRSCIGGLFSLLTVTTIIASCIQRYDFEPDDDFLPADGEIPLRYDVT
MCFPKGLKMKLTKRDVEPGSRVQTPPAVASAR*
>CYP800B1 Micromonas pusilla CCMP1545
est_orfs.17_5534_4271487:1|MicpuC2
70% to EuGene.0300010745|MicpuN3, probable ortholog
MTGVAVDVSNRAQGEKRSAERDRNPSTRADWLESRSMLTVDFTVSPASRARASVTPGPRRALPHSPPRAPSKTMRACAAT
PAASAAASPRRRASSSSSSSSSSPSSLRSHRKTSVVGRRSTTTRAIDRARLRATSTDDTTASSSSSSSSSSGDEKSDNSG
GECPLGFGKNGRITNAVNGVKNAAYLAFMSSSTPLTMEEARASAAAADPNDITILENKVFLNSLKEAISDPVGMPAAMLD
WHKYYDRDTVGIVNPLGPGCVSTIDPRTVEYVCKTNAANYRDRLLPDVFRVVLKDLGVTGSQGEYNRQHRKVCQKPFINA
NFLKTFSSVVESGVGHLCDSWDMAAAMPGNEKGFTQDVDLHSQHLLLDIISPISFDYSFDLLGKSRNVITGVGTFKPNKM
LEAYHRSAEIMGEVILVPLPLLKLGERFGIGRLRELIEAYDSLEVMGEEIIKRRRDAHKSAEAEGKVFEQNCGREDGNLA
YTDEELWGDVNDIMAAGHQTQAATMSTALLYVSRDEKVKRAIEREVAALGGRAPTFEDVSEGRLAYTQSVIKETLRLHPP
IHMFPRLAAEDDVMPTGHEVKAGDLILLSTWAMGRNPSVWESPLEFDPDRFTDERLVKLARDQNPGADDEEIDRAVTMLK
SGRDFIYTPFGAGPRSCIGGLFSLLTVTTIVASCVQRFDFTPDEESLPADAEIPLRYDVTMCFPKGLKMRLRRRDLDGAA
EAATPGPAVAAAR*
>CYP801A2 Micromonas sp. RCC299
EuGene.0200010345|MicpuN3
C-term 52% to
estExt_fgenesh1_pg.C_Chr_07.00010215 Ostreococcus tauri,
C-term 52% to eugene.0700010342
Ostreococcus lucimarinus
Shows some fragmentary
similarity to CYP769A1 in Chamydomonas (EXXR to PERF and heme signature)
33% to Thalassiosira pseudonana
XM_002288591.1 with some gaps so actually it is
better than 33%
38% to Phaeodactylum tricornutum XM_002178008.1 C-term part
MLAGAPGTAAAAAAASVLALEGGEHALAWGVALYVICVATWTLADAVGARLVFRGKIPSVPWRPNLMFNVYATPRRNLLD
RISRRMAAARDGNDATTKHRHRHRAFATVVGNCPFAHVGGAALARAALDRQPVKSPLYRAFEAFAGVGIFTAEGDDWAAK
RSEVLGAFATAGLEPLAAASMRAAAGLTAEIDAQLAAAARGKGETRRSPLDDAEKGEGGEGRGGVGLEVDTAVEMDMLPR
LQRATLRATFEYLAGVDLPTAAAAEAGDALHDATDQPARPAGVWEDEYLAAATDLRALIPARARSVWMLSDWAYALSPLG
RLERTRIREARRLPELALRAAVPGSPLDHLRRGPAHSRRRLAGRSGGSIFGDWFLGLFRGLFRLGSRDPLLDEATTLLFA
GHDTQSATLSWALLRLAGDPAAQRELRASLADDPAAMESLGLAAGDKAASASTTRTEPVQPAWRTSAGAPVLEACLRETL
RLHPVAPFVARKLTSDAVASAGADGGDALTLPAGCAAGVWLHAVHRDPAVWVDPDSFDPSRWLITDSNAGGQGAGFSPRA
DSIHHSQGSNTPGVRVRFKGSGFMPFATGPRACVGQHLAWVYMRVVLARLVCAYEVRLAEGEGEGDALTPSVGFTVTPAN
AARVRLVPVGG*
>CYP801A2 Micromonas pusilla CCMP1545
fgenesh1_pg.C_scaffold_4000274|MicpuC2
52% to EuGene.0200010345|MicpuN3, probable ortholog
MIAGTPGAAATSAAVVILGAEGGARGLVLGAGCYAILCVVALVVDLARARLALRGRVPIVPWRPNLMFRVYATSKRSLLD
RVSRRMANPARVGVGDGDGDGDGATTKGGGGALDADPPARPHPPRAFGTVIGTCPFAHVGGASLARAALRAQPVKAPLYR
AFEAFAGGGIFTEEGERWEAKRAEVLRSFAVVGLDALAEASTRVATELARDIEDSASSSSGVETEMLPRLQRATLRATFE
YLTGVTVPEAAAAAAAVAAARDRAKRATSSSSRGDPLEVPLRGAPASFADAEEDEARRTAARWEDEYLAAATALRHLIPA
RARSVWMASDWAYRLSPVGRIERRSIRAARKLPTLAVRVAKPGSPLDVLSKGVAHGGGGGGGGGKGAIRLRRRLAPRRDD
DDGFGPPPPKALVDEAVTLLFAGHDTQSATLSWALLRLASDEKTQTRLRDSILADADAARDLGLDDLLPADAPPVDRAVS
RPSPTSGDANRRQRQPAWSAAARAPVLEAVLRETLRLHPVAPLVVRKLTSDAVDDGDATSSGGTTTLPEGCAVGVWLHAV
HRDPAVWSEPETFSIGRWLTTARAWRAERAGADASREDDGGSGGVGGGDDEHGDDVVVRFKGAGFMPFASGPRACVGQHL
AWVFMRLTLARLACAFDVRPGSRREGEEEGEEEDPLTPSVGFTVTPANAARVRLIPRRAGRA*
>CYP802A2 Micromonas sp. RCC299
EuGene.0800010406|MicpuN3
54% to ost_07_008_042|Ost9901_3
Ostreococcus lucimarinus XM_001418626.1
49% to fgenesh1_pg.C_Chr_07.0001000106 Ostreococcus tauri
MTRGQRRERTDTSQTPSPRLGHVPLEFTSRMRVSVYSTSVHSTSCLGAYTAAARDARRLRHRRLATTVCRNKREISSKLN
LDLDLGQVLGTVKRNEGLYDNLPPGKVGLFGLRETFAYLNDPNKFIRTRVEKYGPVFKTAFFFKPAVVFGSPEAIREFKD
FEGDLPADAALPETFRELHTEYGALRMSGERHKATRANFGKVLGRSALTHYTPILAKLTRDFVKGELLEKRTLQPGYDCR
QFCLKALFQLFLGTVPPQDIMEKMYFYNEGLLALGKLSPEFTEGKKALEDLQEFCLKHFRTVRAQGKLDDPEYFFLKQYS
QATDENGDLFTDERVAVTTILMIWGAFIEAAASMGHTTWLLMRNPDKAKKVRAECRSSFSREELDSGKLTLDDVYTKLTF
TECAIKEALRVMPQTAGGLRVNPETRTLAGFTVPSGYVLTADPRIAFLNPDFFPDPEDYRPERFLPAENPAITPDNFFPG
GMGQHKCPGISLSNLMVSIYLLYLYSCFDKWEPDMSAEEMDSEDPQYIQVPIVIIDDRYQLKLERNWQYEM*
>CYP802A2 Micromonas pusilla CCMP1545
fgenesh1_pg.C_scaffold_5000331|MicpuC2
62% to EuGene.0800010406|MicpuN3, probable ortholog
MPTSATAAGALASARALAPPPPRIRRAPRDSSPARQRRRPRRVIATRAAGFETDIGQAIGSIVRKEVTLNLPPGRVGLSG
IRETFEYLQDPRGFIERRVEAYGPVFKTGFFFKPAIVFGSAEAIEEYKRFEGELPADEALPETFRELHTAYGALRQSGAQ
YKATRANFGKVLGRAALTHYAPIIARQTRDFVRGDLLASGTLQPGYECRQHCLRSLFELFLGAIPPEDTMMKMYDYNEGL
LSLGKLTNEFEQGKAALETLTEFVLQTYRRVRASGELETDPRYFFLRQYSTATDENDELFPDDRVATTVVLMVWGAYIEA
AASMGHCAWLLMRNPDAAAKVRAECKRVLSPDALASGNVPIETLMELKYTEACVKEALRTVPQTAGGLRINPTTRTLAGY
DVPAGYVLTADPRIPFLDAKNYPEREKFQPERFLPEGAAAGNVVNGETYFPGGMGQHQCPGISLSTLMTQTFLAYMTTTF
DGWTPDLSGEGSEDPAYVQIPIVIIDDRYRLKLEKNWQFDTFDAK*
>CYP803A1 Micromonas sp. RCC299
EuGene.0800010288|MicpuN3
no good hit, 28% to CYP747A1 Chlamydomonas
69% to Micromonas sp. CCMP490 EST EC847008.1
MSALVANAVGPVAVARRPPRRMRHAPRERVSAPRASRADDLPGILLGAAAKKLEEDVNSFIGLFDEDAPTHERPPTLPVA
GNTLDIAQGGHRQLLEWAETYGVADGVHEVKMLSQTILHLTDPKLARELMFERSDSFPDRGVSAMAKFFREDQAAFVNTS
GEQWMAYRKMGTATVNGGALDRLAGKVAERSEALVTRWVRDASASGDGRNATEVDISDASQAVTLEVIHEALFSEQLDVI
DGERNAVALARSFREFNVANQDLLNDFLTLYQRFETPERARRDTHRRRLRAHFDERADARRAAIARDGAAAAPRDLLTAL
LTARDPATGAALTRDDVNLTLTEMMVAGHDTTAATVACMMCLLASHPEVRASVTGEVDEFRRNNGGRLPSSVADANALVK
LDDAMKETMRLYPAVLIVVRKAEEGTGGVFTKGPGREVRIPEGSGMWVSPYVLGRLARHWGGDEEDVKRFRPSRFEEARE
RGDSLDAYMPFGGGPRVCLGSRFAMLEGKVLAAHILADWDVELAAETRDAIARNDGELPIAYAAGLMSFPEPLRLRVRRR
GAASGPR*
>CYP803A1 Micromonas pusilla CCMP1545
EuGene.0000020033|MicpuC2
57% to EuGene.0800010288|MicpuN3,
probable ortholog
yellow may be part of an intron
MKEAMRLYPAVLVVVREDG
LFFPADLSAHQPSVSIPALGAFQLRLTPLNSTPTFASLVWTLDPQ
AEELGDGGDIVVEPGP
DDSARGRRTIRVPPGTGLWMSPYVYGRLPKCWGEDSDEAVARYRPERWAAMRENGESAPDAYMPFGGGPRVCLGSRFATL
EGKARAISHWSPYDRVRVVNADP*
>CYP803A1 Micromonas sp. CCMP490 EST EC847008.1
69% to EuGene.0800010288|MicpuN3, probable ortholog
45% to CYP769A1
Chlamydomonas, 42% to CYP746A1 Chalmydomonas
GAQMWISPYVMGRLPHLWGGDVEDTKRFRPERFAEFRDQGIKEP
EAFMPFGGGPRVCLGSRFAMLEGKVTAAAILRDFD
>CYP804A1 Micromonas sp. RCC299
EuGene.0900010042|MicpuN3
no good match
Hyaloperonospora parasitica
CU855855.1 7300-8000 range ~36% match in two pieces
(a stramenopile; Oomycete)
60% to Micromonas sp. CCMP490
EST EC846298.1
41% to Phytophthora infestans CV943686.1
MGGYFTSDRAAKYGWGVPRLALVTLFAFEIVPRAFLAGSRAPVDALVSTPLPFTARAAAAPFAPLTLVPYLHSLRWIAHA
YTGSIGAGVNVQVLGLLGNLAIMAVAGGLCLWDLTKDISLVVYFLKTFCGDQLAGKLDTAEWKLVLAFTFALPLAMWGAA
GANGAAFAMFLPYARIAPILFAIQAVCEFGDAHLEYHPVIGKFFRHRYGFEAFTLCALTMMPGGITAPELRVVQFDLVIC
LFYRVANLGIILHKQGFAVAAAASLAVTIRKGCFRVFGALSGQRIVNVTDAEVATAVMRASDVKGDALERHVATPAWRPL
LSLESVDHELYRNMLRDFHAVVKACPPPQRVGEIARAKVDELMYRTYSEEAEEASPVRGGAEPSPPPSPERPVVDVTLAD
SPVDSPHPHGGKGGDVGKCPFVQMQRTMRGGASGQSNAAGRSTPARSDAAPVIDADDVARLSLSVFIEYLFGREWEPKFE
TLLAASWEWRKEIAVRGRADPGVKKAAVELVVDDLIKNSHLWDLFGEKWREPRYYSLIMQPFLVSPAINVGDIAVAMKAH
PDLALEPAMRRMHPFPIFERWVDKDVVVDGRIAVRADTQVIMFTSDFANSKHLWPAFGTGPRACAGTSMALGVLNAIHQK
MLGRPGFEPERGHKFSGRNNDGVTSLSEVWYFAKTVLPVVFGFGGEKTTEAAALERAAAAALE*
>CYP804A1 Micromonas pusilla CCMP1545
EuGene.0000070606|MicpuC2
60% to EuGene.0900010042|MicpuN3, probable ortholog
MSGASRGDGGGYFTNDRAVKYGYGVPRLALVTLFAFEIVPRLFLAAERGGSATSAATSSASAADAAFEFFTSSPPADPRR
WALTLAPYLASRRFIGHVYIGSIGAGVNVQVLGFFGNILMLLLAAFACVWDAKRDLSIVIYLMKTFAGDQLAGKLDTAEW
KLVLFFTFGCMAPWVYFASGSMDAAASAFALYARILPILFLVQAVCEYGDAHYERFVFFRHRYGFEFFTLLALTSLPGGI
TPEELRVVQYDLVICLMYRVSNLIIIANKNGLCSGVLGCFTIAARKLCFAAFGVLRGQRLVNVTDPEVATAVMRASVVKG
DALERHVATPAWRPLLSLESVDGELHSSMMRDFHDLARRLPSPGRLGEIATRRVTELIRAHELKARIAKAAKAKELGVDV
DVAIDARAVARLSLTVFVEYVFGRRWEPAFEPLLDATWEWRKEIAVRGKADVKLKRIAVDVVVNDLLRKHPVLWHVHGED
WAKPRYYSLILQPYLVSPAINVGDIAVAMARHPNLSLENAMRRMHPFPIFERYVDRDVLIRGKVVVRAHTQVIMFTTDFA
STRYRNAQWPVFGAGPRACAGTQLALGILAAIRNNMVGHELFEPTIGHRHSGRHNDGVTTPAEAFYSAATILPIVMGWRG
DEFSDDAESLELAAAAALSGRVR*
>CYP804A1 Micromonas sp.
CCMP490 EST EC846298.1
60% to EuGene.0900010042|MicpuN3, probable ortholog
56% to EuGene.0000070606|MicpuC2
DAEIKKKAVELVVNDLLRNCDKLWAIHGEDWQEPRYYSLIMQPFLISPAINVGDVAVSLK
RNPHLKLEHAMRASHPFPIFERFVDEDVFIGSGLKKKLAVRKNTQVVMFTSDFCGSAIPW
PVFGAGPRMCAGTGMALGVLRAVATGFQNTDRFEPERGHKYSGRHNERVASFAELAYFVK
TVVPIVLGLG