Cyanidioschyzon
merolae P450s 5 sequences
These two algal genomes have P450s that cluster into five groups.
#331 is a CYP51 as is contig 1016. #4765 and #4211 are both CYP710 sequences. Both algae seem to have two that are recent duplications, not in the common ancestor. #444 is probably the ortholog of contig 1062. #201 is the probable ortholog of contig 1041.
Contigs 454, 981 and 989 are unique to Galdiera. They cluster together on a tree. Contig 981 and 989 are quite similar (40%) and are a recent duplication. Any orthologs to these sequences are lost in Cyanidioschyzon. The alignment below suggests some revision may be needed to some of the sequences.
Alignment of the 13 sequences (Phylip format)
13 628
331
-------MGT ILAALANQFQ ALLARARDGD TDAIGWLALG VAALVWLMTR
CYP51con10
---------- ---MLSQDSI ALSTLTSSLE AYCWALVYIL STILFFGILW
4211
-MIRRPRYAL DASGSSRVGG SVARHCLRRA TLRWDRR-LG YRQPCAGVRC
4765
MIETASLTRS TPVAAAEWVV SVARHCLRRA DAALGIAGWG TASRALAYVA
710B_gen ---------- ---------- ----------
MDIVSFNSLS SGNLIILLVV
710B1_like
---------- ---------- -------MQL TEFDSFNKFL SGNLVFLGVS
con981 ---------- ----------
---------- ---------- -MLQLWIVLV
con989 ---------- ---------M
MMSCLAVSLL QLSNLSQDWS RVFKLFILAA
con454 ---------- ----------
---------- ---MASPNEI ARWFLQTRTK
con1062 ---------- ----------
---------- ---------- -MWIGLLLFF
444
---------- ---------- --MFPVFIWF VIVAIAIVTF TFGVPGTLLR
201
---------- ---------- -----MILAR NLLHVPRPSL YVVAALGAFS
con1041 ---------- ------MSLM
KRVMFLGGQV ARWLVNGGLL SLVFVDLAFS
AAQALATLLS
PAKPAEGLEL AYPPLYTEGF PVVGNVAAFA R---NPLQLA
RITGSFFLSK LGIAREVKGQ QLPPTYKEGL PLVGNLIAFA K---GPLNVV
AAGARPRDRR QVRLRRWAGY GRIPGPRFVI PLIGSIVEMV L---SPYGFW
LLVLGLAIAE QVRLRRWAGY GRIPGPRFVI PLIGSIVEMV L---SPYGFW
ITMICYFILE
QLHYFWWKRS SKLPGPSFTL PFLGSIIEMV K---NPYQFW
IALVCYLLFE QLRYFWWKRS SKLRGPSFTL PFLGSIIEMV K---NPYEFW
TFSCLFLYV- FILPKWRNRH IPGPRPSLLL GNVSELSRQG G---TAPLVF
LFWTVFKFLK YVYPYWRFRN IPGPPPKWPV GNIFELLRKP G---QEHRIL
LHSYLFSYCF MFLAPRIALV SPEAAKHVMV KNVRNYVKPP M---VRQGLS
VLVFTLYLVR QNTCTGNKAN LSYSPVCKGL PLLGSALEFG K---NPLKFL
RQRRKQLRGD TFPVRDVSAE RLPPCIRGWC PYVRAGFQFA FGAAPPQSFL
LFQVGSTVAK YALWFWKQKR AGRPVSRGLL EVVTALGPVP VPG-FGNALY
RWSLEDLWCT PPSQSFNCIG QGRDHSMDHK SKQVIFVYII FDCNLPSLPG
KRAYAQLGDV FTLRVGPKRF TFLVGPRAHE VFFRANDDEL DQG---PVYG
QRGYQSCGDI FTFKVFHKHI TFLVGPKAHE IFFQGTDDEL DQN---EVYA
ERQRRLNPCG LSWNALAGFF VLFVTDTDLS RYVLSENGPH AFE---MILH
ERQRRLNPCG LSWNALAGFF VLFVTDTDLS RYVLSENGPH AFE---MILH
EKQRLLDPQG VSANFLVGRI TLFVTDSALV RAILNNNSAR TFL---LALH
EKQRLLDPQG VSANFSLGRI TLFVTDSALV RAILNNNSAK TFL---LALH
ERFRKQYGDV FQIWSFYRQI VVISHPDDIK YIIVTKNFPK A-----EEFN
LQYAKQYGPT FQLWYLNRRT IIVANPEDAK FVLATRNYPK S-----PIFC
N-LLGNKGIL LAEGDDHARQ RRIILPAFHF DALVHLGPIF R--------A
QECRKQYGDV FTVLLPGRRM TFIFAPTQEL RKIFFNGSPN LIS-----FT
EAARKQHGDV FTVEMFGRRM TFLFSKNGIG QFFDSAPSKV SFIRAVEPFT
LSRVGFFRAL YESCLERQVT VFWLSSTPLL IVSAPDLVEQ VLT----SRT
PSPWPIVGNC IPLSSNLYQT LYQYVEQPIS LYFIASTPFV VVTDEAAVRK
FSVPVFGRGV
VYDAPLAIRL EQFRFVSTAL RAARLREYVP LMVAEAETYF
FSVPIFGKGV VYDAPLEKRL QQLRIMSAAL RPARMYGYVD QMVLEAVQFF
PNGWRILGRN NIAFKSGEEH KLLRQSFLRL FTPKALGVYV SIQERLIREH
PNGWRILGRN NIAFKSGEEH KLLRQSFLRL FTPKALGMYV SIHERLIGEH
PSARLILGKN
NIAFMHGQEH KELRKSFLSL FTRKALGVYL TLQETSIKSH
PSARLILGKN NIAFMHGPEH KELRKSFLAL FTRKALGVYL TLQETTIKSH
LSLSPLAGRG LLTVGKSQHQ ERRRAISKHF NEDFLRQLHR HMRVELMILL
RCFSPLG-HG LLTLSQEEHP VQRKAISQRF NEEFLQSLHH HLTAELEVFM
QGQQVVQRWL NRPEEAIDVH LDMTQVTMNV IALAAFGYDP NTDSGQELYR
AGVEPLTCRI FGISKKGFSM AHRSLLTTLR SELGAKHIPQ LAHRLINRYL
DGIFGLAPTY FEKILHMLLV QLREELHLDK GSDRGLARHL TGFAEALRMA
FEKPQYFGYR SRTVKTALEL HQRAELLREK IETADPDQQQ RVREDPSRAA
VLGSGMYQKP KYFGYRSSTI RYSVEMNQKL ILTNEQMRQQ QADSSRKALK
KRHWHG---- TDGTAD--LL KSLSELIILT ASRCLMGREV REQLFEEVST
RKWGD----- -QGQVD--IL ESLSDLIILT ASRCLMGREV REQLFEKVSK
LAAWLS---- GRGTDGTTAA FEVRPRIRDL NLRTSQTVFV GPYLRD-RER
LAAWLS---- GRGTDGTAAA FEVRPRIRDL NLRTSQTVFV GPYLRD-RER
LQKWIQ---- LSKEN---DE MEMSFLCRDL NLETSQYVFA GPYIGEQRDQ
LQRWIE---- LCKEK---SP LEMSFFCRDL NLETSQYVFA GPYIGERRDE
SKLQQV---- TERKESIDFD KEATSYTLDV MCRTGFGCTA NTQEDA-SHP
AQMDAL---- CDTERVVDLD ALISALTLDV IARTAFGVSF TAQTSQ-HHP
AYRDIF---- TQRPPSRMLA MLFSLLPSWL LQSMPLSRLL RRQQSN-VRL
FTFRTV---- WGKEDEKEAS NLLTETLSDA SLRVIFGDEF ANASPS---L
MRQQMASLDG QAVAHVDDLF DLCGRLIFTA SFQTVFGREC ANALNK-DGR
LLDLIDR--- SLTEIASSTE KFLAELRAED NVGKDASKLI QRFYVQLNCK
VMIDS----- -KVSDIIDGM IEAAEAVVHA VDGREQVENI RRKVIELNLN
LYHQLDQGMQ
PLSVFAPHFP CKAHW----- QRDRARREMR RLFASIIANR
LYHDLDQGMQ PISVFAPYLP ISAHR----- KRDKAREEMV QLFRTVIQNR
FCADYLLITQ GFLSFPLALP GTGLW----- RAIRARERVL RDLTACVRAS
FCADYLLITQ GFLSFPLALP GTGLW----- RAIRARERVL RDLTACVRAS
FCHWYITVTK
AFISAPVFLP GTNLW----- KAYFARKKIV ALLENAVIQS
FCSWYITVTK AFISAPVFFP GTNLW----- KAYFARKKIV ALLENAVIQS
ISRAVNVSLR EMYHNLVAYP IRNCFGLYSS PALKNATGVI REFASQVIEA
MPHAVLTLLD ELVNNMIFYP YRFWLSHITQ KRLNEAINVI RKFCNMVIDL
VKKKVTEIVQ KRREEYEALL VKDSN----- AMG--KSTTN RDLLDMLVAA
FKDFVDFDEW FELAATPLLP HFLLR----- PFVKSRRKLL DTISQNWKYT
LYETFVAFDR EFELAALPIP QFLLR----- SFSRARRQLL RAMQRAVRLV
VLFGLVVDNA EATRIASAIE KAGAEFSR-- RMILPQRALY AWFANVSYIY
VLFGYKNDKD VGSLSHIIFE AGKEFILR-- -TVNPFRIGW RWMANFRFFQ
RKRYQEIAAQ AESVGQDPQQ ALEAAKEVDV LQVFMDSQYR DGS-RLTDDQ
RR-------- ---------- --RNVKEDDM LQTFMDASYR DGS-RPSEYE
KERF----RK DAEAEPQCLL DFWTVSVLEE VAAAKRENRA PPK-YSADHE
KERF----RK DAEAEPQCLL DFWTVSVLEE VAAAKRENRA PPK-YSADHE
KK------YI GNGGTPRCLL DFWTQRVLEE MEEATQQDKE MPS-YSNNRK
KR------YM ADGGSPRCLL DFWTQRVLEE VEEAAQQGR- SVS-YANNRK
RRT------- ------ESEE D-KTRRPLDL LDIFLKMDN- -----LSDQN
RLQ------- ------ESRE E-KSNRVRDL LDIFLESDE- -----TRD-N
RD-------- ---------- --------PE LEKKSSHLP- ----YLTDEE
KN-------- ---------- --------AP IHKLTEAYG- ------NDGN
EP-------- ---------- -------ATP AGKLLAKLD- -----SDEKV
HTS------- --------VL LIFGQKILRH LRISSNSWIN GWLGKAGRLR
YVFS------ ---------L ITIGRRVCQH MDSQPATWVH GWVGKVGKIG
ITGLLIAVLF AGQHTSTITG TWTGLLMLRK PELVTRVRAE QEQVLYDDDG
VAGLLIALLF AGQHTSSITG SWTGMLLLRN KDVFERVKKE QDTIIEEHG-
MADAMLDFLF ASQDASTASL VWTTVLVAER PDVLQRVREE QQRLRPHDE-
MADAMLDFLF ASQDASTASL VWTTVLVAER PDVLQRVREE QQRLRPHDE-
MAETLMDFLF ASQDASTASL TWTLALMSDY PDVLKKVQEE QKRLRPNNE-
MAETMMDFLF ASQDASTASL TWTLALMADH PDILKRVQEE QKRLRPNNE-
IIAEIATFLV AGHDTTSHTM SWLIYEVCQH PEIEQKIQQE VDTIWGDRQD
VIAHVATFML AGHDSTSHTL SFCMYEIAQN RDIERKLQEE SD-RFIVAQD
ITSQALTFMA
AGQVTTAVLL SWTLFELSIH PSAQEKLRQE LQTMETTLST
VPSLLLSALW ATWSNVSPTS FWTLTHILAD EKAKVKVLAE VEKSCPLLLS
CASVLLTLLW ASSANTLLSA GWLIVLSAEW CNRARPVTES MA--------
RLAKVLGLLM AATQTVPIAA SWALILVSVH EDVREKLACE ARRLLSEDLS
KLGKVVGLIM ASSQTVPTTC LWLLFLLSKY PQVVEKIREE TSRVLHSTKK
C--------- ---------- ---------- --FKIDYDAL LR-LDVMHRC
---------- ---------- ---------- --DELNYDVL SK-MNLLHLC
---------- ---------- ---------- ---PITYELL EQ-MVYTRAV
---------- ---------- ---------- ---PITYELL EQ-MVYTRAV
---------- ---------- ---------- ---PLSFELV ES-MTYTRQV
---------- ---------- ---------- ---PLSFELV EN-MTFTRQV
---------- ---------- ---------- --WMLSFEEI GQ-LEYLNKV
---------- ---------- ---------- --RIVPFDQV GH-LDYTRMV
QD-------- ---------- ---------- --ITEMVQHL DK-LEYLDVV
SK-------- ---------- ---------- --TELSLEWI FSNLPFTAYC
---------- ---------- ---------- ---------- WE-------L
VNGGRGVAQQ GSQTSQTTPA QKANLTASDI ATDVNRLSKL LKKHSYFDAV
QS-------- ---------- ---------- --MEEFTVDD LNELAYVDCV
IKEALRMYPP LIFLMREVVI PRTYRDYV-- -IPKGDIVVV SPPLAMSLPE
IKETLRMYPP LILLMRKVLK PKFYKEYV-- -IPENDIVMV SPAASGRLEN
VLEVLRFRPP PVMVP-QVAS KRVQLPNG-- -YEVPRGALV VPSLWTACMQ
VLEVLRFRPP PVMVP-QVAS KRVQLPNG-- -YEVPRGALV VPSLWTACMQ
VKEILRYRPP AVMVP-QNAM GSVPLTEN-- -VTVPKGSFV MPSIWSSCMQ
VKEILRYRPP A--------- ---------- ---------- ----------
WKETLRKHPV AATGTLRRLD TDVTLPSCGM LLRKNTAILV PIYLVHRNPE
WNEALRTHPA AANTSVRCAD RDDVLPGSGI PITKGTGLMV SSYLIHHLPQ
LHESLRLHPP
VLFITRQAVQ DDEILGFP-- -ISQGAIVNI PIVALHRDPE
VSETLRLYAS VVDIR-KVVE NLEFREFI-- -IRKGDYLCI SPAVSHRETT
VTETLRLASS GIAVR-IGCE PLYVDDFR-- -IPAGDYLCI SPWLAHQDEH
IAETLRLFPP FPLIQRQAQC DTHLGDVF-- -VPGGTLVAA VPWLQHHHPA
VKECLRLYPP FPLLQREPEM DDILENVK-- -IPARTPVYI VPWLLHHHPK
VF-ADPDRYD PDRYA---PP REED--KRVP FSHIGFGGGR HACMGEQFAY
VF-KNPNAWD PDRFG---PN REED--KKAP FSFIGFGGGR HGCMGEQFAY
GF-PSPERFD PERML---PP RQEE--QKYR KHFLTFGCGP HMCVGRNYAI
GF-PSPERFD PERML---PP RQEE--QKYR KHFLTFGCGP HMCVGRNYAI
GF-PDAYKFD PDRMS---PE RQED--IKYR QNFLTFGIGP HVCVGREYAI
---------- ---------- ---------- ---------- ----------
FW-PDPETFE PERFT---RE NTM---KRHP FAFQAFSNGP RNCIGQFFAT
YW-ENPDHFI PERHT---KE AVR---QRSP YYFLPFSRGS RNCIGQFVAN
QWGPDAESFR PERFLSSDKN NVVI--QRHA MAWLPFLYGT RACTGQRFAM
LF-PQSEDFI PDRFQ--KQG THPN--AVFD KDLLTFGGGF YKCPGQSFAM
RG---GKRFD PCRYQHIEDK KALF--RGRT RQLYTFGGGM YRCPGQEYAL
YW-KQAESFN PERWISPTDN VARHGDAPSD YCYIPFGRGR RMCAGNPLAM
YWK-QPEDFI PDRFMY---- NASHGDAPSD FVYIPFGRGN KMCAGYHLAL
LQIKTIWSVI LRDYDLEPVG PL----PLPD YSAMVVGPKP PCLVRWRRRT
LQIKTIWTVL VRSFDLEPIG DL----SQPD YNAMVVGPRP PCLLKYRKKK
NHLMCYLAVL ATTVDWTRVR TV----HSDE IIYLPTIYPA DSVIRARWRV
NHLMCYLAVL ATTVDWTRVR TV----HSDE IIYLPTIYPA DSVIRARWRV
NNLIAFLALI STECKFQRYR TK----KSDD IIYLPTIYPG DCLMKFV---
---------- ---------- ---------- ---------- ----------
HEALTTLSSL YHFFTFRLAC RA----EDVK PYHAMTMKPS VGKVSEDAKG
HEALTILSTI YKRYEIRLAV GA----QEVE EYFRVTMKPH CRFYVQGKKD
LEAKTILFEL
LTKVSVRLQP GC----EVKG YGMVSVPRDV RLQVVDLHKE
VEIVLLIALV FYLYDIQLVD RV----PKMK ESQSVGIKKP SCSCRIHYLW
HELVLFIQIF FEFVERVELG VAGDPLTSMD AYRLVGIKRP TRPFPGKLFL
VELKCLLLLV ALQEPDLLFD LEP---QETS PEPQAGMRFP PLTMRPPRFA
LELKILTIYV CQYYDWKCSF PQ----GKEP VSKKYPIETI THSSCNRFFF
P--------- ---------- --------
DSFLDRVSLY A--------- --------
ADASTGAGTG AGTDTACTSE AVPVVVAS
ADASTGAGTG AGTDTACTSE AVPVVVAS
---------- ---------- --------
---------- ---------- --------
--VSEYVKLP VWVTPRNTMA HLREE---
PSLDAHLGLP VKIYSRKCYS --------
---------- ---------- --------
KRRLAGMEEI ---------- --------
ISSCCDLRCG DPHPSERW--
--------
VRQEEPIHRK LCQQNSNEL- --------
IMQLLSIGNV S--------- --------