CYP34A8

 

>CYP34A8 B0213.14 499 aa

         Length = 499

 

 Score = 2442 (859.6 bits), Expect = 4.5e-257, P = 4.5e-257

 Identities = 484/499 (96%), Positives = 484/499 (96%)

 

Query:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

             MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK

Sbjct:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

 

Query:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

             QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF

Sbjct:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

 

Query:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

             WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG

Sbjct:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

 

Query:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

             SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP

Sbjct:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

 

Query:   241 ---------------VAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFTYVLSII 285

                            VAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFT     

Sbjct:   241 FDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFT------ 294

 

Query:   286 RKFSIACFSLETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT 345

                      LETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT

Sbjct:   295 ---------LETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT 345

 

Query:   346 RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML 405

             RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML

Sbjct:   346 RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML 405

 

Query:   406 HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD 465

             HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD

Sbjct:   406 HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD 465

 

Query:   466 LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ 499

             LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ

Sbjct:   466 LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ 499

 

Compare to seq below

 

 

>CYP34A7 B0213.12 499 aa

         Length = 499

 

 Score = 2193 (772.0 bits), Expect = 1.1e-230, P = 1.1e-230

 Identities = 405/498 (81%), Positives = 456/498 (91%)

 

Query:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

             ML+ILIL+A+AAVLT NLWRARQKLP GPTPLP+IGNFHQLFY  WK G LVA F++ +K

Sbjct:     1 MLIILILVAIAAVLTVNLWRARQKLPNGPTPLPIIGNFHQLFYNGWKYGGLVAGFDQFRK 60

 

Query:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

             QYGKVFTVWMGP P+V ICD+D+AHETHVK+A+ FG RY+ G MEYIREG+GIIGSNGDF

Sbjct:    61 QYGKVFTVWMGPIPAVQICDFDVAHETHVKKAHTFGHRYTFGAMEYIREGKGIIGSNGDF 120

 

Query:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

             WLEHRRFALMTLRNFG+GR I+EDKIM+EYRYRF+DFK+T+FK+GAIQVNASS+FDLLVG

Sbjct:   121 WLEHRRFALMTLRNFGLGRNIIEDKIMEEYRYRFEDFKKTNFKDGAIQVNASSLFDLLVG 180

 

Query:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

             SIINQLLVSERFEQDD+EFE+LKT+LA ALEN SIIEG LPLW+LKS  MKWRTK TFAP

Sbjct:   181 SIINQLLVSERFEQDDEEFEELKTNLAMALENGSIIEGVLPLWMLKSRFMKWRTKTTFAP 240

 

Query:   241 FDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFTLETLAI 300

             FDF+FE+G +GIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKK+GIDSSFTLETLA+

Sbjct:   241 FDFVFEVGKKGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKDGIDSSFTLETLAV 300

 

Query:   301 DLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGTRGVSLTDRTKTPYLN 360

             DLFDLWQAGQETTSTTLTWAC CLLNHPEVVEKLRKELTEVTGG RGVSLTDRTKTPYLN

Sbjct:   301 DLFDLWQAGQETTSTTLTWACACLLNHPEVVEKLRKELTEVTGGARGVSLTDRTKTPYLN 360

 

Query:   361 ANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSMLHTDEEIFKNPQEFRP 420

             A INE QRI+SILNVNL R+LEED  ID  PVPAG   TT L++LHTDEE FKN +EF P

Sbjct:   361 ATINEVQRISSILNVNLLRILEEDAVIDGHPVPAGTAFTTQLALLHTDEETFKNHKEFIP 420

 

Query:   421 ERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYDLEPVGTLPKIETTTP 480

             ERF+ENNNLEKRLIPFGIGKR+CPGESLA+AEL+LI GN+++D+DL+PVG +PKIE+ TP

Sbjct:   421 ERFLENNNLEKRLIPFGIGKRSCPGESLAKAELYLIIGNLVIDFDLKPVGAIPKIESPTP 480

 

Query:   481 FAPMKRPPVYDIRFVPRS 498

             F+P+KRPPVYDIRF+ R+

Sbjct:   481 FSPVKRPPVYDIRFISRA 498

 

CYP23A1

 

>CYP23A1 B0304.3    U39472 529 aa (revised 12/8/2003)

         Length = 529

 

 Score = 2756 (970.2 bits), Expect = 2.4e-290, P = 2.4e-290

 Identities = 529/534 (99%), Positives = 529/534 (99%)

 

Query:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

             MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP

Sbjct:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

 

Query:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

             VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD

Sbjct:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

 

Query:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

             SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS

Sbjct:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

 

Query:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

             LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV

Sbjct:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

 

Query:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

             MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED

Sbjct:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

 

Query:   301 LTYAYMIEVEKRKRNGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 360

             LTYAYMIEVEKRKRNG     FDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ

Sbjct:   301 LTYAYMIEVEKRKRNG-----FDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 355

 

Query:   361 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 420

             KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK

Sbjct:   356 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 415

 

Query:   421 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 480

             KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR

Sbjct:   416 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 475

 

Query:   481 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 534

             AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK

Sbjct:   476 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 529

 

Compare to the sequence below:

 

>cb25.fpc3052 ACC=CAAC01000064 86% to CYP23A1

            Length = 534

 

 Score = 2430 (855.4 bits), Expect = 8.4e-256, P = 8.4e-256

 Identities = 454/534 (85%), Positives = 493/534 (92%)

 

Query:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

             MPS++ AILL++ T+FLYFRIM++   D+Y  TYIY   C   Y  YE NYKRRRLPNGP

Sbjct:     1 MPSIKYAILLSVVTSFLYFRIMKSFEMDSYTTTYIYLFFCTISYIFYESNYKRRRLPNGP 60

 

Query:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

             +PWLVAGNMPSFINV NVD LF  WKQ+YGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD

Sbjct:    61 MPWLVAGNMPSFINVKNVDDLFLYWKQRYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

 

Query:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

             +FSNRWRNFVTDSIMEGSNGIVQIDG+KWREQRRFALHTLRDFGVG+PLMEQMITLEVT+

Sbjct:   121 AFSNRWRNFVTDSIMEGSNGIVQIDGDKWREQRRFALHTLRDFGVGRPLMEQMITLEVTT 180

 

Query:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

             LMNHM KSCGL  KE++LCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLH LLDDQSHTV

Sbjct:   181 LMNHMAKSCGLSTKEVNLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHSLLDDQSHTV 240

 

Query:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

             MQPIMGAYIAFPVT+K+P INGEWNRLMGIK ELLEFLE QI+ HR NWK+EM+EQEPED

Sbjct:   241 MQPIMGAYIAFPVTTKVPFINGEWNRLMGIKKELLEFLEGQIQKHRENWKEEMMEQEPED 300

 

Query:   301 LTYAYMIEVEKRKRNGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 360

             LTYAYMIEVEKRKR GEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLM+KN +VQ

Sbjct:   301 LTYAYMIEVEKRKRAGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMSKNPRVQ 360

 

Query:   361 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 420

             + VQ ELDSI QPM+EIQHRTRLPY+QATINEIQRIANILPINLLRTVAEDI+IDGY FK

Sbjct:   361 RKVQEELDSIAQPMVEIQHRTRLPYIQATINEIQRIANILPINLLRTVAEDIDIDGYQFK 420

 

Query:   421 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 480

             KGDL IPQISILMNDPEIF+NP++F P RFLDE+ NVKKIDEFLPFSIGRRQCLGESLAR

Sbjct:   421 KGDLTIPQISILMNDPEIFKNPKDFCPERFLDENLNVKKIDEFLPFSIGRRQCLGESLAR 480

 

Query:   481 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 534

             AELYL+FANL+QNF FEV++DVTTERVLGLTVSPV+Y+CKI+RRGLDHN+N VK

Sbjct:   481 AELYLIFANLMQNFKFEVSEDVTTERVLGLTVSPVQYTCKISRRGLDHNENCVK 534

 

CYP29A4

 

>CYP29A4 B0331 500 aa 4 small changes, probably errors in my old sequence

         Length = 500

 

 Score = 2532 (891.3 bits), Expect = 1.3e-266, P = 1.3e-266

 Identities = 481/486 (98%), Positives = 481/486 (98%)

 

Query:     1 MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTTKDFV 60

             MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKT  DFV

Sbjct:     1 MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTXSDFV 60

 

Query:    61 ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG 120

             ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG

Sbjct:    61 ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG 120

 

Query:   121 GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSESKILIDCLEKIAETQETVDLFPF 180

             GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSES ILIDCLEKIAETQETVDLFPF

Sbjct:   121 GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSES-ILIDCLEKIAETQETVDLFPF 179

 

Query:   181 FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVEYSLNPFLWNRFVYWALGYQ 240

             FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTV YSLNPFLWNRFVYWALGYQ

Sbjct:   180 FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVSYSLNPFLWNRFVYWALGYQ 239

 

Query:   241 KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR 300

             KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR

Sbjct:   240 KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR 299

 

Query:   301 KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE 360

             KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE

Sbjct:   300 KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE 359

 

Query:   361 YTERVLKESKRMFPPVPGFQRKLTKDIVIDGITIPSEGNITISPTVLHCNPFVYQNPEKF 420

             YTERVLKESKRMFPPVPGFQRKLTKDIVI GITIPSEGNITISPTVLHCNPFVYQNPEKF

Sbjct:   360 YTERVLKESKRMFPPVPGFQRKLTKDIVIGGITIPSEGNITISPTVLHCNPFVYQNPEKF 419

 

Query:   421 DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET 480

             DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET

Sbjct:   420 DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET 479

 

Query:   481 KPLFEV 486

             KPLFEV

Sbjct:   480 KPLFEV 485

 

CYP35A1

 

C03G6.14; CE07888. This gene is badly  misassembled

MFLVLIFLALSCWLIIRQYQKVSRLPPGP skipped sequence here

KYGNIFTLWVGPVPHVSICDY  50

ETSHEVFVKGANKYADIAHAPLFRELRQEMGVLVTNGSHWSTMKRFALHT 100

FRDMGVGKDLMETRIMEELDARCADTDKSATDGVTVAQAGDFFDLTVGSI 150

INSILVGKRFEEHNKDDFLKIKEAMGAAFEVFSPFDMAVPVWFLRTFFRS 200

RYDMMMTTQNTAKRFAAAEAVK skipped seq here

SIETLKTMIIDLWMTGQETTTTTLISGF 250

TQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHA 300

SILNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDP 350

ERFIRDEELLQKVIPFGVGKRSCIGESLARAELYL

C-term is wrong seq, but my C-term may also be wrong

ISLLCICGTSNKSFLHKYLPLWNSVALLLPVKLSRPAFAALISSLPLDKLLPHI

 

>C03G6.14    C13B3.B    CYP35A1 502 aa

yellow is at an intron boundary

MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVS

TLDLFRKKYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFR

ELRRW*MGVLVTNGSHWSTMKRFALHTFRDMGVGKDLMETRIMEELDARC

ADTDKSATDGVTVAQAGDFFDLTVGSIINSILVGKRFEEHNKDDFLKIKEAMG

AAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTTQNTAKRFAAAEAVKRIED

IKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTMIIDLWMTGQETTTT

TLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHASI

LNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDPERFIRDEE

LLQKVIPFGVGKRSCIGESLARAELYL VRHLQYRISNFLFRLLGTFCSAINSNRMEH

CRQQSCCRIAPEKDHSSWK*

18318 IIGNLLLRYKFEPHGTLSTTELLPYSAGKRPFKLEMKFVKI 18196 correct C-term

 

Compare my 35A1 assembly to 35A2 below

>CYP35A2 C03G6.15    C13B3.A 495 aa

         Length = 496

 

 Score = 2005 (705.8 bits), Expect = 9.2e-211, P = 9.2e-211

 Identities = 386/505 (76%), Positives = 431/505 (85%)

 

Query:     1 MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVSTLDLFRK 60

             MF VL F  L  +LI+RQYQKVSRLPPGP+S P+IGNLP IIYYLW+TGGIVSTLDLFRK

Sbjct:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

 

Query:    61 KYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFRELRRW*MGVLVTNGS 120

             +YGNIFTLWVGP+PHVSI DYETSHEVFVK A KYAD  HAP+ R++R   +GVL+TNG

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSK-IGVLITNGD 119

 

Query:   121 HWSTMKRFALHTFRDMGVGKDLMETRIMEELDARCADTDKSATDGVTVAQAGDFFDLTVG 180

             HW  M+RF+L  FR+MGVGKD+METRIMEELDARC+D DK AT+GVT+  A +FFDLTVG

Sbjct:   120 HWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVG 179

 

Query:   181 SIINSILVGKRFEEHNKDDFLKIKEAMGAAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTT 240

             SIINSILVGKRFEE  K +FLKIKE M A+FE FSPFDM  PVWFL+TFF+ RYD + +

Sbjct:   180 SIINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSA 239

 

Query:   241 QNTAKRFAAAEAVKRIEDIKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTM 300

             Q TAK FAAAEA+KR+E IKSG Y IDE+N++DYTDAFLLKIQK+GE  DFNIETLKTM

Sbjct:   240 QETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTM 298

 

Query:   301 IIDLWMTGQETTTTTLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLN 360

             IIDLWMTGQETTTTTLISGF QLLLHPEVM+KAREEILKITENGSRHLSLTDRTSTPY+N

Sbjct:   299 IIDLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVN 358

 

Query:   361 AMIGEIQRHASILNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDP 420

             A+IGEIQRHASILNVSFWKINKE TYMGGHPVDAGALVT+QLSALHVN+T+FKNPQEF+P

Sbjct:   359 AVIGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNP 418

 

Query:   421 ERFIRDEELLQKVIPFGVGKRSCIGESLARAELYLVRHLQYRISNFLFRLLGTFCSAINS 480

             ERFIRD +LLQKVIPFGVGKR+C+GESLA+AELYLVR + YR S        TF  A NS

Sbjct:   419 ERFIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLVRFVFYRSSV-------TFSFATNS 471

 

Query:   481 NRMEHCRQQSCCRIAPEKDHSSWK* 505

             N M +    S C  A EKDHSSWK*

Sbjct:   472 NNMANYPLPSLCHTALEKDHSSWK* 496

 

 

CYP35A2

 

>CYP35A2 C03G6.15    C13B3.A 495 aa

         Length = 496

 

 Score = 2342 (824.4 bits), Expect = 1.8e-246, P = 1.8e-246

 Identities = 451/454 (99%), Positives = 453/454 (99%)

 

My C-term may be wrong see below.  By comparison to the briggsae seqs

It looks like all my C-term exons for elegans 35As are wrong.  They may be in an alternative reading frame from the correct sequences. 

 

Query:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

             MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK

Sbjct:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

 

Query:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVLITNGDH 120

             RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVR+ IGVLITNGDH

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSKIGVLITNGDH 120

 

Query:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

             WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS

Sbjct:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

 

Query:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

             IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ

Sbjct:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

 

Query:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

             ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII

Sbjct:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

 

Query:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

             DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV

Sbjct:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

 

Query:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

             IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER

Sbjct:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

 

Query:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLI 454

             FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYL+

Sbjct:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLV 454

 

>C03G6.15    C13B3.A     CYP35A2  495 aa

yellow is at intron boundary

MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVST

LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMR

DVRSKIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD

IDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFLKIKETMDASF

ETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAEAIKRVESIKSGKYVID

ENNLQDYTDAFLLKIQKEGESKDFNIETLKTMIIDLWMTGQETTTTTLISGFNQL

LLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAVIGEIQRHASILNVSFWKIN

KEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPERFIRDGKLLQKVIPF

GVGKRNCLGESLAKAELYL

VRFVFYRSSVTFSFATNSNNMANYPLPSLCHTALEKDHSSWK*

Probable correct C-term =

IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI*

 

cDNA evidence

>gi|30742705|gb|CB400978.1|CB400978  UniGene info OSTF185G5_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 544

 

 Score =  210 bits (534), Expect = 7e-56

 Identities = 100/102 (98%), Positives = 101/102 (99%)

 Frame = +3

 

Query: 1   LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSKIGVL 60

           LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVR+ IGVL

Sbjct: 135 LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVL 314

 

Query: 61  ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD 102

           ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD

Sbjct: 315 ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD 440

 

>CYP35A.e cb25.fpc0023 76% to CYP35A.b briggsae seq on bottom

          Length = 495

 

 Score = 2054 (723.0 bits), Expect = 5.9e-216, P = 5.9e-216

 Identities = 379/494 (76%), Positives = 445/494 (90%)

 

Query:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

             MFFVL     L +L+VRQYQKVSRLPPGP+SLPLIGNLPQI+YYL++TGG+VSTLD FRK

Sbjct:     1 MFFVLIVFTFLTWLVVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLYTTGGVVSTLDFFRK 60

 

Query:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVLITNGDH 120

             RYGNIFTLWVGPIPHVSIADYETSHEVFVKNA KYADKFH+P+ R++R+  G+L  NGDH

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNANKYADKFHSPIFREMRSKRGILTANGDH 120

 

Query:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

             WQEMRRF+L  FRNMGVGKD+METRIMEEL+ARC+DIDK A NGVT T A+EFFDLTVGS

Sbjct:   121 WQEMRRFALFTFRNMGVGKDLMETRIMEELNARCADIDKAAVNGVTTTQAAEFFDLTVGS 180

 

Query:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

             IINSILVGKRFEE  K++FLKIKE MDASFETFSPFDMT PVW LK FF  R++K+ + Q

Sbjct:   181 IINSILVGKRFEEHNKNDFLKIKEVMDASFETFSPFDMTMPVWILKNFFPRRFEKMRNGQ 240

 

Query:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

             E++K FAA EA+KR++ IK+G+YVIDENN QDYTDAFL K+QK+GE++D+ +E+LKTMI+

Sbjct:   241 ESSKQFAAKEALKRIDEIKAGRYVIDENNFQDYTDAFLWKMQKDGENEDYKVESLKTMIL 300

 

Query:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

             DLW+TGQETTTTTLISGFNQLLLH + M KAR E+ +ITENGSR++SL+DR  TPY+NAV

Sbjct:   301 DLWITGQETTTTTLISGFNQLLLHSKFMEKARAELFEITENGSRNVSLSDRPKTPYLNAV 360

 

Query:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

             IGEIQRHASILNV+FW+ N E TYMGGH VD+GALVT+QLSALHVNETVF++P++F+PER

Sbjct:   361 IGEIQRHASILNVNFWRWNNEPTYMGGHMVDSGALVTAQLSALHVNETVFEHPEKFDPER 420

 

Query:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLIFGNLLLRYKFEQHGKLSTTELMPYSA 480

             FIRD KLLQKVIPFG+GKR+CLGESLA++ELYLIFGNLLLRYKF+ HG+LST E+MPYSA

Sbjct:   421 FIRDEKLLQKVIPFGLGKRSCLGESLARSELYLIFGNLLLRYKFQPHGELSTREIMPYSA 480

 

Query:   481 GKRPFKLEMKFVKI 494

             GKRPFKLEM+F+K+

Sbjct:   481 GKRPFKLEMQFIKV 494

 

C03G6.15; CE35217. = correct seq

MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGG  50

                            IVSTLDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFH 100

                            APVMRDVRNDIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEEL 150

                            DARCSDIDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFL 200

                            KIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAE 250

                            AIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

                            DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTD 350

                            RTSTPYVNAVIGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQL 400

                            SALHVNETVFKNPQEFNPERFIRDGKLLQKVIPFGVGKRNCLGESLAKAE 450

                            LYL IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI

 

CYP33B1

 

>CYP33B1 C25E10.2   U50311 495 aa

         Length = 496

 

 Score = 2536 (892.7 bits), Expect = 4.9e-267, P = 4.9e-267

 Identities = 485/496 (97%), Positives = 489/496 (98%)

 

My seq wrong at green region yellow is intron boundary

 

Query:     1 MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG 60

             MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG

Sbjct:     1 MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG 60

 

Query:    61 PIFTFYMGPVPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFRGGEYGIMDISGDRW 120

             PIFTFYMG  PFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFR  ++  + +SGDRW

Sbjct:    61 PIFTFYMGIAPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFREMQFTKL-LSGDRW 119

 

Query:   121 KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI 180

             KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI

Sbjct:   120 KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI 179

 

Query:   181 NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ 240

             NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ

Sbjct:   180 NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ 239

 

Query:   241 KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD 300

             KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD

Sbjct:   240 KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD 299

 

Query:   301 IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA 360

             IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA

Sbjct:   300 IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA 359

 

Query:   361 NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF 420

             NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF

Sbjct:   360 NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF 419

 

Query:   421 IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF 480

             IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF

Sbjct:   420 IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF 479

 

Query:   481 SLVSHVIPYKCRQYIP 496

             SLVSHVIPYKCRQYIP

Sbjct:   480 SLVSHVIPYKCRQYIP 495

 

CYP32A1

 

>CYP32A1 C26F1.2    U53148 508 aa  also Y97E10