CYP34A8

 

>CYP34A8 B0213.14 499 aa

         Length = 499

 

 Score = 2442 (859.6 bits), Expect = 4.5e-257, P = 4.5e-257

 Identities = 484/499 (96%), Positives = 484/499 (96%)

 

Query:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

             MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK

Sbjct:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

 

Query:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

             QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF

Sbjct:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

 

Query:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

             WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG

Sbjct:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

 

Query:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

             SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP

Sbjct:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

 

Query:   241 ---------------VAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFTYVLSII 285

                            VAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFT     

Sbjct:   241 FDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFT------ 294

 

Query:   286 RKFSIACFSLETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT 345

                      LETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT

Sbjct:   295 ---------LETLAIDLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGT 345

 

Query:   346 RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML 405

             RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML

Sbjct:   346 RGVSLTDRTKTPYLNANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSML 405

 

Query:   406 HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD 465

             HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD

Sbjct:   406 HTDEEIFKNPQEFRPERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYD 465

 

Query:   466 LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ 499

             LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ

Sbjct:   466 LEPVGTLPKIETTTPFAPMKRPPVYDIRFVPRSQ 499

 

Compare to seq below

 

 

>CYP34A7 B0213.12 499 aa

         Length = 499

 

 Score = 2193 (772.0 bits), Expect = 1.1e-230, P = 1.1e-230

 Identities = 405/498 (81%), Positives = 456/498 (91%)

 

Query:     1 MLLILILLAVAAVLTANLWRARQKLPKGPTPLPLIGNFHQLFYLSWKTGSLVAAFNELKK 60

             ML+ILIL+A+AAVLT NLWRARQKLP GPTPLP+IGNFHQLFY  WK G LVA F++ +K

Sbjct:     1 MLIILILVAIAAVLTVNLWRARQKLPNGPTPLPIIGNFHQLFYNGWKYGGLVAGFDQFRK 60

 

Query:    61 QYGKVFTVWMGPKPSVYICDYDIAHETHVKRANIFGTRYSVGGMEYIREGRGIIGSNGDF 120

             QYGKVFTVWMGP P+V ICD+D+AHETHVK+A+ FG RY+ G MEYIREG+GIIGSNGDF

Sbjct:    61 QYGKVFTVWMGPIPAVQICDFDVAHETHVKKAHTFGHRYTFGAMEYIREGKGIIGSNGDF 120

 

Query:   121 WLEHRRFALMTLRNFGVGRTIMEDKIMDEYRYRFKDFKRTHFKNGAIQVNASSIFDLLVG 180

             WLEHRRFALMTLRNFG+GR I+EDKIM+EYRYRF+DFK+T+FK+GAIQVNASS+FDLLVG

Sbjct:   121 WLEHRRFALMTLRNFGLGRNIIEDKIMEEYRYRFEDFKKTNFKDGAIQVNASSLFDLLVG 180

 

Query:   181 SIINQLLVSERFEQDDQEFEKLKTSLAEALENISIIEGFLPLWVLKSPLMKWRTKITFAP 240

             SIINQLLVSERFEQDD+EFE+LKT+LA ALEN SIIEG LPLW+LKS  MKWRTK TFAP

Sbjct:   181 SIINQLLVSERFEQDDEEFEELKTNLAMALENGSIIEGVLPLWMLKSRFMKWRTKTTFAP 240

 

Query:   241 FDFIFELGNRGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKEGIDSSFTLETLAI 300

             FDF+FE+G +GIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKK+GIDSSFTLETLA+

Sbjct:   241 FDFVFEVGKKGIQRRVAAIENGTHTLSEEGDDFVDAFIVKMEKDKKDGIDSSFTLETLAV 300

 

Query:   301 DLFDLWQAGQETTSTTLTWACVCLLNHPEVVEKLRKELTEVTGGTRGVSLTDRTKTPYLN 360

             DLFDLWQAGQETTSTTLTWAC CLLNHPEVVEKLRKELTEVTGG RGVSLTDRTKTPYLN

Sbjct:   301 DLFDLWQAGQETTSTTLTWACACLLNHPEVVEKLRKELTEVTGGARGVSLTDRTKTPYLN 360

 

Query:   361 ANINEFQRIASILNVNLFRVLEEDTTIDSQPVPAGALVTTNLSMLHTDEEIFKNPQEFRP 420

             A INE QRI+SILNVNL R+LEED  ID  PVPAG   TT L++LHTDEE FKN +EF P

Sbjct:   361 ATINEVQRISSILNVNLLRILEEDAVIDGHPVPAGTAFTTQLALLHTDEETFKNHKEFIP 420

 

Query:   421 ERFMENNNLEKRLIPFGIGKRACPGESLARAELFLITGNMILDYDLEPVGTLPKIETTTP 480

             ERF+ENNNLEKRLIPFGIGKR+CPGESLA+AEL+LI GN+++D+DL+PVG +PKIE+ TP

Sbjct:   421 ERFLENNNLEKRLIPFGIGKRSCPGESLAKAELYLIIGNLVIDFDLKPVGAIPKIESPTP 480

 

Query:   481 FAPMKRPPVYDIRFVPRS 498

             F+P+KRPPVYDIRF+ R+

Sbjct:   481 FSPVKRPPVYDIRFISRA 498

 

CYP23A1

 

>CYP23A1 B0304.3    U39472 529 aa (revised 12/8/2003)

         Length = 529

 

 Score = 2756 (970.2 bits), Expect = 2.4e-290, P = 2.4e-290

 Identities = 529/534 (99%), Positives = 529/534 (99%)

 

Query:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

             MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP

Sbjct:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

 

Query:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

             VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD

Sbjct:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

 

Query:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

             SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS

Sbjct:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

 

Query:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

             LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV

Sbjct:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

 

Query:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

             MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED

Sbjct:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

 

Query:   301 LTYAYMIEVEKRKRNGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 360

             LTYAYMIEVEKRKRNG     FDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ

Sbjct:   301 LTYAYMIEVEKRKRNG-----FDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 355

 

Query:   361 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 420

             KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK

Sbjct:   356 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 415

 

Query:   421 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 480

             KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR

Sbjct:   416 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 475

 

Query:   481 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 534

             AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK

Sbjct:   476 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 529

 

Compare to the sequence below:

 

>cb25.fpc3052 ACC=CAAC01000064 86% to CYP23A1

            Length = 534

 

 Score = 2430 (855.4 bits), Expect = 8.4e-256, P = 8.4e-256

 Identities = 454/534 (85%), Positives = 493/534 (92%)

 

Query:     1 MPSLRNAILLTIATTFLYFRIMRTLNFDNYMATYIYFLTCITLYAIYELNYKRRRLPNGP 60

             MPS++ AILL++ T+FLYFRIM++   D+Y  TYIY   C   Y  YE NYKRRRLPNGP

Sbjct:     1 MPSIKYAILLSVVTSFLYFRIMKSFEMDSYTTTYIYLFFCTISYIFYESNYKRRRLPNGP 60

 

Query:    61 VPWLVAGNMPSFINVNNVDVLFQSWKQQYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

             +PWLVAGNMPSFINV NVD LF  WKQ+YGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD

Sbjct:    61 MPWLVAGNMPSFINVKNVDDLFLYWKQRYGGIFTVWIGPIPLVMVSDLPTIKKYFIQHAD 120

 

Query:   121 SFSNRWRNFVTDSIMEGSNGIVQIDGNKWREQRRFALHTLRDFGVGKPLMEQMITLEVTS 180

             +FSNRWRNFVTDSIMEGSNGIVQIDG+KWREQRRFALHTLRDFGVG+PLMEQMITLEVT+

Sbjct:   121 AFSNRWRNFVTDSIMEGSNGIVQIDGDKWREQRRFALHTLRDFGVGRPLMEQMITLEVTT 180

 

Query:   181 LMNHMEKSCGLDGKELHLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHQLLDDQSHTV 240

             LMNHM KSCGL  KE++LCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLH LLDDQSHTV

Sbjct:   181 LMNHMAKSCGLSTKEVNLCPSIAVCVGNIINNMLFGLRFNQDNSYMHRLHSLLDDQSHTV 240

 

Query:   241 MQPIMGAYIAFPVTSKIPIINGEWNRLMGIKNELLEFLETQIEGHRMNWKDEMIEQEPED 300

             MQPIMGAYIAFPVT+K+P INGEWNRLMGIK ELLEFLE QI+ HR NWK+EM+EQEPED

Sbjct:   241 MQPIMGAYIAFPVTTKVPFINGEWNRLMGIKKELLEFLEGQIQKHRENWKEEMMEQEPED 300

 

Query:   301 LTYAYMIEVEKRKRNGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMAKNQKVQ 360

             LTYAYMIEVEKRKR GEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLM+KN +VQ

Sbjct:   301 LTYAYMIEVEKRKRAGEDVGFFDDQQLKMLLLDLFFAGMETTVTTLKWAFLLMSKNPRVQ 360

 

Query:   361 KNVQAELDSIGQPMIEIQHRTRLPYVQATINEIQRIANILPINLLRTVAEDIEIDGYNFK 420

             + VQ ELDSI QPM+EIQHRTRLPY+QATINEIQRIANILPINLLRTVAEDI+IDGY FK

Sbjct:   361 RKVQEELDSIAQPMVEIQHRTRLPYIQATINEIQRIANILPINLLRTVAEDIDIDGYQFK 420

 

Query:   421 KGDLIIPQISILMNDPEIFENPEEFNPSRFLDEDNNVKKIDEFLPFSIGRRQCLGESLAR 480

             KGDL IPQISILMNDPEIF+NP++F P RFLDE+ NVKKIDEFLPFSIGRRQCLGESLAR

Sbjct:   421 KGDLTIPQISILMNDPEIFKNPKDFCPERFLDENLNVKKIDEFLPFSIGRRQCLGESLAR 480

 

Query:   481 AELYLVFANLIQNFNFEVADDVTTERVLGLTVSPVEYSCKITRRGLDHNQNSVK 534

             AELYL+FANL+QNF FEV++DVTTERVLGLTVSPV+Y+CKI+RRGLDHN+N VK

Sbjct:   481 AELYLIFANLMQNFKFEVSEDVTTERVLGLTVSPVQYTCKISRRGLDHNENCVK 534

 

CYP29A4

 

>CYP29A4 B0331 500 aa 4 small changes, probably errors in my old sequence

         Length = 500

 

 Score = 2532 (891.3 bits), Expect = 1.3e-266, P = 1.3e-266

 Identities = 481/486 (98%), Positives = 481/486 (98%)

 

Query:     1 MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTTKDFV 60

             MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKT  DFV

Sbjct:     1 MSILIPVALALLFVYLLSFYDTIRLMRKFWIYGGKMPGPPAHPIFGNASLFKNKTXSDFV 60

 

Query:    61 ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG 120

             ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG

Sbjct:    61 ELFVQLAHEARSKGANLMRTQVMNRIYVWPLNGKTAATILESSTEVNKGDDYAFLVPWLG 120

 

Query:   121 GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSESKILIDCLEKIAETQETVDLFPF 180

             GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSES ILIDCLEKIAETQETVDLFPF

Sbjct:   121 GGLLMEKGEKWKSHRRILTPAFHFAKLEGYLDVFNSES-ILIDCLEKIAETQETVDLFPF 179

 

Query:   181 FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVEYSLNPFLWNRFVYWALGYQ 240

             FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTV YSLNPFLWNRFVYWALGYQ

Sbjct:   180 FKRCTLDIICGTAMGIKLDAQNVHNLGYVQAVEGFNKLTVSYSLNPFLWNRFVYWALGYQ 239

 

Query:   241 KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR 300

             KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR

Sbjct:   240 KMHDDFLYTLKKFTNDAIVERRTVIASGEIEKETSKRKMNFLDILLNSEESNELTSDEIR 299

 

Query:   301 KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE 360

             KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE

Sbjct:   300 KEVDTFMFAGHDTTSTSLSWLCWNIAHNPEVQENVYKEIISIFGEDPNQDVTSENINRLE 359

 

Query:   361 YTERVLKESKRMFPPVPGFQRKLTKDIVIDGITIPSEGNITISPTVLHCNPFVYQNPEKF 420

             YTERVLKESKRMFPPVPGFQRKLTKDIVI GITIPSEGNITISPTVLHCNPFVYQNPEKF

Sbjct:   360 YTERVLKESKRMFPPVPGFQRKLTKDIVIGGITIPSEGNITISPTVLHCNPFVYQNPEKF 419

 

Query:   421 DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET 480

             DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET

Sbjct:   420 DPDRFLPEECAKRHSYDYIPFSAGLRNCIGQKFSILNEKVMLIHILRNFKLEPKLEFYET 479

 

Query:   481 KPLFEV 486

             KPLFEV

Sbjct:   480 KPLFEV 485

 

CYP35A1

 

C03G6.14; CE07888. This gene is badly  misassembled

MFLVLIFLALSCWLIIRQYQKVSRLPPGP skipped sequence here

KYGNIFTLWVGPVPHVSICDY  50

ETSHEVFVKGANKYADIAHAPLFRELRQEMGVLVTNGSHWSTMKRFALHT 100

FRDMGVGKDLMETRIMEELDARCADTDKSATDGVTVAQAGDFFDLTVGSI 150

INSILVGKRFEEHNKDDFLKIKEAMGAAFEVFSPFDMAVPVWFLRTFFRS 200

RYDMMMTTQNTAKRFAAAEAVK skipped seq here

SIETLKTMIIDLWMTGQETTTTTLISGF 250

TQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHA 300

SILNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDP 350

ERFIRDEELLQKVIPFGVGKRSCIGESLARAELYL

C-term is wrong seq, but my C-term may also be wrong

ISLLCICGTSNKSFLHKYLPLWNSVALLLPVKLSRPAFAALISSLPLDKLLPHI

 

>C03G6.14    C13B3.B    CYP35A1 502 aa

yellow is at an intron boundary

MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVS

TLDLFRKKYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFR

ELRRW*MGVLVTNGSHWSTMKRFALHTFRDMGVGKDLMETRIMEELDARC

ADTDKSATDGVTVAQAGDFFDLTVGSIINSILVGKRFEEHNKDDFLKIKEAMG

AAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTTQNTAKRFAAAEAVKRIED

IKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTMIIDLWMTGQETTTT

TLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLNAMIGEIQRHASI

LNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDPERFIRDEE

LLQKVIPFGVGKRSCIGESLARAELYL VRHLQYRISNFLFRLLGTFCSAINSNRMEH

CRQQSCCRIAPEKDHSSWK*

18318 IIGNLLLRYKFEPHGTLSTTELLPYSAGKRPFKLEMKFVKI 18196 correct C-term

 

Compare my 35A1 assembly to 35A2 below

>CYP35A2 C03G6.15    C13B3.A 495 aa

         Length = 496

 

 Score = 2005 (705.8 bits), Expect = 9.2e-211, P = 9.2e-211

 Identities = 386/505 (76%), Positives = 431/505 (85%)

 

Query:     1 MFLVLIFLALSCWLIIRQYQKVSRLPPGPVSFPIIGNLPHIIYYLWATGGIVSTLDLFRK 60

             MF VL F  L  +LI+RQYQKVSRLPPGP+S P+IGNLP IIYYLW+TGGIVSTLDLFRK

Sbjct:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

 

Query:    61 KYGNIFTLWVGPVPHVSICDYETSHEVFVKGANKYADIAHAPLFRELRRW*MGVLVTNGS 120

             +YGNIFTLWVGP+PHVSI DYETSHEVFVK A KYAD  HAP+ R++R   +GVL+TNG

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSK-IGVLITNGD 119

 

Query:   121 HWSTMKRFALHTFRDMGVGKDLMETRIMEELDARCADTDKSATDGVTVAQAGDFFDLTVG 180

             HW  M+RF+L  FR+MGVGKD+METRIMEELDARC+D DK AT+GVT+  A +FFDLTVG

Sbjct:   120 HWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVG 179

 

Query:   181 SIINSILVGKRFEEHNKDDFLKIKEAMGAAFEVFSPFDMAVPVWFLRTFFRSRYDMMMTT 240

             SIINSILVGKRFEE  K +FLKIKE M A+FE FSPFDM  PVWFL+TFF+ RYD + +

Sbjct:   180 SIINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSA 239

 

Query:   241 QNTAKRFAAAEAVKRIEDIKSGAYEIDESNIEDYTDAFLLKIQKDGEDLDFNIETLKTM 300

             Q TAK FAAAEA+KR+E IKSG Y IDE+N++DYTDAFLLKIQK+GE  DFNIETLKTM

Sbjct:   240 QETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTM 298

 

Query:   301 IIDLWMTGQETTTTTLISGFTQLLLHPEVMVKAREEILKITENGSRHLSLTDRTSTPYLN 360

             IIDLWMTGQETTTTTLISGF QLLLHPEVM+KAREEILKITENGSRHLSLTDRTSTPY+N

Sbjct:   299 IIDLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVN 358

 

Query:   361 AMIGEIQRHASILNVSFWKINKELTYMGGHPVDAGALVTAQLSALHVNDTIFKNPQEFDP 420

             A+IGEIQRHASILNVSFWKINKE TYMGGHPVDAGALVT+QLSALHVN+T+FKNPQEF+P

Sbjct:   359 AVIGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNP 418

 

Query:   421 ERFIRDEELLQKVIPFGVGKRSCIGESLARAELYLVRHLQYRISNFLFRLLGTFCSAINS 480

             ERFIRD +LLQKVIPFGVGKR+C+GESLA+AELYLVR + YR S        TF  A NS

Sbjct:   419 ERFIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLVRFVFYRSSV-------TFSFATNS 471

 

Query:   481 NRMEHCRQQSCCRIAPEKDHSSWK* 505

             N M +    S C  A EKDHSSWK*

Sbjct:   472 NNMANYPLPSLCHTALEKDHSSWK* 496

 

 

CYP35A2

 

>CYP35A2 C03G6.15    C13B3.A 495 aa

         Length = 496

 

 Score = 2342 (824.4 bits), Expect = 1.8e-246, P = 1.8e-246

 Identities = 451/454 (99%), Positives = 453/454 (99%)

 

My C-term may be wrong see below.  By comparison to the briggsae seqs

It looks like all my C-term exons for elegans 35As are wrong.  They may be in an alternative reading frame from the correct sequences. 

 

Query:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

             MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK

Sbjct:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

 

Query:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVLITNGDH 120

             RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVR+ IGVLITNGDH

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSKIGVLITNGDH 120

 

Query:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

             WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS

Sbjct:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

 

Query:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

             IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ

Sbjct:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

 

Query:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

             ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII

Sbjct:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

 

Query:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

             DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV

Sbjct:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

 

Query:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

             IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER

Sbjct:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

 

Query:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLI 454

             FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYL+

Sbjct:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLV 454

 

>C03G6.15    C13B3.A     CYP35A2  495 aa

yellow is at intron boundary

MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVST

LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMR

DVRSKIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD

IDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFLKIKETMDASF

ETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAEAIKRVESIKSGKYVID

ENNLQDYTDAFLLKIQKEGESKDFNIETLKTMIIDLWMTGQETTTTTLISGFNQL

LLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAVIGEIQRHASILNVSFWKIN

KEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPERFIRDGKLLQKVIPF

GVGKRNCLGESLAKAELYL

VRFVFYRSSVTFSFATNSNNMANYPLPSLCHTALEKDHSSWK*

Probable correct C-term =

IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI*

 

cDNA evidence

>gi|30742705|gb|CB400978.1|CB400978  UniGene info OSTF185G5_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 544

 

 Score =  210 bits (534), Expect = 7e-56

 Identities = 100/102 (98%), Positives = 101/102 (99%)

 Frame = +3

 

Query: 1   LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRSKIGVL 60

           LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVR+ IGVL

Sbjct: 135 LDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVL 314

 

Query: 61  ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD 102

           ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD

Sbjct: 315 ITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSD 440

 

>CYP35A.e cb25.fpc0023 76% to CYP35A.b briggsae seq on bottom

          Length = 495

 

 Score = 2054 (723.0 bits), Expect = 5.9e-216, P = 5.9e-216

 Identities = 379/494 (76%), Positives = 445/494 (90%)

 

Query:     1 MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGGIVSTLDLFRK 60

             MFFVL     L +L+VRQYQKVSRLPPGP+SLPLIGNLPQI+YYL++TGG+VSTLD FRK

Sbjct:     1 MFFVLIVFTFLTWLVVRQYQKVSRLPPGPVSLPLIGNLPQIVYYLYTTGGVVSTLDFFRK 60

 

Query:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFHAPVMRDVRNDIGVLITNGDH 120

             RYGNIFTLWVGPIPHVSIADYETSHEVFVKNA KYADKFH+P+ R++R+  G+L  NGDH

Sbjct:    61 RYGNIFTLWVGPIPHVSIADYETSHEVFVKNANKYADKFHSPIFREMRSKRGILTANGDH 120

 

Query:   121 WQEMRRFSLQAFRNMGVGKDIMETRIMEELDARCSDIDKLATNGVTITHASEFFDLTVGS 180

             WQEMRRF+L  FRNMGVGKD+METRIMEEL+ARC+DIDK A NGVT T A+EFFDLTVGS

Sbjct:   121 WQEMRRFALFTFRNMGVGKDLMETRIMEELNARCADIDKAAVNGVTTTQAAEFFDLTVGS 180

 

Query:   181 IINSILVGKRFEEDTKHEFLKIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQ 240

             IINSILVGKRFEE  K++FLKIKE MDASFETFSPFDMT PVW LK FF  R++K+ + Q

Sbjct:   181 IINSILVGKRFEEHNKNDFLKIKEVMDASFETFSPFDMTMPVWILKNFFPRRFEKMRNGQ 240

 

Query:   241 ETAKNFAAAEAIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

             E++K FAA EA+KR++ IK+G+YVIDENN QDYTDAFL K+QK+GE++D+ +E+LKTMI+

Sbjct:   241 ESSKQFAAKEALKRIDEIKAGRYVIDENNFQDYTDAFLWKMQKDGENEDYKVESLKTMIL 300

 

Query:   301 DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTDRTSTPYVNAV 360

             DLW+TGQETTTTTLISGFNQLLLH + M KAR E+ +ITENGSR++SL+DR  TPY+NAV

Sbjct:   301 DLWITGQETTTTTLISGFNQLLLHSKFMEKARAELFEITENGSRNVSLSDRPKTPYLNAV 360

 

Query:   361 IGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQLSALHVNETVFKNPQEFNPER 420

             IGEIQRHASILNV+FW+ N E TYMGGH VD+GALVT+QLSALHVNETVF++P++F+PER

Sbjct:   361 IGEIQRHASILNVNFWRWNNEPTYMGGHMVDSGALVTAQLSALHVNETVFEHPEKFDPER 420

 

Query:   421 FIRDGKLLQKVIPFGVGKRNCLGESLAKAELYLIFGNLLLRYKFEQHGKLSTTELMPYSA 480

             FIRD KLLQKVIPFG+GKR+CLGESLA++ELYLIFGNLLLRYKF+ HG+LST E+MPYSA

Sbjct:   421 FIRDEKLLQKVIPFGLGKRSCLGESLARSELYLIFGNLLLRYKFQPHGELSTREIMPYSA 480

 

Query:   481 GKRPFKLEMKFVKI 494

             GKRPFKLEM+F+K+

Sbjct:   481 GKRPFKLEMQFIKV 494

 

C03G6.15; CE35217. = correct seq

MFFVLFFSVLLGYLIVRQYQKVSRLPPGPISLPLIGNLPQIIYYLWSTGG  50

                            IVSTLDLFRKRYGNIFTLWVGPIPHVSIADYETSHEVFVKNAGKYADKFH 100

                            APVMRDVRNDIGVLITNGDHWQEMRRFSLQAFRNMGVGKDIMETRIMEEL 150

                            DARCSDIDKLATNGVTITHASEFFDLTVGSIINSILVGKRFEEDTKHEFL 200

                            KIKETMDASFETFSPFDMTAPVWFLKTFFKHRYDKIWSAQETAKNFAAAE 250

                            AIKRVESIKSGKYVIDENNLQDYTDAFLLKIQKEGESKDFNIETLKTMII 300

                            DLWMTGQETTTTTLISGFNQLLLHPEVMIKAREEILKITENGSRHLSLTD 350

                            RTSTPYVNAVIGEIQRHASILNVSFWKINKEFTYMGGHPVDAGALVTSQL 400

                            SALHVNETVFKNPQEFNPERFIRDGKLLQKVIPFGVGKRNCLGESLAKAE 450

                            LYL IFGNLLLRYKFEQHGKLSTTELMPYSAGKRPFKLEMKFVKI

 

CYP33B1

 

>CYP33B1 C25E10.2   U50311 495 aa

         Length = 496

 

 Score = 2536 (892.7 bits), Expect = 4.9e-267, P = 4.9e-267

 Identities = 485/496 (97%), Positives = 489/496 (98%)

 

My seq wrong at green region yellow is intron boundary

 

Query:     1 MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG 60

             MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG

Sbjct:     1 MILIFFALCTLFFLIHQYLWRRRGLPPGPTPIPIFGNLFQLSGSEAPGISIFQKWKDQYG 60

 

Query:    61 PIFTFYMGPVPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFRGGEYGIMDISGDRW 120

             PIFTFYMG  PFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFR  ++  + +SGDRW

Sbjct:    61 PIFTFYMGIAPFVVLTDYQDIKETVIKDGDTYADKYLSPEFNKYFREMQFTKL-LSGDRW 119

 

Query:   121 KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI 180

             KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI

Sbjct:   120 KEHRKFAVLQLRELGVGKPLMESKILIEAEELIKKLKTAEILNEDFLLQSELDVAVGSVI 179

 

Query:   181 NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ 240

             NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ

Sbjct:   180 NQFLFGYRFDRSKLFEFTRIKTLVNNFMEEVGKPLGVLAFTCHGIPSFLVKLMVSGIEEQ 239

 

Query:   241 KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD 300

             KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD

Sbjct:   240 KRELFRFLRKQIDGAKSQINYEEEHNEDFVEAYLRKKFQREQKNDFDSYCDSQLENVCFD 299

 

Query:   301 IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA 360

             IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA

Sbjct:   300 IWAAGFDTLTNTVGFLIAYAINYPEMQMLIHQEIDNYLAHHSRLLTLADKNALVYFNAFA 359

 

Query:   361 NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF 420

             NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF

Sbjct:   360 NEAQRVSNILPMNLPHALTRDVKLKGYHLKKGTGVIHQIANVMTDETIFKDSQRFDPNRF 419

 

Query:   421 IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF 480

             IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF

Sbjct:   420 IDENGKLKKIEELCPFSMGKRQCIGEGLARMEIFLLAANLFNYFEFLPASDGLPSLYKDF 479

 

Query:   481 SLVSHVIPYKCRQYIP 496

             SLVSHVIPYKCRQYIP

Sbjct:   480 SLVSHVIPYKCRQYIP 495

 

CYP32A1

 

>CYP32A1 C26F1.2    U53148 508 aa  also Y97E10 contig266

         Length = 522

 

 Score = 1508 (530.8 bits), Expect = 4.3e-158, P = 4.3e-158

 Identities = 283/321 (88%), Positives = 297/321 (92%)

 

Query:     1 MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY 60

             MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY

Sbjct:     1 MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPY 60

 

Query:    61 AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT 120

             AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT

Sbjct:    61 AFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNT 120

 

Query:   121 NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA 180

             NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA

Sbjct:   121 NISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADA 180

 

Query:   181 VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF 240

             VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF

Sbjct:   181 VELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKF 240

 

Query:   241 PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAADFKIENQKNYYYLGIVLP---EAFK 297

             PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAAD +   +K   +L ++L    E  

Sbjct:   241 PWLWLKPIWYLTGLGFEFDRNVRMTNNFVRKVDAADNEASEKKRKAFLDLLLTIQKEEGT 300

 

Query:   298 INFQVIQERKELLNEDGNEAS 318

             ++ + I+E  +    +G++ +

Sbjct:   301 LSDEDIREEVDTFMFEGHDTT 321

 

 Score = 1184 (416.8 bits), Expect = 9.2e-124, P = 9.2e-124

 Identities = 241/307 (78%), Positives = 254/307 (82%)

 

Query:   252 TGLGFEFDRNVRMTNNFVRKVDAADFKIENQKNYYYLGIV-------LPEAFKINFQVIQ 304

             T +G + +  +   N +V  V      + N   + +L +        L   F  N ++ 

Sbjct:   207 TAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWYLTGLGFEFDRNVRMTN 266

 

Query:   305 ERKELLNEDGNEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG 364

                  ++   NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG

Sbjct:   267 NFVRKVDAADNEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDTTSSGIG 326

 

Query:   365 FTILWLGFYPECQKKLQKELDEVFGFETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI 424

             FTILWLGFYPEC             FETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI

Sbjct:   327 FTILWLGFYPEC-------------FETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLI 373

 

Query:   425 ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD 484

             ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD

Sbjct:   374 ARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRD 433

 

Query:   485 PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM 544

             PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM

Sbjct:   434 PYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGM 493

 

Query:   545 KIKIKRREAADYVV 558

             KIKIKRREAADYVV

Sbjct:   494 KIKIKRREAADYVV 507

 

Cyan not in my seq (intron?) C-terms do not agree

Is my seq frameshifted at the end?

C26F1.2; CE32809.

MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVG  50

SAHLFKWNPYAFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGP 100

VPIVFLGTSECIRPVLESNTNISKPSQYDKMSEWIGTGLLTSTHEKWFHR 150

RKMLTPTFHFTIIQDYFPVFVRNAEVLADAVELHVDGDYFDAFPYFKRCT 200

LDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWY 250

LTGLGFEFDRNVRMTNNFVRKVDAAD

FKIENQKNYYYLGIVLPEAFKINFQVIQERKELLNEDG

NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDT 350

FMFEGHDTTSSGIGFTILWLGFYPECQKKLQKELDEVFGFETNQPPSMDD 400

IKKCSYLEKCIKESLRMFPSVPLIARRLSEDVTINHPSGQKIVLPAGLAA 450

CVSPIAAARDPRAWPDPDTYNPDNFDIDAIAGRDPYAYIPFSAGPRNCIG 500

QKFALLEQKTILSTFFRKYEVESLQTEENLRPVPELILRPYNGMKIKIKR 550

REAADYVVL

 

>C26F1.2    U53148    CYP32A1 508 aa  also Y97E10 contig266

 

MIIVISIVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKW

NPYAFTFQMEGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECI

RPVLESNTNISKPSQYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQD

YFPVFVRNAEVLADAVELHVDGDYFDAFPYFKRCTLDIICETAMGIQVNAQL

GHNNEYVHAVKRISEIVWNHMKFPWLWLKPIWYLTGLGFEFDRNVRMTNN

FVRKVDAAD  NEASEKKRKAFLDLLLTIQKEEGTLSDEDIREEVDTFMFEGHDT

TSSGIGFTILWLGFYPECFETNQPPSMDDIKKCSYLEKCIKESLRMFPSVPLIAR

RLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWPDPDTYNPDNFDID

AIAGRDPYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQTEENLRP

VPELILRPYNGMKIKIKRREAADYVV FTTFFRMKLDECEK*

 

CYP25A1

 

>CYP25A1 C36A4.1    Z66495 496 aa corrected 9/14/98

         Length = 497

 

Score = 1060 (373.1 bits), Expect = 4.0e-244, Sum P(2) = 4.0e-244

 Identities = 212/229 (92%), Positives = 213/229 (93%)

 

Query:     1 MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERKEKLGY 60

             MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK    

Sbjct:     1 MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK----- 55

 

Query:    61 DDANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ 120

              DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ

Sbjct:    56 -DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSIYEANQ 114

 

Query:   121 LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI 180

             LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI

Sbjct:   115 LTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDILREKASSGQKWDI 174

 

Query:   181 YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYHPVTVKITINNFTYFHS 229

             YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY  V  K  I+N    HS

Sbjct:   175 YDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY--VNAKKYISNIDIRHS 221

 

 Score = 1283 (451.6 bits), Expect = 4.0e-244, Sum P(2) = 4.0e-244

 Identities = 236/236 (100%), Positives = 236/236 (100%)

 

Query:   275 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 334

             YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL

Sbjct:   261 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 320

 

Query:   335 LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED 394

             LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED

Sbjct:   321 LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCRED 380

 

Query:   395 ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 454

             ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY

Sbjct:   381 ITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 440

 

Query:   455 CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN 510

             CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN

Sbjct:   441 CVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPNDPVRLHLKPRN 496

 

C36A4.1; CE03070. cyan not in my seq, green instead

MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQ  50

TAERKEKLGYDDANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVF 100

IKNFSNFSDRSVPSIYEANQLTASLLMNSYSSGWKHTRSAIAPIFSTGKM 150

KAMQETINSKVDLFLDILREKASSGQKWDIYDDFQGLTLDVIGKCAFAID 200

SNCQRDRNDVFY

HPVTVKITINNFTYFHSSSPGTFHFLESTLQIHTTGRCRNSTCRRTVKCVGFRQDKAKFCSD

YERRRGGEGSDSVDLLKLLLNREDDK 300

SKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLSKYPNVQQKLYEEIM 350

EAKENGGLTYDSIHNMKYLDCVYKETLRFYPPHFSFIRRLCREDITIRGQ 400

FYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLKWIPFGV 450

GPRYCVGMRFAEMEFKTTIVKLLDTFELKQFEGEADLIPDCNGVIMRPND 500

PVRLHLKPRN                                         510

 

>C36A4.1    Z66495   CYP25A1 496 aa corrected 9/14/98

 

MALLILSSLVISIFTFFIYIILARRERFKLREKIGLSGPEPHWFLGNLKQTAERK

DANRWFNELHEQYGETFGIYYGSQMNIVISNEKDIKEVFIKNFSNFSDRSVPSI

YEANQLTASLLMNSYSSGWKHTRSAIAPIFSTGKMKAMQETINSKVDLFLDI

LREKASSGQKWDIYDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFY

VNAKKYISNIDIRHSKIIAASVLLPELSTFWKALYKYTPLADAEIPLVEGLSNV

YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTA

MTYCSYLLSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDCVYKETLRFY

PPHFSFIRRLCREDITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFE

NWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIVKLLDTFELKQFEGEAD

LIPDCNGVIMRPNDPVRLHLKPRN*

 

My green seq matches other CYP25As see below

 

 

>CYP25A2 C36A4.2    Z66495 496 aa

         Length = 497

 

 Score = 234 (82.4 bits), Expect = 2.3e-22, P = 2.3e-22

 Identities = 44/54 (81%), Positives = 51/54 (94%)

 

Query:     1 VNAKKYISNIDIRHSKIIAASVLLPELSTFWKALYKYTPLADAEIPLVEGLSNV 54

             VNA+K+I+NIDIRHSK+IAAS +LPEL+  W+ALYKYTPLADAEIPLVEGLSNV

Sbjct:   207 VNARKFIANIDIRHSKVIAASFILPELAPLWRALYKYTPLADAEIPLVEGLSNV 260

 

CYP25A2

 

>CYP25A2 C36A4.2    Z66495 496 aa

         Length = 497

 

 Score = 1327 (467.1 bits), Expect = 9.8e-272, Sum P(2) = 9.8e-272

 Identities = 260/266 (97%), Positives = 260/266 (97%)

 

Query:     1 MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRKEKLGY 60

             MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRK    

Sbjct:     1 MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQIVDRK----- 55

 

Query:    61 DDSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ 120

              DSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ

Sbjct:    56 -DSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVFIKNFSNFSDRIVPPIFDSNQ 114

 

Query:   121 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI 180

             LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI

Sbjct:   115 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMQETIHSKVDLFLDILREKASSGQKWDI 174

 

Query:   181 YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA 240

             YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA

Sbjct:   175 YEDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELA 234

 

Query:   241 PLWRALYKYTPLADAEIPLVEGLSNV 266

             PLWRALYKYTPLADAEIPLVEGLSNV

Sbjct:   235 PLWRALYKYTPLADAEIPLVEGLSNV 260

 

 Score = 1277 (449.5 bits), Expect = 9.8e-272, Sum P(2) = 9.8e-272

 Identities = 236/236 (100%), Positives = 236/236 (100%)

 

Query:   287 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 346

             YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL

Sbjct:   261 YERRRGGEGSDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYL 320

 

Query:   347 LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD 406

             LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD

Sbjct:   321 LSKYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFINRRCLAD 380

 

Query:   407 ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 466

             ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY

Sbjct:   381 ITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEEKSSSLKWIPFGVGPRY 440

 

Query:   467 CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN 522

             CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN

Sbjct:   441 CVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLIPDCNGVIMRPKDPVRLLLKPRN 496

 

C36A4.2; CE03071. cyan not in my seq extra intron seq?

MALLILTSILILLVSFIIYILFARREQFKLREKIGLTGPEPHWFMGNLKQ  50

IVDRKEKLGYDDSNKWFNELHKQYGETFGIYFGAQMNIVLSNEEDIKEVF 100

IKNFSNFSDRIVPPIFDSNQLNQSLLQNTYATGWKHTRSAIAPIFSTGKM 150

KAMQETIHSKVDLFLDILREKASSGQKWDIYEDFQGLTLDVIGKCAFAID 200

SNCQRDRNDVFYVNARKFIANIDIRHSKVIAASFILPELAPLWRALYKYT 250

PLADAEIPLVEGLSNV

TPFLLENFCSFSLILTKIFS

YERRRGGEGSDSVD 300

LLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLSKY 350

PNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFIN 400

RRCLADITIRGQFYPKGSVVTCLPHTVHLNPENWDSPEEFHPERFENWEE 450

KSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIVKLLDTFELKQFKGEADLI 500

PDCNGVIMRPKDPVRLLLKPRN

 

 

CYP35C1

 

C06B3.3; CE27662. CYP35C1

MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGG  50

VTNMLNHFRKEYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHV 100

SPMIDYVRKGNGVFFSNGDKWQELRRFSMLTMRNMGMGRDLMEEKILSEL 150

DARCAEINEKSIDGTTVLQVNEFFDLTVGSIINNMLLGFRFDERTKSRFL 200

TMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQFEIIDYVSVD 250

AIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYDENNLKMLIT 300

DLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRD 350

KTETPYLNATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQL 400

SALHTNEKIFENPEKFNPERFIRNENLMQQTIPFGIGKRSCLGESLARAE 450

LYLIIGNLLLRYNFESSGKMPSTRETVPFGFAKRCEAFDMKVTKI

 

>CYP35C1 C06B3.3    Z77652 508 aa

         Length = 509

 

 Score = 2351 (827.6 bits), Expect = 3.3e-249, Sum P(2) = 3.3e-249

 Identities = 452/457 (98%), Positives = 454/457 (99%)

 

M C-term is wrong

 

Query:     1 MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK 60

             MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK

Sbjct:     1 MFFILLFVSIISFLTARQFLKAKRLPPGPFSLPLIGNAHQVGYQLWRTGGVTNMLNHFRK 60

 

Query:    61 EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK 120

             EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK

Sbjct:    61 EYGDIFTLWLGPIPHVNITNYELSHEVFVKNSTKYADKHVSPMIDYVRKGNGVFFSNGDK 120

 

Query:   121 WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS 180

             WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS

Sbjct:   121 WQELRRFSMLTMRNMGMGRDLMEEKILSELDARCAEINEKSIDGTTVLQVNEFFDLTVGS 180

 

Query:   181 IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ 240

             IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ

Sbjct:   181 IINNMLLGFRFDERTKSRFLTMKHMFDEGMDKMTPLFFTLPVWALQKFLAKDFNSIIKDQ 240

 

Query:   241 FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYD---ENNLKM 297

             FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVY+   ENNLKM

Sbjct:   241 FEIIDYVSVDAIKRSRDFMNGDYEIDPNNVEDIVDAFLLKMKQNPKSDVYESFSENNLKM 300

 

Query:   298 LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL 357

             LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL

Sbjct:   301 LITDLWITGQETTTTTLVSAFIQFLNNPQVMDTVQKELIKVTNGGSRQLSLRDKTETPYL 360

 

Query:   358 NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN 417

             NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN

Sbjct:   361 NATIAEVQRHASILNINFWRINNEPTVIGGHPVDSGCLIASQLSALHTNEKIFENPEKFN 420

 

Query:   418 PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYLI 454

             PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYL+

Sbjct:   421 PERFIRNENLMQQTIPFGIGKRSCLGESLARAELYLV 457

 

CYP31A1P

 

>gi|17538369|ref|NM_069185.1| Caenorhabditis elegans cytochrome P450 (ccp-31A1), mRNA

          Length = 1269

 

 Score =  780 bits (2014), Expect = 0.0

 Identities = 407/487 (83%), Positives = 412/487 (84%), Gaps = 1/487 (0%)

 Frame = +1

 

By attempting to make this into a functional gene without gaps or stop codons the actual sequence has been asssembled incorrectly as an mRNA

 

Query: 1   MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI 60

           MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI

Sbjct: 1   MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI 180

 

Query: 61  GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNRGFAYVLLEPWLGISILTS 120

           GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNR                 

Sbjct: 181 GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNR------------------ 306

 

Query: 121 QKEQWRLXXXXXXXXXXXXXXXXXXXXXXXXXKILVQKLCCLGADERVDVLSVIALCTLD 180

                                            ILVQKLCCLGADERVDVLSVIALCTLD

Sbjct: 307 ---------------------------------ILVQKLCCLGADERVDVLSVIALCTLD 387

 

Query: 181 IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMTEDGRTHEKCLHIFHDFTK 240

           IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLM            I++ + 

Sbjct: 388 IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNS--------FIYNLYGS 543

 

Query: 241 KLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDVQAEGNTFMLEGHDTTSTGL 299

            +I +      ENDYKMEGRLAFLDLLLEMVNSG MDETDVQAEGNTFMLEGHDTTSTGL

Sbjct: 544 FIINK------ENDYKMEGRLAFLDLLLEMVNSGQMDETDVQAEGNTFMLEGHDTTSTGL 705

 

Query: 300 MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT 359

           MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT

Sbjct: 706 MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT 885

 

Query: 360 RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 419

           RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP

Sbjct: 886 RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 1065

 

Query: 420 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 479

           FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR

Sbjct: 1066FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 1245

 

Query: 480 RRPIASP 486

           RRPIASP

Sbjct: 1246RRPIASP 1266

 

Compare to 31A2

 

>CYP31A2 F22B3 Z68336 487 aa

         Length = 488

 

 Score = 2236 (787.1 bits), Expect = 3.0e-235, P = 3.0e-235

 Identities = 435/488 (89%), Positives = 442/488 (90%)

 

Query:     1 MGVIILAVLLASATVIAWLLYKHLRMRQALKHLNQPRSYPIIGHGLITKPDPEGFMNQVI 60

             MGVII AVLLA ATVIAWLLYKHLRMRQ LKHLNQPRSYPI+GHGLITKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSGDLVEAIFSSTKHLNRGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYS DLVE IFSSTKHLN+GFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRLXXXXXXXXXXXXXXXXXXXXXXXXXKILVQKLCCLGADERVDVLSVIALCTLD 180

             QKEQWR                          KILVQKLCCLGADE VDVLSVI LCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMTEDGRTHEKCLHIFHDFTK 240

             IICETSMG AIGAQLAENNEYVWAVHTINKLISKRTNNPL+TEDGRTHEKCL I HDFTK

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

 

Query:   241 KLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDVQAEGNTFMLEGHDTTSTGL 299

             K+I ERK ALQENDYKMEGRLAFLDLLLEMV SG MDETDVQAE +TFM EGHDTTSTGL

Sbjct:   241 KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300

 

Query:   300 MWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVLIIT 359

             MWA+HLLGNHP+VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSV IIT

Sbjct:   301 MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360

 

Query:   360 RELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFHPENSIGRKSFAFIP 419

             RELSDDQVIGG NIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRF PENSIGRKSFAFIP

Sbjct:   361 RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFLPENSIGRKSFAFIP 420

 

Query:   420 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 479

             FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR

Sbjct:   421 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480

 

Query:   480 RRPIASP* 487

             RRPI SP*

Sbjct:   481 RRPIVSP* 488

 

compare to cDNA

>gi|18246951|dbj|BJ104281.1|BJ104281  UniGene info BJ104281 unpublished oligo-capped cDNA library, C. elegans L1 stage

           Caenorhabditis elegans cDNA clone yk1059h10 5'.

          Length = 601

 

 Score =  164 bits (415), Expect = 4e-42

 Identities = 84/100 (84%), Positives = 86/100 (86%), Gaps = 8/100 (8%)

 Frame = +1

 

Query: 14  DVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL-------- 65

           DVLSVI LCTLDIICETSMG AIGAQLAENNEYVWAVHTINKLISKRTNNPL       

Sbjct: 1   DVLSVIALCTLDIICETSMGIAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYN 180

 

Query: 66  ITEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGR 105

           +TEDGRTHEKCL I HDFTKK+I ERK ALQENDYKMEGR

Sbjct: 181 LTEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGR 300

 

 

>gi|1122764|emb|Z68213.1|CEC01F6 Caenorhabditis elegans cosmid C01F6, complete sequence

          Length = 31281

 

 

Score = 61.6 bits (148), Expect(5) = 0.0

 Identities = 31/31 (100%), Positives = 31/31 (100%)

 Frame = -2

 

Query: 1    MGVIILAVLLASATVIAWLLYKHLRMRQALK 31

            MGVIILAVLLASATVIAWLLYKHLRMRQALK

Sbjct: 8876 MGVIILAVLLASATVIAWLLYKHLRMRQALK 8784

 

 

 Score =  162 bits (410), Expect(5) = 0.0

 Identities = 85/135 (62%), Positives = 89/135 (65%)

 Frame = -3

 

Query: 31   KHLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV 90

            +HLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV

Sbjct: 8740 QHLNQPRSYPIIGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSGDLV 8561

 

Query: 91   EAIFSSTKHLNRGFAYVLLEPWLGISILTSQKEQWRLXXXXXXXXXXXXXXXXXXXXXXX 150

            EAIFSSTKHLNR            + I  +    W                        

Sbjct: 8560 EAIFSSTKHLNR------------VRIRFA*TMAWN-------QYFDEPERTMASFFDFL 8438

 

Query: 151  XXKILVQKLCCLGAD 165

              KILVQKLCCLGA+

Sbjct: 8437 LLKILVQKLCCLGAE 8393

 

Score = 55.8 bits (133), Expect = 6e-09

 Identities = 25/25 (100%), Positives = 25/25 (100%)

 Frame = -1

 

Query: 103  GFAYVLLEPWLGISILTSQKEQWRL 127

            GFAYVLLEPWLGISILTSQKEQWRL

Sbjct: 8526 GFAYVLLEPWLGISILTSQKEQWRL 8452

 

 Score = 63.9 bits (154), Expect(5) = 0.0

 Identities = 32/32 (100%), Positives = 32/32 (100%)

 Frame = -2

 

Query: 166  ERVDVLSVIALCTLDIICETSMGIAIGAQLAE 197

            ERVDVLSVIALCTLDIICETSMGIAIGAQLAE

Sbjct: 8345 ERVDVLSVIALCTLDIICETSMGIAIGAQLAE 8250

 

Score = 56.2 bits (134), Expect(5) = 0.0

 Identities = 25/27 (92%), Positives = 26/27 (96%)

 Frame = -1

 

Query: 195  LAENNEYVWAVHTINKLISKRTNNPLM 221

            L +NNEYVWAVHTINKLISKRTNNPLM

Sbjct: 8211 LFQNNEYVWAVHTINKLISKRTNNPLM 8131

 

Score =  480 bits (1236), Expect(5) = 0.0

 Identities = 238/240 (99%), Positives = 239/240 (99%), Gaps = 1/240 (0%)

 Frame = -3

 

Query: 222  TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG-MDETDV 280

            TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSG MDETDV

Sbjct: 8059 TEDGRTHEKCLHIFHDFTKKLIGERK*ALQENDYKMEGRLAFLDLLLEMVNSGQMDETDV 7880

 

Query: 281  QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY 340

            QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY

Sbjct: 7879 QAEGNTFMLEGHDTTSTGLMWAVHLLGNHPDVQRKVQAELDEVMGDDEDVTIEHLSRMKY 7700

 

Query: 341  LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD 400

            LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD

Sbjct: 7699 LECALKEALRLFPSVLIITRELSDDQVIGGFNIPKGVTFLLNLYLVHRDPAQWKDPDVFD 7520

 

Query: 401  PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVR 460

            PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEV+

Sbjct: 7519 PDRFHPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVK 7340

 

 Score = 60.5 bits (145), Expect = 4e-09

 Identities = 28/29 (96%), Positives = 29/29 (100%)

 Frame = -2

 

Query: 458  EVRPKMEIIVRPVTPIHMKLTRRRPIASP 486

            +VRPKMEIIVRPVTPIHMKLTRRRPIASP

Sbjct: 6887 QVRPKMEIIVRPVTPIHMKLTRRRPIASP 6801

 

F23A7.3 fusion to CYP29A1 (bogus)

 

Sequence F23A7.3 matches this sequence NM_077558.  This seq is a fusion protein with P450 CYP29A1 on the bottom and some unknown sequence on the top. F23A7.3 matches the top only (CYAN).  I think this may be an artifactual fusion.  This sequence should be unfused and the annotation of F23A7.3 as a P450 should be stopped.  Note that the first exon of 29A1 has been skipped in this fusion.

 

These genes are not linked in C. briggsae

 

 

NM_077558               2199 bp    mRNA    linear   INV 21-NOV-2003

DEFINITION  Caenorhabditis elegans cytochrome 450 family member (XM328), mRNA.

MVRKSFKRNHSSTAEDDNKDFDIKKVSDGLSSSISEPFVVANFP

KNKQWDIDEMTTNEKYIAYLPTVVKVTPAHKKRLLMNAELNNLFENLRTHKTGPLDTE

IDGANFVAIEKDVEPHLIRKIQLLPNGMLKLIWPVLPRNEYESNSNNQEEIDATKKEL

SETKKQLETERQKSIESQRFLEKSCNEKVSKLAVENQKLKNKMAQKSDETDSAYRMIL

ELKKKHEQELQMKNSTIMKLEDNV KFNNSSNEQFSKMTTENQQLKDWMATKTKEVEHA

AKMMLELKQKHIEELQMKTTQITKLENDVKLNGNFMDHISELTEENHNLKEQLKQKTD

AETKMIAEMKKKHRGERHYMNSRITKLEDALEYKDNQLSKREIAHNKLLDQLS

EITEIFRQTADETRSQGKSVMKYHILGKLYVWPLDGKTIAKLVESTTELNKGDDYNFFLPWLG

GGVLVEGFGERWRTHRKLLTPTFHFAKLEGYLEVFYSETKIMIEHLEKFADNEETVDM

FPYIKRCALDIICGAAIGTKINAQMHHNHPYVKAVEGFNSMAISHAINPSYQIPAIYW

ALGLKKQKDAHLNTMKTFTVNVIADRKAAIASGEVEKETSKRKMNFLDILLN N

Missing sequence of CYP29A1 here.

GMTIP

SGANVSIAPLALHSNAQVFPNPNKFDPDRFLPDEIAKRNAYDFMPFSAGLRNCIGEKF

ALLNEKVMMIHILKNFRLEPMGGFYSTKPMFEAVARPSNGISVKLIRRQF"

 

>CYP29A1 C44C10.2 CE05409    Z69787 502 aa also Y102F5 AL022276 61000-63000 region

MALILIIILICLLSFKPWSWKTIQLIXKYRQYDDKIPGPPTHPIFGNTETFKNKSQT

EITEIFRQTADETRSQGKSVMKYHILGKLYVWPLDGKTIAKLVESTTELNKGDDYNFF

LPWLGGGVLVEGFGERWRTHRKLLTPTFHFAKLEGYLEVFYSETKIMIEHLEKFA

DNEETVDMFPYIKRCALDIICGAAIGTKINAQMHHNHPYVKAVEGFNSMAISHAIN

PSYQIPAIYWALGLKKQKDAHLNTMKTFTVNVIADRKAAIASGEVEKETSKRKMN

FLDILLN SEESNSLTSEDIRQEVDTFMFAGHDTTTTSVSWACWNLAHNPDIQEKVY

EEIVHIFGEEPNDEVTSEGISXLEYTERVLNESKRIIAPVPSLQRKLINDMEFG GMTIP

SGANVSIAPLALHSNAQVFPNPNKFDPDRFLPDEIAKRNAYDFMPFSAGLRNCIGE

KFALLNEKVMMIHILKNFRLEPMGGFYSTKPMFEAVARPSNGISVKLIRRQF*

 

F23A7.3; CE09577.

MVRIICKRKTAEDKPNAFEFKKVNKGLTSPSDGPFVIAHFPKNMIWEIDQ  50

ETSNEKYIAYFPTIVSSPDTKRLMENDQLKMLYEKLRDHKSGPLKTQIDG 100

TNFVAVEQDAEPHQIESIQLLPNGNLKLYWPVICRDELEEDTNKEFNEMK 150

KDLYETKRELTEVRISLMNLEAEHRTLQTNCNRKDTQLFERHEEIQQLKD 200

HIQWKSKDTEVAAKMVFQMQNKHKEEVEKYTSTISKLKEEV VAKDKEIVE 250

NKTANDNMVSALLGLINKQRRVGT

 

>gi|17550997|ref|NM_077558.1| Caenorhabditis elegans cytochrome 450 family member (XM328), mRNA

          Length = 2199

 

 Score =  208 bits (530), Expect = 2e-52

 Identities = 117/248 (47%), Positives = 167/248 (67%), Gaps = 7/248 (2%)

 Frame = +1

 

Query: 1   MVRIICKRK---TAEDKPNAFEFKKVNKGLTSPSDGPFVIAHFPKNMIWEIDQETSNEKY 57

           MVR   KR    TAED    F+ KKV+ GL+S    PFV+A+FPKN  W+ID+ T+NEKY

Sbjct: 1   MVRKSFKRNHSSTAEDDNKDFDIKKVSDGLSSSISEPFVVANFPKNKQWDIDEMTTNEKY 180

 

Query: 58  IAYFPTIV--SSPDTKRLMENDQLKMLYEKLRDHKSGPLKTQIDGTNFVAVEQDAEPHQI 115

           IAY PT+V  +    KRL+ N +L  L+E LR HK+GPL T+IDG NFVA+E+D EPH I

Sbjct: 181 IAYLPTVVKVTPAHKKRLLMNAELNNLFENLRTHKTGPLDTEIDGANFVAIEKDVEPHLI 360

 

Query: 116 ESIQLLPNGNLKLYWPVICRDELEEDTN--KEFNEMKKDLYETKRELTEVRISLMNLEAE 173

             IQLLPNG LKL WPV+ R+E E ++N  +E +  KK+L ETK++L   R    ++E++

Sbjct: 361 RKIQLLPNGMLKLIWPVLPRNEYESNSNNQEEIDATKKELSETKKQLETER--QKSIESQ 534

 

Query: 174 HRTLQTNCNRKDTQLFERHEEIQQLKDHIQWKSKDTEVAAKMVFQMQNKHKEEVEKYTST 233

            R L+ +CN K ++L     E Q+LK+ +  KS +T+ A +M+ +++ KH++E++   ST

Sbjct: 535 -RFLEKSCNEKVSKL---AVENQKLKNKMAQKSDETDSAYRMILELKKKHEQELQMKNST 702

 

Query: 234 ISKLKEEV 241

           I KL++ V

Sbjct: 703 IMKLEDNV 726

 

CYP31A2

 

>gi|4263218|gb|AC006720.1|  Download subject sequence spanning the HSP Caenorhabditis elegans cosmid Y17G9B, complete sequence

          Length = 19620

 

 Score =  223 bits (567), Expect = 3e-59

 Identities = 119/152 (78%), Positives = 120/152 (78%), Gaps = 31/152 (20%)

 Frame = +3

 

Query: 1     DVLSVITLCTLDIICETSMGKAIGAQLAE----------------NNEYVWAVHTINKLI 44

             DVLSVITLCTLDIICETSMGKAIGAQLAE                NNEYVWAVHTINKLI

Sbjct: 12522 DVLSVITLCTLDIICETSMGKAIGAQLAEVREILQSQHF*TFFFQNNEYVWAVHTINKLI 12701

 

Query: 45    SKRTNNPLMWNSFIYNII---------------TEDGRTHEKCLRILHDFTKKVIVERKE 89

             SKRTNNPLMWNSFIYN+                TEDGRTHEKCLRILHDFTKKVIVERKE

Sbjct: 12702 SKRTNNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGRTHEKCLRILHDFTKKVIVERKE 12881

 

Y62E10A.15 = CYP31A5P (a new pseudogene)

 

This seq is almost identical to CYP31A2 (only 4 amino acid differences, plus some minor errors at intron boundaries.  There is a large fragment missing in the middle

My conclusion is that this is a very recent pseudogene of CYP31A2 = CYP31A5P

 

>CYP31A2 F22B3 Z68336 487 aa

         Length = 488

 

 Score = 1239 (436.1 bits), Expect = 4.8e-143, Sum P(2) = 4.8e-143

 Identities = 237/251 (94%), Positives = 240/251 (95%)

 

Query:     1 MGVIIPAVLLASATVIAWLIYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60

             MGVIIPAVLLA ATVIAWL+YKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILIQKLCCLGVADEEVDVLSVITLCTL 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKIL+QKLCCLG ADEEVDVLSVITLCTL

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLG-ADEEVDVLSVITLCTL 179

 

Query:   181 DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYNLTEDGRTHEKC 240

             DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL        +TEDGRTHEKC

Sbjct:   180 DIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL--------ITEDGRTHEKC 231

 

Query:   241 LHILHDFTKKV 251

             L ILHDFTKKV

Sbjct:   232 LRILHDFTKKV 242

 

 Score = 148 (52.1 bits), Expect = 4.8e-143, Sum P(2) = 4.8e-143

 Identities = 30/40 (75%), Positives = 34/40 (85%)

 

Query:   239 KCLHILHDFTKKVRPKMEIIVRPVTPIHMKLTRRRPIVSP 278

             K + ++H+    VRPKMEIIVRPVTPIHMKLTRRRPIVSP

Sbjct:   452 KAVELMHE----VRPKMEIIVRPVTPIHMKLTRRRPIVSP 487

 

>CYP31A2 F22B3 Z68336 487 aa

MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFM

NQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEP

WLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEV

DVLSVITLCTLDIICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDG

RTHEKCLRILHDFTKK VIVERKEALQENDYKMEGRLAFL DLLLEMVKSGQMDETD

VQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEH

LSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPA

QWKDPDVFDPDRFLPENSIGRKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRN

FNIKAVELMHE VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

 

>gi|6425490|emb|AL132865.1|CEY62E10A Caenorhabditis elegans YAC Y62E10A, complete sequence

          Length = 57237

 

Score = 60.1 bits (144), Expect(3) = e-112

 Identities = 29/31 (93%), Positives = 30/31 (96%)

 Frame = +3

 

Query: 1     MGVIIPAVLLAMATVIAWLLYKHLRMRQVLK 31

             MGVIIPAVLLA ATVIAWL+YKHLRMRQVLK

Sbjct: 42477 MGVIIPAVLLASATVIAWLIYKHLRMRQVLK 42569

 

 Score =  285 bits (730), Expect(3) = e-112

 Identities = 134/145 (92%), Positives = 139/145 (95%), Gaps = 1/145 (0%)

 Frame = +1

 

Query: 23    HLRMRQV-LKHLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC 81

             HL++  +  +HLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC

Sbjct: 42586 HLKIN*INFQHLNQPRSYPIVGHGLITKPDPEGFMNQVIGMGYLYPDPRMCLLWIGPFPC 42765

 

Query: 82    LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL 141

             LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL

Sbjct: 42766 LMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDIL 42945

 

Query: 142   KDFLPIFNEQSKILVQKLCCLGADE 166

             KDFLPIFNEQSKIL+QKLCCLG  E

Sbjct: 42946 KDFLPIFNEQSKILIQKLCCLGVAE 43020

 

 

 Score =  102 bits (254), Expect(3) = e-112

 Identities = 55/73 (75%), Positives = 56/73 (76%), Gaps = 17/73 (23%)

 Frame = +3

 

Query: 166   EEVDVLSVITLCTLDIICETSMGKAIGAQLAE-----------------NNEYVWAVHTI 208

             EEVDVLSVITLCTLDIICETSMGKAIGAQLAE                 NNEYVWAVHTI

Sbjct: 43068 EEVDVLSVITLCTLDIICETSMGKAIGAQLAEVREILPSI*ISKCFFFQNNEYVWAVHTI 43247

 

Query: 209   NKLISKRTNNPLI 221

             NKLISKRTNNPL+

Sbjct: 43248 NKLISKRTNNPLM 43286

 

 

 Score = 84.3 bits (207), Expect = 2e-16

 Identities = 39/42 (92%), Positives = 41/42 (97%)

 Frame = +1

 

Query: 222   TEDGRTHEKCLRILHDFTKKVIVERKEALQENDYKMEGRLAF 263

             TEDGRTHEKCL ILHDFTKKVIVERKEALQ++DYKMEGRLAF

Sbjct: 43885 TEDGRTHEKCLHILHDFTKKVIVERKEALQDSDYKMEGRLAF 44010

 

This gene missing 195 amino acids here

 

 

 Score = 60.5 bits (145), Expect = 4e-09

 Identities = 28/29 (96%), Positives = 29/29 (100%)

 Frame = +1

 

Query: 459   EVRPKMEIIVRPVTPIHMKLTRRRPIVSP 487

             +VRPKMEIIVRPVTPIHMKLTRRRPIVSP

Sbjct: 44824 QVRPKMEIIVRPVTPIHMKLTRRRPIVSP 44910

 

Y62E10A.15; CE28717.

MGVIIPAVLLASATVIAWLIYKHLRMRQVLK (0)

HLNQPRSYPIVGHGLITKP  50

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL 100

NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE 150

QSKILIQKLCCLGVAD (2)

EEVDVLSVITLCTLDIICETSMGKAIGAQLAE (0)

NNEYVWAVHTINKLISKRTNNPI (2)

(LMWNSFIYNL) delete this seq

TEDGRTHEKCLHILHDFTKK 250

VIVERKEALQDSDYKMEGRLAF (add this seq)

(sequence gap)

VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

 

New pseudogene seq = CYP31A5P

MGVIIPAVLLASATVIAWLIYKHLRMRQVLK (0)

HLNQPRSYPIVGHGLITKP

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL

NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE

QSKILIQKLCCLGVAD (2)

EEVDVLSVITLCTLDIICETSMGKAIGAQLAE (0)

NNEYVWAVHTINKLISKRTNNPI (2)

TEDGRTHEKCLHILHDFTKK

VIVERKEALQDSDYKMEGRLAF

(sequence gap)

VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

 

Y5H2B.5 = CYP32B1

 

Y5H2B.5; CE26222.

MLAAALVLLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGT

LWYFKKDPVEMVYQAQAWFSEYTLAPDNCGVLKVAVNVEVS (0)

LGFQLWLGPIPAVNIARGEIAK ()

IVLDSSVNISKSSQYNKLKEWIGDGLLI ()

STGDKWRSRRKMLTQTFHF ()

AVLKEYQKIFGAQGKILVEVLQLRANNKFSFDIMPYIKRCALDIICETAM

GCSISSQRGANDEYVNSVRRLSEIVWNYEKAPQFWLKPIWYLFGDGFEFN

RHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDYLLKSQEEHPDI

LTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYQQRVHDELDEIF

GEDFERIPNSEDIQKMVYLEQCIKETLRMTPPVPFVSRKLTEDVKI ()

PHATKPDLLLPAGINCMINIITIMKDARYFERPYEFFPEHFSPERVAAREPFAFVPFSAGPRNCI ()

GQKFALLEEKVLLSWIFRNFTVTSMTKFPEEMPIPELILKPQFGTQVLLRNRRKL

 

>EMBOSS_001_4

EHFGISKRILLVGTNFYILNTFKYFRNGIPSPGMVQRVHTGSG*LWCVEGSGKC*SFSVK

LGFQLWLGPIPAVNIARGEI

>EMBOSS_001_5

TLWYFKKDPV GRN*FLYT*HF*IF*KWYTKPRHGSASTHWLRITVVC*R*R*MLKFQCEI

GFSVMVGTNTSCQHCERGDX

>EMBOSS_001_6

NTLVFQKGSCW*ELIFIYLTLLNILEMVYQAQAWFSEYTLAPDNCGVLKVAVNVEVSV*N

WVFSYGWDQYQLSTLREGRX

 

 

46% to 32A1 (below) 42% to 37A1 39% to 37B1

 

>CYP32A1 C26F1.2    U53148 508 aa  also Y97E10 contig266 revised 3/19/04

         Length = 509

 

 Score = 1169 (411.5 bits), Expect = 3.6e-122, P = 3.6e-122

 Identities = 228/495 (46%), Positives = 329/495 (66%)

 

Query:     8 LLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGTLWYFKKDPVEMVYQA 67

             +++ Y  YL+      IL+   + + C +    + GPPA+ + G+   FK +P    +Q

Sbjct:     7 IVIGYVIYLVVVNFQQILELWRINRKCAQNLSMVNGPPALPLVGSAHLFKWNPYAFTFQM 66

 

Query:    68 QAWFSEYTL---------APDN--CGVLKLWLGPIPAVNIARGEIAKIVLDSSVNISKSS 116

             + W  +Y           AP+N   G++ LW+GP+P V +   E  + VL+S+ NISK S

Sbjct:    67 EGWAQKYLFGRAKYGEIAAPNNEVDGIMLLWIGPVPIVFLGTSECIRPVLESNTNISKPS 126

 

Query:   117 QYNKLKEWIGDGLLISTGDKWRSRRKMLTQTFHFAVLKEYQKIFGAQGKILVEVLQLRAN 176

             QY+K+ EWIG GLL ST +KW  RRKMLT TFHF ++++Y  +F    ++L + ++L  +

Sbjct:   127 QYDKMSEWIGTGLLTSTHEKWFHRRKMLTPTFHFTIIQDYFPVFVRNAEVLADAVELHVD 186

 

Query:   177 NKFSFDIMPYIKRCALDIICETAMGCSISSQRGANDEYVNSVRRLSEIVWNYEKAPQFWL 236

               + FD  PY KRC LDIICETAMG  +++Q G N+EYV++V+R+SEIVWN+ K P  WL

Sbjct:   187 GDY-FDAFPYFKRCTLDIICETAMGIQVNAQLGHNNEYVHAVKRISEIVWNHMKFPWLWL 245

 

Query:   237 KPIWYLFGDGFEFNRHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDYLLKSQEE 296

             KPIWYL G GFEF+R+V++T +F R V           ++E +E K+ AFLD LL  Q+E

Sbjct:   246 KPIWYLTGLGFEFDRNVRMTNNFVRKV--------DAADNEASEKKRKAFLDLLLTIQKE 297

 

Query:   297 HPDILTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYFGEDFERIPNSEDIQKMV 356

                 L+DE IREEVDTFMFEGHDTTSSGI F + +LG +PE F  +  + P+ +DI+K 

Sbjct:   298 E-GTLSDEDIREEVDTFMFEGHDTTSSGIGFTILWLGFYPECF--ETNQPPSMDDIKKCS 354

 

Query:   357 YLEQCIKETLRMTPPVPFVSRKLTEDVKIPHATKPDLLLPAGINCMINIITIMKDARYFE 416

             YLE+CIKE+LRM P VP ++R+L+EDV I H +   ++LPAG+   ++ I   +D R +

Sbjct:   355 YLEKCIKESLRMFPSVPLIARRLSEDVTINHPSGQKIVLPAGLAACVSPIAAARDPRAWP 414

 

Query:   417 RPYEFFPEHFSPERVAAREPFAFVPFSAGPRNCIGQKFALLEEKVLLSWIFRNFTVTSMT 476

              P  + P++F  + +A R+P+A++PFSAGPRNCIGQKFALLE+K +LS  FR + V S+

Sbjct:   415 DPDTYNPDNFDIDAIAGRDPYAYIPFSAGPRNCIGQKFALLEQKTILSTFFRKYEVESLQ 474

 

Query:   477 KFPEEMPIPELILKPQFGTQVLLRNR 502

                   P+PELIL+P  G ++ ++ R

Sbjct:   475 TEENLRPVPELILRPYNGMKIKIKRR 500

 

>gi|22417476|emb|CAAC01000007.1|  Download subject sequence spanning the HSP Caenorhabditis briggsae contig cb25.fpc0023 from assembly cb25.agp8,

              whole genome shotgun sequence

          Length = 1407032

 

726375 MIAAILVALLTYFLFSIFQKRHEIQKFLYARRICIQEFEKVKGPPAVPIFGTTWYFKSDP 726554

726555 IEMIKQAQKWFVEYTLAPDSNGLLK (0) 726629

726676 FWMGPVPVVSICRGEVAK ()

726977 IFDSSTNIPKSSQYRKLKEWIGDGLLI () 727057

727102 STGPKWRSRRKMLTQTFHFAVLKEYHKVFASQGKILVDVLRLRANNTYPFDIMPYIKR 727275

727276 CTLDIIC () 727296

727682 ETAMGCSISSQMGSNDKYVESVKRLSELVWNYEK () 727783

727827 APLYWLKPIWYLFGNGFEFDRLVKLTTDFTRDVIDKRKEELKLHESEPSDKKLAFLDY 728000

728001 LLKSQTDHPEILTDEGIREEVDTFMFEGHDTTSSGIKFAIWFLGQYPEYQQQVQDEMDEI 728180

728181 FGDDYERYPNSEDIQRMIYLEQCIKETLRLTPPVPFISRQLEEDVLI () 728321

728362 AHATKPPVLLPAGMNIMINIITIMKDARYFEKPYEFFPEHFEPERVNSREAFAYVPFSA 728538

728539 GPRNCI ()

728820 GQKFALLEEKVVLSWIFRNFTVTSMSKYPEEHPIPELILKPQFGTQVLLKNRRK 728981

 

 

Score =  119 bits (299), Expect(2) = 8e-30

 Identities = 53/86 (61%), Positives = 71/86 (82%)

 Frame = +3

 

Query: 1      MLAAALVLLLTYFAYLIFRQKDDILQFLHVRKICTREFKKLRGPPAVLIFGTLWYFKKDP 60

              M+AA LV LLTYF + IF+++ +I +FL+ R+IC +EF+K++GPPAV IFGT WYFK DP

Sbjct: 726375 MIAAILVALLTYFLFSIFQKRHEIQKFLYARRICIQEFEKVKGPPAVPIFGTTWYFKSDP 726554

 

Query: 61     VEMVYQAQAWFSEYTLAPDNCGVLKL 86

              +EM+ QAQ WF EYTLAPD+ G+LK+

Sbjct: 726555 IEMIKQAQKWFVEYTLAPDSNGLLKV 726632

 

Score = 37.7 bits (86), Expect(2) = 8e-30

 Identities = 17/50 (34%), Positives = 30/50 (60%), Gaps = 2/50 (4%)

 Frame = +1

 

Query: 85     KLWLGPIPAVNIARGEIAKI--VLDSSVNISKSSQYNKLKEWIGDGLLIS 132

              + W+GP+P V+I RGE+AK+    DS  ++    +Y K+   + D  +I+

Sbjct: 726673 QFWMGPVPVVSICRGEVAKVGQKFDSIEHLGSFDRYQKVIN*LNDHHIIN 726822

 

 Score = 48.9 bits (115), Expect(2) = 2e-32

 Identities = 22/27 (81%), Positives = 23/27 (85%)

 Frame = +2

 

Query: 105    VLDSSVNISKSSQYNKLKEWIGDGLLI 131

              + DSS NI KSSQY KLKEWIGDGLLI

Sbjct: 726977 IFDSSTNIPKSSQYRKLKEWIGDGLLI 727057

 

 Score =  117 bits (293), Expect(2) = 2e-32

 Identities = 54/67 (80%), Positives = 60/67 (89%)

 Frame = +1

 

Query: 130    LISTGDKWRSRRKMLTQTFHFAVLKEYQKIFGAQGKILVEVLQLRANNKFSFDIMPYIKR 189

              + STG KWRSRRKMLTQTFHFAVLKEY K+F +QGKILV+VL+LRANN + FDIMPYIKR

Sbjct: 727096 IYSTGPKWRSRRKMLTQTFHFAVLKEYHKVFASQGKILVDVLRLRANNTYPFDIMPYIKR 727275

 

Query: 190    CALDIIC 196

              C LDIIC

Sbjct: 727276 CTLDIIC 727296

 

Score = 63.5 bits (153), Expect(3) = e-117

 Identities = 28/34 (82%), Positives = 32/34 (94%)

 Frame = +2

 

Query: 197    ETAMGCSISSQRGANDEYVNSVRRLSEIVWNYEK 230

              ETAMGCSISSQ G+ND+YV SV+RLSE+VWNYEK

Sbjct: 727682 ETAMGCSISSQMGSNDKYVESVKRLSELVWNYEK 727783

 

 Score =  285 bits (730), Expect(3) = e-117

 Identities = 133/167 (79%), Positives = 154/167 (92%)

 Frame = +3

 

Query: 230    KAPQFWLKPIWYLFGDGFEFNRHVKLTTDFTRDVIENRKKELKTHNSEQNETKKLAFLDY 289

              +AP +WLKPIWYLFG+GFEF+R VKLTTDFTRDVI+ RK+ELK H SE ++ KKLAFLDY

Sbjct: 727824 RAPLYWLKPIWYLFGNGFEFDRLVKLTTDFTRDVIDKRKEELKLHESEPSD-KKLAFLDY 728000

 

Query: 290    LLKSQEEHPDILTDEGIREEVDTFMFEGHDTTSSGITFAVWFLGQFPEYQQRVHDELDEI 349

              LLKSQ +HP+ILTDEGIREEVDTFMFEGHDTTSSGI FA+WFLGQ+PEYQQ+V DE+DEI

Sbjct: 728001 LLKSQTDHPEILTDEGIREEVDTFMFEGHDTTSSGIKFAIWFLGQYPEYQQQVQDEMDEI 728180

 

Query: 350    FGEDFERIPNSEDIQKMVYLEQCIKETLRMTPPVPFVSRKLTEDVKI 396

              FG+D+ER PNSEDIQ+M+YLEQCIKETLR+TPPVPF+SR+L EDV I

Sbjct: 728181 FGDDYERYPNSEDIQRMIYLEQCIKETLRLTPPVPFISRQLEEDVLI 728321

 

 Score =  122 bits (307), Expect(3) = e-117

 Identities = 58/89 (65%), Positives = 70/89 (78%)

 Frame = +1

 

Query: 396    IPHATKPDLLLPAGINCMINIITIMKDARYFERPYEFFPEHFSPERVAAREPFAFVPFSA 455

              + HATKP +LLPAG+N MINIITIMKDARYFE+PYEFFPEHF PERV +RE FA+VPFSA

Sbjct: 728359 LAHATKPPVLLPAGMNIMINIITIMKDARYFEKPYEFFPEHFEPERVNSREAFAYVPFSA 728538

 

Query: 456    GPRNCIGQKFALLEEKVLLSWIFRNFTVT 484

              GPRNCIG+ +   E  + + + F    V+

Sbjct: 728539 GPRNCIGK*YDKCEPILYVHFKFNYLDVS 728625

 

 Score =  104 bits (259), Expect = 4e-20

 Identities = 49/55 (89%), Positives = 54/55 (98%)

 Frame = +3

 

Query: 461    IGQKFALLEEKVLLSWIFRNFTVTSMTKFPEEMPIPELILKPQFGTQVLLRNRRK 515

              +GQKFALLEEKV+LSWIFRNFTVTSM+K+PEE PIPELILKPQFGTQVLL+NRRK

Sbjct: 728817 VGQKFALLEEKVVLSWIFRNFTVTSMSKYPEEHPIPELILKPQFGTQVLLKNRRK 728981

 

 

 

Y5H2B.6 = CYP33C12P

 

Y5H2B.6; CE21315. AC006810 one in frame stop codon = pseudogene

MLLLLFLSVLFLALFYEFHWKRRNYPAGPLPLPVIGNMWSMMRNNSGVEC

FRQWTKDFGDVYTFWFGTKPYIVVSSYKRLKEAFILDGDTFADKIRQPFQ

DQFRGGNYGVVDTNGHVWSTHRRFALSSFRDFGLGKNLLQEKMLIEVQDM

FAKFDANLGKEQNLPVVLYNAAANVINQLIFGYRFDKEREGELKKLKALM

EFQETAFTTFKVYVQFFAPAIGKHLPGKSVEDLLAEFTVDFYKFFNHQIE

EHRSKIDFDSEESLDYAEAYLKEQRKQEAQGEFELFSTKQLSNTCFDLWF

AGLSTTHITLTWIVGHVLNYPDVQRKLHKELDEVIGSDRLITNDDKNNLP

YLNAVINESQRCANIVPINQIHSTSRDTVINGITVKKGTGVIPQISAIML

DDKVFPDPYAFNPERFLDANGKLRKI*EFVPFSVGKRQCLGEGLARMEMF

IFMSNFFNRYQ (0) 17014

VSPASSGPPSLVKESLLNVAPRKFDAILKKRHV* 16469

 

This gene is a CYP33C but it is missing its C-terminal

Including the heme signature of p450s.  The missing seq has a stop codon.

 

Compare to briggsae

 

>CYP33C.b cb25.fpc0023 4 exons

          Length = 495

 

 Score = 1761 (619.9 bits), Expect = 6.6e-185, P = 6.6e-185

 Identities = 318/426 (74%), Positives = 375/426 (88%)

 

Query:     1 MLLLLFLSVLFLALFYEFHWKRRNYPAGPLPLPVIGNMWSMMRNNSGVECFRQWTKDFGD 60

             M+LLL  + L L LF+E +WKRRNYP GPLPLPVIGNM  +M+   G E FRQWTK+FGD

Sbjct:     1 MILLLLFTTLSLWLFHELYWKRRNYPNGPLPLPVIGNMVPIMKAKPGYEAFRQWTKEFGD 60

 

Query:    61 VYTFWFGTKPYIVVSSYKRLKEAFILDGDTFADKIRQPFQDQFRGGNYGVVDTNGHVWST 120

             V+TFW GTKPYIVVSSYKRLKE FILDGDT+ADK  QPFQ+QFRGG YGV+DTNGHVWST

Sbjct:    61 VFTFWLGTKPYIVVSSYKRLKETFILDGDTYADKAYQPFQEQFRGGQYGVIDTNGHVWST 120

 

Query:   121 HRRFALSSFRDFGLGKNLLQEKMLIEVQDMFAKFDANLGKEQNLPVVLYNAAANVINQLI 180

             HRRFAL++FRDFGLGK+L+Q+K+LIEV ++F KFD N+GKEQ +P V YNA ANVINQLI

Sbjct:   121 HRRFALTTFRDFGLGKDLMQQKILIEVDEIFRKFDENIGKEQEIPGVFYNAGANVINQLI 180

 

Query:   181 FGYRFDKEREGELKKLKALMEFQETAFTTFKVYVQFFAPAIGKHLPGKSVEDLLAEFTVD 240

             FGYRFD+E++ ELKKLKALMEFQETAFTTFKV VQFFAP +G+ LPGKSVE+LLAEFTVD

Sbjct:   181 FGYRFDEEKQEELKKLKALMEFQETAFTTFKVQVQFFAPIVGRMLPGKSVEELLAEFTVD 240

 

Query:   241 FYKFFNHQIEEHRSKIDFDSEESLDYAEAYLKEQRKQEAQGEFELFSTKQLSNTCFDLWF 300

             FYKFF+HQIEEHRSKIDFDSEE+LDYAEAYLKEQ+K+E++G+ ELF  +QLSN CFDLW

Sbjct:   241 FYKFFDHQIEEHRSKIDFDSEENLDYAEAYLKEQKKKESEGDMELFGNRQLSNMCFDLWV 300

 

Query:   301 AGLSTTHITLTWIVGHVLNYPDVQRKLHKELDEVIGSDRLITNDDKNNLPYLNAVINESQ 360

             AGLSTTH TL+WI+ +VLN+ +VQ+ +  ELDEVIGSDRLI+  DKNNLP++NAVINESQ

Sbjct:   301 AGLSTTHTTLSWIIAYVLNHSEVQKTMQIELDEVIGSDRLISTGDKNNLPFMNAVINESQ 360

 

Query:   361 RCANIVPINQIHSTSRDTVINGITVKKGTGVIPQISAIMLDDKVFPDPYAFNPERFLDAN 420

             RCANIVP+NQIH  S+DT+ING+ VKKGTG+IPQIS ++LD+  FPDPY FNPERF+D +

Sbjct:   361 RCANIVPLNQIHCVSKDTMINGVLVKKGTGIIPQISTVLLDETTFPDPYKFNPERFIDEH 420

 

Query:   421 GKLRKI 426

             GKL+K+

Sbjct:   421 GKLKKV 426

 

>CYP33C.b cb25.fpc0023 4 exons

723577 MILLLLFTTLSLWLFHELYWKRRNYPNGPLPLPVIGNMVPIMKAKPGYEAFRQWTK 723744

723745 EFGDVFTFWL (1) 723774

724349 GTKPYIVVSSYKRLKETFILDGDTYADKAYQP 724444

724445 FQEQFRGGQYGVIDTNGHVWSTHRRFALTTFRDFGLGKDLMQQKILIEVDEIFRKFDENI 724624

724625 GKEQEIPGVFYNAGANVINQLIFGYRFDEEKQEELKKLKALMEFQETAFTTFKVQVQFFA 724804

724805 PIVGRMLPGKSVEEL (2) 724849

724904 LAEFTVDFYKFFDHQIEEHRSKIDFDSEENLDYAEA 725009

725010 YLKEQKKKESEGDMELFGNRQLSNMCFDLWVAGLSTTHTTLSWIIAYVLNHSEVQKTMQI 725189

725190 ELDEVIGSDRLISTGDKNNLPFMNAVINESQRCANIVPLNQIHCVSKDTMINGVLVKKGT 725369

725370 GIIPQISTVLLDETTFPDPYKFNPERFIDEHGKLKKV EELCAFSVGKRQCLGEGLARMEM 725549

725550 FLFISNFFNRYQ (0) 725585

725633 VTPGSSGPPSLEKETMFNATPRKIRAVLTKRYL* 725734

 

 

CYP31A3 and pseudogene CYP31A4P

 

>CYP31A3 T16C6 486 aa this sequence revised according to Y17G9.contig 61

         Length = 488

 

Score = 1164 (409.7 bits), Expect = 1.2e-121, P = 1.2e-121

 Identities = 242/332 (72%), Positives = 263/332 (79%)

 

Query:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

             MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNS---------FIYNLYD 231

             IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL+             +++  

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

 

Query:   232 SFIIKKVNSILF--FRTEDGRTHEKCLRILHDFTKKVIVERKEALQEND-YKMEGR-LAF 287

               I+++  ++    ++ E GR     L +L +  K   ++  +   E D +  EG    

Sbjct:   241 KVIVERKEALQENDYKME-GRL--AFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTS 297

 

Query:   288 LDLLLEMVKSGQMDETD--VQAEVDTFMFEGHDTT 320

               L+  +   G   E    VQAE+D  M +  D T

Sbjct:   298 TGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVT 332

 

 Score = 1394 (490.7 bits), Expect = 5.1e-146, P = 5.1e-146

 Identities = 274/295 (92%), Positives = 277/295 (93%)

 

Query:   217 NNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGRTHEKCLRILHDFTKKVIVERKEALQE 276

             NN  +W     N     I K+ N+ L   TEDGRTHEKCLRILHDFTKKVIVERKEALQE

Sbjct:   198 NNEYVWAVHTIN---KLISKRTNNPLI--TEDGRTHEKCLRILHDFTKKVIVERKEALQE 252

 

Query:   277 NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE 336

             NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE

Sbjct:   253 NDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPE 312

 

Query:   337 VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV 396

             VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV

Sbjct:   313 VQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGV 372

 

Query:   397 NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR 456

             NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR

Sbjct:   373 NIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSRNCIGQR 432

 

Query:   457 FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP 511

             FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP

Sbjct:   433 FALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTRRRPIVSP 487

 

Y17G9B.3; CE24183. remove cyan seq

MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKP  50

DPEGFMNQVIGMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHL 100

NKGFAYVLLEPWLGISILTSQKEQWRPKRKLLTPTFHYDILKDFLPIFNE 150

QSKILVQKMCSLGAEEEVDVLSVITLCTLDIICETSMGKAIGAQLAENNE 200

YVWAVHTINKLISKRTNNPLMWNSFIYNLYDSFIIKKVNSILFFRTEDGR 250

THEKCLRILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQM 300

DETDVQAEVDTFMFEGHDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMG 350

DDEDVTIEHLSRMKYLECALKEALRLFPSVPIITRELSDDQVIGGVNIPK 400

GVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIPFSAGSR 450

NCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHM 500

KLTRRRPIVSP                                        511

 

with cyan seq removed we get

 

>CYP31A3 T16C6 486 aa this sequence revised according to Y17G9.contig 61

         Length = 488

 

add the green seq to the model

 Score = 2521 (887.4 bits), Expect = 1.9e-265, P = 1.9e-265

 Identities = 486/495 (98%), Positives = 487/495 (98%)

 

Query:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

             MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLMWNSFIYNLTEDGRTHEKCL 240

             IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL        +TEDGRTHEKCL

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPL--------ITEDGRTHEKCL 232

 

Query:   241 RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG 300

             RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG

Sbjct:   233 RILHDFTKKVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEG 292

 

Query:   301 HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL 360

             HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL

Sbjct:   293 HDTTSTGLMWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRL 352

 

Query:   361 FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA 420

             FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA

Sbjct:   353 FPSVPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIA 412

 

Query:   421 RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT 480

             RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT

Sbjct:   413 RKSFAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVT 472

 

Query:   481 PIHMKLTRRRPIVSP 495

             PIHMKLTRRRPIVSP

Sbjct:   473 PIHMKLTRRRPIVSP 487

 

Compare to 31A2

 

>CYP31A2 F22B3 Z68336 487 aa

         Length = 488

 

 Score = 2517 (886.0 bits), Expect = 5.1e-265, P = 5.1e-265

 Identities = 478/488 (97%), Positives = 484/488 (99%)

 

Query:     1 MGVIIPAVLLASATIIAWLLYKHLRMRQALKHLNQPRSYPIVGHGLVTKPDPEGFMNQVI 60

             MGVIIPAVLLA AT+IAWLLYKHLRMRQ LKHLNQPRSYPIVGHGL+TKPDPEGFMNQVI

Sbjct:     1 MGVIIPAVLLAMATVIAWLLYKHLRMRQVLKHLNQPRSYPIVGHGLITKPDPEGFMNQVI 60

 

Query:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

             GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS

Sbjct:    61 GMGYLYPDPRMCLLWIGPFPCLMLYSADLVEPIFSSTKHLNKGFAYVLLEPWLGISILTS 120

 

Query:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKMCSLGAEEEVDVLSVITLCTLD 180

             QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQK+C LGA+EEVDVLSVITLCTLD

Sbjct:   121 QKEQWRPKRKLLTPTFHYDILKDFLPIFNEQSKILVQKLCCLGADEEVDVLSVITLCTLD 180

 

Query:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

             IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK

Sbjct:   181 IICETSMGKAIGAQLAENNEYVWAVHTINKLISKRTNNPLITEDGRTHEKCLRILHDFTK 240

 

Query:   241 KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300

             KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL

Sbjct:   241 KVIVERKEALQENDYKMEGRLAFLDLLLEMVKSGQMDETDVQAEVDTFMFEGHDTTSTGL 300

 

Query:   301 MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360

             MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT

Sbjct:   301 MWAIHLLGNHPEVQRKVQAELDEVMGDDEDVTIEHLSRMKYLECALKEALRLFPSVPIIT 360

 

Query:   361 RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKSFAFIP 420

             RELSDDQVIGGVNIPKGVTFLLNLYLVHRDP+QWKDPDVFDPDRFLPENSI RKSFAFIP

Sbjct:   361 RELSDDQVIGGVNIPKGVTFLLNLYLVHRDPAQWKDPDVFDPDRFLPENSIGRKSFAFIP 420

 

Query:   421 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480

             FSAGSRNCIGQRFALMEEKVIMAHLLRNFN+KAVELMHEVRPKMEIIVRPVTPIHMKLTR

Sbjct:   421 FSAGSRNCIGQRFALMEEKVIMAHLLRNFNIKAVELMHEVRPKMEIIVRPVTPIHMKLTR 480

 

Query:   481 RRPIVSP* 488

             RRPIVSP*

Sbjct:   481 RRPIVSP* 488

 

I had the ends of my genes mixed up.  The pseudogene should be downstream of CYP31A3

Not in the last intron.

 

>CYP31A3 T16C6 486 aa this sequence revised according to Y17G9.contig 61

         Length = 488

 

  Plus Strand HSPs:

 

 Score = 551 (194.0 bits), Expect = 1.9e-69, Sum P(2) = 1.9e-69

 Identities = 107/113 (94%), Positives = 111/113 (98%), Frame = +3

 

Query: 13203 VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS 13382

             VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS

Sbjct:   356 VPIITRELSDDQVIGGVNIPKGVTFLLNLYLVHRDPSQWKDPDVFDPDRFLPENSIARKS 415

 

Query: 13383 FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVKH*KLKLI 13541

             FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEV+  K+++I

Sbjct:   416 FAFIPFSAGSRNCIGQRFALMEEKVIMAHLLRNFNVKAVELMHEVRP-KMEII 467

 

Score = 146 (51.4 bits), Expect = 1.9e-69, Sum P(2) = 1.9e-69

 Identities = 29/30 (96%), Positives = 30/30 (100%), Frame = +1

 

Query: 13567 QVRPKMEIIVRPVTPIHMKLTRRRPIVSP* 13656 end of CYP31A3

             +VRPKMEIIVRPVTPIHMKLTRRRPIVSP*

Sbjct:   459 EVRPKMEIIVRPVTPIHMKLTRRRPIVSP* 488

 

Score = 165 (58.1 bits), Expect = 2.3e-11, P = 2.3e-11

 Identities = 40/63 (63%), Positives = 44/63 (69%), Frame = +2

 

This is the pseudogene CYP31A4P

 

Query: 15233 MTHLLRNFSVKDVELVHEMIFESL*IKFLIKLVQVRPKMEIIVCPVSPIHMKLTRRRPII 15412

             M HLLRNF+VK VEL+HE                VRPKMEIIV PV+PIHMKLTRRRPI+

Sbjct:   442 MAHLLRNFNVKAVELMHE----------------VRPKMEIIVRPVTPIHMKLTRRRPIV 485

 

Query: 15413 SP* 15421

             SP*

Sbjct:   486 SP* 488

 

AC006720               19620 bp    DNA     linear   INV 07-JUL-2003

DEFINITION  Caenorhabditis elegans cosmid Y17G9B, complete sequence.

 

    13201 ctgttccaat tattacaaga gaattgagtg atgatcaagt gattggaggt gttaatattc

    13261 caaaaggagt cacttttctg ctcaacctgt accttgttca tcgtgatcct tcgcaatgga

    13321 aggatcctga tgtattcgat ccggatcgtt tccttcctga aaattcaatt gctcgcaagt

    13381 ctttcgcctt cattccattc tcagctggaa gtcgtaactg cattggccaa cgttttgcac

    13441 tcatggagga gaaggtcatc atggctcatt tgttgagaaa tttcaatgta aaagctgtcg

    13501 agttgatgca tgaggtgaaa cattgaaaat taaaattgat aaattaaatt ctaactgaaa

    13561 attgttcagg ttcgtccgaa aatggagatt atcgttcgcc cagtcactcc aattcacatg

    13621 aagcttacca gacgccgtcc aatcgtctct ccataatttc aaaatagcca attttattct

    13681 tgatgattat atatttttaa tgcatgataa attttaattt tcatacttta ccttgcaaat

    13741 gtgatgaaca aattcttatt tcggggagtg aactattcct ggaacaaaaa cgatttgatt

    13801 ttttaagctc tataggctct acactaaatt ctgtgaatac gtactgcaaa tatatcaaaa

    13861 ctaaattttt ctcattgcaa aaagtctgaa tttattgatt tttgaatgaa gttatagcta

    13921 gtaatattga ttctcatgga acagctcata ttgacaaatg tgacaaattt tgatacggta

    13981 tttttaaagg cgcagcaaaa aattttctct atggtcccgc aacgtacaca gtttcactag

    14041 attttgacat gccgctttaa aggcacgtgg atttatttaa atgggtaaca cggcccggca

    14101 agtggtacat ccatgcaaat gcgctctact gataatttga gtgtagacca ggtttgggcg

    14161 cgtgataacg aaaaaagctt tggtccaaaa aatttagaat ttagtttcgg acatttttta

    14221 tatgcatcac aaaaaaactg gaccaaccgt ttttgagata cacgcgccca aacgtccagg

    14281 tatacggtag acaaattgcg tacaggtacc acttctcggg ccgtgggtaa tatgggaaca

    14341 aacaattcta agaatgcgta ctgggcgaca tatttggcgc gcaaaatatc ttgtagcgaa

    14401 aactacagta attcttttaa tgaccactgt agctcttgtg tcgatttacg ggcccacttt

    14461 cgaaatgagt ttctttttga atagtgataa caaatttctc attttccttc gttattttgt

    14521 attattttct cacgttttgc ttaattttaa tattccataa atggttttaa ttcattcaga

    14581 aattcaggcc cgtaaatcga caaaagagct acagtagtca tctatagaat tactgtattt

    14641 ttcgctccga gatattttgc gcgtcaaata tattgcgtag tacgcattct cagaaattaa

    14701 tgttcccgta ataccaaagt catacaggaa aagtgaattt ttcgaaattc aaaatttgaa

    14761 ccgttgatat tttcttgttg atgttttgca aacaacactc cagcgcttga gcatattgcc

    14821 atcaaacacc aatgagaacg aaaaaggttg aatcgaacaa ttcgaagaat tttctatttt

    14881 aagagaaaag atctctaacc agtgccttct cgctccatta tcctttggaa catgtgcgtg

    14941 tctctgcacg ctttagacaa agaaaaatat gtataatatc tctaagatga gaattagaga

    15001 attggggaag gggggtcaat tggatcggac cacccaatct tcccatctgc tcattctcca

    15061 tttgtgacca tggacaacgg tgtgtctatt ccattccaac tgctcaaatc acctcaaaaa

    15121 tcgattagtt atgttcttca aagaatgact atcggagatc tgtaagtaaa atgtgaccaa

    15181 atataattat ttacagacct gcaagtggaa ctattcgcta gacattcact agatgactca

    15241 tttgttgaga aatttcagtg taaaagatgt tgagttggtg catgaaatga tatttgaaag

    15301 tttataaatt aaattcttaa taaaacttgt tcaggttcgt ccaaaaatgg aaataatcgt

    15361 ttgcccggtc tctccaatcc acatgaagct caccagacgc cgtccaatca tatctccatg

    15421 atttgaaatt aatttgaaat ttcgaacgat tttatatttt tattgcttga taaatttgag

    15481 ttttctgtaa aaagtttgaa acttcaacct tgaaattccc gttactggtc gagtccaaat

 

 

Y80D3A.5

 

>CYP42A1 CM08B12  M89401, AL020988 Y80D3 contig 01341 and 00427 504 aa

         Length = 505

 

 Score = 2290 (806.1 bits), Expect = 5.8e-241, P = 5.8e-241

 Identities = 440/450 (97%), Positives = 440/450 (97%)

 

Query:     1 MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS 60

             MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS

Sbjct:     1 MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQS 60

 

Query:    61 QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL 120

             QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL

Sbjct:    61 QGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSAWIGDGL 120

 

Query:   121 LIS-KPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT 179

             LIS KPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT

Sbjct:   121 LISRKPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGTGEYSDVFHTIT 180

 

Query:   180 LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE 239

             LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE

Sbjct:   181 LCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKE 240

 

Query:   240 HDECVK--ILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGE 297

             HDECVK  ILHEFTSKAIYARK          QLLAQETAEGRRRMAFLDLMLDMNSKGE

Sbjct:   241 HDECVKVIILHEFTSKAIYARK----------QLLAQETAEGRRRMAFLDLMLDMNSKGE 290

 

Query:   298 LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE 357

             LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE

Sbjct:   291 LPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEADRPVSYE 350

 

Query:   358 DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW 417

             DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW

Sbjct:   351 DLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVMVPSMVHKDPRYW 410

 

Query:   418 DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 450

             DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI

Sbjct:   411 DDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 443

 

Y80D3A.5; CE23108.

MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFH  50

FSPEEFFEQSQGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSP 100

KMLNKPFLYGFLSAWIGDGLLISKPDKWRPRRKLLTPTFHYDILKDFVEV 150

YNRHGRTLLSKFEAQAGTGEYSDVFHTITLCTLDVICEAALGTSINAQKD 200

PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVK  ILHEF 250

TSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM 300

EGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDEVLGEA 350

DRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSG 400

TAVVMVPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCI 450

ANIDNMGVKLTEIT end wrong

 

>gi|25815101|emb|AL132853.2|CEY80D3A  Download subject sequence spanning the HSP Caenorhabditis elegans YAC Y80D3A, complete sequence

          Length = 102495

 

The genomic seq supports the yellow regions for Y80D3A.5 so my seq is wrong there

 

 Score =  102 bits (253), Expect = 7e-23

 Identities = 45/47 (95%), Positives = 47/47 (100%)

 Frame = +1

 

Query: 1     PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVKIL 47

             PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVK++

Sbjct: 58066 PHSPYLDAVFKMKDIVFQRLLRPHYFSDTIFNLIGPGKEHDECVKVI 58206

 

 

 Score =  109 bits (272), Expect = 5e-25

 Identities = 55/56 (98%), Positives = 56/56 (100%)

 Frame = +3

 

Query: 45    KILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM 100

             +ILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM

Sbjct: 58620 QILHEFTSKAIYARKAKVDAAGGVEQLLAQETAEGRRRMAFLDLMLDMNSKGELPM 58787

 

 

 

>gi|275296|gb|M89401.1|M89401  UniGene info CEL08B12 Chris Martin sorted cDNA library Caenorhabditis elegans

           cDNA clone cm08b12 5' similar to cytochrome P450

           homologous peptide.

          Length = 528

 

 

 Score = 26.6 bits (57), Expect(2) = 2e-11

 Identities = 12/12 (100%), Positives = 12/12 (100%)

 Frame = +2

 

Query: 60 AKVDAAGGVEQL 71

          AKVDAAGGVEQL

Sbjct: 2  AKVDAAGGVEQL 37

 

 

 Score = 56.2 bits (134), Expect(2) = 2e-11

 Identities = 27/28 (96%), Positives = 27/28 (96%)

 Frame = +3

 

Query: 73  AQETAEGRRRMAFLDLMLDMNSKGELPM 100

           AQETA GRRRMAFLDLMLDMNSKGELPM

Sbjct: 42  AQETAXGRRRMAFLDLMLDMNSKGELPM 125

 

 

 

 

>gi|25815101|emb|AL132853.2|CEY80D3A  Download subject sequence spanning the HSP Caenorhabditis elegans YAC Y80D3A, complete sequence

          Length = 102495

 

 Score =  226 bits (577), Expect = 5e-60

 Identities = 107/107 (100%), Positives = 107/107 (100%)

 Frame = +3

 

Query: 1     VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60

             VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM

Sbjct: 60489 VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60668

 

Query: 61    VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE 107

             VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE

Sbjct: 60669 VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE 60809

 

 

 

 

 Score =  119 bits (299), Expect = 9e-28

 Identities = 60/61 (98%), Positives = 60/61 (98%)

 Frame = +3

 

Query: 106   GERFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI 165

             G RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI

Sbjct: 61968 GMRFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSI 62147

 

Query: 166   Y 166

             Y

Sbjct: 62148 Y 62150

 

>gi|30733169|gb|CB391459.1|CB391459  UniGene info OSTF152C3_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 153

 

 Score = 99.4 bits (246), Expect = 4e-22

 Identities = 50/50 (100%), Positives = 50/50 (100%)

 Frame = +3

 

cDNAs for the end of the protein

 

Query: 117 ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY 166

           ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY

Sbjct: 3   ILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY 152

 

>gi|30733181|gb|CB391471.1|CB391471  UniGene info OSTF152C3_2 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 142

 

 Score = 89.7 bits (221), Expect = 3e-19

 Identities = 45/46 (97%), Positives = 46/46 (100%)

 Frame = +3

 

Query: 108 RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK 153

           RFAMM+EKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK

Sbjct: 3   RFAMMDEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELK 140

 

>CYP42A1 CM08B12  M89401, AL020988 Y80D3 contig 01341 and 00427 504 aa

MGIITASLIVLTITWIIHFAFRKAKFIYNKLTVFQGPAALPLIGNFHQFHFSPEEFFEQ

SQGIAYMMRKGDERITRVWLGGLPFVLLYGAHEVEAILGSPKMLNKPFLYGFLSA

WIGDGLLISRKPDKWRPRRKLLTPTFHYDILKDFVEVYNRHGRTLLSKFEAQAGT

GEYSDVFHTITLCTLDVICEAALGTSINAQKDPHSPYLDAVFKMKDIVFQRLLRPHY

FSDTIFNLIGPGKEHDECVKVIILHEFTSKAIYARKQLLAQETAEGRRRMAFLDLML

DMNSKGELPMEGICEEVDTFTFEGHDTTSAAMNWFLHLMGANPEIQSKVQKEIDE

VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTA

VVMVPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIG (1)

RFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEKREFGDYTSIY*

 

>gi|22417578|emb|CAAC01000109.1|  Download subject sequence spanning the HSP Caenorhabditis briggsae contig cb25.fpc4175 from assembly cb25.agp8,

             whole genome shotgun sequence

          Length = 116228

 

 Score =  223 bits (567), Expect = 5e-57

 Identities = 103/107 (96%), Positives = 107/107 (100%)

 Frame = +2

 

Query: 1     VLGEADRPVSYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQVRGHTLPSGTAVVM 60

             VLGEADRP+SYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQ+RGHTLPSGTAVVM

Sbjct: 66992 VLGEADRPISYEDLGKLKYLEACFKETLRLYPSVPLIARQCVEDIQIRGHTLPSGTAVVM 67171

 

Query: 61    VPSMVHKDPRYWDDPEIFNPERFITGELKHPYAYIPFSAGSRNCIGE 107

             VPSMVHKDPRYW+DPEIFNPERFI+GELKHPYAYIPFSAGSRNCIGE

Sbjct: 67172 VPSMVHKDPRYWEDPEIFNPERFISGELKHPYAYIPFSAGSRNCIGE 67312

 

 

 

 

 Score =  110 bits (275), Expect = 4e-23

 Identities = 55/66 (83%), Positives = 60/66 (90%)

 Frame = +2

 

Query: 97    FSAGSRNCIGERFAMMEEKCILAIILKNLKVKAKLRTDEMRVAAELIIRPLYGNELKFEK 156

             FS  + N  G RFAMMEEKCILAI+LKNLKVKAKLRTD+MRVAAELIIRPL+GNELKFE+

Sbjct: 68207 FSPKTSNFPGMRFAMMEEKCILAILLKNLKVKAKLRTDQMRVAAELIIRPLFGNELKFER 68386

 

Query: 157   REFGDY 162

             REFGDY

Sbjct: 68387 REFGDY 68404

 

CYP25A4

 

C36A4.6; CE03074. use this seq

MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNI  50

IKDRVSRLGYNDTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIF 100

IQNFSNFSDRMTPDIFGMNQLNQSLLQNTYATGWKHTRSAIAPIFSTGKM 150

KAMHETLVSKIDIFLEVLKEKSSSGQKWDIFENFQSLSLDIIGKCAFAID 200

SNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELSWLWRFLYKFS 250

DLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV 300

IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLN 350

YDSIHNMKYLDCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLK 400

VQPYTIHRNPANWESPDEFQPERFENWEEKSSSLKWIPFGVGPRYCVGMR 450

FAEMEFKTTIAKLLDTFELSLVPGDPPMIPETNGVIFRPRSPVRLNLKLR 500

I

 

>CYP25A4 C36A4.6    Z66495 495 aa

         Length = 496

 

 Score = 2587 (910.7 bits), Expect = 1.9e-272, P = 1.9e-272

 Identities = 493/501 (98%), Positives = 493/501 (98%)

 

Query:     1 MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRVSRLGY 60

             MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRV    

Sbjct:     1 MAILIISTIFFTIITFISYSIWRRHAIFKLRSSIGIPGPPVHWLWGNLNIIKDRV----- 55

 

Query:    61 NDTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ 120

              DTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ

Sbjct:    56 -DTTQWHPTLHTKYGPIFGLYCGTQLHITVSEEEDIKEIFIQNFSNFSDRMTPDIFGMNQ 114

 

Query:   121 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI 180

             LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI

Sbjct:   115 LNQSLLQNTYATGWKHTRSAIAPIFSTGKMKAMHETLVSKIDIFLEVLKEKSSSGQKWDI 174

 

Query:   181 FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS 240

             FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS

Sbjct:   175 FENFQSLSLDIIGKCAFAIDSNCQRDRTDLFYVQARKFVGAVDLKKSWILPVSLILPELS 234

 

Query:   241 WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV 300

             WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV

Sbjct:   235 WLWRFLYKFSDLSAAELPLVKGLVDLYDRRRAGEGGNDSTDLLNLLIRRETIGKMTQREV 294

 

Query:   301 IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLNYDSIHNMKYL 360

             IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGL  DSIHNMKYL

Sbjct:   295 IENCFAFLIAGYETTSTAMMFSAYLLAEYPIVQQKLYEEIKKTKENAGLYNDSIHNMKYL 354

 

Query:   361 DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ 420

             DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ

Sbjct:   355 DCVYKESLRFYPPTTHFTNRVCLNDMTIRGQIYPEDSTLKVQPYTIHRNPANWESPDEFQ 414

 

Query:   421 PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP 480

             PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP

Sbjct:   415 PERFENWEEKSSSLKWIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELSLVPGDPPMIP 474

 

Query:   481 ETNGVIFRPRSPVRLNLKLRI 501

             ETNGVIFRPRSPVRLNLKLRI

Sbjct:   475 ETNGVIFRPRSPVRLNLKLRI 495

 

CYP25A3

 

added 7 aa to the seq

 

>CYP25A3 C36A4.3    Z66495 496 aa

MAFLILTSILVSLVSFIIYVILARKERFRLRGKIGLSGPEPHWLMGNLKQIIER

KAKLGYDDSYDWYNKLHKQFGETFGIYFGTQLNINITNEEDIKEVFIKNFSNFSDRTPPPI

IEDNKLKESLLQNTYESGWKHTRSAIAPIFSTGKMKAMHETIHSKVDLFLEIL

KEKASSGQKWDIYDDFQGLTLDVIGKCAFAIDSNCQRDRNDVFYVNARKFITN

IDIRHSKIISTSFLFPELSKLWKVLYRFTDLAKAEIPLVEGLADVYERRRGGEG

SDSVDLLKLLLNREDDKSKPMTKQEVIENCFAFLLAGYETTSTAMTYCSYLLS

KYPNVQQKLYEEIMEAKENGGLTYDSIHNMKYLDYVYKETLRCYPPVIHFSNR

RCLKDITIRGQFYPKGAIVVCLPHTVHRNPENWDSPEEFHPERFENWEEKSSSLK

WIPFGVGPRYCVGMRFAEMEFKTTIAKLLDTFELKQFEGEADLIPDCNGVIMRPKD

PVRLHLKPRN*

 

CYP34A3

 

C41G6.1; CE15699.

MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGG  50

IVAGFQVFKKQYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFT 100

TGVMNYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEY 150

RYR LLKNSR incorrect seq here

KNGVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMK 200

VYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPVEYIYSLVQKEIQ 250

ERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAIDLF 300

DLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDR 350

ARTPYLMAVLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLS 400

MIHTDEELFEDHTRFDPERFIENSTLEKKLIPFNLGKRSCPGESLARAEL 450

YLIIGNLVLDFDFEAVGIKPEIKTSTPFGIMKRPPNYISD           490

 

>CYP34A3 C41G6.1     Z81047 492 aa also T13C10 Z81591

         Length = 493

 

 Score = 2526 (889.2 bits), Expect = 5.7e-266, P = 5.7e-266

 Identities = 490/492 (99%), Positives = 490/492 (99%)

 

Query:     1 MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK 60

             MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK

Sbjct:     1 MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIVAGFQVFKK 60

 

Query:    61 QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF 120

             QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF

Sbjct:    61 QYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVMNYIREGRGIIGSNGAF 120

 

Query:   121 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRY--RLLKNSRKNGVIEVNAATMFDLLVGS 178

             WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRY  RLLKNSRKNGVIEVNAATMFDLLVGS

Sbjct:   121 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYNPRLLKNSRKNGVIEVNAATMFDLLVGS 180

 

Query:   179 IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV 238

             IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV

Sbjct:   181 IINRMLVSKRFEQGNLEFETMKVYLTKALEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPV 240

 

Query:   239 EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID 298

             EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID

Sbjct:   241 EYIYSLVQKEIQERTTSIENGSHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAID 300

 

Query:   299 LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA 358

             LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA

Sbjct:   301 LFDLWLAGQDTSSTTLLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMA 360

 

Query:   359 VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE 418

             VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE

Sbjct:   361 VLNEIQRIASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPE 420

 

Query:   419 RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF 478

             RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF

Sbjct:   421 RFIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKTSTPF 480

 

Query:   479 GIMKRPPNYISD 490

             GIMKRPPNYISD

Sbjct:   481 GIMKRPPNYISD 492

 

>CYP34A3 C41G6.1     Z81047 494 aa also T13C10 Z81591

This gene has a frameshift IQDFSKTHG revised 3/20/04

MIFVTLVTTILIYFSCKLWRGRKKNPNGPLPLPLIGNLHQLVYNSWKTGGIV

AGFQVFKKQYGKFFTLWFGPIPIVFIADYDIAYETHVKKANIFGHRFTTGVM

NYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR (2)

IQDFSKTHG (frameshift)

KNGVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMKVYLTKA

LEEVSIFEAFLPVWLLKSNLLRWRTKYTLAPVEYIYSLVQKEIQERTTSIENG

SHVPSEDGDDFVDAFLIKIEKDKKEGIDSTFTLESLAIDLFDLWLAGQDTSSTT

LLWAGICLLNHPEVVEKLRSELLEVTGGIRRLSLTDRARTPYLMAVLNEIQR

IASILNINIFRELREDTEIDGQPIAAGTVVANQLSMIHTDEELFEDHTRFDPER

FIENSTLEKKLIPFNLGKRSCPGESLARAELYLIIGNLVLDFDFEAVGIKPEIKT

STPFGIMKRPPNYISD*

 

compare to CYP34A5 B0213.10

>gi|17557253|ref|NM_071698.1|  LocusLink infoUniGene info Caenorhabditis elegans cytochrome p450 2c2 family member (5E430),

           mRNA

          Length = 1500

 

 Score =  192 bits (488), Expect = 4e-50

 Identities = 91/105 (86%), Positives = 101/105 (96%)

 Frame = +1

 

Query: 1   NYIREGRGIIGSNGAFWQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYRIQDFSKTHGKN 60

           +YIREGRGI+GSNG FWQEHRRFALTTLRNFG+G+NIMED+IMDEYRYRIQDFSKTHGKN

Sbjct: 313 DYIREGRGIVGSNGDFWQEHRRFALTTLRNFGVGRNIMEDKIMDEYRYRIQDFSKTHGKN 492

 

Query: 61  GVIEVNAATMFDLLVGSIINRMLVSKRFEQGNLEFETMKVYLTKA 105

           GVIEVNA TMFDLLVGSIINRMLVS+RFEQG+ +FE +K+YLTKA

Sbjct: 493 GVIEVNATTMFDLLVGSIINRMLVSERFEQGDQDFEKLKMYLTKA 627

 

 

>gi|6435485|emb|Z81047.2|CEC41G6  Download subject sequence spanning the HSP Caenorhabditis elegans cosmid C41G6, complete sequence

          Length = 31437

 

 Score = 74.7 bits (182), Expect = 1e-14

 Identities = 36/41 (87%), Positives = 37/41 (90%), Gaps = 1/41 (2%)

 Frame = +3

 

Query: 1     WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR-IQDFSKT 40

             WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYR  + F KT

Sbjct: 21507 WQEHRRFALTTLRNFGLGKNIMEDRIMDEYRYRCFKIFQKT 21629

 

Score = 28.5 bits (62), Expect(2) = 4e-07

 Identities = 11/12 (91%), Positives = 12/12 (100%)

 Frame = +1

 

Query: 32    YRIQDFSKTHGK 43

             +RIQDFSKTHGK

Sbjct: 21844 FRIQDFSKTHGK 21879

 

 Score = 40.8 bits (94), Expect(2) = 4e-07

 Identities = 20/23 (86%), Positives = 20/23 (86%)

 Frame = +3

 

Query: 39    KTHGKNGVIEVNAATMFDLLVGS 61

             K   KNGVIEVNAATMFDLLVGS

Sbjct: 21864 KNSRKNGVIEVNAATMFDLLVGS 21932

 

CYP33C1

 

>CYP33C1 C45H4.2 502 aa

         Length = 503

 

 Score = 2603 (916.3 bits), Expect = 3.9e-274, P = 3.9e-274

 Identities = 496/502 (98%), Positives = 497/502 (99%)

 

Query:     1 MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD 60

             MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD

Sbjct:     1 MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFERWTKKYGD 60

 

Query:    61 VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE 120

             VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE

Sbjct:    61 VYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSRGGEYGIIDSNGAMWRE 120

 

Query:   121 HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL 180

             HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL

Sbjct:   121 HRRFALSTMRDFGLGKNLMQENILMEVQDVFARLDAKLGSETDVPEVFDHAVANVVNQLL 180

 

Query:   181 FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT 240

             FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT

Sbjct:   181 FGYRFMGPKENEYQELKHIIDSPAEIFGKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDT 240

 

Query:   241 TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESE-----ANSKNLNFSNIQL 295

             TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESE     ANSKNLNFSNIQL

Sbjct:   241 TLAFFNKQIEAHRHRIDFEDLNSESTDFVETFLKEQKRRESEGDSETANSKNLNFSNIQL 300

 

Query:   296 LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY 355

             LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY

Sbjct:   301 LNVCIDLWFAGLNTTTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPY 360

 

Query:   356 FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF 415

             FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF

Sbjct:   361 FNASINESQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF 420

 

Query:   416 NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYKIHPSKEGLPS 475

             NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRY+IHPSKEGLPS

Sbjct:   421 NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYQIHPSKEGLPS 480

 

Query:   476 MAKGSGPVVAPRLFTAILTRRF 497

             MAKGSGPVVAPRLFTAILTRRF

Sbjct:   481 MAKGSGPVVAPRLFTAILTRRF 502

 

compare to cDNA

 

>gi|30738516|gb|CB396805.1|CB396805  UniGene info OSTR179D10_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 592

 

 Score = 85.5 bits (210), Expect = 4e-18

 Identities = 43/53 (81%), Positives = 45/53 (84%)

 Frame = -3

 

remove the ANSKNLN fragment Keep GDSET

 

Query: 1   NSESTDFVETFLKEQKRRESEGDSETANSKNLNFSNIQLLNVCIDLWFAGLNT 53

           NSESTDFVETFLKEQKRRESEGDSET       FSN+QL NVC+DLWFAGLNT

Sbjct: 320 NSESTDFVETFLKEQKRRESEGDSET-------FSNVQLSNVCMDLWFAGLNT 183

 

>CYP33C1 C45H4.2 502 aa revised 3/20/04 small deletion

MIIILLLTFLTIYFVYELYWKRRNFPPGPCPLPVFGNLLSIANPPPGYKAFER

WTKKYGDVYTFWIGNTPHIMINTWDKIKETFIRDADTYTNKVVLPMVTLSR

GGEYGIIDSNGAMWREHRRFALSTMRDFGLGKNLMQENILMEVQDVFARLD

AKLGSETDVPEVFDHAVANVVNQLLFGYRFMGPKENEYQELKHIIDSPAEIF

GKLHIFLAMNIPIFAKLLPESLYEGPIKTFRDTTLAFFNKQIEAHRHRIDFEDL

NSESTDFVETFLKEQKRRESEGDSETFSNIQLLNVCIDLWFAGLNT

TTNTITWAISYVLHHPEVQDKIHEELDKVIGSDRLITTADKNDLPYFNASINE

SQRGINILPLNLQHATTRDTVIDGFKIPKGTGVVAQISTVMNNEEVFPDPYTF

NPDRFIDENGKLKKVDELAPFSVGKRSCPGEGLARMELFLFIANFLNRYQ IHPS

KEGLPSMAKGSGPVVAPRLFTAILTRRF*

 

CYP33E1

 

>CYP33E1 C49C8.4    U61945 494 aa

         Length = 495

 

 Score = 2605 (917.0 bits), Expect = 2.4e-274, P = 2.4e-274

 Identities = 493/494 (99%), Positives = 493/494 (99%)

 

Query:     1 MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG 60

             MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG

Sbjct:     1 MILLILTSILIIYLFNHFYWKRRKLPPGPIPLPIIGNLYLMTEDVKPGYKMYEKLKDKYG 60

 

Query:    61 PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI 120

             PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI

Sbjct:    61 PVFTFWLANLPMVTVTDWKLIKQHFIKDGANFVGRPEFPISMEMRQGPYGIIESHGDRWI 120

 

Query:   121 QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML 180

             QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML

Sbjct:   121 QQRRFALHILRDFGLGKNLMEEKVLGEVTAMIDSIRKSMEDVDMQNIFDASVGSVINNML 180

 

Query:   181 FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG 240

             FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG

Sbjct:   181 FGYRYDETNIEEFLELKNRMNKHFKLAAEPMGGLIGMNPWLGHLPFFKGYKNVIMHNWMG 240

 

Query:   241 LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGFEMEQLDSVCFDLWV 300

             LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGF MEQLDSVCFDLWV

Sbjct:   241 LMEMFRKQATDRLASIDYDSDEYSDYVEAFLKERKKHENEQDFGGFRMEQLDSVCFDLWV 300

 

Query:   301 AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ 360

             AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ

Sbjct:   301 AGMETTSNTLNWALLYVLLNPEVRQKVYEELEREIGSDRIITTTDRPKLNYINATVNESQ 360

 

Query:   361 RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD 420

             RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD

Sbjct:   361 RLANLLPMNLSRSTNADVEIAGYRIPKDTVITPQISSVMYDPEIFPEPYEFKPERFLESD 420

 

Query:   421 GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT 480

             GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT

Sbjct:   421 GSLKKVEELVPFSIGKRQCLGEGLAKMELFLYFANLFNKFDIKFHESNPNPSIKKEVGVT 480

 

Query:   481 MKAKNYRVSMKERY 494

             MKAKNYRVSMKERY

Sbjct:   481 MKAKNYRVSMKERY 494

 

E is created at intron joint

 

CYP35A4

 

>CYP35A4 C49G7 494 aa Revised 3/19/04

         Length = 494

 

 Score = 2535 (892.4 bits), Expect = 6.3e-267, P = 6.3e-267

 Identities = 492/494 (99%), Positives = 492/494 (99%)

 

Query:     1 MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK 60

             MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK

Sbjct:     1 MFVILFISAILSWLIVRQYQKVSRLPPGPVSLPLIGNLPQIFYYLWTTGSIVSTLDLFRK 60

 

Query:    61 RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIRKDMGIMVTNGDH 120

             RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIR  MGIMVTNGDH

Sbjct:    61 RYGNIFTLWVGPLPHVSIADYEISHEVFVKNGSKYADKFHAPVMQDIRSIMGIMVTNGDH 120

 

Query:   121 WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS 180

             WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS

Sbjct:   121 WQHMRRFSLQTFRNMGVGKDIMETKIMQELDARCAEIDTSAMNGVTVTQASEFFELTVGS 180

 

Query:   181 IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF 240

             IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF

Sbjct:   181 IINSILVGKRFDATTKHEFLKIIETMDASIETASFFDMMVPVWILKTFFKHRYDNLSDAF 240

 

Query:   241 EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG 300

             EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG

Sbjct:   241 EVSKAFSAAEAIKRVDQIKSGRYFIDENNLQDYTDAFLLKIEKEGQCQDFNMETLKTMIG 300

 

Query:   301 DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM 360

             DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM

Sbjct:   301 DLWITGQETTTTTLISGFNQLLLHPEVMVKAREELMKVTENGSRSLSLTDRASTPYLNAM 360

 

Query:   361 IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER 420

             IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER

Sbjct:   361 IGEIQRHASILNVNFWKINKEFTYMGGHPVDAGALVTAQLGTLHVNETVFENPLKFDPER 420

 

Query:   421 YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI 480

             YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI

Sbjct:   421 YIRDENLLQKVIPFGVGKRSCLGESLARAELYLIFGNLLLRYKFESSSKLSTTELAPYSI 480

 

Query:   481 GKRPFKLEMKFVKI 494

             GKRPFKLEMKFVKI

Sbjct:   481 GKRPFKLEMKFVKI 494

 

KD forms at intron joint

 

CYP33C9

 

>CYP33C9 C50H11 495 aa

         Length = 497

 

 Score = 2609 (918.4 bits), Expect = 9.1e-275, P = 9.1e-275

 Identities = 492/496 (99%), Positives = 493/496 (99%)

 

Query:     1 MVLNIILVVVVAFLFHHLYWKRRNWPAGPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD 60

             MVLNIILVVVVAFLFHHLYWKRRN+  GPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD

Sbjct:     1 MVLNIILVVVVAFLFHHLYWKRRNFFPGPTPLPLIGNLLSLRNPAPGYKAFARWTAKYGD 60

 

Query:    61 IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE 120

             IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE

Sbjct:    61 IYTFWLGTRPYILVSSYEALKETFIKDGETYADKKPMAFQESFRGGSYGVVETNGPFWRE 120

 

Query:   121 HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML 180

             HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML

Sbjct:   121 HRRFAIHQFRDFGLGKDRMEQRIMLEVEDIFNNCDKTIGEGVDLTDIFDRAVGNVINQML 180

 

Query:   181 FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG 240

             FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG

Sbjct:   181 FGYRFDETRADEFRTIRAFFNFNSGEFASFSMRVQFFLPWMGYIMPGPTILDRFKKYQKG 240

 

Query:   241 FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFESFSTKQLSNMCLDLWF 300

             FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFES STKQLSNMCLDLWF

Sbjct:   241 FTEFFGTQIENHKKEIDFELEENSDYVEAFLKEQRKREASGDFESSSTKQLSNMCLDLWF 300

 

Query:   301 AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ 360

             AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ

Sbjct:   301 AALMTTSNTMTWCFAYTLNYLDAQQKLHEELDRVIGSERHINTADKPNLPYTNAYINEIQ 360

 

Query:   361 RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD 420

             RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD

Sbjct:   361 RTANLVPLNLLHMTTRDTVLKGYNIPKGTGVVAQISTVMYDENVFPEPYIFKPERFLDDD 420

 

Query:   421 GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM 480

             GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM

Sbjct:   421 GKLKKVEQLVPFSVGKRQCLGEGLARMELFLFIANFFNRYRVVPDANGPPIIDKAVLGGM 480

 

Query:   481 HTKEFKAILQRRHVSE 496

             HTKEFKAILQRRHVSE

Sbjct:   481 HTKEFKAILQRRHVSE 496

 

correct seq at two exon boundaries

 

CYP43A1

 

>CYP43A1 E03E2.1 485 aa

         Length = 485

 

 Score = 2143 (754.4 bits), Expect = 9.1e-247, Sum P(2) = 9.1e-247

 Identities = 406/406 (100%), Positives = 406/406 (100%)

 

Query:   101 PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS 160

             PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS

Sbjct:    80 PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNS 139

 

Query:   161 LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG 220

             LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG

Sbjct:   140 LSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTAEKIAYIFPETKLIFKNSFVG 199

 

Query:   221 HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK 280

             HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK

Sbjct:   200 HFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTENDHYSLLGFFFEHHNEKKLIEK 259

 

Query:   281 AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA 340

             AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA

Sbjct:   260 AEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEA 319

 

Query:   341 EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 400

             EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV

Sbjct:   320 EIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 379

 

Query:   401 VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA 460

             VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA

Sbjct:   380 VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTA 439

 

Query:   461 FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR 506

             FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR

Sbjct:   440 FRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR 485

 

 Score = 225 (79.2 bits), Expect = 9.1e-247, Sum P(2) = 9.1e-247

 Identities = 43/43 (100%), Positives = 43/43 (100%)

 

Query:    58 MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 100

             MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI

Sbjct:     1 MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 43

 

 

>cb25.fpc3857 ACC=CAAC01000068 CYP43

            Length = 474

 

 Score = 1243 (437.6 bits), Expect = 3.8e-193, Sum P(2) = 3.8e-193

 Identities = 243/363 (66%), Positives = 290/363 (79%)

 

Query:   152 SEFMDHVNSLSDGQ-SVVIN---HSHSLFQNH--TSYV-LARKAFGNFAELQKSTAEKIA 204

             S  +DH +SL     S V+    + H   QNH   +++ +   AFG  ++ QKS  EKI

Sbjct:   109 SVVIDHSHSLFQNHTSYVLARCAYGHKE-QNHRVNNFLSVFSDAFGALSDFQKSMTEKIT 167

 

Query:   205 YIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKNVDNNNGICCTEND--HY 262

             Y FPE K IFKN+    FL+S  QQKFLDYLL+LIS FQSR+ ++NNN    ++ +   Y

Sbjct:   168 YFFPEMKSIFKNNLFAQFLQSTNQQKFLDYLLNLISKFQSRRVIENNNNEESSDPEAGKY 227

 

Query:   263 SLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEEITAQCKFISVAGFDTTSNTLTL 322

             SLL FFFEHH+EKK++EKAEG+IDMKKVKVEKSISY+EITAQCKFISVAGFDTT+NTLTL

Sbjct:   228 SLLEFFFEHHHEKKIVEKAEGRIDMKKVKVEKSISYQEITAQCKFISVAGFDTTANTLTL 287

 

Query:   323 LFNFLANNPDVQDKIYEAEIKNQPEQISFETVSSLRLLQNCIFETLRLFPHASPLQTRIC 382

             LFNFLA+NP +Q++IY +EIK     ++FETV SL +LQNCIFETLRLFPHASPLQ RIC

Sbjct:   288 LFNFLAHNPAIQEQIYNSEIKGNKSSMNFETVCSLPILQNCIFETLRLFPHASPLQMRIC 347

 

Query:   383 TEPFKIGKYQFLENVQIVVNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVG 442

             T P  IG+Y+F EN+Q+V+NPWGPH DR IWG+DV+CF+PSRFE+L+EQQRKAFMPFGVG

Sbjct:   348 TAPITIGQYKFDENMQVVINPWGPHRDRVIWGDDVNCFKPSRFESLSEQQRKAFMPFGVG 407

 

Query:   443 PRQCVGMRFALLEMKTTAFRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLV 502

             PRQCVGMRFALLE+KTTAFRMLQKY V +  PV DRHGK V MTVRDTGT+WPTDKLGLV

Sbjct:   408 PRQCVGMRFALLELKTTAFRMLQKYVVRSTQPVIDRHGKLVNMTVRDTGTVWPTDKLGLV 467

 

Query:   503 LKQR 506

             L +R

Sbjct:   468 LTKR 471

 

 Score = 618 (217.5 bits), Expect = 3.8e-193, Sum P(2) = 3.8e-193

 Identities = 113/134 (84%), Positives = 131/134 (97%)

 

Query:    58 MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRIPEILSDDPITSDNIHMF 117

             MCS++GD+TFS+LRGATPV++T++V+LIHAISTE FDCFHSRIPE LSDDPIT++NIHMF

Sbjct:     1 MCSVVGDKTFSMLRGATPVIVTNDVNLIHAISTESFDCFHSRIPEALSDDPITAENIHMF 60

 

Query:   118 AAKGERWKRLRTLTSYGLSTVKLKLLFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQ 177

             AAKGERWKRLRT+TSYGLSTVKLKLLFPT++TCVSEF+DHVNSLSDGQSVVI+HSHSLFQ

Sbjct:    61 AAKGERWKRLRTITSYGLSTVKLKLLFPTIETCVSEFLDHVNSLSDGQSVVIDHSHSLFQ 120

 

Query:   178 NHTSYVLARKAFGN 191

             NHTSYVLAR A+G+

Sbjct:   121 NHTSYVLARCAYGH 134

 

>CYP43A1 E03E2.1 485 aa green missed in CE29747

MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI CICCSILIVHQGSKM

KNYCNTAKILKLCIFYNNYLQ PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGL

STVKLKLLFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQNHTSYVLARKAFG

NFAELQKSTAEKIAYIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKN

VDNNNGICCTENDHYSLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEEITA

QCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEAEIKNQPEQISFETVSSLRLLQ

NCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIVVNPWGPHHDREIWGNDV

DCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMRFALLEMKTTAFRMLQKYSVF

TNSPVHDRHGKTVRMTVRDTGTIWPTDKLGLVLKQR

 

E03E2.1; CE29747. extra seq

MILLILLPVAIFTYVFLR ()

NFKLYYTLHKCGLNPRFDIFGIKGLLWLDSSA  50

AHENFTR MCSMIGDQTFSVLRGATPVVITSNVDLIHAISTEHFDCFHSRI 100

 

PEILSDDPITSDNIHMFAAKGERWKRLRTLTSYGLSTVKLKL

LFPTMDTCVSEFMDHVNSLSDGQSVVINHSHSLFQNHTSYVLARKAFGNFAELQKSTA

EKIAYIFPETKLIFKNSFVGHFLKSATQQKFLDYLLHLISNFQSRKNVDN 250

NNGICCTENDHYSLLGFFFEHHNEKKLIEKAEGQIDMKKVKVEKSISYEE 300

ITAQCKFISVAGFDTTSNTLTLLFNFLANNPDVQDKIYEAEIKNQPEQIS 350

FETVSSLRLLQNCIFETLRLFPHASPLQTRICTEPFKIGKYQFLENVQIV 400

VNPWGPHHDREIWGNDVDCFRPSRFENLTEQQRKAFMPFGVGPRQCVGMR 450

FALLEMKTTAFRMLQKYSVFTNSPVHDRHGKTVRMTVRDTGTIWPTDKLG 500

LVLKQR                                             506

 

 

TGAATCAACAACTGTACTTGTCTGCAATTTTTTCAGTTCAATGATTCTACTTTTTCTTCT

GCCGACTGCATTATTAATTTATGTTTTAATAAGGTAAAGCTAACTAAATGTCAAGACACG

GCTCTCAGAATAGATTTTCAGAAATCTCAAAATATATTTCACACTCTATAGAAATGGACT

GAACCCCCAATTTGACATATTTGGAATCAAGGGGCTCCGATGGTTGGATAGTTCTAATGC

TCATGAAAACTTTACACAAATGTGTTCTGTTGTTGGTGATAAAACATTTTCTATGCTCCG

CGGTGCAACACCGGTCATTGTGACCAATGATGTTAATTTGATACATGCAATTTCGACGGA

AAGTTTTGATTGTTTCCATTCGCGAATTGTGAGTTTTATTTAGAATAGAAAAACAATACT

GGAAAATTCATTAAATTGGAAAATATATAATTCCAGCCAGAAGCTTTATCTGATGACCCG

ATCACAGCCGAAAATATCCACATGTTTGCTGCCAAAGGTGAGCGCTGGAAACGTCTTCGA

ACAATAACGAGTTATGGATTATCCACAGTTAAGTTAAAATTGGTATGTCCCATGACAGTT

TTGTTGTCCTTTTTATCTAATCATGTTTGTTCAGTTGTTCCCAACCATTGAGACTTGTGT

TTCTGAATTTTTGGATCACGTGAACAGCCTGTCCGATGGTCAAAGTGTTGTCATCGATCA

TTCTCATAGGTTTGTATAATCATCTTCTATTTATAGATTTGACTTTTTTCAGTCTCTTTC

AAAATCATACATCTTACGTTCTAGCTCGATGTGCTTACGGTCACAAAGAGCAAAATCATC

 

>EMBOSS_001_1

*INNCTCLQFFQFNDSTFSSADCIINLCFNKVKLTKCQDTALRIDFQKSQNIFHTL*KWT

EPPI*HIWNQGAPMVG*F*CS*KLYTNVFCCW**NIFYAPRCNTGHCDQ*C*FDTCNFDG

KF*LFPFANCEFYLE*KNNTGKFIKLENI*FQPEALSDDPITAENIHMFAAKGERWKRLR

TITSYGLSTVKLKLVCPMTVLLSFLSNHVCSVVPNH*DLCF*IFGSREQPVRWSKCCHRS

FS*VCIIIFYL*I*LFSVSFKIIHLTF*LDVLTVTKSKII

>EMBOSS_001_2

ESTTVLVCNFFSSMILLFLLPTALLIYVLIR*S*LNVKTRLSE*IFRNLKIYFTLYRNGL

NPQFDIFGIKGLRWLDSSNAHENFTQMCSVVGDKTFSMLRGATPVIVTNDVNLIHAISTE

SFDCFHSRIVSFI*NRKTILENSLNWKIYNSSQKLYLMTRSQPKISTCLLPKVSAGNVFE

Q*RVMDYPQLS*NWYVP*QFCCPFYLIMFVQLFPTIETCVSEFLDHVNSLSDGQSVVIDH

SHRFV*SSSIYRFDFFQSLSKSYILRSSSMCLRSQRAKS

>EMBOSS_001_3

NQQLYLSAIFSVQ*FYFFFCRLHY*FMF**GKAN*MSRHGSQNRFSEISKYISHSIEMD*

TPNLTYLESRGSDGWIVLMLMKTLHKCVLLLVIKHFLCSAVQHRSL*PMMLI*YMQFRRK

VLIVSIREL*VLFRIEKQYWKIH*IGKYIIPARSFI**PDHSRKYPHVCCQR*ALETSSN

NNELWIIHS*VKIGMSHDSFVVLFI*SCLFSCSQPLRLVFLNFWIT*TACPMVKVLSSII

LIGLYNHLLFIDLTFFSLFQNHTSYVLARCAYGHKEQNH

 

 

GAGTCAACAATTTTCTTTCGGTGTTTTCTGATGCGTTCGGGGCATTGTCAGATTTTCAAA

AATCAATGACTGAGAAAATTACTTGTGAGTCACTAAAACATAATTCATCTAATTGAGAAA

TCTATTTGCAGATTTTTTCCCAGAAATGAAGTCAATTTTCAAAAACAACTTATTTGCCCA

ATTTTTACAAAGTACAAATCAACAAAAATTCTTGGATTATTTACTGAATCTGATCTCAAA

ATTTCAATCCCGAAGAGTTATCGAAAACAATAATAATGAAGAATCAAGCGACCCCGAAGC

TGGCAAGTACAGTTTATTGGAGTTTTTCTTCGAACATCATCATGAAAAAAAGATTGTGGA

AAAGGCGGAAGGGCGGATTGACATGAAGAAAGTGAAAGTCGAGAAAAGTATTTCTTATCA

GGTGTGAGTCTCGGTTATCTTGTAGTCTAAAAAACGCAGTACATTAGAGTTTCTGCTGGC

TACAATGTTCATTTGTTTTGCAAATATTGAAAATTGTTTTGCAAAATATTCAATGAAAGA

AAACCGAAACTAATTTATTGCAATACTGGAGAATGTAAACATTTACTATCGTTTGTTTTT

TTCAATAGAAAAATAGACATTGGAAACTTTGGAGTCCCAAAAAGTGATCCAAATATTTTT

GAATATCGGAAAATGTTAGAGCATACTAAACAAAATTGGAGCTTCGGTTTTATCGTCGAG

AACCAAACTCTTATTACAGCTTCAGAAAAGTTAAATAAATCGAATTAGAAAAGGTTTTTT

TGAGTGATACCGGTACACTACTGGACAAAAATTTATTCTCGATTTTTTTAAAAGAATATT

GAGTTGGCAGGTCTAAAAATTATTTTACTGCTCTGAAAACTTTGTTTTTCAATGTAGACA

TAAAAAGAGATTAACTTAGGTAATGAAATTTGTCTTTAGTACATTCTGAAATTTTCAGAC

AAATCTGAAATATTTAGACTA

 

>CYP37A1 F01D5    Z81493 516 aa also Y39G8 Z92851

         Length = 517

 

 Score = 2596 (913.8 bits), Expect = 2.2e-273, P = 2.2e-273

 Identities = 503/524 (95%), Positives = 505/524 (96%)

 

Query:     1 MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE 60

             MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE

Sbjct:     1 MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGTTFQFKMDPVE 60

 

Query:    61 FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES 120

             FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES

Sbjct:    61 FALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKVQTRNMSGAQNFHIFQKVLES 120

 

Query:   121 NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDFQVVFNSQSMILL 180

             NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID    ++ Q  ILL

Sbjct:   121 NALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDEFANWDFQ--ILL 178

 

Query:   181 EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF 240

             EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF

Sbjct:   179 EQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAF 238

 

Query:   241 KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK 300

             KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK

Sbjct:   239 KYQRMPWLWIKPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK 298

 

Query:   301 KAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE 360

             KAFLDMLIE   EGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE

Sbjct:   299 KAFLDMLIE---EGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANCPEYQKKCHEE 355

 

Query:   361 LDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS 420

             LDEIF     E    DLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS

Sbjct:   356 LDEIF---GEEFDSFDLKKMKYLEKCVKEALRMRPSVPQMARSVEEEVEIDGKILPKGCS 412

 

Query:   421 VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK 480

             VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK

Sbjct:   413 VMISPAFIQNNPRTFPNHEVFDPERFNEDEISKRHAYAYIPFSAGPRNCIGQKFAMQEEK 472

 

Query:   481 TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ 524

             TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ

Sbjct:   473 TVISWVLRRFHIHTDIGLLENMPLPETITRPSLGFPLKFTVRQQ 516

 

compare to EST

 

>gi|14837585|dbj|AU205355.1|AU205355  UniGene info AU205355 unpublished oligo-capped cDNA library, stage L4

           Caenorhabditis elegans cDNA clone yk852g07 5'.

          Length = 603

 

 Score =  183 bits (465), Expect = 6e-48

 Identities = 94/104 (90%), Positives = 95/104 (91%), Gaps = 2/104 (1%)

 Frame = +1

 

Query: 11  QKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID--EFANW 68

           +KVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLID     N

Sbjct: 208 KKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWRGRRKMMTPSFHFNVLIDFQVVFNS 387

 

Query: 69  DFQILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV 112

              ILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV

Sbjct: 388 QSMILLEQIENAAKKTDDSTIDAFPYIKRCALDIICETAMGTTV 519

 

>gi|18321101|dbj|BJ153116.1|BJ153116   BJ153116 unpublished oligo-capped cDNA library, C. elegans L1 stage

           Caenorhabditis elegans cDNA clone yk1315e09 3'.

          Length = 725

 

 Score =  243 bits (620), Expect = 2e-65

 Identities = 124/136 (91%), Positives = 124/136 (91%), Gaps = 6/136 (4%)

 Frame = -1

 

Query: 1   KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIE---EGGLGYEDIREEVDTFMFEGHD 57

           KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIE   EGGLGYEDIREEVDTFMFEGHD

Sbjct: 623 KKVIDRKLREHDETDGMVVVEEESKKKAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHD 444

 

Query: 58  TTSAGIGWSLWCLANCPEYQKKCHEELDEIF---GEEFDSFDLKKMKYLEKCVKEALRMR 114

           TTSAGIGWSLWCLANCPEYQKKCHEELDEIF     E    DLKKMKYLEKCVKEALRMR

Sbjct: 443 TTSAGIGWSLWCLANCPEYQKKCHEELDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMR 264

 

Query: 115 PSVPQMARSVEEEVEI 130

           PSVPQMARSVEEEVEI

Sbjct: 263 PSVPQMARSVEEEVEI 216

 

This is the correct seq UNIPROT

MGIAVYLLALVVIYVVFNLSKILKFVKERMRLYHLMSKIDGPLALPLLGT  50

TFQFKMDPVEFALQLYNWGLEYSTKGSSLAAFWMGPYPMVIVLTPEANKV 100

QTRNMSGAQNFHIFQKVLESNALINKSSEYDIFLPWLGTGLLLASGEKWR 150

GRRKMMTPSFHFNVLIDFQVVFNSQSMILLEQIENAAKKTDDSTIDAFPY 200

IKRCALDIICETAMGTTVSAQTNHTHPYVVAVNEMNSLAFKYQRMPWLWI 250

KPIRQLIGYEADFQRNLDIVTSFTKKVIDRKLREHDETDGMVVVEEESKK 300

KAFLDMLIEKKEEGGLGYEDIREEVDTFMFEGHDTTSAGIGWSLWCLANC 350

PEYQKKCHEELDEIFEGTSRECSVEDLKKMKYLEKCVKEALRMRPSVPQM 400

ARSVEEEVEIDGKILPKGCSVMISPAFIQNNPRTFPNHEVFDPERFNEDE 450

ISKRHAYAYIPFSAGPRNCIGQKFAMQEEKTVISWVLRRFHIHTDIGLLE 500

NMPLPETITRPSLGFPLKFTVRQQ

 

CYP13B1

 

F02C12.5a; CE09177.

F02C12.5b; CE09178.

F02C12.5c; CE23627.

MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNK  50

PRGYLIHKWTQKFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARK 100

STNHIHGNLECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQVHDV 150

MEDSAINMVDLMAKHEDGKPFNIHAYFQEFTYDVISRLAMGQPNSELFNN 200

SGVEIVKSIFMRTHRVLPWYFTVLFPQFEHLVKRMFYNHAAVQGGDIEKL 250

LLICKKTVESRIQEREENAKLGFENAENDFIDMFLNYYSEQVEDIEFGST 300

VEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEV 350

DTVVGSENVSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGD 400

IYIDKGVKIEADVMSLHRSKEIWGENADDFVPERWLEPSSRHTMSWIPFG 450

AGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEVRLTLRFFFIFKSS 500

QIQKELNILGCTTTSPEAVTLYLKPRI                        527

 

>CYP13B1 F02C12.5   Z54269  (C-terminal) C29F7 Z92827 N-terminal 511 aa

         Length = 511

 

 Score = 2512 (884.3 bits), Expect = 7.3e-276, Sum P(2) = 7.3e-276

 Identities = 483/491 (98%), Positives = 484/491 (98%)

 

Query:     1 MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT 60

             MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT

Sbjct:     1 MGAIIVLVVLFATIAGYFKWIHTYWRRRGISGPEGLPFIGNYYDLADVNKPRGYLIHKWT 60

 

Query:    61 QKFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL 120

             Q FGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL

Sbjct:    61 QVFGKVFGYYEGAVPVLVVSDMDMLQELFLKKFDNFYARKSTNHIHGNLECSKSEPRINL 120

 

Query:   121 FTSRGARWKRLRALASPGFSVKALKQV--HDVMEDSAINMVDLMAKHEDGKPFNIHAYFQ 178

             FTSRGARWKRLRALASPGFSVKALKQV  HDVMEDSAINMVDLMAKHEDGKPFNIH YFQ

Sbjct:   121 FTSRGARWKRLRALASPGFSVKALKQVYVHDVMEDSAINMVDLMAKHEDGKPFNIHRYFQ 180

 

Query:   179 EFTYDVISRLAMGQPNSELFNNSGVEIVKSIFMRTHRVLPWYFTVLFPQFEHLVKRMFYN 238

             EFTYDVISRLAMGQPNSELFNNSGVEIVK IFMRTHRVLPWYFTVLFPQFEHLVKRMFYN

Sbjct:   181 EFTYDVISRLAMGQPNSELFNNSGVEIVK-IFMRTHRVLPWYFTVLFPQFEHLVKRMFYN 239

 

Query:   239 HAAVQGGDIEKLLLICKKTVESRIQEREENAKLGFENAENDFIDMFLNYYSEQVEDIEFG 298

             HAAVQGGDIEKLLLICKKTVESRIQERE NAKLGFENAENDFIDMFLNYYSEQVEDIEFG

Sbjct:   240 HAAVQGGDIEKLLLICKKTVESRIQERE-NAKLGFENAENDFIDMFLNYYSEQVEDIEFG 298

 

Query:   299 STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN 358

             STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN

Sbjct:   299 STVEKKVTAEDVIGACFVFLLAGFDTTANSLAYASYLLAKHPEKMKLAQEEVDTVVGSEN 358

 

Query:   359 VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR 418

             VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR

Sbjct:   359 VSYDDMTKLKYLDAVVRESLRLYPVAWFACSRECVKPTTLGDIYIDKGVKIEADVMSLHR 418

 

Query:   419 SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD 478

             SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD

Sbjct:   419 SKEIWGENADDFVPERWLEPSSRHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYD 478

 

Query:   479 LVAGVETEVRLTL 491

             LVAGVETE  L +

Sbjct:   479 LVAGVETEKELNI 491

 

 Score = 131 (46.1 bits), Expect = 7.3e-276, Sum P(2) = 7.3e-276

 Identities = 27/44 (61%), Positives = 33/44 (75%)

 

Query:   484 ETEVRLTLRFFFIFKSSQIQKELNILGCTTTSPEAVTLYLKPRI 527

             +T +   LR + +    + +KELNILGCTTTSPEAVTLYLKPRI

Sbjct:   467 KTALAHLLRRYDLVAGVETEKELNILGCTTTSPEAVTLYLKPRI 510

 

compare to ests

 

>gi|30744632|gb|CB402905.1|CB402905  UniGene info OSTF221B8_1 AD-wrmcDNA Caenorhabditis elegans cDNA.

          Length = 556

 

 Score =  192 bits (489), Expect = 4e-50

 Identities = 97/100 (97%), Positives = 97/100 (97%)

 Frame = +1

 

Query: 1   ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQVYVHDVMEDSAINMVDLMAKHED 60

           ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQV  HDVMEDSAINMVDLMAKHED

Sbjct: 262 ECSKSEPRINLFTSRGARWKRLRALASPGFSVKALKQV--HDVMEDSAINMVDLMAKHED 435

 

Query: 61  GKPFNIHRYFQEFTYDVISRLAMGQPNSELFNNSGVEIVK 100

           GKPFNIH YFQEFTYDVISRLAMGQPNSELFNNSGVEIVK

Sbjct: 436 GKPFNIHAYFQEFTYDVISRLAMGQPNSELFNNSGVEIVK 555

 

 

>gi|14853552|dbj|AU215395.1|AU215395  UniGene info AU215395 unpublished oligo-capped cDNA library, stage L1

           Caenorhabditis elegans cDNA clone yk825b05 3'.

          Length = 623

 

 Score =  145 bits (365), Expect = 4e-36

 Identities = 70/70 (100%), Positives = 70/70 (100%)

 Frame = -3

 

F02C12.5a; CE09177.

F02C12.5b; CE09178.

F02C12.5c; CE23627.

These sequences fail to remove a short intron see below.

 

Query: 1   RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE 60

           RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE

Sbjct: 291 RHTMSWIPFGAGPRQCVGMRLGLSEAKTALAHLLRRYDLVAGVETEKELNILGCTTTSPE 112

 

Query: 61  AVTLYLKPRI 70

           AVTLYLKPRI

Sbjct: 111 AVTLYLKPRI 82

 

CYP14A5

 

F08F3.7; CE09262. 100%

MSVFIVALSVFIISYVISFYWKVRKYPKGPFPLPFFGNLLQFPADNIQEH  50

LDKLSKTYGPCFTVWTPLPAVVLTDYEHIKEAFVTQGDAFVNRAQRLPEI 100

LFQPHPNTGVVFSSGDNWKIQRRTALKILRDFGLGRNLMEEQVMRSVHEM 150

LAQLEHISDKKNVDMYWPIQLCVGNVINESLFGYHYKYEDAGRFEKFVKV 200

VDRHLKIAQGNASLLVSAFPWLRHLPVIGNLGYHSIKNNIKSYQQFIEEE 250

VTSQLKNYDGESEPENFVHAYMQQMKQTGNPNLDMTNLCASVLDFWLAGM 300

ETASNSLRWHLAFMMKYPEVQDKVRNEIFENIGTARLPSMSDKQNMPYTQ 350

AVIHEVQRCSNMVPILATHMNTEDVLVKEHNIPTGTLLFAQIWSVLKNDP 400

VFEENSKFNPDRYLMPDGKTLNKTVLERTIPFSVGKRNCVGEGLARMELF 450

LIFSALIQKYEFIPKTNVDLKPVYGGVITVKPYLCELVPQNA

 

CYP35D1

 

>CYP35D1 F14H3.10 498 aa

         Length = 499

 

 Score = 2595 (913.5 bits), Expect = 2.8e-273, P = 2.8e-273

 Identities = 498/499 (99%), Positives = 498/499 (99%)

 

add the extra K

 

Query:     1 MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK 60

             MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK

Sbjct:     1 MILILLFLTAITVITVRLYRKVLRFPPGPFPLPLIGNAHQIAYQAWRRGGILPALDYYRK 60

 

Query:    61 KYGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN 120

              YGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN

Sbjct:    61 -YGNAYTLWLGPKASVSITDFETSQEVFVKQGKKCYNRQLAPILEHVTGGVGLLIANGEN 119

 

Query:   121 WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS 180

             WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS

Sbjct:   120 WAEMRRFTLLTFRQMGVGTNIMEKRIMDELNGRCLEIDAQIARNDRAIVDVKFFDLTVGS 179

 

Query:   181 VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ 240

             VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ

Sbjct:   180 VINSFLIGKRFEDEEEFLKIKKLFDESSETFNIFDLNVPVWFLKTFLPSRFKLTWDSRHQ 239

 

Query:   241 IMDHVMKGVEERIRDIESGAYKIDPKKPNDVVDAFLSKMKKEEEIAGGQHPYYNLKSLKL 300